Attention – Page 7 – GrooveSquid.com

July 13, 2025

CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attentionby Han Li, Fei Liu, Zhi Zheng, Yu Zhang,…

July 13, 2025

Learner Attentiveness and Engagement Analysis in Online Education Using Computer Visionby Sharva Gogawale, Madhura Deshpande,…

July 13, 2025

MOSABench: Multi-Object Sentiment Analysis Benchmark for Evaluating Multimodal Large Language Models Understanding of Complex Imageby…

July 13, 2025

Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiersby Chancharik Mitra, Brandon Huang,…

July 13, 2025

SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generationby Yuhan Pei, Ruoyu Wang, Yongqi…

July 13, 2025

MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancementby Xiwei Deng, Xianchun He,…

July 13, 2025

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Modelsby Yudong Zhang, Ruobing Xie,…

July 13, 2025

Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extractionby Mohamed RashadFirst submitted to arxiv…

July 13, 2025

An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognitionby…

July 13, 2025

Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priorsby Zhengfei Kuang, Tianyuan Zhang, Kai…