Summary of Cada: Cross-problem Routing Solver with Constraint-aware Dual-attention, by Han Li et al.
CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attentionby Han Li, Fei Liu, Zhi Zheng, Yu Zhang,…
CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attentionby Han Li, Fei Liu, Zhi Zheng, Yu Zhang,…
Learner Attentiveness and Engagement Analysis in Online Education Using Computer Visionby Sharva Gogawale, Madhura Deshpande,…
MOSABench: Multi-Object Sentiment Analysis Benchmark for Evaluating Multimodal Large Language Models Understanding of Complex Imageby…
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiersby Chancharik Mitra, Brandon Huang,…
SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generationby Yuhan Pei, Ruoyu Wang, Yongqi…
MvKeTR: Chest CT Report Generation with Multi-View Perception and Knowledge Enhancementby Xiwei Deng, Xianchun He,…
DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Modelsby Yudong Zhang, Ruobing Xie,…
Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extractionby Mohamed RashadFirst submitted to arxiv…
An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognitionby…
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priorsby Zhengfei Kuang, Tianyuan Zhang, Kai…