Summary of Vision Language Models Are Blind, by Pooyan Rahmanzadehgervi et al.
Vision language models are blindby Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti NguyenFirst…
Vision language models are blindby Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti NguyenFirst…
TVR-Ranking: A Dataset for Ranked Video Moment Retrieval with Imprecise Queriesby Renjie Liang, Li Li,…
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understandingby Wenhao Xu, Wenming Weng, Yueyi Zhang, Zhiwei…
Reasoning about unpredicted change and explicit timeby Florence Dupin de Saint-Cyr, Jérôme LangFirst submitted to…
SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-trainingby Nan He, Weichen…
Collaborative Design of AI-Enhanced Learning Activitiesby Margarida RomeroFirst submitted to arxiv on: 9 Jul 2024CategoriesMain:…
TriQXNet: Forecasting Dst Index from Solar Wind Data Using an Interpretable Parallel Classical-Quantum Framework with…
Games played by Exponential Weights Algorithmsby Maurizio d'Andrea, Fabien Gensbittel, Jérôme RenaultFirst submitted to arxiv…
Deep-Motion-Net: GNN-based volumetric organ shape reconstruction from single-view 2D projectionsby Isuru Wijesinghe, Michael Nix, Arezoo…
iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicineby Anastasia Krithara, Fotis Aisopos, Vassiliki Rentoumi,…