Summary of Image Retrieval Methods in the Dissimilarity Space, by Madhu Kiran et al.
Image Retrieval Methods in the Dissimilarity Spaceby Madhu Kiran, Kartikey Vishnu, Rafael M. O. Cruz,…
Image Retrieval Methods in the Dissimilarity Spaceby Madhu Kiran, Kartikey Vishnu, Rafael M. O. Cruz,…
Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Modelsby Vahid Balazadeh, Mohammadmehdi…
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigationby Mingfei Han, Liang Ma, Kamila Zhumakhanova, Ekaterina Radionova,…
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptionsby Jiarui Zhang, Ollie Liu, Tianyu Yu,…
In-Context Learning with Topological Information for Knowledge Graph Completionby Udari Madhushani Sehwag, Kassiani Papasotiriou, Jared…
Mobile Video Diffusionby Haitam Ben Yahia, Denis Korzhenkov, Ioannis Lelekas, Amir Ghodrati, Amirhossein HabibianFirst submitted…
Multimodal Contextualized Support for Enhancing Video Retrieval Systemby Quoc-Bao Nguyen-Le, Thanh-Huy Le-NguyenFirst submitted to arxiv…
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphsby Xiaqiang Tang, Jian…
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotationsby Linke Ouyang, Yuan Qu, Hongbin Zhou,…
Piece of Table: A Divide-and-Conquer Approach for Selecting Subtables in Table Question Answeringby Wonjin Lee,…