Summary of Llava-next-interleave: Tackling Multi-image, Video, and 3d in Large Multimodal Models, by Feng Li et al.
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Modelsby Feng Li, Renrui Zhang, Hao…
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Modelsby Feng Li, Renrui Zhang, Hao…
Deconstructing What Makes a Good Optimizer for Language Modelsby Rosie Zhao, Depen Morwani, David Brandfonbrener,…
Training on the Test Task Confounds Evaluation and Emergenceby Ricardo Dominguez-Olmedo, Florian E. Dorner, Moritz…
GLBench: A Comprehensive Benchmark for Graph with Large Language Modelsby Yuhan Li, Peisong Wang, Xiao…
MAN TruckScenes: A multimodal dataset for autonomous trucking in diverse conditionsby Felix Fent, Fabian Kuttenreich,…
Rigorous Probabilistic Guarantees for Robust Counterfactual Explanationsby Luca Marzari, Francesco Leofante, Ferdinando Cicalese, Alessandro FarinelliFirst…
Fine-Grained Classification for Poisonous Fungi Identification with Transfer Learningby Christopher Chiu, Maximilian Heil, Teresa Kim,…
CHILLI: A data context-aware perturbation method for XAIby Saif Anwar, Nathan Griffiths, Abhir Bhalerao, Thomas…
Machine Unlearning for Medical Imagingby Reza Nasirigerdeh, Nader Razmi, Julia A. Schnabel, Daniel Rueckert, Georgios…
MLRS-PDS: A Meta-learning recommendation of dynamic ensemble selection pipelinesby Hesam Jalalian, Rafael M. O. CruzFirst…