Multi modal – Page 52 – GrooveSquid.com

July 13, 2025

PathoTune: Adapting Visual Foundation Model to Pathological Specialistsby Jiaxuan Lu, Fang Yan, Xiaofan Zhang, Yue…

July 13, 2025

Few-Shot Adversarial Prompt Learning on Vision-Language Modelsby Yiwei Zhou, Xiaobo Xia, Zhiwei Lin, Bo Han,…

July 13, 2025

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?by Renrui Zhang,…

July 13, 2025

Multi-Modal Hallucination Control by Visual Information Groundingby Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary,…

July 13, 2025

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognitionby Ziyu Liu, Zeyi Sun, Yuhang Zang,…

July 13, 2025

Bilevel Hypergraph Networks for Multi-Modal Alzheimer’s Diagnosisby Angelica I. Aviles-Rivero, Chun-Wun Cheng, Zhongying Deng, Zoe…

July 13, 2025

From Explainable to Interpretable Deep Learning for Natural Language Processing in Healthcare: How Far from…

July 13, 2025

Just Say the Name: Online Continual Learning with Category Names Only via Data Generationby Minhyuk…

July 13, 2025

Improving Medical Multi-modal Contrastive Learning with Expert Annotationsby Yogesh Kumar, Pekka MarttinenFirst submitted to arxiv…

July 13, 2025

Functional Graph Convolutional Networks: A unified multi-task and multi-modal learning framework to facilitate health and…