Summary of Pathotune: Adapting Visual Foundation Model to Pathological Specialists, by Jiaxuan Lu et al.
PathoTune: Adapting Visual Foundation Model to Pathological Specialistsby Jiaxuan Lu, Fang Yan, Xiaofan Zhang, Yue…
PathoTune: Adapting Visual Foundation Model to Pathological Specialistsby Jiaxuan Lu, Fang Yan, Xiaofan Zhang, Yue…
Few-Shot Adversarial Prompt Learning on Vision-Language Modelsby Yiwei Zhou, Xiaobo Xia, Zhiwei Lin, Bo Han,…
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?by Renrui Zhang,…
Multi-Modal Hallucination Control by Visual Information Groundingby Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary,…
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognitionby Ziyu Liu, Zeyi Sun, Yuhang Zang,…
Bilevel Hypergraph Networks for Multi-Modal Alzheimer’s Diagnosisby Angelica I. Aviles-Rivero, Chun-Wun Cheng, Zhongying Deng, Zoe…
From Explainable to Interpretable Deep Learning for Natural Language Processing in Healthcare: How Far from…
Just Say the Name: Online Continual Learning with Category Names Only via Data Generationby Minhyuk…
Improving Medical Multi-modal Contrastive Learning with Expert Annotationsby Yogesh Kumar, Pekka MarttinenFirst submitted to arxiv…
Functional Graph Convolutional Networks: A unified multi-task and multi-modal learning framework to facilitate health and…