Summary of Fake News Detection and Manipulation Reasoning Via Large Vision-language Models, by Ruihan Jin et al.
Fake News Detection and Manipulation Reasoning via Large Vision-Language Modelsby Ruihan Jin, Ruibo Fu, Zhengqi…
Fake News Detection and Manipulation Reasoning via Large Vision-Language Modelsby Ruihan Jin, Ruibo Fu, Zhengqi…
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilitiesby Chenming Zhu, Tai Wang, Wenwei Zhang, Kai…
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Drivingby Ran Tian,…
DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problemsby Kexiong Yu, Hang Zhao, Yuhang Huang,…
Few-Shot Medical Image Segmentation with High-Fidelity Prototypesby Song Tang, Shaxu Yan, Xiaozhi Qi, Jianxin Gao,…
MammothModa: Multi-Modal Large Language Modelby Qi She, Junwen Pan, Xin Wan, Rui Zhang, Dawei Lu,…
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessmentby Jun Fu,…
Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversationsby…
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Modelsby…
Heterogeneous Graph Neural Networks with Post-hoc Explanations for Multi-modal and Explainable Land Use Inferenceby Xuehao…