Summary of Simplicity in Complexity : Explaining Visual Complexity Using Deep Segmentation Models, by Tingke Shen et al.
Simplicity in Complexity : Explaining Visual Complexity using Deep Segmentation Modelsby Tingke Shen, Surabhi S…
Simplicity in Complexity : Explaining Visual Complexity using Deep Segmentation Modelsby Tingke Shen, Surabhi S…
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPTby Yifang Xu, Yunzhuo Sun, Zien Xie, Benxiang…
Adversarial Testing for Visual Grounding via Image-Aware Property Reductionby Zhiyuan Chang, Mingyang Li, Junjie Wang,…
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignmentby Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding,…
How to Understand “Support”? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Groundingby Jiamin Luo,…
GROUNDHOG: Grounding Large Language Models to Holistic Segmentationby Yichi Zhang, Ziqiao Ma, Xiaofeng Gao, Suhaila…
Grounding from an AI and Cognitive Science Lensby Goonmeet Bajaj, Srinivasan Parthasarathy, Valerie L. Shalin,…
The Revolution of Multimodal Large Language Models: A Surveyby Davide Caffagni, Federico Cocchi, Luca Barsellotti,…
“Understanding AI”: Semantic Grounding in Large Language Modelsby Holger LyreFirst submitted to arxiv on: 16…
Grounding Language Model with Chunking-Free In-Context Retrievalby Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou,…