Grounding – Page 13 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Vigor: Improving Visual Grounding Of Large Vision Language Models with Fine-grained Reward Modeling, by Siming Yan et al.

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modelingby Siming Yan,…

July 13, 2025

Summary of Vialm: a Survey and Benchmark Of Visually Impaired Assistance with Large Models, by Yi Zhao et al.

VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large Modelsby Yi Zhao, Yilin…

July 13, 2025

Summary of Sco-vist: Social Interaction Commonsense Knowledge-based Visual Storytelling, by Eileen Wang et al.

SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytellingby Eileen Wang, Soyeon Caren Han, Josiah PoonFirst submitted…

July 13, 2025

Summary of A Decision Theoretic Framework For Measuring Ai Reliance, by Ziyang Guo et al.

A Decision Theoretic Framework for Measuring AI Relianceby Ziyang Guo, Yifan Wu, Jason Hartline, Jessica…

July 13, 2025

Summary of Lcv2: An Efficient Pretraining-free Framework For Grounded Visual Question Answering, by Yuhan Chen et al.

LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answeringby Yuhan Chen, Lumei Su, Lihua…

July 13, 2025

Summary of Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning, by Debjyoti Mondal et al.

KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoningby Debjyoti Mondal, Suraj Modi, Subhadarshi Panda, Rituraj Singh, Godawari…

July 13, 2025

Summary of Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models For Video Question Answering, by Haibo Wang et al.

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answeringby Haibo Wang,…

July 13, 2025

Summary of When Large Language Model Agents Meet 6g Networks: Perception, Grounding, and Alignment, by Minrui Xu et al.

When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignmentby Minrui Xu, Dusit…

July 13, 2025

Summary of Interacted Object Grounding in Spatio-temporal Human-object Interactions, by Xiaoyang Liu et al.

Interacted Object Grounding in Spatio-Temporal Human-Object Interactionsby Xiaoyang Liu, Boran Wen, Xinpeng Liu, Zizheng Zhou,…

July 13, 2025

Summary of Pc Agent: While You Sleep, Ai Works — a Cognitive Journey Into Digital World, by Yanheng He et al.

PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital Worldby Yanheng…