Grounding – Page 5 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Grounding 3d Scene Affordance From Egocentric Interactions, by Cuiyu Liu et al.

Grounding 3D Scene Affordance From Egocentric Interactionsby Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo,…

July 13, 2025

Summary of See Then Tell: Enhancing Key Information Extraction with Vision Grounding, by Shuhang Liu et al.

See then Tell: Enhancing Key Information Extraction with Vision Groundingby Shuhang Liu, Zhenrong Zhang, Pengfei…

July 13, 2025

Summary of Simvg: a Simple Framework For Visual Grounding with Decoupled Multi-modal Fusion, by Ming Dai et al.

SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusionby Ming Dai, Lingfeng Yang,…

July 13, 2025

Summary of Ltntorch: Pytorch Implementation Of Logic Tensor Networks, by Tommaso Carraro et al.

LTNtorch: PyTorch Implementation of Logic Tensor Networksby Tommaso Carraro, Luciano Serafini, Fabio AiolliFirst submitted to…

July 13, 2025

Summary of Mapper: Multimodal Prior-guided Parameter Efficient Tuning For Referring Expression Comprehension, by Ting Liu et al.

MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehensionby Ting Liu, Zunnan Xu, Yue…

July 13, 2025

Summary of Multi-document Grounded Multi-turn Synthetic Dialog Generation, by Young-suk Lee et al.

Multi-Document Grounded Multi-Turn Synthetic Dialog Generationby Young-Suk Lee, Chulaka Gunasekara, Danish Contractor, Ramón Fernandez Astudillo,…

July 13, 2025

Summary of Question-answering Dense Video Events, by Hangyu Qin et al.

Question-Answering Dense Video Eventsby Hangyu Qin, Junbin Xiao, Angela YaoFirst submitted to arxiv on: 6…

July 13, 2025

Summary of From Grounding to Planning: Benchmarking Bottlenecks in Web Agents, by Segev Shlomov et al.

From Grounding to Planning: Benchmarking Bottlenecks in Web Agentsby Segev Shlomov, Ben wiesel, Aviad Sela,…

July 13, 2025

Summary of Improving Apple Object Detection with Occlusion-enhanced Distillation, by Liang Geng

Improving Apple Object Detection with Occlusion-Enhanced Distillationby Liang GengFirst submitted to arxiv on: 3 Sep…

July 13, 2025

Summary of Unlocking the Wisdom Of Large Language Models: An Introduction to the Path to Artificial General Intelligence, by Edward Y. Chang

Unlocking the Wisdom of Large Language Models: An Introduction to The Path to Artificial General…