Grounding – Page 9 – GrooveSquid.com

July 13, 2025

OLIVE: Object Level In-Context Visual Embeddingsby Timothy Ossowski, Junjie HuFirst submitted to arxiv on: 2…

July 13, 2025

Artemis: Towards Referential Understanding in Complex Videosby Jihao Qiu, Yuan Zhang, Xi Tang, Lingxi Xie,…

July 13, 2025

Don’t Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Modelsby A. Bavaresco, A.…

July 13, 2025

LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Groundingby Haoyu Zhao, Wenhang…

July 13, 2025

V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLMby Abdur Rahman, Rajat…

July 13, 2025

Finite Groundings for ASP with Functions: A Journey through Consistencyby Lukas Gerlach, David Carral, Markus…

July 13, 2025

Creativity and Markov Decision Processesby Joonas Lahikainen, Nadia M. Ady, Christian GuckelsbergerFirst submitted to arxiv…

July 13, 2025

WorldAfford: Affordance Grounding based on Natural Language Instructionsby Changmao Chen, Yuren Cong, Zhen KanFirst submitted…

July 13, 2025

Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgeryby…

July 13, 2025

Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Trainingby Sheng Yan,…