Bounding box – Page 2 – GrooveSquid.com

July 13, 2025

A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language…

July 13, 2025

Artemis: Towards Referential Understanding in Complex Videosby Jihao Qiu, Yuan Zhang, Xi Tang, Lingxi Xie,…

July 13, 2025

AUG: A New Dataset and An Efficient Model for Aerial Image Urban Scene Graph Generationby…

July 13, 2025

Towards Two-Stream Foveation-based Active Vision Learningby Timur Ibrayev, Amitangshu Mukherjee, Sai Aparna Aketi, Kaushik RoyFirst…

July 13, 2025

Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representationsby Bhishma Dedhia, Niraj K. JhaFirst…

July 13, 2025

Theoretically Achieving Continuous Representation of Oriented Bounding Boxesby Zi-Kai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang…

July 13, 2025

Jacquard V2: Refining Datasets using the Human In the Loop Data Correction Methodby Qiuhao Li,…

July 13, 2025

Boximator: Generating Rich and Controllable Motions for Video Synthesisby Jiawei Wang, Yuchen Zhang, Jiaxin Zou,…

July 13, 2025

Improving the Detection of Small Oriented Objects in Aerial Imagesby Chandler Timm C. Doloriel, Rhandley…

July 13, 2025

Improving Generalization Performance of YOLOv8 for Camera Trap Object Detectionby Aroj SubediFirst submitted to arxiv…