Artificial intelligence – Page 690

July 13, 2025

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithmsby Miaosen Zhang, Yixuan Wei,…

July 13, 2025

STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite…

July 13, 2025

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotationsby Ruiyuan Lyu, Tai Wang,…

July 13, 2025

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understandingby Fei Wang, Xingyu Fu, James Y. Huang,…

July 13, 2025

Pandora: Towards General World Model with Natural Language Actions and Video Statesby Jiannan Xiang, Guangyi…

July 13, 2025

Advancing High Resolution Vision-Language Models in Biomedicineby Zekai Chen, Arda Pekis, Kevin BrownFirst submitted to…

July 13, 2025

Updating CLIP to Prefer Descriptions Over Captionsby Amir Zur, Elisa Kreiss, Karel D'Oosterlinck, Christopher Potts,…

July 13, 2025

SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Videoby Hector A. Valdez, Kyle Min, Subarna TripathiFirst…

July 13, 2025

GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?by Desmond…

July 13, 2025

Talking Heads: Understanding Inter-layer Communication in Transformer Language Modelsby Jack Merullo, Carsten Eickhoff, Ellie PavlickFirst…