Artificial intelligence – Page 672

July 13, 2025

MedCalc-Bench: Evaluating Large Language Models for Medical Calculationsby Nikhil Khandekar, Qiao Jin, Guangzhi Xiong, Soren…

July 13, 2025

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Modelby Yongting Zhang, Lu Chen,…

July 13, 2025

Grade Score: Quantifying LLM Performance in Option Selectionby Dmitri IourovitskiFirst submitted to arxiv on: 17…

July 13, 2025

WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying…

July 13, 2025

MEDeA: Multi-view Efficient Depth Adjustmentby Mikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander LimonovFirst submitted to…

July 13, 2025

When Reasoning Meets Information Aggregation: A Case Study with Sports Narrativesby Yebowen Hu, Kaiqiang Song,…

July 13, 2025

Conformance Checking of Fuzzy Logs against Declarative Temporal Specificationsby Ivan Donadello, Paolo Felli, Craig Innes,…

July 13, 2025

Who’s asking? User personas and the mechanics of latent misalignmentby Asma Ghandeharioun, Ann Yuan, Marius…

July 13, 2025

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Featuresby…

July 13, 2025

IDs for AI Systemsby Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de…