Artificial intelligence – Page 231

July 13, 2025

ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoningby Xiaodong Yu, Ben Zhou, Hao Cheng,…

July 13, 2025

A Counterexample in Cross-Correlation Template Matchingby Serap A. SavariFirst submitted to arxiv on: 24 Oct…

July 13, 2025

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasksby Lawrence Jang, Yinheng Li,…

July 13, 2025

A framework for GNSS-based solutions performance analysis in an ERTMS contextby Juliette Marais, Quentin Mayolle,…

July 13, 2025

PRACT: Optimizing Principled Reasoning and Acting of LLM Agentby Zhiwei Liu, Weiran Yao, Jianguo Zhang,…

July 13, 2025

LOGO – Long cOntext aliGnment via efficient preference Optimizationby Zecheng Tang, Zechen Sun, Juntao Li,…

July 13, 2025

On Explaining with Attention Matricesby Omar Naim, Nicholas AsherFirst submitted to arxiv on: 24 Oct…

July 13, 2025

Bielik 7B v0.1: A Polish Language Model – Development, Insights, and Evaluationby Krzysztof Ociepa, Łukasz…

July 13, 2025

Explainable News Summarization – Analysis and mitigation of Disagreement Problemby Seema Aswani, Sujala D. ShettyFirst…

July 13, 2025

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoningby Shivam Adarsh, Kumar Shridhar, Caglar Gulcehre, Nicholas…