Summary of Reasonagain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning, by Xiaodong Yu et al.
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoningby Xiaodong Yu, Ben Zhou, Hao Cheng,…
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoningby Xiaodong Yu, Ben Zhou, Hao Cheng,…
A Counterexample in Cross-Correlation Template Matchingby Serap A. SavariFirst submitted to arxiv on: 24 Oct…
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasksby Lawrence Jang, Yinheng Li,…
A framework for GNSS-based solutions performance analysis in an ERTMS contextby Juliette Marais, Quentin Mayolle,…
PRACT: Optimizing Principled Reasoning and Acting of LLM Agentby Zhiwei Liu, Weiran Yao, Jianguo Zhang,…
LOGO – Long cOntext aliGnment via efficient preference Optimizationby Zecheng Tang, Zechen Sun, Juntao Li,…
On Explaining with Attention Matricesby Omar Naim, Nicholas AsherFirst submitted to arxiv on: 24 Oct…
Bielik 7B v0.1: A Polish Language Model – Development, Insights, and Evaluationby Krzysztof Ociepa, Łukasz…
Explainable News Summarization – Analysis and mitigation of Disagreement Problemby Seema Aswani, Sujala D. ShettyFirst…
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoningby Shivam Adarsh, Kumar Shridhar, Caglar Gulcehre, Nicholas…