Summary of Adaptive Circuit Behavior and Generalization in Mechanistic Interpretability, by Jatin Nainani et al.
Adaptive Circuit Behavior and Generalization in Mechanistic Interpretabilityby Jatin Nainani, Sankaran Vaidyanathan, AJ Yeung, Kartik…
Adaptive Circuit Behavior and Generalization in Mechanistic Interpretabilityby Jatin Nainani, Sankaran Vaidyanathan, AJ Yeung, Kartik…
MC-NEST – Enhancing Mathematical Reasoning in Large Language Models with a Monte Carlo Nash Equilibrium…
Improving Next Tokens via Second-to-Last Predictions with Generate and Refineby Johannes SchneiderFirst submitted to arxiv…
Development of Pre-Trained Transformer-based Models for the Nepali Languageby Prajwal Thapa, Jinu Nyachhyon, Mridul Sharma,…
LLM Online Spatial-temporal Signal Reconstruction Under Noiseby Yi Yan, Dayu Qin, Ercan Engin KuruogluFirst submitted…
Don’t Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation…
Understanding World or Predicting Future? A Comprehensive Survey of World Modelsby Jingtao Ding, Yunke Zhang,…
Testing Uncertainty of Large Language Models for Physics Knowledge and Reasoningby Elizaveta Reganova, Peter SteinbachFirst…
Evaluating the Robustness of Analogical Reasoning in Large Language Modelsby Martha Lewis, Melanie MitchellFirst submitted…
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMsby Akari Asai, Jacqueline He, Rulin Shao, Weijia Shi,…