Summary of Bayesian Weaks-to-strong From Text Classification to Generation, by Ziyun Cui et al.
Bayesian WeakS-to-Strong from Text Classification to Generationby Ziyun Cui, Ziyang Zhang, Guangzhi Sun, Wen Wu,…
Bayesian WeakS-to-Strong from Text Classification to Generationby Ziyun Cui, Ziyang Zhang, Guangzhi Sun, Wen Wu,…
Prediction-powered Generalization of Causal Inferencesby Ilker Demirel, Ahmed Alaa, Anthony Philippakis, David SontagFirst submitted to…
Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilitiesby Wenyue Hua, Kaijie…
Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networksby Hojoon Lee,…
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasksby Tianyu…
DNCs Require More Planning Stepsby Yara Shamshoum, Nitzan Hodos, Yuval Sieradzki, Assaf SchusterFirst submitted to…
On the Limitations of Fractal Dimension as a Measure of Generalizationby Charlie B. Tan, InĂ©s…
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional…
Verifying the Generalization of Deep Learning to Out-of-Distribution Domainsby Guy Amir, Osher Maayan, Tom Zelazny,…
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Modelsby Marianna…