Summary of A Dynamical Model Of Neural Scaling Laws, by Blake Bordelon et al.
A Dynamical Model of Neural Scaling Lawsby Blake Bordelon, Alexander Atanasov, Cengiz PehlevanFirst submitted to…
A Dynamical Model of Neural Scaling Lawsby Blake Bordelon, Alexander Atanasov, Cengiz PehlevanFirst submitted to…
Towards Trustable Language Models: Investigating Information Quality of Large Language Modelsby Rick Rejeleene, Xiaowei Xu,…
Scaling Laws for Forgetting When Fine-Tuning Large Language Modelsby Damjan KalajdzievskiFirst submitted to arxiv on:…
DeepSeek LLM: Scaling Open-Source Language Models with Longtermismby DeepSeek-AI, Xiao Bi, Deli Chen, Guanting Chen,…
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Lawsby Nikhil Sardana, Jacob Portes, Sasha…