Summary of Transformer Block Coupling and Its Correlation with Generalization in Llms, by Murdock Aubry et al.
Transformer Block Coupling and its Correlation with Generalization in LLMsby Murdock Aubry, Haoming Meng, Anton…
Transformer Block Coupling and its Correlation with Generalization in LLMsby Murdock Aubry, Haoming Meng, Anton…
The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Othersby Daniel…
When to Accept Automated Predictions and When to Defer to Human Judgment?by Daniel Sikar, Artur…
Estimating the stability number of a random graph using convolutional neural networksby Randy DavilaFirst submitted…
Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformersby Cody Wild, Jesper AndersonFirst submitted to arxiv…
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Trainingby Sami Jaghouar, Jack Min Ong, Johannes…
FACTS About Building Retrieval Augmented Generation-based Chatbotsby Rama Akkiraju, Anbang Xu, Deepak Bora, Tan Yu,…
Dynamical Measure Transport and Neural PDE Solvers for Samplingby Jingtong Sun, Julius Berner, Lorenz Richter,…
Toto: Time Series Optimized Transformer for Observabilityby Ben Cohen, Emaad Khwaja, Kan Wang, Charles Masson,…
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimizationby Junkang Wu, Yuexiang Xie,…