Summary of Some Best Practices in Operator Learning, by Dustin Enyeart and Guang Lin
Some Best Practices in Operator Learningby Dustin Enyeart, Guang LinFirst submitted to arxiv on: 9…
Some Best Practices in Operator Learningby Dustin Enyeart, Guang LinFirst submitted to arxiv on: 9…
BatchTopK Sparse Autoencodersby Bart Bussmann, Patrick Leask, Neel NandaFirst submitted to arxiv on: 9 Dec…
Nonmyopic Global Optimisation via Approximate Dynamic Programmingby Filippo Airaldi, Bart De Schutter, Azita DabiriFirst submitted…
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learningby Yiran Wang, Chenshu Liu, Yunfan Li, Sanae…
Hyperparameter Tuning Through Pessimistic Bilevel Optimizationby Meltem Apaydin Ustun, Liang Xu, Bo Zeng, Xiaoning QianFirst…
Beyond algorithm hyperparameters: on preprocessing hyperparameters and associated pitfalls in machine learning applicationsby Christina Sauer,…
Scaling Law for Language Models Training Considering Batch Sizeby Xian Shuai, Yiding Wang, Yimeng Wu,…
Explainable fault and severity classification for rolling element bearings using Kolmogorov-Arnold networksby Spyros Rigas, Michalis…
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuningby Kaustubh Ponkshe,…
Exponential Moving Average of Weights in Deep Learning: Dynamics and Benefitsby Daniel Morales-Brotons, Thijs Vogels,…