Summary of Bayesian Online Natural Gradient (bong), by Matt Jones et al.
Bayesian Online Natural Gradient (BONG)by Matt Jones, Peter Chang, Kevin MurphyFirst submitted to arxiv on:…
Bayesian Online Natural Gradient (BONG)by Matt Jones, Peter Chang, Kevin MurphyFirst submitted to arxiv on:…
LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuningby Zixian…
Preference Learning Algorithms Do Not Learn Preference Rankingsby Angelica Chen, Sadhika Malladi, Lily H. Zhang,…
Robust Preference Optimization through Reward Model Distillationby Adam Fisch, Jacob Eisenstein, Vicky Zayats, Alekh Agarwal,…
MGDA Converges under Generalized Smoothness, Provablyby Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi JiFirst submitted…
Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Databy Xuxing Chen, Abhishek Roy, Yifan…
The Data Minimization Principle in Machine Learningby Prakhar Ganesh, Cuong Tran, Reza Shokri, Ferdinando FiorettoFirst…
Decentralized Optimization in Time-Varying Networks with Arbitrary Delaysby Tomas Ortega, Hamid JafarkhaniFirst submitted to arxiv…
Robust Entropy Search for Safe Efficient Bayesian Optimizationby Dorina Weichert, Alexander Kister, Sebastian Houben, Patrick…
OMPO: A Unified Framework for RL under Policy and Dynamics Shiftsby Yu Luo, Tianying Ji,…