Summary of Scaling Laws For Reward Model Overoptimization in Direct Alignment Algorithms, by Rafael Rafailov et al.
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithmsby Rafael Rafailov, Yaswanth Chittepu, Ryan…
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithmsby Rafael Rafailov, Yaswanth Chittepu, Ryan…
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsityby Wentao Guo, Jikai Long, Yimeng Zeng, Zirui Liu,…
Quantifying Task Priority for Multi-Task Optimizationby Wooseong Jeong, Kuk-Jin YoonFirst submitted to arxiv on: 5…
Adaptive Preference Scaling for Reinforcement Learning with Human Feedbackby Ilgee Hong, Zichong Li, Alexander Bukharin,…
Cyclic Sparse Training: Is it Enough?by Advait Gadhikar, Sree Harsha Nelaturu, Rebekka BurkholzFirst submitted to…
Building Socially-Equitable Public Modelsby Yejia Liu, Jianyi Yang, Pengfei Li, Tongxin Li, Shaolei RenFirst submitted…
Randomized Geometric Algebra Methods for Convex Neural Networksby Yifei Wang, Sungyoon Kim, Paul Chu, Indu…
You Only Accept Samples Once: Fast, Self-Correcting Stochastic Variational Inferenceby Dominic B. DaytaFirst submitted to…
Combinatorial Optimization with Automated Graph Neural Networksby Yang Liu, Peng Zhang, Yang Gao, Chuan Zhou,…
RoutePlacer: An End-to-End Routability-Aware Placer with Graph Neural Networkby Yunbo Hou, Haoran Ye, Yingxue Zhang,…