Summary of Jacobian Descent For Multi-objective Optimization, by Pierre Quinton et al.
Jacobian Descent for Multi-Objective Optimizationby Pierre Quinton, ValĂ©rian ReyFirst submitted to arxiv on: 23 Jun…
Jacobian Descent for Multi-Objective Optimizationby Pierre Quinton, ValĂ©rian ReyFirst submitted to arxiv on: 23 Jun…
Gradual Divergence for Seamless Adaptation: A Novel Domain Incremental Learning Methodby Kishaan Jeeveswaran, Elahe Arani,…
Preference Tuning For Toxicity Mitigation Generalizes Across Languagesby Xiaochen Li, Zheng-Xin Yong, Stephen H. BachFirst…
Position: Benchmarking is Limited in Reinforcement Learning Researchby Scott M. Jordan, Adam White, Bruno Castro…
An Optimal Tightness Bound for the Simulation Lemmaby Sam Lobel, Ronald ParrFirst submitted to arxiv…
Confidence Regulation Neurons in Language Modelsby Alessandro Stolfo, Ben Wu, Wes Gurnee, Yonatan Belinkov, Xingyi…
Graph-Augmented LLMs for Personalized Health Insights: A Case Study in Sleep Analysisby Ajan Subramanian, Zhongqi…
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagationby Yuchen Yang, Yingdong Shi, Cheems Wang,…
Uncertainty-Aware Reward-Free Exploration with General Function Approximationby Junkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan GuFirst…
Towards Scalable Exact Machine Unlearning Using Parameter-Efficient Fine-Tuningby Somnath Basu Roy Chowdhury, Krzysztof Choromanski, Arijit…