Summary of Optimization Without Retraction on the Random Generalized Stiefel Manifold, by Simon Vary et al.
Optimization without Retraction on the Random Generalized Stiefel Manifoldby Simon Vary, Pierre Ablin, Bin Gao,…
Optimization without Retraction on the Random Generalized Stiefel Manifoldby Simon Vary, Pierre Ablin, Bin Gao,…
Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Riskby Xinyi Ni, Lifeng LaiFirst submitted to arxiv on:…
Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimizationby Sam Reifenstein, Timothee Leleu, Yoshihisa YamamotoFirst submitted to…
Multivariate Bayesian Last Layer for Regression: Uncertainty Quantification and Disentanglementby Han Wang, Eiji Kawasaki, Guillaume…
Non-linear Welfare-Aware Strategic Learningby Tian Xie, Xueru ZhangFirst submitted to arxiv on: 3 May 2024CategoriesMain:…
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulationby Shangding…
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignmentby Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng,…
Common pitfalls to avoid while using multiobjective optimization in machine learningby Junaid Akhter, Paul David…
Customizing Text-to-Image Models with a Single Image Pairby Maxwell Jones, Sheng-Yu Wang, Nupur Kumari, David…
Boosting Jailbreak Attack with Momentumby Yihao Zhang, Zeming WeiFirst submitted to arxiv on: 2 May…