Optimization – Page 308 – GrooveSquid.com

July 13, 2025

RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language…

July 13, 2025

Multi-Fidelity Methods for Optimization: A Surveyby Ke Li, Fan LiFirst submitted to arxiv on: 15…

July 13, 2025

PAL: Proxy-Guided Black-Box Attack on Large Language Modelsby Chawin Sitawarin, Norman Mu, David Wagner, Alexandre…

July 13, 2025

Efficient Prompt Optimization Through the Lens of Best Arm Identificationby Chengshuai Shi, Kun Yang, Zihan…

July 13, 2025

UMOEA/D: A Multiobjective Evolutionary Algorithm for Uniform Pareto Objectives based on Decompositionby Xiaoyuan Zhang, Xi…

July 13, 2025

Layerwise Proximal Replay: A Proximal Point Method for Online Continual Learningby Jason Yoo, Yunpeng Liu,…

July 13, 2025

Exact, Fast and Expressive Poisson Point Processes via Squared Neural Familiesby Russell Tsuchida, Cheng Soon…

July 13, 2025

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorizationby Idan Attias, Gintare Karolina…

July 13, 2025

Loss Shaping Constraints for Long-Term Time Series Forecastingby Ignacio Hounie, Javier Porras-Valenzuela, Alejandro RibeiroFirst submitted…

July 13, 2025

Reinforcement Learning from Human Feedback with Active Queriesby Kaixuan Ji, Jiafan He, Quanquan GuFirst submitted…