Optimization – Page 210 – GrooveSquid.com

July 13, 2025

Summary of Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments, by Han Zhou et al.

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgmentsby Han Zhou, Xingchen Wan, Yinhong Liu,…

July 13, 2025

Summary of P-ta: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation Via Large Language Models, by Shuo Yang et al.

P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Modelsby Shuo…

July 13, 2025

Summary of Constrained Reinforcement Learning with Average Reward Objective: Model-based and Model-free Algorithms, by Vaneet Aggarwal et al.

Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithmsby Vaneet Aggarwal, Washim Uddin…

July 13, 2025

Summary of How Neural Networks Learn the Support Is An Implicit Regularization Effect Of Sgd, by Pierfrancesco Beneventano et al.

How Neural Networks Learn the Support is an Implicit Regularization Effect of SGDby Pierfrancesco Beneventano,…

July 13, 2025

Summary of Active Search For Bifurcations, by Yorgos M. Psarellis et al.

Active search for Bifurcationsby Yorgos M. Psarellis, Themistoklis P. Sapsis, Ioannis G. KevrekidisFirst submitted to…

July 13, 2025

Summary of Distributed Stochastic Gradient Descent with Staleness: a Stochastic Delay Differential Equation Based Framework, by Siyuan Yu et al.

Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Frameworkby Siyuan Yu,…

July 13, 2025

Summary of Learning Iterative Reasoning Through Energy Diffusion, by Yilun Du et al.

Learning Iterative Reasoning through Energy Diffusionby Yilun Du, Jiayuan Mao, Joshua B. TenenbaumFirst submitted to…

July 13, 2025

Summary of Dipper: Direct Preference Optimization to Accelerate Primitive-enabled Hierarchical Reinforcement Learning, by Utsav Singh et al.

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learningby Utsav Singh, Souradip Chakraborty, Wesley…

July 13, 2025

Summary of Bayesian Intervention Optimization For Causal Discovery, by Yuxuan Wang et al.

Bayesian Intervention Optimization for Causal Discoveryby Yuxuan Wang, Mingzhou Liu, Xinwei Sun, Wei Wang, Yizhou…

July 13, 2025

Summary of Unizero: Generalized and Efficient Planning with Scalable Latent World Models, by Yuan Pu and Yazhe Niu and Zhenjie Yang and Jiyuan Ren and Hongsheng Li and Yu Liu

UniZero: Generalized and Efficient Planning with Scalable Latent World Modelsby Yuan Pu, Yazhe Niu, Zhenjie…