Summary of Near-optimal Algorithm For Non-stationary Kernelized Bandits, by Shogo Iwazaki and Shion Takeno
Near-Optimal Algorithm for Non-Stationary Kernelized Banditsby Shogo Iwazaki, Shion TakenoFirst submitted to arxiv on: 21…
Near-Optimal Algorithm for Non-Stationary Kernelized Banditsby Shogo Iwazaki, Shion TakenoFirst submitted to arxiv on: 21…
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statisticsby Thomas Robert, Mher Safaryan, Ionut-Vlad Modoranu, Dan AlistarhFirst…
Enabling Asymmetric Knowledge Transfer in Multi-Task Learning with Self-Auxiliariesby Olivier Graffeuille, Yun Sing Koh, Joerg…
Karush-Kuhn-Tucker Condition-Trained Neural Networks (KKT Nets)by Shreya Arvind, Rishabh Pomaje, Rajshekhar V BhatFirst submitted to…
Offline reinforcement learning for job-shop scheduling problemsby Imanol Echeverria, Maialen Murua, Roberto SantanaFirst submitted to…
Traffic Matrix Estimation based on Denoising Diffusion Probabilistic Modelby Xinyu Yuan, Yan Qiao, Pei Zhao,…
S-CFE: Simple Counterfactual Explanationsby Shpresim Sadiku, Moritz Wagner, Sai Ganesh Nagarajan, Sebastian PokuttaFirst submitted to…
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applicationsby Wenyi Xiao, Zechuan…
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminatesby Shicheng Liu, Minghui ZhuFirst…
Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaceby Anjiang Wei, Allen Nie, Thiago…