Summary of Pickllm: Context-aware Rl-assisted Large Language Model Routing, by Dimitrios Sikeridis et al.
PickLLM: Context-Aware RL-Assisted Large Language Model Routingby Dimitrios Sikeridis, Dennis Ramdass, Pranay PareekFirst submitted to…
PickLLM: Context-Aware RL-Assisted Large Language Model Routingby Dimitrios Sikeridis, Dennis Ramdass, Pranay PareekFirst submitted to…
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Mapsby Linfeng Zhao, Lawson…
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulationby Eliot Xing, Vernon Luk, Jean OhFirst submitted to…
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximizationby Bhavya Sukhija, Stelian Coros, Andreas…
Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discoveryby Minjae Cho, Chuangchuang SunFirst submitted to arxiv on:…
AlphaZero Neural Scaling and Zipf’s Law: a Tale of Board Games and Power Lawsby Oren…
Generalized Bayesian deep reinforcement learningby Shreya Sinha Roy, Richard G. Everitt, Christian P. Robert, Ritabrata…
MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learningby Xing Lei, Xuetao Zhang,…
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancementby Junjie…
Auto-bidding in real-time auctions via Oracle Imitation Learning (OIL)by Alberto Silvio Chiappa, Briti Gangopadhyay, Zhao…