Summary of Ed-copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance, by Liwen Sun et al.
ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistanceby Liwen Sun, Abhineet Agarwal,…
ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistanceby Liwen Sun, Abhineet Agarwal,…
Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop…
Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Spaceby Sindre…
Align Your Intents: Offline Imitation Learning via Optimal Transportby Maksim Bobrin, Nazar Buzun, Dmitrii Krylov,…
Skill or Luck? Return Decomposition via Advantage Functionsby Hsiao-Ru Pan, Bernhard SchölkopfFirst submitted to arxiv…
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learningby Junyan Liu, Yunfan Li, Ruosong Wang, Lin…
Offline Multi-task Transfer RL with Representational Penalizationby Avinandan Bose, Simon Shaolei Du, Maryam FazelFirst submitted…
Reflect-RL: Two-Player Online RL Fine-Tuning for LMsby Runlong Zhou, Simon S. Du, Beibin LiFirst submitted…
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policiesby Xiangyu Liu, Chenghao Deng,…
In value-based deep reinforcement learning, a pruned network is a good networkby Johan Obando-Ceron, Aaron…