Summary of Pareto Inverse Reinforcement Learning For Diverse Expert Policy Generation, by Woo Kyung Kim and Minjong Yoo and Honguk Woo
Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generationby Woo Kyung Kim, Minjong Yoo, Honguk…
Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generationby Woo Kyung Kim, Minjong Yoo, Honguk…
Pareto Merging: Multi-Objective Optimization for Preference-Aware Model Mergingby Weiyu Chen, James KwokFirst submitted to arxiv…
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewardsby Shresth Verma, Niclas Boehmer, Lingkai Kong,…
Critique-out-Loud Reward Modelsby Zachary Ankner, Mansheej Paul, Brandon Cui, Jonathan D. Chang, Prithviraj AmmanabroluFirst submitted…
Approaching Deep Learning through the Spectral Dynamics of Weightsby David Yunis, Kumar Kshitij Patel, Samuel…
LLM Pruning and Distillation in Practice: The Minitron Approachby Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj…
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstractionby Anthony GX-Chen, Kenneth Marino,…
FAKER: Full-body Anonymization with Human Keypoint Extraction for Real-time Video Deidentificationby Byunghyun Ban, Hyoseok LeeFirst…
Fast Training Dataset Attribution via In-Context Learningby Milad Fotouhi, Mohammad Taha Bahadori, Oluwaseyi Feyisetan, Payman…
MicroXercise: A Micro-Level Comparative and Explainable System for Remote Physical Therapyby Hanchen David Wang, Nibraas…