Summary of Exploring and Addressing Reward Confusion in Offline Preference Learning, by Xin Chen et al.
Exploring and Addressing Reward Confusion in Offline Preference Learningby Xin Chen, Sam Toyer, Florian ShkurtiFirst…
Exploring and Addressing Reward Confusion in Offline Preference Learningby Xin Chen, Sam Toyer, Florian ShkurtiFirst…
Transformer-based Capacity Prediction for Lithium-ion Batteries with Data Augmentationby Gift Modekwe, Saif Al-Wahaibi, Qiugang LuFirst…
Enhancing Temporal Understanding in LLMs for Semi-structured Tablesby Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan…
STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replayby Yongcan Yu, Lijun Sheng, Ran He, Jian…
Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labelsby Zhuorui Ye, Stephanie Milani, Geoffrey…
Robust Mixture Learning when Outliers Overwhelm Small Groupsby Daniil Dmitriev, Rares-Darius Buhai, Stefan Tiegel, Alexander…
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budgetby Vikash Sehwag, Xianghao Kong, Jingtao…
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learningby Emanuele Frascaroli, Aniello Panariello,…
Perceptions of Linguistic Uncertainty by Language Models and Humansby Catarina G Belem, Markelle Kelly, Mark…
HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioningby Eugene Valassakis, Guillermo Garcia-HernandoFirst submitted to…