Summary of Large Language Models Can Self-improve at Web Agent Tasks, by Ajay Patel et al.
Large Language Models Can Self-Improve At Web Agent Tasksby Ajay Patel, Markus Hofmarcher, Claudiu Leoveanu-Condrei,…
Large Language Models Can Self-Improve At Web Agent Tasksby Ajay Patel, Markus Hofmarcher, Claudiu Leoveanu-Condrei,…
Exploring Diffusion Models’ Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networksby Xiaoyu…
MM-Lego: Modular Biomedical Multimodal Models with Minimal Fine-Tuningby Konstantin Hemker, Nikola Simidjievski, Mateja JamnikFirst submitted…
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Headsby…
Preference Alignment with Flow Matchingby Minu Kim, Yongsik Lee, Sehyeok Kang, Jihwan Oh, Song Chong,…
Is In-Context Learning Sufficient for Instruction Following in LLMs?by Hao Zhao, Maksym Andriushchenko, Francesco Croce,…
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Modelsby Masatoshi Uehara, Yulai…
Stress-Testing Capability Elicitation With Password-Locked Modelsby Ryan Greenblatt, Fabien Roger, Dmitrii Krasheninnikov, David KruegerFirst submitted…
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulationby Junjie Zhang, Chenjia Bai,…
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectorsby Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham…