Summary of Paligemma: a Versatile 3b Vlm For Transfer, by Lucas Beyer et al.
PaliGemma: A versatile 3B VLM for transferby Lucas Beyer, Andreas Steiner, AndrĂ© Susano Pinto, Alexander…
PaliGemma: A versatile 3B VLM for transferby Lucas Beyer, Andreas Steiner, AndrĂ© Susano Pinto, Alexander…
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimizationby Junkang Wu, Yuexiang Xie,…
Teaching Transformers Causal Reasoning through Axiomatic Trainingby Aniket Vashishtha, Abhinav Kumar, Abbavaram Gowtham Reddy, Vineeth…
Composable Interventions for Language Modelsby Arinbjorn Kolbeinsson, Kyle O'Brien, Tianjin Huang, Shanghua Gao, Shiwei Liu,…
Variational Best-of-N Alignmentby Afra Amini, Tim Vieira, Elliott Ash, Ryan CotterellFirst submitted to arxiv on:…
The Solution for the AIGC Inference Performance Optimization Competitionby Sishun Pan, Haonan Xu, Zhonghua Wan,…
PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Postsby Ana-Cristina Rogoz, Maria Ilinca…
Crafting Large Language Models for Enhanced Interpretabilityby Chung-En Sun, Tuomas Oikarinen, Tsui-Wei WengFirst submitted to…
AgentInstruct: Toward Generative Teaching with Agentic Flowsby Arindam Mitra, Luciano Del Corro, Guoqing Zheng, Shweti…
Efficient Training of Language Models with Compact and Consistent Next Token Distributionsby Ashutosh Sathe, Sunita…