Summary of Orpo: Monolithic Preference Optimization Without Reference Model, by Jiwoo Hong et al.
ORPO: Monolithic Preference Optimization without Reference Modelby Jiwoo Hong, Noah Lee, James ThorneFirst submitted to…
ORPO: Monolithic Preference Optimization without Reference Modelby Jiwoo Hong, Noah Lee, James ThorneFirst submitted to…
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuningby Peiyuan Liu, Hang Guo, Tao…
Verification-Aided Learning of Neural Network Barrier Functions with Termination Guaranteesby Shaoru Chen, Lekan Molu, Mahyar…
Knowledge Graph Large Language Model (KG-LLM) for Link Predictionby Dong Shu, Tianle Chen, Mingyu Jin,…
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of…
One Category One Prompt: Dataset Distillation using Diffusion Modelsby Ali Abbasi, Ashkan Shahbazi, Hamed Pirsiavash,…
Calibrating Multi-modal Representations: A Pursuit of Group Robustness without Annotationsby Chenyu You, Yifei Min, Weicheng…
On the Generalization Ability of Unsupervised Pretrainingby Yuyang Deng, Junyuan Hong, Jiayu Zhou, Mehrdad MahdaviFirst…
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Databy Jialu Li, Jaemin Cho, Yi-Lin…
Thread Detection and Response Generation using Transformers with Prompt Optimisationby Kevin Joshua T, Arnav Agarwal,…