Summary of How to Train Long-context Language Models (effectively), by Tianyu Gao et al.
How to Train Long-Context Language Models (Effectively)by Tianyu Gao, Alexander Wettig, Howard Yen, Danqi ChenFirst…
How to Train Long-Context Language Models (Effectively)by Tianyu Gao, Alexander Wettig, Howard Yen, Danqi ChenFirst…
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferencesby Genta Indra Winata, David Anugraha, Lucky…
SEAL: SEmantic-Augmented Imitation Learning via Language Modelby Chengyang Gu, Yuxin Pan, Haotian Bai, Hui Xiong,…
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networksby Siddharth Joshi, Jiayi…
Semi-Supervised Fine-Tuning of Vision Foundation Models with Content-Style Decompositionby Mariia Drozdova, Vitaliy Kinakh, Yury Belousov,…
Bayesian Binary Searchby Vikash Singh, Matthew Khanzadeh, Vincent Davis, Harrison Rush, Emanuele Rossi, Jesse Shrader,…
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Datasetby Xiao Wang,…
Machine Learning in Industrial Quality Control of Glass Bottle Printsby Maximilian Bundscherer, Thomas H. Schmitt,…
Calibrating Language Models with Adaptive Temperature Scalingby Johnathan Xie, Annie S. Chen, Yoonho Lee, Eric…
An Unbiased Risk Estimator for Partial Label Learning with Augmented Classesby Jiayu Hu, Senlin Shu,…