Summary of Improving Line Search Methods For Large Scale Neural Network Training, by Philip Kenneweg et al.
Improving Line Search Methods for Large Scale Neural Network Trainingby Philip Kenneweg, Tristan Kenneweg, Barbara…
Improving Line Search Methods for Large Scale Neural Network Trainingby Philip Kenneweg, Tristan Kenneweg, Barbara…
DORE: A Dataset For Portuguese Definition Generationby Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frédéric Blain,…
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Cachingby Youpeng Zhao, Di Wu, Jun…
Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flowsby Shujian Zhang, Lemeng Wu, Chengyue…
NSINA: A News Corpus for Sinhalaby Hansi Hettiarachchi, Damith Premasiri, Lasitha Uyangodage, Tharindu RanasingheFirst submitted…
SMART: Automatically Scaling Down Language Models with Accuracy Guarantees for Reduced Processing Feesby Saehan Jo,…
Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Textsby Sai Ashish Somayajula, Youwei…
Are LLMs Good Cryptic Crossword Solvers?by Abdelrahman Sadallah, Daria Kotova, Ekaterina KochmarFirst submitted to arxiv…
From Explainable to Interpretable Deep Learning for Natural Language Processing in Healthcare: How Far from…
Energy-Based Models with Applications to Speech and Language Processingby Zhijian OuFirst submitted to arxiv on:…