LLaMA – Page 44 – GrooveSquid.com

July 13, 2025

Unified Lexical Representation for Interpretable Visual-Language Alignmentby Yifan Li, Yikai Wang, Yanwei Fu, Dongyu Ru,…

July 13, 2025

Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balanceby Ao Shen, Qiang…

July 13, 2025

Lawma: The Power of Specialization for Legal Tasksby Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe, Stefan…

July 13, 2025

A deeper look at depth pruning of LLMsby Shoaib Ahmed Siddiqui, Xin Dong, Greg Heinrich,…

July 13, 2025

Attention Is All You Need But You Don’t Need All Of It For Inference of…

July 13, 2025

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilitiesby Peng…

July 13, 2025

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inferenceby Qichen Fu, Minsik Cho, Thomas…

July 13, 2025

MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMsby Quang H. Nguyen, Duy C.…

July 13, 2025

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Togetherby Dilara Soylu, Christopher Potts,…

July 13, 2025

Flash normalization: fast RMSNorm for LLMsby Nils Graef, Matthew Clapp, Andrew WasielewskiFirst submitted to arxiv…