LLaMA – Page 14 – GrooveSquid.com

July 13, 2025

ThinK: Thinner Key Cache by Query-Driven Pruningby Yuhui Xu, Zhanming Jie, Hanze Dong, Lei Wang,…

July 13, 2025

Evaluating Large Language Models for automatic analysis of teacher simulationsby David de-Fitero-Dominguez, Mariano Albaladejo-González, Antonio…

July 13, 2025

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judgeby Tianhao Wu, Weizhe Yuan, Olga Golovneva, Jing Xu,…

July 13, 2025

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabicby Fakhraddin Alwajih, Gagan Bhatia, Muhammad Abdul-MageedFirst…

July 13, 2025

FLRT: Fluent Student-Teacher Redteamingby T. Ben Thompson, Michael SklarFirst submitted to arxiv on: 24 Jul…

July 13, 2025

Odyssey: Empowering Minecraft Agents with Open-World Skillsby Shunyu Liu, Yaoru Li, Kongcheng Zhang, Zhenyu Cui,…

July 13, 2025

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?by Nemika Tyagi, Mihir Parmar, Mohith…

July 13, 2025

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compressionby Daniel Goldstein, Fares…

July 13, 2025

The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for…

July 13, 2025

MUSCLE: A Model Update Strategy for Compatible LLM Evolutionby Jessica Echterhoff, Fartash Faghri, Raviteja Vemulapalli,…