Token – Page 62 – GrooveSquid.com

July 13, 2025

Transformer Normalisation Layers and the Independence of Semantic Subspacesby Stephen Menary, Samuel Kaski, Andre FreitasFirst…

July 13, 2025

CaLMQA: Exploring culturally specific long-form question answering across 23 languagesby Shane Arora, Marzena Karpinska, Hung-Ting…

July 13, 2025

Understanding and Mitigating Tokenization Bias in Language Modelsby Buu Phan, Marton Havasi, Matthew Muckley, Karen…

July 13, 2025

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Modelsby Sean Welleck, Amanda Bertsch, Matthew…

July 13, 2025

Token-based Decision Criteria Are Suboptimal in In-context Learningby Hakaze Cho, Yoshihiro Sakai, Mariko Kato, Kenshiro…

July 13, 2025

Confidence Regulation Neurons in Language Modelsby Alessandro Stolfo, Ben Wu, Wes Gurnee, Yonatan Belinkov, Xingyi…

July 13, 2025

ReCaLL: Membership Inference via Relative Conditional Log-Likelihoodsby Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang,…

July 13, 2025

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMsby Jannik Kossen, Jiatong Han, Muhammed…

July 13, 2025

SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attentionby Qianchao Zhu,…

July 13, 2025

Multi-View Empowered Structural Graph Wordification for Language Modelsby Zipeng Liu, Likang Wu, Ming He, Zhong…