Summary of Not All Heads Matter: a Head-level Kv Cache Compression Method with Integrated Retrieval and Reasoning, by Yu Fu et al.
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoningby…
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoningby…
Designing LLM-Agents with Personalities: A Psychometric Approachby Muhua Huang, Xijuan Zhang, Christopher Soto, James EvansFirst…
Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Modelby Reachsak…
Interleaving Text and Number Embeddings to Solve Mathemathics Problemsby Marvin Alberts, Gianmarco Gabrieli, Irina Espejo…
Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimizationby Diana Pfau, Alexander JungFirst submitted…
LArctan-SKAN: Simple and Efficient Single-Parameterized Kolmogorov-Arnold Networks using Learnable Trigonometric Functionby Zhijie Chen, Xinglin ZhangFirst…
Learning Neural Strategy-Proof Matching Mechanism from Examplesby Ryota Maruo, Koh Takeuchi, Hisashi KashimaFirst submitted to…
Investigating the Role of Prompting and External Tools in Hallucination Rates of Large Language Modelsby…
Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Modelsby Yige Li, Hanxun…
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Explorationby Hai Zhong, Xun…