Perplexity – Page 12 – GrooveSquid.com

July 13, 2025

Optimized Multi-Token Joint Decoding with Auxiliary Model for LLM Inferenceby Zongyue Qin, Ziniu Hu, Zifan…

July 13, 2025

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillationby Liqun Ma, Mingjie Sun,…

July 13, 2025

B’MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memoryby Luca Zancato,…

July 13, 2025

Just read twice: closing the recall gap for recurrent language modelsby Simran Arora, Aman Timalsina,…

July 13, 2025

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spikingby Xingrun Xing,…

July 13, 2025

Learning to (Learn at Test Time): RNNs with Expressive Hidden Statesby Yu Sun, Xinhao Li,…

July 13, 2025

GPTQT: Quantize Large Language Models Twice to Push the Efficiencyby Yipin Guo, Yilin Lang, Qinyuan…

July 13, 2025

LLMs Plagiarize: Ensuring Responsible Sourcing of Large Language Model Training Data Through Knowledge Graph Comparisonby…

July 13, 2025

Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?by Wataru Hashimoto, Hidetaka…

July 13, 2025

Deep Image-to-Recipe Translationby Jiangqin Ma, Bilal Mawji, Franz WilliamsFirst submitted to arxiv on: 1 Jul…