Inference – Page 166 – GrooveSquid.com

July 13, 2025

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inferenceby Qichen Fu, Minsik Cho, Thomas…

July 13, 2025

Forecasting GPU Performance for Deep Learning Training and Inferenceby Seonho Lee, Amar Phanishayee, Divya MahajanFirst…

July 13, 2025

MeshFeat: Multi-Resolution Features for Neural Fields on Meshesby Mihir Mahajan, Florian Hofherr, Daniel CremersFirst submitted…

July 13, 2025

Attention Based Simple Primitives for Open World Compositional Zero-Shot Learningby Ans Munir, Faisal Z. Qureshi,…

July 13, 2025

Mixture of Experts based Multi-task Supervise Learning from Crowdsby Tao Han, Huaixuan Shi, Xinyi Ding,…

July 13, 2025

Improving Out-of-Distribution Generalization of Trajectory Prediction for Autonomous Driving via Polynomial Representationsby Yue Yao, Shengchao…

July 13, 2025

A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASRby Jian…

July 13, 2025

Out-of-Distribution Detection through Soft Clustering with Non-Negative Kernel Regressionby Aryan Gulati, Xingjian Dong, Carlos Hurtado,…

July 13, 2025

A Resolution Independent Neural Operatorby Bahador Bahmani, Somdatta Goswami, Ioannis G. Kevrekidis, Michael D. ShieldsFirst…

July 13, 2025

Scaling Retrieval-Based Language Models with a Trillion-Token Datastoreby Rulin Shao, Jacqueline He, Akari Asai, Weijia…