Token – Page 68 – GrooveSquid.com

July 13, 2025

Summary of Dager: Exact Gradient Inversion For Large Language Models, by Ivo Petrov and Dimitar I. Dimitrov et al.

DAGER: Exact Gradient Inversion for Large Language Modelsby Ivo Petrov, Dimitar I. Dimitrov, Maximilian Baader,…

July 13, 2025

Summary of Towards Understanding the Working Mechanism Of Text-to-image Diffusion Model, by Mingyang Yi et al.

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Modelby Mingyang Yi, Aoxue Li, Yi Xin,…

July 13, 2025

Summary of Ivideogpt: Interactive Videogpts Are Scalable World Models, by Jialong Wu et al.

iVideoGPT: Interactive VideoGPTs are Scalable World Modelsby Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He,…

July 13, 2025

Summary of Multicast: Zero-shot Multivariate Time Series Forecasting Using Llms, by Georgios Chatzigeorgakidis et al.

MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMsby Georgios Chatzigeorgakidis, Konstantinos Lentzos, Dimitrios SkoutasFirst submitted…

July 13, 2025

Summary of Unchosen Experts Can Contribute Too: Unleashing Moe Models’ Power by Self-contrast, By Chufan Shi et al.

Unchosen Experts Can Contribute Too: Unleashing MoE Models’ Power by Self-Contrastby Chufan Shi, Cheng Yang,…

July 13, 2025

Summary of Segformer++: Efficient Token-merging Strategies For High-resolution Semantic Segmentation, by Daniel Kienzle et al.

Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentationby Daniel Kienzle, Marco Kantonis, Robin Schön, Rainer…

July 13, 2025

Summary of Dynamic Mixture Of Experts: An Auto-tuning Approach For Efficient Transformer Models, by Yongxin Guo et al.

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Modelsby Yongxin Guo, Zhenglin Cheng,…

July 13, 2025

Summary of Self-taught Recognizer: Toward Unsupervised Adaptation For Speech Foundation Models, by Yuchen Hu et al.

Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Modelsby Yuchen Hu, Chen Chen, Chao-Han Huck…

July 13, 2025

Summary of Next-token Prediction Capacity: General Upper Bounds and a Lower Bound For Transformers, by Liam Madden et al.

Next-token prediction capacity: general upper bounds and a lower bound for transformersby Liam Madden, Curtis…

July 13, 2025

Summary of Asymptotic Theory Of In-context Learning by Linear Attention, By Yue M. Lu et al.

Asymptotic theory of in-context learning by linear attentionby Yue M. Lu, Mary I. Letey, Jacob…