Summary of Accelerating Transformers with Spectrum-preserving Token Merging, by Hoai-chau Tran et al.
Accelerating Transformers with Spectrum-Preserving Token Mergingby Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen,…
Accelerating Transformers with Spectrum-Preserving Token Mergingby Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen,…
BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detectionby Yuwei Niu, Shuo He, Qi Wei, Zongyu Wu,…
The Buffer Mechanism for Multi-Step Information Reasoning in Language Modelsby Zhiwei Wang, Yunji Wang, Zhongwang…
Quantifying the Gain in Weak-to-Strong Generalizationby Moses Charikar, Chirag Pabbaraju, Kirankumar ShiragurFirst submitted to arxiv…
AnalogCoder: Analog Circuit Design via Training-Free Code Generationby Yao Lai, Sungyoung Lee, Guojin Chen, Souradip…
Parameter-free Clipped Gradient Descent Meets Polyakby Yuki Takezawa, Han Bao, Ryoma Sato, Kenta Niwa, Makoto…
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Stepby Yuntian Deng,…
Not All Language Model Features Are One-Dimensionally Linearby Joshua Engels, Eric J. Michaud, Isaac Liao,…
AGILE: A Novel Reinforcement Learning Framework of LLM Agentsby Peiyuan Feng, Yichen He, Guanhua Huang,…
Evaluating Large Language Models for Public Health Classification and Extraction Tasksby Joshua Harris, Timothy Laurence,…