Summary of Token Turing Machines Are Efficient Vision Models, by Purvish Jajal et al.
Token Turing Machines are Efficient Vision Modelsby Purvish Jajal, Nick John Eliopoulos, Benjamin Shiue-Hal Chou,…
Token Turing Machines are Efficient Vision Modelsby Purvish Jajal, Nick John Eliopoulos, Benjamin Shiue-Hal Chou,…
Representation Tuningby Christopher M. AckermanFirst submitted to arxiv on: 11 Sep 2024CategoriesMain: Machine Learning (cs.LG)Secondary:…
Understanding Knowledge Drift in LLMs through Misinformationby Alina Fastowski, Gjergji KasneciFirst submitted to arxiv on:…
Alleviating Hallucinations in Large Language Models with Scepticism Modelingby Yetao Wu, Yihong Wang, Teng Chen,…
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Modelsby Maryam Akhavan Aghdam, Hongpeng Jin, Yanzhao WuFirst…
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generationby Yecheng Wu, Zhuoyang Zhang, Junyu…
Residual Stream Analysis with Multi-Layer SAEsby Tim Lawson, Lucy Farnik, Conor Houghton, Laurence AitchisonFirst submitted…
Preserving Empirical Probabilities in BERT for Small-sample Clinical Entity Recognitionby Abdul Rehman, Jian Jun Zhang,…
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Samplingby Kaiwen Zheng,…
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMsby Ruoyu Wang, Xiaoxuan Li, Lina YaoFirst…