Summary of Fit: Flexible Vision Transformer For Diffusion Model, by Zeyu Lu et al.
FiT: Flexible Vision Transformer for Diffusion Modelby Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu,…
FiT: Flexible Vision Transformer for Diffusion Modelby Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu,…
Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPTby Zhengfu…
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Taskby Jannik Brinkmann,…
Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generationby Yuan Yuan, Chenyang Shao, Jingtao Ding, Depeng…
InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integrationby Fali Wang, Runxue…
A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of…
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chainsby Benjamin L. Edelman, Ezra Edelman,…
Measuring and Controlling Instruction (In)Stability in Language Model Dialogsby Kenneth Li, Tianle Liu, Naomi Bashkansky,…
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Missby Yuri…
An end-to-end attention-based approach for learning on graphsby David Buterez, Jon Paul Janet, Dino Oglic,…