Summary of Elliptical Attention, by Stefan K. Nielsen et al.
Elliptical Attentionby Stefan K. Nielsen, Laziz U. Abdullaev, Rachel S.Y. Teo, Tan M. NguyenFirst submitted…
Elliptical Attentionby Stefan K. Nielsen, Laziz U. Abdullaev, Rachel S.Y. Teo, Tan M. NguyenFirst submitted…
In-Context In-Context Learning with Transformer Neural Processesby Matthew Ashman, Cristiana Diaconu, Adrian Weller, Richard E.…
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Modelsby Haowen Hou, Peigen Zeng, Fei Ma,…
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Modelsby Yili Wang, Kaixiong Zhou, Ninghao Liu, Ying…
M3T: Multi-Modal Medical Transformer to bridge Clinical Context with Visual Insights for Retinal Image Medical…
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Modelby Sajib Acharjee Dip,…
How Out-of-Distribution Detection Learning Theory Enhances Transformer: Learnability and Reliabilityby Yijin Zhou, Yutang Ge, Xiaowen…
Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Modelsby Hengyi Wang, Shiwei Tan, Hao…
Translation Equivariant Transformer Neural Processesby Matthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya, Stratis Markou,…
Exploring the Impact of a Transformer’s Latent Space Geometry on Downstream Task Performanceby Anna C.…