Summary of Algoformer: An Efficient Transformer Framework with Algorithmic Structures, by Yihang Gao et al.
AlgoFormer: An Efficient Transformer Framework with Algorithmic Structuresby Yihang Gao, Chuanyang Zheng, Enze Xie, Han…
AlgoFormer: An Efficient Transformer Framework with Algorithmic Structuresby Yihang Gao, Chuanyang Zheng, Enze Xie, Han…
Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformersby Joshua F. Cooper, Seung…
Transformer tricks: Precomputing the first layerby Nils GraefFirst submitted to arxiv on: 20 Feb 2024CategoriesMain:…
Conditional Logical Message Passing Transformer for Complex Query Answeringby Chongzhi Zhang, Zhiping Peng, Junhao Zheng,…
Backward Lens: Projecting Language Model Gradients into the Vocabulary Spaceby Shahar Katz, Yonatan Belinkov, Mor…
Chain of Thought Empowers Transformers to Solve Inherently Serial Problemsby Zhiyuan Li, Hong Liu, Denny…
An Equivariant Pretrained Transformer for Unified 3D Molecular Representation Learningby Rui Jiao, Xiangzhe Kong, Li…
Beyond Uniform Scaling: Exploring Depth Heterogeneity in Neural Architecturesby Akash Guna R.T, Arnav Chavan, Deepak…
Induced Model Matching: How Restricted Models Can Help Larger Onesby Usama Muneeb, Mesrob I. OhannessianFirst…
Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physicsby Siqi Miao, Zhiyuan Lu, Mia…