Summary of Cross-architecture Transfer Learning For Linear-cost Inference Transformers, by Sehyun Choi
Cross-Architecture Transfer Learning for Linear-Cost Inference Transformersby Sehyun ChoiFirst submitted to arxiv on: 3 Apr…
Cross-Architecture Transfer Learning for Linear-Cost Inference Transformersby Sehyun ChoiFirst submitted to arxiv on: 3 Apr…
Mixture-of-Depths: Dynamically allocating compute in transformer-based language modelsby David Raposo, Sam Ritter, Blake Richards, Timothy…
EGTR: Extracting Graph from Transformer for Scene Graph Generationby Jinbae Im, JeongYeon Nam, Nokyung Park,…
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spacesby Toshihiro OtaFirst submitted to…
Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classificationby Qingyu Wang,…
Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidanceby Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Wooseok Jang, Jungwoo…
GTC: GNN-Transformer Co-contrastive Learning for Self-supervised Heterogeneous Graph Representationby Yundong Sun, Dongjie Zhu, Yansong Wang,…
A task of anomaly detection for a smart satellite Internet of things systemby Zilong ShaoFirst…
Divide-Conquer Transformer Learning for Predicting Electric Vehicle Charging Events Using Smart Meter Databy Fucai Ke,…
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEsby Md Ashiqur Rahman, Robert Joseph George,…