Summary of Grokformer: Graph Fourier Kolmogorov-arnold Transformers, by Guoguo Ai et al.
GrokFormer: Graph Fourier Kolmogorov-Arnold Transformersby Guoguo Ai, Guansong Pang, Hezhe Qiao, Yuan Gao, Hui YanFirst…
GrokFormer: Graph Fourier Kolmogorov-Arnold Transformersby Guoguo Ai, Guansong Pang, Hezhe Qiao, Yuan Gao, Hui YanFirst…
Star Attention: Efficient LLM Inference over Long Sequencesby Shantanu Acharya, Fei Jia, Boris GinsburgFirst submitted…
Fundamental Limits of Prompt Tuning Transformers: Universality, Capacity and Efficiencyby Jerry Yao-Chieh Hu, Wei-Po Wang,…
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillationby Ke Zhao,…
FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Accelerationby…
Investigating Graph Neural Networks and Classical Feature-Extraction Techniques in Activity-Cliff and Molecular Property Predictionby Markus…
Transformers with Sparse Attention for Granger Causalityby Riya Mahesh, Rahul Vashisht, Chandrashekar LakshminarayananFirst submitted to…
Selective Attention: Enhancing Transformer through Principled Context Controlby Xuechen Zhang, Xiangyu Chang, Mingchen Li, Amit…
ST-Tree with Interpretability for Multivariate Time Series Classificationby Mingsen Du, Yanxuan Wei, Yingxia Tang, Xiangwei…
Distributed solar generation forecasting using attention-based deep neural networks for cloud movement predictionby Maneesha Perera,…