Summary of On the Resurgence Of Recurrent Models For Long Sequences — Survey and Research Opportunities in the Transformer Era, by Matteo Tiezzi et al.
On the Resurgence of Recurrent Models for Long Sequences – Survey and Research Opportunities in…
On the Resurgence of Recurrent Models for Long Sequences – Survey and Research Opportunities in…
On Limitations of the Transformer Architectureby Binghui Peng, Srini Narayanan, Christos PapadimitriouFirst submitted to arxiv…
Comparing skill of historical rainfall data based monsoon rainfall prediction in India with NCEP-NWP forecastsby…
Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Modelby Mikail Khona,…
TransAxx: Efficient Transformers with Approximate Computingby Dimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg HenkelFirst submitted…
The I/O Complexity of Attention, or How Optimal is Flash Attention?by Barna Saha, Christopher YeFirst…
ClusterTabNet: Supervised clustering method for table detection and table structure recognitionby Marek Polewczyk, Marco SpinaciFirst…
Power Transformer Fault Prediction Based on Knowledge Graphsby Chao Wang, Zhuo Chen, Ziyan Zhang, Chiyi…
GeoFormer: A Vision and Sequence Transformer-based Approach for Greenhouse Gas Monitoringby Madhav Khirwar, Ankur NarangFirst…
A Tale of Tails: Model Collapse as a Change of Scaling Lawsby Elvis Dohmatob, Yunzhen…