Summary of Llm Maybe Longlm: Self-extend Llm Context Window Without Tuning, by Hongye Jin et al.
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningby Hongye Jin, Xiaotian Han, Jingfeng Yang,…
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningby Hongye Jin, Xiaotian Han, Jingfeng Yang,…
SwapTransformer: highway overtaking tactical planner model via imitation learning on OSHA datasetby Alireza Shamsoshoara, Safin…
MSGNet: Learning Multi-Scale Inter-Series Correlations for Multivariate Time Series Forecastingby Wanlin Cai, Yuxuan Liang, Xianggen…
Detecting out-of-distribution text using topological features of transformer-based language modelsby Andres Pollano, Anupam Chaudhuri, Anj…
De-SaTE: Denoising Self-attention Transformer Encoders for Li-ion Battery Health Prognosticsby Gaurav Shinde, Rohan Mohapatra, Pooja…
Attention-free Spikformer: Mixing Spike Sequences with Simple Linear Transformsby Qingyu Wang, Duzhen Zhang, Tielin Zhang,…
Attention Augmented Convolutional Networksby Irwan Bello, Barret Zoph, Ashish Vaswani, Jonathon Shlens, Quoc V. LeFirst…