Summary of Xlstm: Extended Long Short-term Memory, by Maximilian Beck et al.
xLSTM: Extended Long Short-Term Memoryby Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova,…
xLSTM: Extended Long Short-Term Memoryby Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova,…
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Promptsby Shudan Zhang, Hanlin Zhao,…
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Servingby Yujun Lin, Haotian Tang, Shang…
Continual Learning in the Presence of Repetitionby Hamed Hemati, Lorenzo Pellegrini, Xiaotian Duan, Zixuan Zhao,…
Adaptive Least Mean pth Power Graph Neural Networksby Yi Yan, Changran Peng, Ercan E. KuruogluFirst…
Acceleration Algorithms in GNNs: A Surveyby Lu Ma, Zeang Sheng, Xunkai Li, Xinyi Gao, Zhezheng…
Policy Learning with a Language Bottleneckby Megha Srivastava, Cedric Colas, Dorsa Sadigh, Jacob AndreasFirst submitted…
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuningby Karim Galliamov,…
Ranking-based Client Selection with Imitation Learning for Efficient Federated Learningby Chunlin Tian, Zhan Shi, Xinpeng…
Geometry and Dynamics of LayerNormby Paul M. RiechersFirst submitted to arxiv on: 7 May 2024CategoriesMain:…