Attention – Page 172 – GrooveSquid.com

July 13, 2025

Summary of State Space Model For New-generation Network Alternative to Transformers: a Survey, by Xiao Wang et al.

State Space Model for New-Generation Network Alternative to Transformers: A Surveyby Xiao Wang, Shiao Wang,…

July 13, 2025

Summary of Hierarchical Attention Models For Multi-relational Graphs, by Roshni G. Iyer et al.

Hierarchical Attention Models for Multi-Relational Graphsby Roshni G. Iyer, Wei Wang, Yizhou SunFirst submitted to…

July 13, 2025

Summary of Transformerfam: Feedback Attention Is Working Memory, by Dongseong Hwang et al.

TransformerFAM: Feedback attention is working memoryby Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim,…

July 13, 2025

Summary of `eyes Of a Hawk and Ears Of a Fox’: Part Prototype Network For Generalized Zero-shot Learning, by Joshua Feinglass et al.

`Eyes of a Hawk and Ears of a Fox’: Part Prototype Network for Generalized Zero-Shot…

July 13, 2025

Summary of Megalodon: Efficient Llm Pretraining and Inference with Unlimited Context Length, by Xuezhe Ma et al.

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Lengthby Xuezhe Ma, Xiaomeng Yang, Wenhan…

July 13, 2025

Summary of Bert-lsh: Reducing Absolute Compute For Attention, by Zezheng Li et al.

BERT-LSH: Reducing Absolute Compute For Attentionby Zezheng Li, Kingston YipFirst submitted to arxiv on: 12…

July 13, 2025

Summary of Inheritune: Training Smaller Yet More Attentive Language Models, by Sunny Sanyal et al.

Inheritune: Training Smaller Yet More Attentive Language Modelsby Sunny Sanyal, Ravid Shwartz-Ziv, Alexandros G. Dimakis,…

July 13, 2025

Summary of Neural Sequence-to-sequence Modeling with Attention by Leveraging Deep Learning Architectures For Enhanced Contextual Understanding in Abstractive Text Summarization, By Bhavith Chandra Challagundla et al.

Neural Sequence-to-Sequence Modeling with Attention by Leveraging Deep Learning Architectures for Enhanced Contextual Understanding in…

July 13, 2025

Summary of Vetrass: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning, by Ming Cheng et al.

VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learningby Ming Cheng, Bowen Zhang,…

July 13, 2025

Summary of Disorderunetlm: Validating Proteinunet For Efficient Protein Intrinsic Disorder Prediction, by Krzysztof Kotowski et al.

DisorderUnetLM: Validating ProteinUnet for efficient protein intrinsic disorder predictionby Krzysztof Kotowski, Irena Roterman, Katarzyna StaporFirst…