Positional encoding – GrooveSquid.com

July 13, 2025

Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integrationby Yifan ShaoFirst submitted to arxiv on:…

July 13, 2025

Tackling the Abstraction and Reasoning Corpus with Vision Transformers: the Importance of 2D Representation, Positions,…

July 13, 2025

Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Modelsby Yida Zhao, Chao Lou, Kewei…

July 13, 2025

Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problemsby Jason Hu, Bowen Song,…

July 13, 2025

A Structure-Aware Lane Graph Transformer Model for Vehicle Trajectory Predictionby Sun Zhanbo, Dong Caiyin, Ji…

July 13, 2025

Exploring the Role of Token in Transformer-based Time Series Forecastingby Jianqi Zhang, Jingyao Wang, Chuxiong…

July 13, 2025

LongHeads: Multi-Head Attention is Secretly a Long Context Processorby Yi Lu, Xin Zhou, Wei He,…

July 13, 2025

VSFormer: Value and Shape-Aware Transformer with Prior-Enhanced Self-Attention for Multivariate Time Series Classificationby Wenjie Xi,…

July 13, 2025

PCA-Featured Transformer for Jamming Detection in 5G UAV Networksby Joseanne Viana, Hamed Farkhari, Pedro Sebastiao,…

July 13, 2025

Hansel: Output Length Controlling Framework for Large Language Modelsby Seoha Song, Junhyun Lee, Hyeonmok KoFirst…