Summary of Small Language Models: Survey, Measurements, and Insights, by Zhenyan Lu et al.
Small Language Models: Survey, Measurements, and Insightsby Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi,…
Small Language Models: Survey, Measurements, and Insightsby Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi,…
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Expertsby Xiaoming Shi, Shiyu Wang, Yuqi…
Looped Transformers for Length Generalizationby Ying Fan, Yilun Du, Kannan Ramchandran, Kangwook LeeFirst submitted to…
Double-Path Adaptive-correlation Spatial-Temporal Inverted Transformer for Stock Time Series Forecastingby Wenbo Yan, Ying TanFirst submitted…
ASTE Transformer Modelling Dependencies in Aspect-Sentiment Triplet Extractionby Iwo Naglik, Mateusz LangoFirst submitted to arxiv…
Efficiently Dispatching Flash Attention For Partially Filled Attention Masksby Agniv Sharma, Jonas GeipingFirst submitted to…
HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learningby Naghmeh Shafiee Roudbari, Ursula…
EDSNet: Efficient-DSNet for Video Summarizationby Ashish Prasad, Pranav Jeevan, Amit SethiFirst submitted to arxiv on:…
Kriformer: A Novel Spatiotemporal Kriging Approach Based on Graph Transformersby Renbin Pan, Feng Xiao, Hegui…
Medical Concept Normalization in a Low-Resource Settingby Tim PatzeltFirst submitted to arxiv on: 6 Sep…