Summary of Small Language Models: Survey, Measurements, and Insights, by Zhenyan Lu et al.
Small Language Models: Survey, Measurements, and Insightsby Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi,…
Small Language Models: Survey, Measurements, and Insightsby Zhenyan Lu, Xiang Li, Dongqi Cai, Rongjie Yi,…
Looped Transformers for Length Generalizationby Ying Fan, Yilun Du, Kannan Ramchandran, Kangwook LeeFirst submitted to…
Double-Path Adaptive-correlation Spatial-Temporal Inverted Transformer for Stock Time Series Forecastingby Wenbo Yan, Ying TanFirst submitted…
ASTE Transformer Modelling Dependencies in Aspect-Sentiment Triplet Extractionby Iwo Naglik, Mateusz LangoFirst submitted to arxiv…
HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learningby Naghmeh Shafiee Roudbari, Ursula…
Efficiently Dispatching Flash Attention For Partially Filled Attention Masksby Agniv Sharma, Jonas GeipingFirst submitted to…
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Modelsby Hossein Rajabzadeh, Aref Jafari,…
EDSNet: Efficient-DSNet for Video Summarizationby Ashish Prasad, Pranav Jeevan, Amit SethiFirst submitted to arxiv on:…
Kriformer: A Novel Spatiotemporal Kriging Approach Based on Graph Transformersby Renbin Pan, Feng Xiao, Hegui…
Sparse Low-Ranked Self-Attention Transformer for Remaining Useful Lifetime Prediction of Optical Fiber Amplifiersby Dominic Schneider,…