Summary of Stablemask: Refining Causal Masking in Decoder-only Transformer, by Qingyu Yin et al.
StableMask: Refining Causal Masking in Decoder-only Transformerby Qingyu Yin, Xuzheng He, Xiang Zhuang, Yu Zhao,…
StableMask: Refining Causal Masking in Decoder-only Transformerby Qingyu Yin, Xuzheng He, Xiang Zhuang, Yu Zhao,…
Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuningby Ningyuan Tang, Minghao Fu, Ke Zhu, Jianxin WuFirst submitted…
Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairingby…
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learningby Zhaohui Jiang, Paul WengFirst submitted to arxiv…
Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learningby Trilok Padhi, Ugur Kursuncu, Yaman…
Benchmark for CEC 2024 Competition on Multiparty Multiobjective Optimizationby Wenjian Luo, Peilan Xu, Shengxiang Yang,…
MUSTAN: Multi-scale Temporal Context as Attention for Robust Video Foreground Segmentationby Praveen Kumar Pokala, Jaya…
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognitionby Youbing Hu, Yun Cheng,…
Local Feature Matching Using Deep Learning: A Surveyby Shibiao Xu, Shunpeng Chen, Rongtao Xu, Changwei…
Navigating the OverKill in Large Language Modelsby Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao,…