Summary of Self-selected Attention Span For Accelerating Large Language Model Inference, by Tian Jin et al.
Self-Selected Attention Span for Accelerating Large Language Model Inferenceby Tian Jin, Wanzin Yazar, Zifei Xu,…
Self-Selected Attention Span for Accelerating Large Language Model Inferenceby Tian Jin, Wanzin Yazar, Zifei Xu,…
CATP: Cross-Attention Token Pruning for Accuracy Preserved Multimodal Model Inferenceby Ruqi Liao, Chuqing Zhao, Jin…
Optimal path for Biomedical Text Summarization Using Pointer GPTby Hyunkyung Han, Jaesik ChoiFirst submitted to…
Playing to Vision Foundation Model’s Strengths in Stereo Matchingby Chuang-Wei Liu, Qijun Chen, Rui FanFirst…
LLM2Vec: Large Language Models Are Secretly Powerful Text Encodersby Parishad BehnamGhader, Vaibhav Adlakha, Marius Mosbach,…
Band-Attention Modulated RetNet for Face Forgery Detectionby Zhida Zhang, Jie Cao, Wenkui Yang, Qihang Fan,…
Uncertainty-aware Evidential Fusion-based Learning for Semi-supervised Medical Image Segmentationby Yuanpeng He, Lijian LiFirst submitted to…
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasetsby…
Spatio-Temporal Attention Graph Neural Network for Remaining Useful Life Predictionby Zhixin Huang, Yujiang He, Bernhard…