Summary of Efficient Vision-language Models by Summarizing Visual Tokens Into Compact Registers, By Yuxin Wen et al.
Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registersby Yuxin Wen, Qingqing Cao, Qichen…
Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registersby Yuxin Wen, Qingqing Cao, Qichen…
SPIN: Self-Supervised Prompt INjectionby Leon Zhou, Junfeng Yang, Chengzhi MaoFirst submitted to arxiv on: 17…
Cerberus: Efficient Inference with Adaptive Parallel Decoding and Sequential Knowledge Enhancementby Yuxuan Liu, Wenyuan Li,…
Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanismby Yimin Tang, Yurong Xu,…
Efficient Diffusion as Low Light Enhancerby Guanzhou Lan, Qianli Ma, Yuqi Yang, Zhigang Wang, Dong…
A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmeticby Lennert De Smet, Pedro Zuidberg…
Dual-Model Distillation for Efficient Action Classification with Hybrid Edge-Cloud Solutionby Timothy Wei, Hsien Xin Peng,…
Order-aware Interactive Segmentationby Bin Wang, Anwesa Choudhuri, Meng Zheng, Zhongpai Gao, Benjamin Planche, Andong Deng,…
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluationby Xiaonan Jing, Srinivas…
Pyramid-Driven Alignment: Pyramid Principle Guided Integration of Large Language Models and Knowledge Graphsby Lei Sun,…