Summary of Fopru: Focal Pruning For Efficient Large Vision-language Models, by Lei Jiang et al.
FoPru: Focal Pruning for Efficient Large Vision-Language Modelsby Lei Jiang, Weizhe Huang, Tongxuan Liu, Yuting…
FoPru: Focal Pruning for Efficient Large Vision-Language Modelsby Lei Jiang, Weizhe Huang, Tongxuan Liu, Yuting…
Revisiting the Integration of Convolution and Attention for Vision Backboneby Lei Zhu, Xinjiang Wang, Wayne…
Mirror Target YOLO: An Improved YOLOv8 Method with Indirect Vision for Heritage Buildings Fire Detectionby…
Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learningby Simone Bianco, Luigi Celona, Paolo…
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulationby…
CCIS-Diff: A Generative Model with Stable Diffusion Prior for Controlled Colonoscopy Image Synthesisby Yifan Xie,…
Enhancing LLM Reasoning with Reward-guided Tree Searchby Jinhao Jiang, Zhipeng Chen, Yingqian Min, Jie Chen,…
ColorEdit: Training-free Image-Guided Color editing with diffusion modelby Xingxi Yin, Zhi Li, Jingfeng Zhang, Chenglin…
Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentationby Markus Karmann, Onay UrfaliogluFirst submitted to…
Vision Eagle Attention: a new lens for advancing image classificationby Mahmudul HasanFirst submitted to arxiv…