Summary of P4q: Learning to Prompt For Quantization in Visual-language Models, by Huixin Sun et al.
P4Q: Learning to Prompt for Quantization in Visual-language Modelsby Huixin Sun, Runqi Wang, Yanjing Li,…
P4Q: Learning to Prompt for Quantization in Visual-language Modelsby Huixin Sun, Runqi Wang, Yanjing Li,…
Intrapartum Ultrasound Image Segmentation of Pubic Symphysis and Fetal Head Using Dual Student-Teacher Framework with…
LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolutionby Jeongsoo Kim, Jongho Nang, Junsuk ChoeFirst submitted…
Equitable Skin Disease Prediction Using Transfer Learning and Domain Adaptationby Sajib Acharjee Dip, Kazi Hasan…
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Modelsby Jingyi Wang, Jianzhong Ju,…
Symmetric masking strategy enhances the performance of Masked Image Modelingby Khanh-Binh Nguyen, Chae Jung ParkFirst…
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Modelsby Kazi Hasan Ibn Arif,…
Pedestrian Attribute Recognition: A New Benchmark Dataset and A Large Language Model Augmented Frameworkby Jiandong…
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitationby Weiqi Feng, Yangrui Chen, Shaoyu Wang,…
DeMansia: Mamba Never Forgets Any Tokensby Ricky FangFirst submitted to arxiv on: 4 Aug 2024CategoriesMain:…