Summary of On Speculative Decoding For Multimodal Large Language Models, by Mukul Gagrani et al.
On Speculative Decoding for Multimodal Large Language Modelsby Mukul Gagrani, Raghavv Goel, Wonseok Jeon, Junyoung…
On Speculative Decoding for Multimodal Large Language Modelsby Mukul Gagrani, Raghavv Goel, Wonseok Jeon, Junyoung…
Harnessing the Power of Large Vision Language Models for Synthetic Image Detectionby Mamadou Keita, Wassim…
Bi-LORA: A Vision-Language Approach for Synthetic Image Detectionby Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene,…
A Survey on Large Language Models from Concept to Implementationby Chen Wang, Jin Zhao, Jiaqi…
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Predictionby Inhwan Bae, Junoh Lee, Hae-Gon JeonFirst…
Semi-Supervised Image Captioning Considering Wasserstein Graph Matchingby Yang YangFirst submitted to arxiv on: 26 Mar…
VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learningby Yongshuo Zong, Ondrej Bohdal,…
Differentially Private Representation Learning via Image Captioningby Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus,…
AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimizationby Jiyao Li, Mingze Ni, Yifei Dong, Tianqing…
PICS: Pipeline for Image Captioning and Searchby Grant Rosario, David NoeverFirst submitted to arxiv on:…