Summary of Freeedit: Mask-free Reference-based Image Editing with Multi-modal Instruction, by Runze He et al.
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instructionby Runze He, Kai Ma, Linjiang Huang, Shaofei…
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instructionby Runze He, Kai Ma, Linjiang Huang, Shaofei…
HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detectionby Yuqi Ma, Mengyin…
AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Supportby Adil Bahaj, Mounir GhoghoFirst submitted…
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyondby Hong Chen, Xin Wang, Yuwei Zhou, Bin…
Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual Conversations for Cross-Lingual Image Captioningby Siddharth Betala, Ishan…