Summary of Umbrae: Unified Multimodal Brain Decoding, by Weihao Xia et al.
UMBRAE: Unified Multimodal Brain Decodingby Weihao Xia, Raoul de Charette, Cengiz Ă–ztireli, Jing-Hao XueFirst submitted…
UMBRAE: Unified Multimodal Brain Decodingby Weihao Xia, Raoul de Charette, Cengiz Ă–ztireli, Jing-Hao XueFirst submitted…
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Spaceby Jianxiang Xiang, Zhenhua Liu,…
LLM2Vec: Large Language Models Are Secretly Powerful Text Encodersby Parishad BehnamGhader, Vaibhav Adlakha, Marius Mosbach,…
HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fieldsby Arnab Dey, Di Yang, Antitza Dantcheva,…
BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusionby Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong…
ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scaleby Jinbin Huang, Chen Chen,…
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representationby Xiongwei Wu, Sicheng Yu, Ee-Peng…
Unleash the Potential of CLIP for Video Highlight Detectionby Donghoon Han, Seunghyeon Seo, Eunhwan Park,…
ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognitionby Weidong Xie, Lun Luo, Nanfei Ye, Yi…
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Modelsby Yanwei Li, Yuechen Zhang, Chengyao Wang,…