Summary of Towards Latent Masked Image Modeling For Self-supervised Visual Representation Learning, by Yibing Wei et al.
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learningby Yibing Wei, Abhinav Gupta, Pedro…
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learningby Yibing Wei, Abhinav Gupta, Pedro…
When Synthetic Traces Hide Real Content: Analysis of Stable Diffusion Image Launderingby Sara Mandelli, Paolo…
Video Occupancy Modelsby Manan Tomar, Philippe Hansen-Estruch, Philip Bachman, Alex Lamb, John Langford, Matthew E.…
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Spaceby Yumeng Zhang,…
Image-Conditional Diffusion Transformer for Underwater Image Enhancementby Xingyang Nie, Su Pan, Xiaoyu Zhai, Shifei Tao,…
MARS: Paying more attention to visual attributes for text-based person searchby Alex Ergasti, Tomaso Fontanini,…
Face Reconstruction Transfer Attack as Out-of-Distribution Generalizationby Yoon Gyo Jung, Jaewoo Park, Xingbo Dong, Hojin…
Nomic Embed Vision: Expanding the Latent Spaceby Zach Nussbaum, Brandon Duderstadt, Andriy MulyarFirst submitted to…
Rethinking and Defending Protective Perturbation in Personalized Diffusion Modelsby Yixin Liu, Ruoxi Chen, Xun Chen,…
Aligning Diffusion Models with Noise-Conditioned Perceptionby Alexander Gambashidze, Anton Kulikov, Yuriy Sosnin, Ilya MakarovFirst submitted…