Summary of Towards Latent Masked Image Modeling For Self-supervised Visual Representation Learning, by Yibing Wei et al.
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learningby Yibing Wei, Abhinav Gupta, Pedro…
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learningby Yibing Wei, Abhinav Gupta, Pedro…
When Synthetic Traces Hide Real Content: Analysis of Stable Diffusion Image Launderingby Sara Mandelli, Paolo…
Video Occupancy Modelsby Manan Tomar, Philippe Hansen-Estruch, Philip Bachman, Alex Lamb, John Langford, Matthew E.…
Image-Conditional Diffusion Transformer for Underwater Image Enhancementby Xingyang Nie, Su Pan, Xiaoyu Zhai, Shifei Tao,…
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Spaceby Yumeng Zhang,…
MARS: Paying more attention to visual attributes for text-based person searchby Alex Ergasti, Tomaso Fontanini,…
Face Reconstruction Transfer Attack as Out-of-Distribution Generalizationby Yoon Gyo Jung, Jaewoo Park, Xingbo Dong, Hojin…
Rethinking and Defending Protective Perturbation in Personalized Diffusion Modelsby Yixin Liu, Ruoxi Chen, Xun Chen,…
Nomic Embed Vision: Expanding the Latent Spaceby Zach Nussbaum, Brandon Duderstadt, Andriy MulyarFirst submitted to…
Aligning Diffusion Models with Noise-Conditioned Perceptionby Alexander Gambashidze, Anton Kulikov, Yuriy Sosnin, Ilya MakarovFirst submitted…