Summary of An Intermediate Fusion Vit Enables Efficient Text-image Alignment in Diffusion Models, by Zizhao Hu et al.
An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Modelsby Zizhao Hu, Shaochong Jia,…
An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Modelsby Zizhao Hu, Shaochong Jia,…
Adversarial Guided Diffusion Models for Adversarial Purificationby Guang Lin, Zerui Tao, Jianhai Zhang, Toshihisa Tanaka,…
Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Planeby Han Yan, Yang Li, Zhennan Wu, Shenzhou…
An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous…
A Framework for Portrait Stylization with Skin-Tone Awareness and Nudity Identificationby Seungkwon Kim, Sangyeon Kim,…
Latent Diffusion Models for Attribute-Preserving Image Anonymizationby Luca Piano, Pietro Basci, Fabrizio Lamberti, Lia MorraFirst…
ACDG-VTON: Accurate and Contained Diffusion Generation for Virtual Try-Onby Jeffrey Zhang, Kedan Li, Shao-Yu Chang,…
Depth-guided NeRF Training via Earth Mover’s Distanceby Anita Rau, Josiah Aklilu, F. Christopher Holsinger, Serena…
S2DM: Sector-Shaped Diffusion Models for Video Generationby Haoran Lang, Yuxuan Ge, Zheng TianFirst submitted to…
AnimateDiff-Lightning: Cross-Model Diffusion Distillationby Shanchuan Lin, Xiao YangFirst submitted to arxiv on: 19 Mar 2024CategoriesMain:…