Summary of Wem-gan: Wavelet Transform Based Facial Expression Manipulation, by Dongya Sun et al.
WEM-GAN: Wavelet transform based facial expression manipulationby Dongya Sun, Yunfei Hu, Xianzhe Zhang, Yingsong HuFirst…
WEM-GAN: Wavelet transform based facial expression manipulationby Dongya Sun, Yunfei Hu, Xianzhe Zhang, Yingsong HuFirst…
ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?by Leixin Zhang, Steffen…
Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Modelsby Jungwon…
AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolationby Zhihang Lin, Mingbao Lin, Wengyi Zhan, Rongrong…
RandAR: Decoder-only Autoregressive Visual Generation in Random Ordersby Ziqi Pang, Tianyuan Zhang, Fujun Luan, Yunze…
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Modelsby Khaled Abud, Sergey…
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Costby Sen Xing, Muyan…
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generationby Zilyu Ye,…
CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Imagesby Jian…
Safety Alignment Backfires: Preventing the Re-emergence of Suppressed Concepts in Fine-tuned Text-to-Image Diffusion Modelsby Sanghyun…