Summary of Is One Gpu Enough? Pushing Image Generation at Higher-resolutions with Foundation Models, by Athanasios Tragakis et al.
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Modelsby Athanasios Tragakis, Marco…
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Modelsby Athanasios Tragakis, Marco…
Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learningby Amandeep Kumar, Muhammad Awais, Sanath Narayan,…
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Modelby Zhengang Li, Yan Kang, Yuchen…
CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsby Sijie Zhao, Yong Zhang, Xiaodong…
Exploring Alignment in Shared Cross-lingual Spacesby Basel Mousi, Nadir Durrani, Fahim Dalvi, Majd Hawasly, Ahmed…
NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolationby Chaokang Jiang, Dalong…
SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Spaceby Zeren Zhang, Haibo Qin,…
Long Tail Image Generation Through Feature Space Augmentation and Iterated Learningby Rafael Elberg, Denis Parra,…
Contextual Categorization Enhancement through LLMs Latent-Spaceby Zineddine Bettouche, Anas Safi, Andreas FischerFirst submitted to arxiv…
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?by Yuchi Wang, Shuhuai…