Summary of Gaussiananything: Interactive Point Cloud Latent Diffusion For 3d Generation, by Yushi Lan et al.
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generationby Yushi Lan, Shangchen Zhou, Zhaoyang Lyu,…
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generationby Yushi Lan, Shangchen Zhou, Zhaoyang Lyu,…
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generationby Zhenbin Wang, Lei Zhang,…
VQ-Map: Bird’s-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantizationby Yiwei Zhang, Jin…
Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopyby Panagiota Gatoula,…
LPUWF-LDM: Enhanced Latent Diffusion Model for Precise Late-phase UWF-FA Generation on Limited Datasetby Zhaojie Fang,…
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representationsby Can Qin, Congying Xia, Krithika Ramakrishnan, Michael Ryoo,…
SentenceVAE: Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer…
FedVAE: Trajectory privacy preserving based on Federated Variational AutoEncoderby Yuchen Jiang, Ying Wu, Shiyao Zhang,…
V-VIPE: Variational View Invariant Pose Embeddingby Mara Levy, Abhinav ShrivastavaFirst submitted to arxiv on: 9…
CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsby Sijie Zhao, Yong Zhang, Xiaodong…