Summary of Gaussiananything: Interactive Point Cloud Latent Diffusion For 3d Generation, by Yushi Lan et al.
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generationby Yushi Lan, Shangchen Zhou, Zhaoyang Lyu,…
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generationby Yushi Lan, Shangchen Zhou, Zhaoyang Lyu,…
VQ-Map: Bird’s-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantizationby Yiwei Zhang, Jin…
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generationby Zhenbin Wang, Lei Zhang,…
Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopyby Panagiota Gatoula,…
LPUWF-LDM: Enhanced Latent Diffusion Model for Precise Late-phase UWF-FA Generation on Limited Datasetby Zhaojie Fang,…
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representationsby Can Qin, Congying Xia, Krithika Ramakrishnan, Michael Ryoo,…
SentenceVAE: Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer…
FedVAE: Trajectory privacy preserving based on Federated Variational AutoEncoderby Yuchen Jiang, Ying Wu, Shiyao Zhang,…
V-VIPE: Variational View Invariant Pose Embeddingby Mara Levy, Abhinav ShrivastavaFirst submitted to arxiv on: 9…
CV-VAE: A Compatible Video VAE for Latent Generative Video Modelsby Sijie Zhao, Yong Zhang, Xiaodong…