Summary of Efficient Long Video Tokenization Via Coordinate-based Patch Reconstruction, by Huiwon Jang et al.
Efficient Long Video Tokenization via Coordinate-based Patch Reconstructionby Huiwon Jang, Sihyun Yu, Jinwoo Shin, Pieter…
Efficient Long Video Tokenization via Coordinate-based Patch Reconstructionby Huiwon Jang, Sihyun Yu, Jinwoo Shin, Pieter…
Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translationby Jeongsol Kim, Beomsu Kim,…
Preference Alignment for Diffusion Model via Explicit Denoised Distribution Estimationby Dingyuan Shi, Yong Wang, Hangyu…
Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddingsby Pura Peetathawatchai, Wei-Ning Chen, Berivan…
Stable Flow: Vital Layers for Training-Free Image Editingby Omri Avrahami, Or Patashnik, Ohad Fried, Egor…
Dealing with Synthetic Data Contamination in Online Continual Learningby Maorong Wang, Nicolas Michel, Jiafeng Mao,…
Non-Linear Outlier Synthesis for Out-of-Distribution Detectionby Lars Doorenbos, Raphael Sznitman, Pablo Márquez-NeilaFirst submitted to arxiv…
PoM: Efficient Image and Video Generation with the Polynomial Mixerby David Picard, Nicolas DufourFirst submitted…
UrbanDiT: A Foundation Model for Open-World Urban Spatio-Temporal Learningby Yuan Yuan, Chonghua Han, Jingtao Ding,…
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testingby Haiping Ma, Aoqing Xia, Changqian…