Summary of Ominicontrol: Minimal and Universal Control For Diffusion Transformer, by Zhenxiong Tan et al.
OminiControl: Minimal and Universal Control for Diffusion Transformerby Zhenxiong Tan, Songhua Liu, Xingyi Yang, Qiaochu…
OminiControl: Minimal and Universal Control for Diffusion Transformerby Zhenxiong Tan, Songhua Liu, Xingyi Yang, Qiaochu…
What You See is Not What You Get: Neural Partial Differential Equations and The Illusion…
Context-Aware Multimodal Pretrainingby Karsten Roth, Zeynep Akata, Dima Damen, Ivana Balažević, Olivier J. HénaffFirst submitted…
Grid and Road Expressions Are Complementary for Trajectory Representation Learningby Silin Zhou, Shuo Shang, Lisi…
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computersby Hongbo LiuFirst submitted to…
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connectionsby Youwei Zhou, Tianyang…
Continual SFT Matches Multimodal RLHF with Negative Supervisionby Ke Zhu, Yu Wang, Yanpeng Sun, Qiang…
Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehensionby Luca Parolari, Elena Izzo, Lamberto…
Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approachby Shulin Lan, Kanlin Liu,…
High-Resolution Image Synthesis via Next-Token Predictionby Dengsheng Chen, Jie Hu, Tiezhu Yue, Xiaoming Wei, Enhua…