Summary of Swifttry: Fast and Consistent Video Virtual Try-on with Diffusion Models, by Hung Nguyen et al.
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Modelsby Hung Nguyen, Quang Qui-Vinh Nguyen,…
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Modelsby Hung Nguyen, Quang Qui-Vinh Nguyen,…
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understandingby Zhiyu Wu, Xiaokang Chen, Zizheng Pan, Xingchao…
Semi-IIN: Semi-supervised Intra-inter modal Interaction Learning Network for Multimodal Sentiment Analysisby Jinhao Lin, Yifei Wang,…
Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations?by Yifan Zhang, Junhui HouFirst submitted to…
Accurate Water Level Monitoring in AWD Rice Cultivation Using Convolutional Neural Networksby Ahmed Rafi Hasan,…
BENet: A Cross-domain Robust Network for Detecting Face Forgeries via Bias Expansion and Latent-space Attentionby…
ACQ: A Unified Framework for Automated Programmatic Creativity in Online Advertisingby Ruizhi Wang, Kai Liu,…
AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesisby Shidan He, Lei Liu, Shen ZhaoFirst…
A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentationby Ruoxin Wang, Tianyi Tang, Haiming Du,…
UMSPU: Universal Multi-Size Phase Unwrapping via Mutual Self-Distillation and Adaptive Boosting Ensemble Segmentersby Lintong Du,…