Summary of From Noise to Nuance: Advances in Deep Generative Image Models, by Benji Peng et al.
From Noise to Nuance: Advances in Deep Generative Image Modelsby Benji Peng, Chia Xin Liang,…
From Noise to Nuance: Advances in Deep Generative Image Modelsby Benji Peng, Chia Xin Liang,…
Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations?by Yifan Zhang, Junhui HouFirst submitted to…
MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agentsby Yun Xing, Nhat Chung,…
ContRail: A Framework for Realistic Railway Image Synthesis using ControlNetby Andrei-Robert Alexandrescu, Razvan-Gabriel Petec, Alexandru…
TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraftby Qian Long, Zhi Li, Ran Gong,…
Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Expertsby Chenyang Zhu,…
MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Modelby Yunhe Pang, Bo Chen,…
SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactionsby Bufang…
ProtDAT: A Unified Framework for Protein Sequence Design from Any Protein Text Descriptionby Xiao-Yu Guo,…
BodyMetric: Evaluating the Realism of Human Bodies in Text-to-Image Generationby Nefeli Andreou, Varsha Vivek, Ying…