Summary of Catvton: Concatenation Is All You Need For Virtual Try-on with Diffusion Models, by Zheng Chong et al.
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Modelsby Zheng Chong, Xiao…
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Modelsby Zheng Chong, Xiao…
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streamsby Liangyan Jiang, Chuang Zhu, Yanxu ChenFirst submitted to…
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layerby Jinfeng Wei, Xiaofeng ZhangFirst submitted…
ReAttention: Training-Free Infinite Context with Finite Attention Scopeby Xiaoran Liu, Ruixiao Li, Qipeng Guo, Zhigeng…
Towards Robust Vision Transformer via Masked Adaptive Ensembleby Fudong Lin, Jiadong Lou, Xu Yuan, Nian-Feng…
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questionsby Sarah Wiegreffe, Oyvind Tafjord, Yonatan…
Training-free Composite Scene Generation for Layout-to-Image Synthesisby Jiaqi Liu, Tao Huang, Chang XuFirst submitted to…
Linear-Complexity Self-Supervised Learning for Speech Processingby Shucong Zhang, Titouan Parcollet, Rogier van Dalen, Sourav BhattacharyaFirst…
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusionby Huiguo He, Huan Yang, Zixi Tuo,…
CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inferenceby Mohammad Erfan Sadeghi, Arash…