Summary of Diffava: Personalized Text-to-audio Generation with Visual Alignment, by Shentong Mo et al.
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignmentby Shentong Mo, Jing Shi, Yapeng TianFirst submitted to…
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignmentby Shentong Mo, Jing Shi, Yapeng TianFirst submitted to…
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learningby Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek…
On-Device Training Under 256KB Memoryby Ji Lin, Ligeng Zhu, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan,…
LPT: Long-tailed Prompt Tuning for Image Classificationby Bowen Dong, Pan Zhou, Shuicheng Yan, Wangmeng ZuoFirst…