Summary of Melfusion: Synthesizing Music From Image and Language Cues Using Diffusion Models, by Sanjoy Chowdhury et al.
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Modelsby Sanjoy Chowdhury, Sayan Nag,…
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Modelsby Sanjoy Chowdhury, Sayan Nag,…
RU-AI: A Large Multimodal Dataset for Machine-Generated Content Detectionby Liting Huang, Zhihao Zhang, Yiran Zhang,…
FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languagesby Bernardo Leite, Tomás Freitas…
ELFS: Label-Free Coreset Selection with Proxy Training Dynamicsby Haizhong Zheng, Elisa Tsai, Yifu Lu, Jiachen…
SocialNLP Fake-EmoReact 2021 Challenge Overview: Predicting Fake Tweets from Their Replies and GIFsby Chien-Kun Huang,…
Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learningby Amandeep Kumar, Muhammad Awais, Sanath Narayan,…
Promoting the Responsible Development of Speech Datasets for Mental Health and Neurological Disorders Researchby Eleonora…
Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detectionby Qutub Syed,…
Are language models rational? The case of coherence norms and belief revisionby Thomas Hofweber, Peter…
PuFace: Defending against Facial Cloaking Attacks for Facial Recognition Modelsby Jing WenFirst submitted to arxiv…