Summary of Av-emodialog: Chat with Audio-visual Users Leveraging Emotional Cues, by Se Jin Park et al.
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cuesby Se Jin Park, Yeonju Kim, Hyeongseop Rha,…
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cuesby Se Jin Park, Yeonju Kim, Hyeongseop Rha,…
Let’s Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversationby Se Jin Park, Chae Won…
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languagesby…