Summary of Audiobert: Audio Knowledge Augmented Language Model, by Hyunjong Ok et al.
AudioBERT: Audio Knowledge Augmented Language Modelby Hyunjong Ok, Suho Yoo, Jaeho LeeFirst submitted to arxiv…
AudioBERT: Audio Knowledge Augmented Language Modelby Hyunjong Ok, Suho Yoo, Jaeho LeeFirst submitted to arxiv…
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sourcesby Alisia Lupidi, Carlos Gemmell,…
LT3SD: Latent Trees for 3D Scene Diffusionby Quan Meng, Lei Li, Matthias Nießner, Angela DaiFirst…
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scaleby Rogerio Bonatti, Dan Zhao, Francesco Bonacci,…
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimallyby Qiuhong Shen, Xingyi Yang, Xinchao WangFirst…
IFAdapter: Instance Feature Control for Grounded Text-to-Image Generationby Yinwei Wu, Xianpan Zhou, Bing Ma, Xuefeng…
360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translationby Hai Wang, Jing-Hao XueFirst submitted to arxiv on: 12…
Bayesian Inverse Graphics for Few-Shot Concept Learningby Octavio Arriaga, Jichen Guo, Rebecca Adam, Sebastian Houben,…
When Context Leads but Parametric Memory Follows in Large Language Modelsby Yufei Tao, Adam Hiatt,…
Knowledge Tagging with Large Language Model based Multi-Agent Systemby Hang Li, Tianlong Xu, Ethan Chang,…