Summary of Mardini: Masked Autoregressive Diffusion For Video Generation at Scale, by Haozhe Liu et al.
MarDini: Masked Autoregressive Diffusion for Video Generation at Scaleby Haozhe Liu, Shikun Liu, Zijian Zhou,…
MarDini: Masked Autoregressive Diffusion for Video Generation at Scaleby Haozhe Liu, Shikun Liu, Zijian Zhou,…
Deep Learning Based Dense Retrieval: A Comparative Studyby Ming Zhong, Zhizhi Wu, Nanako HondaFirst submitted…
Effective Instruction Parsing Plugin for Complex Logical Query Answering on Knowledge Graphsby Xingrui Zhuo, Jiapu…
Peter Parker or Spiderman? Disambiguating Multiple Class Labelsby Nuthan Mummani, Simran Ketha, Venkatakrishnan RamaswamyFirst submitted…
Shared Control with Black Box Agents using Oracle Queriesby Inbal Avraham, Reuth MirskyFirst submitted to…
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimizationby Hongliang He, Wenlin…
Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Drivingby Liu Yunhao, Ding Hong, Zhang…
Knowledge Graph Enhanced Language Agents for Recommendationby Taicheng Guo, Chaochun Liu, Hai Wang, Varun Mannam,…
VARS: Vision-based Assessment of Risk in Security Systemsby Pranav Gupta, Pratham Gohil, Sridhar SFirst submitted…
AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMsby Clemencia Siro,…