Summary of How Far Is Video Generation From World Model: a Physical Law Perspective, by Bingyi Kang et al.
How Far is Video Generation from World Model: A Physical Law Perspectiveby Bingyi Kang, Yang…
How Far is Video Generation from World Model: A Physical Law Perspectiveby Bingyi Kang, Yang…
RS-MoE: A Vision-Language Model with Mixture of Experts for Remote Sensing Image Captioning and Visual…
OSAD: Open-Set Aircraft Detection in SAR Imagesby Xiayang Xiao, Zhuoxuan Li, Haipeng WangFirst submitted to…
Shortcut Learning in In-Context Learning: A Surveyby Rui Song, Yingji Li, Lida Shi, Fausto Giunchiglia,…
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Trackingby Christopher Richardson, Roshan Sharma, Neeraj…
Graph Learning for Numeric Planningby Dillon Z. Chen, Sylvie ThiébauxFirst submitted to arxiv on: 31…
Effective Guidance for Model Attention with Simple Yes-no Annotationsby Seongmin Lee, Ali Payani, Duen Horng…
Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Modelby Yiming Ji, Yang Liu,…
Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanismsby Feifei Zhao, Hui Feng,…
ADAM: An Embodied Causal Agent in Open-World Environmentsby Shu Yu, Chaochao LuFirst submitted to arxiv…