Summary of Bevworld: a Multimodal World Model For Autonomous Driving Via Unified Bev Latent Space, by Yumeng Zhang et al.
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Spaceby Yumeng Zhang,…
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Spaceby Yumeng Zhang,…
Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognitionby Yuxiang…
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instructby Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang,…
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activationsby Bowen Shen, Zheng Lin,…
Fine-Grained Multi-View Hand Reconstruction Using Inverse Renderingby Qijun Gan, Wentong Li, Jinwei Ren, Jianke ZhuFirst…
Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation…
Fast and Continual Knowledge Graph Embedding via Incremental LoRAby Jiajun Liu, Wenjun Ke, Peng Wang,…
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devicesby Jianwen Jiang, Gaojie Lin, Zhengkun Rong,…
TransMA: an explainable multi-modal deep learning model for predicting properties of ionizable lipid nanoparticles in…
Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learningby Jakob…