Summary of Doe-1: Closed-loop Autonomous Driving with Large World Model, by Wenzhao Zheng et al.
Doe-1: Closed-Loop Autonomous Driving with Large World Modelby Wenzhao Zheng, Zetian Xia, Yuanhui Huang, Sicheng…
Doe-1: Closed-Loop Autonomous Driving with Large World Modelby Wenzhao Zheng, Zetian Xia, Yuanhui Huang, Sicheng…
Multi-Scale Heterogeneous Text-Attributed Graph Datasets From Diverse Domainsby Yunhui Liu, Qizhuo Xie, Jinwei Shi, Jiaxu…
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Surveyby Yayun Qi, Hongxi Li, Yiqi…
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Lossesby Jiayun Luo, Mir Rayat…
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Modelsby Quang-Hung Le, Long Hoang Dang,…
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answeringby Amirhossein Abaskohi, Spandana…
SplaXBERT: Leveraging Mixed Precision Training and Context Splitting for Question Answeringby Zhu Yufan, Hao Zeyu,…
EACO: Enhancing Alignment in Multimodal LLMs via Critical Observationby Yongxin Wang, Meng Cao, Haokun Lin,…
SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extractionby Ethan Bradley, Muhammad…
Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challengesby Minghao Shao, Abdul Basit,…