Summary of On Domain-specific Post-training For Multimodal Large Language Models, by Daixuan Cheng et al.
On Domain-Specific Post-Training for Multimodal Large Language Modelsby Daixuan Cheng, Shaohan Huang, Ziyu Zhu, Xintong…
On Domain-Specific Post-Training for Multimodal Large Language Modelsby Daixuan Cheng, Shaohan Huang, Ziyu Zhu, Xintong…
Free-form Generation Enhances Challenging Clothed Human Modelingby Hang Ye, Xiaoxuan Ma, Hai Ci, Wentao Zhu,…
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capabilityby Zicheng Lin, Tian Liang, Jiahao…
Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmarkby Joseph Heyward, João Carreira,…
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillationby Zhiqiang Shen, Ammar Sherif, Zeyuan Yin,…
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videosby Yuze He, Wang…
Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentationby Shukang Yin, Chaoyou Fu, Sirui Zhao, Yunhang Shen, Chunjiang…
Creating Hierarchical Dispositions of Needs in an Agentby Tofara MoyoFirst submitted to arxiv on: 23…
Partitioning Message Passing for Graph Fraud Detectionby Wei Zhuo, Zemin Liu, Bryan Hooi, Bingsheng He,…
Mapping waterways worldwide with deep learningby Matthew Pierson, Zia MehrabiFirst submitted to arxiv on: 24…