Loading Now

Summary of The Synergy Between Data and Multi-modal Large Language Models: a Survey From Co-development Perspective, by Zhen Qin et al.


The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

by Zhen Qin, Daoyuan Chen, Wenhao Zhang, Liuyi Yao, Yilun Huang, Bolin Ding, Yaliang Li, Shuiguang Deng

First submitted to arxiv on: 11 Jul 2024

Categories

  • Main: Artificial Intelligence (cs.AI)
  • Secondary: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper investigates the development of large language models (LLMs) and multi-modal LLMs (MLLMs), which extend their capabilities beyond text to various domains. MLLMs rely on vast amounts of data, highlighting the importance of data in achieving emergent capabilities. Recent works find that model development and data development are interconnected, with better data leading to improved MLLM performance and MLLMs facilitating data development. The paper reviews existing works related to MLLMs from a data-model co-development perspective.
Low GrooveSquid.com (original content) Low Difficulty Summary
This paper explores how large language models (LLMs) can be extended to handle various types of data. It looks at the connection between LLMs and multi-modal LLMs, which can process images, audio, or other forms of data. The authors find that these models rely on huge amounts of information to work well. They also discover that developing better models and better data go hand in hand.

Keywords

* Artificial intelligence  * Multi modal