Summary of Aligning Cyber Space with Physical World: a Comprehensive Survey on Embodied Ai, by Yang Liu et al.
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
by Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li, Wen Gao, Liang Lin
First submitted to arxiv on: 9 Jul 2024
Categories
- Main: Computer Vision and Pattern Recognition (cs.CV)
- Secondary: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Robotics (cs.RO)
GrooveSquid.com Paper Summaries
GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!
Summary difficulty | Written by | Summary |
---|---|---|
High | Paper authors | High Difficulty Summary Read the original abstract here |
Medium | GrooveSquid.com (original content) | Medium Difficulty Summary This survey provides a comprehensive exploration of the latest advancements in Embodied Artificial Intelligence (Embodied AI), a crucial step towards achieving Artificial General Intelligence (AGI) and enabling various applications that bridge cyberspace and the physical world. The emergence of Multi-modal Large Models (MLMs) and World Models (WMs) has attracted significant attention due to their remarkable perception, interaction, and reasoning capabilities, making them a promising architecture for the brain of embodied agents. This survey navigates through representative works of embodied robots and simulators, analyzing four main research targets: embodied perception, embodied interaction, embodied agent, and sim-to-real adaptation. The study covers state-of-the-art methods, essential paradigms, and comprehensive datasets, highlighting the significance of MLMs in facilitating interactions in dynamic digital and physical environments. |
Low | GrooveSquid.com (original content) | Low Difficulty Summary Embodied Artificial Intelligence is a way to make computers work like humans do. It’s an important step towards making super smart computers that can understand and interact with the world around them. The paper looks at what people have been doing to try and achieve this goal, and how they’re using special kinds of computer models called Multi-modal Large Models (MLMs) to help them. These models are really good at learning and understanding things, which is important for making computers that can work in the real world. |
Keywords
* Artificial intelligence * Attention * Multi modal