Loading Now

Summary of Aligning Cyber Space with Physical World: a Comprehensive Survey on Embodied Ai, by Yang Liu et al.


Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

by Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li, Wen Gao, Liang Lin

First submitted to arxiv on: 9 Jul 2024

Categories

  • Main: Computer Vision and Pattern Recognition (cs.CV)
  • Secondary: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Robotics (cs.RO)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
This survey provides a comprehensive exploration of the latest advancements in Embodied Artificial Intelligence (Embodied AI), a crucial step towards achieving Artificial General Intelligence (AGI) and enabling various applications that bridge cyberspace and the physical world. The emergence of Multi-modal Large Models (MLMs) and World Models (WMs) has attracted significant attention due to their remarkable perception, interaction, and reasoning capabilities, making them a promising architecture for the brain of embodied agents. This survey navigates through representative works of embodied robots and simulators, analyzing four main research targets: embodied perception, embodied interaction, embodied agent, and sim-to-real adaptation. The study covers state-of-the-art methods, essential paradigms, and comprehensive datasets, highlighting the significance of MLMs in facilitating interactions in dynamic digital and physical environments.
Low GrooveSquid.com (original content) Low Difficulty Summary
Embodied Artificial Intelligence is a way to make computers work like humans do. It’s an important step towards making super smart computers that can understand and interact with the world around them. The paper looks at what people have been doing to try and achieve this goal, and how they’re using special kinds of computer models called Multi-modal Large Models (MLMs) to help them. These models are really good at learning and understanding things, which is important for making computers that can work in the real world.

Keywords

* Artificial intelligence  * Attention  * Multi modal