Loading Now

Summary of Hapfi: History-aware Planning Based on Fused Information, by Sujin Jeon et al.


HAPFI: History-Aware Planning based on Fused Information

by Sujin Jeon, Suyeon Shin, Byoung-Tak Zhang

First submitted to arxiv on: 23 Jul 2024

Categories

  • Main: Artificial Intelligence (cs.AI)
  • Secondary: Robotics (cs.RO)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper introduces a task called Embodied Instruction Following (EIF), which involves planning a sequence of sub-goals based on high-level natural language instructions. The authors argue that an agent must consider its past experiences when making decisions in each step. They propose History-Aware Planning based on Fused Information (HAPFI) to leverage historical data from diverse modalities, including RGB observations, bounding boxes, and high-level instructions. HAPFI integrates these modalities using a Mutually Attentive Fusion method. The authors compare their approach with others that neglect historical data and demonstrate its superiority in action planning capability. They also provide qualitative evidence highlighting the importance of leveraging historical multi-modal data.
Low GrooveSquid.com (original content) Low Difficulty Summary
Embodied Instruction Following is a way for machines to follow instructions given in natural language. Imagine you’re trying to make a salad, and someone tells you what to do step by step: “Rinse a slice of lettuce…” The authors of this paper think that machines should also consider their past experiences when following these kinds of instructions. They propose a new way for machines to use information from the past to make better decisions in each step. This approach combines different types of data, like what’s happening around them and what they’re supposed to do. The results show that this method is better than others at planning the right actions. It also helps machines recover better if they make a mistake.

Keywords

* Artificial intelligence  * Multi modal