Summary of Bootstrapping Linear Models For Fast Online Adaptation in Human-agent Collaboration, by Benjamin a Newman et al.

Bootstrapping Linear Models for Fast Online Adaptation in Human-Agent Collaboration

by Benjamin A Newman, Chris Paxton, Kris Kitani, Henny Admoni

First submitted to arxiv on: 16 Apr 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary This paper proposes a novel approach to initializing policies for agents that assist people, allowing them to adapt quickly to align with their partners’ reward functions. The method, called BLR-HAC (Bootstrapped Logistic Regression for Human Agent Collaboration), combines the benefits of large nonlinear models and low-capacity models to achieve efficient online learning. By bootstrapping large nonlinear models to learn the parameters of a low-capacity model, BLR-HAC enables rapid inference and fine-tuning updates during collaboration. The approach is tested in a simulated surface rearrangement task, demonstrating higher zero-shot accuracy than shallow methods while requiring less computation to adapt online.
Low	GrooveSquid.com (original content)	Low Difficulty Summary This paper helps machines work better with people by learning how to quickly adjust to the person’s goals. The researchers created a new method called BLR-HAC that combines two different types of models to make this happen. It starts with a big model and then adjusts it using a smaller model, which allows it to learn quickly and accurately while still being efficient with its computations.

Keywords

* Artificial intelligence * Bootstrapping * Fine tuning * Inference * Logistic regression * Online learning * Zero shot

Bootstrapping Linear Models for Fast Online Adaptation in Human-Agent Collaboration

by Benjamin A Newman, Chris Paxton, Kris Kitani, Henny Admoni

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of Into the Fog: Evaluating Robustness Of Multiple Object Tracking, by Nadezda Kirillova et al.

Summary of Binder: Hierarchical Concept Representation Through Order Embedding Of Binary Vectors, by Croix Gyurek and Niloy Talukder and Mohammad Al Hasan

Related Posts