Loading Now

Summary of Comparing Bottom-up and Top-down Steering Approaches on In-context Learning Tasks, by Madeline Brumley et al.


Comparing Bottom-Up and Top-Down Steering Approaches on In-Context Learning Tasks

by Madeline Brumley, Joe Kwon, David Krueger, Dmitrii Krasheninnikov, Usman Anwar

First submitted to arxiv on: 11 Nov 2024

Categories

  • Main: Machine Learning (cs.LG)
  • Secondary: None

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
This paper presents a case study comparing two distinct approaches to interpretability in large language models (LLMs): “bottom-up” function vectors (FV) and “top-down” in-context vectors (ICV). The authors evaluate the effectiveness of these methods for steering LLMs toward desired behaviors, highlighting their strengths and limitations. Specifically, they find that ICVs outperform FVs in behavioral shifting tasks, while FVs excel in precision-based tasks.
Low GrooveSquid.com (original content) Low Difficulty Summary
This study compares two ways to make large language models behave in a certain way. The researchers test two methods: function vectors (FV) which are like building blocks for the model, and in-context vectors (ICV) which are more about how the model is used. They find that one method works better than the other depending on what task they’re doing. For example, one method is good at making the model change its behavior suddenly, while the other method is better at making it be very precise.

Keywords

* Artificial intelligence  * Precision