Loading Now

Summary of Empirical Influence Functions to Understand the Logic Of Fine-tuning, by Jordan K. Matelsky et al.


Empirical influence functions to understand the logic of fine-tuning

by Jordan K. Matelsky, Lyle Ungar, Konrad P. Kording

First submitted to arxiv on: 1 Jun 2024

Categories

  • Main: Machine Learning (cs.LG)
  • Secondary: Artificial Intelligence (cs.AI)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
A novel approach is proposed to understand the process of learning in neural networks, enabling improved performance and behavior interpretation. By analyzing the influence of fine-tuning on a new training sample, the paper identifies desiderata for such influences, including semantic distance, sparseness, noise invariance, transitive causality, and logical consistency. Empirical influence measured using fine-tuning is used to demonstrate how individual training samples affect outputs, revealing violations of these desiderata for both simple convolutional networks and modern Large Language Models (LLMs). The study also shows that prompting can partially rescue this failure. The paper presents a practical method for quantifying neural network learning from fine-tuning stimuli, suggesting popular models lack generalization capabilities and logical reasoning abilities.
Low GrooveSquid.com (original content) Low Difficulty Summary
Neural networks are super smart computers that learn from data, but we don’t fully understand how they do it. This research tries to figure out what happens when these networks learn from new information. The scientists want to know if the network’s output changes in a way that makes sense and is consistent with our understanding of language and logic. They found that most popular models can’t even perform simple logical tasks, which is surprising! The researchers also showed that by using special prompts, they could partially fix this problem. This study helps us understand how neural networks learn and what they’re capable of.

Keywords

» Artificial intelligence  » Fine tuning  » Generalization  » Neural network  » Prompting