Summary of Empirical Influence Functions to Understand the Logic Of Fine-tuning, by Jordan K. Matelsky et al.

Empirical influence functions to understand the logic of fine-tuning

by Jordan K. Matelsky, Lyle Ungar, Konrad P. Kording

First submitted to arxiv on: 1 Jun 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary A novel approach is proposed to understand the process of learning in neural networks, enabling improved performance and behavior interpretation. By analyzing the influence of fine-tuning on a new training sample, the paper identifies desiderata for such influences, including semantic distance, sparseness, noise invariance, transitive causality, and logical consistency. Empirical influence measured using fine-tuning is used to demonstrate how individual training samples affect outputs, revealing violations of these desiderata for both simple convolutional networks and modern Large Language Models (LLMs). The study also shows that prompting can partially rescue this failure. The paper presents a practical method for quantifying neural network learning from fine-tuning stimuli, suggesting popular models lack generalization capabilities and logical reasoning abilities.
Low	GrooveSquid.com (original content)	Low Difficulty Summary Neural networks are super smart computers that learn from data, but we don’t fully understand how they do it. This research tries to figure out what happens when these networks learn from new information. The scientists want to know if the network’s output changes in a way that makes sense and is consistent with our understanding of language and logic. They found that most popular models can’t even perform simple logical tasks, which is surprising! The researchers also showed that by using special prompts, they could partially fix this problem. This study helps us understand how neural networks learn and what they’re capable of.

Keywords

» Artificial intelligence » Fine tuning » Generalization » Neural network » Prompting

Empirical influence functions to understand the logic of fine-tuning

by Jordan K. Matelsky, Lyle Ungar, Konrad P. Kording

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of Stochastic Resetting Mitigates Latent Gradient Bias Of Sgd From Label Noise, by Youngkyoung Bae et al.

Summary of Learning Discrete Concepts in Latent Hierarchical Models, by Lingjing Kong et al.

Related Posts