Loading Now

Summary of A Pseudo-semantic Loss For Autoregressive Models with Logical Constraints, by Kareem Ahmed et al.


A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints

by Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

First submitted to arxiv on: 6 Dec 2023

Categories

  • Main: Machine Learning (cs.LG)
  • Secondary: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The proposed approach in this paper bridges the gap between symbolic and neural learning by optimizing the likelihood of a symbolic constraint under an autoregressive output distribution. Traditional methods assume fully-factorized distributions, limiting their applicability to expressive autoregressive models like transformers. To address this, the authors introduce a pseudolikelihood-based approximation centered around a model sample, which is factorized and allows for the reuse of solutions to sub-problems. This approach exhibits low entropy and KL-divergence around the model sample, improving the prediction of logically-consistent outputs in tasks such as Sudoku and shortest-path prediction. Additionally, the authors demonstrate state-of-the-art detoxification results on large language models by steering their outputs away from toxic generations.
Low GrooveSquid.com (original content) Low Difficulty Summary
This paper introduces a new approach to combining symbolic and neural learning, which can be used to predict logical outcomes in problems like Sudoku or shortest-path prediction. The method works by optimizing the likelihood of certain rules under an output distribution that is similar to how people think. This helps the model generate more realistic and sensible outputs. The authors tested their approach on some example tasks and found it was very effective.

Keywords

* Artificial intelligence  * Autoregressive  * Likelihood