Loading Now

Summary of Scanner: Knowledge-enhanced Approach For Robust Multi-modal Named Entity Recognition Of Unseen Entities, by Hyunjong Ok et al.


SCANNER: Knowledge-Enhanced Approach for Robust Multi-modal Named Entity Recognition of Unseen Entities

by Hyunjong Ok, Taeho Kil, Sukmin Seo, Jaeho Lee

First submitted to arxiv on: 2 Apr 2024

Categories

  • Main: Computation and Language (cs.CL)
  • Secondary: Artificial Intelligence (cs.AI)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper proposes SCANNER, a two-stage model for named entity recognition (NER) that can handle multiple variants, including multi-modal NER and grounded MNER. The model extracts entity candidates in the first stage and uses them as queries to retrieve knowledge from various sources. This approach allows it to generalize to unseen entities and address noisy annotations in training data. Additionally, the paper introduces a novel self-distillation method that enhances the robustness and accuracy of the model. SCANNER demonstrates competitive performance on NER benchmarks and surpasses existing methods on multi-modal and grounded MNER benchmarks.
Low GrooveSquid.com (original content) Low Difficulty Summary
The paper is about a new way to find named entities in text and images. It’s like a detective trying to figure out who or what is being talked about. The new method, called SCANNER, can handle different types of information, like words, pictures, and sounds. It does this by looking at possible entity candidates and then using that information to learn more about the entities. This helps it find entities it hasn’t seen before, even if the training data has mistakes. The new approach works well on tests and is better than other methods for certain types of recognition.

Keywords

» Artificial intelligence  » Distillation  » Multi modal  » Named entity recognition  » Ner