Loading Now

Summary of Automated Neural Patent Landscaping in the Small Data Regime, by Tisa Islam Erana and Mark A. Finlayson


Automated Neural Patent Landscaping in the Small Data Regime

by Tisa Islam Erana, Mark A. Finlayson

First submitted to arxiv on: 10 Jul 2024

Categories

  • Main: Computation and Language (cs.CL)
  • Secondary: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
This paper presents an automated neural patent landscaping system that improves performance on difficult examples while requiring less training data. The system demonstrates significant advancements over previous methods, achieving 0.69 F1 score on ‘hard’ examples and 0.75 F1 score with as few as 24 labeled examples. The authors also introduce a higher-quality training data generation procedure by merging the “seed/anti-seed” approach with active learning to collect difficult labeled examples near the decision boundary. This new dataset of labeled AI patents is released for others to build upon, along with baseline comparisons and code.
Low GrooveSquid.com (original content) Low Difficulty Summary
The paper describes an automated way to create a map of all patents related to a specific technology area. This is important because it helps us understand intellectual property better. The system is faster and more accurate than previous methods and only needs a small amount of labeled examples to train. It also generates better training data by combining two existing approaches with active learning. This new approach is released for others to use.

Keywords

» Artificial intelligence  » Active learning  » F1 score