Loading Now

Summary of Interpretable Solutions For Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation, by Yumnah Hasan et al.


Interpretable Solutions for Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation

by Yumnah Hasan, Allan de Lima, Fatemeh Amerehi, Darian Reyes Fernandez de Bulnes, Patrick Healy, Conor Ryan

First submitted to arxiv on: 25 Jan 2024

Categories

  • Main: Machine Learning (cs.LG)
  • Secondary: Neural and Evolutionary Computing (cs.NE)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper presents a novel approach to medical imaging diagnosis using Machine Learning (ML) models that are inherently understandable and address issues with imbalanced datasets. The authors utilize a synthetic data generation technique called STEM, which combines Synthetic Minority Oversampling Technique (SMOTE), Edited Nearest Neighbour (ENN), and Mixup, to produce data for training Grammatical Evolution (GE) models. These models demonstrate better Area Under the Curve (AUC) results compared to standard ML classifiers while maintaining interpretability. The technique is tested on the Digital Database for Screening Mammography (DDSM) and Wisconsin Breast Cancer (WBC) datasets.
Low GrooveSquid.com (original content) Low Difficulty Summary
This paper helps doctors make better decisions about medical images by using special machine learning models that are easy to understand. These models can learn from small amounts of training data, which is important because many medical imaging datasets are very imbalanced. The researchers use a new way to create synthetic data called STEM, which combines different techniques to make the data more balanced and realistic. They show that these new models work better than other machine learning models on two important medical image datasets.

Keywords

* Artificial intelligence  * Auc  * Machine learning  * Synthetic data