Summary of Interpretable Solutions For Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation, by Yumnah Hasan et al.

Interpretable Solutions for Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation

by Yumnah Hasan, Allan de Lima, Fatemeh Amerehi, Darian Reyes Fernandez de Bulnes, Patrick Healy, Conor Ryan

First submitted to arxiv on: 25 Jan 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary The paper presents a novel approach to medical imaging diagnosis using Machine Learning (ML) models that are inherently understandable and address issues with imbalanced datasets. The authors utilize a synthetic data generation technique called STEM, which combines Synthetic Minority Oversampling Technique (SMOTE), Edited Nearest Neighbour (ENN), and Mixup, to produce data for training Grammatical Evolution (GE) models. These models demonstrate better Area Under the Curve (AUC) results compared to standard ML classifiers while maintaining interpretability. The technique is tested on the Digital Database for Screening Mammography (DDSM) and Wisconsin Breast Cancer (WBC) datasets.
Low	GrooveSquid.com (original content)	Low Difficulty Summary This paper helps doctors make better decisions about medical images by using special machine learning models that are easy to understand. These models can learn from small amounts of training data, which is important because many medical imaging datasets are very imbalanced. The researchers use a new way to create synthetic data called STEM, which combines different techniques to make the data more balanced and realistic. They show that these new models work better than other machine learning models on two important medical image datasets.

Keywords

* Artificial intelligence * Auc * Machine learning * Synthetic data

Interpretable Solutions for Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation

by Yumnah Hasan, Allan de Lima, Fatemeh Amerehi, Darian Reyes Fernandez de Bulnes, Patrick Healy, Conor Ryan

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of Enhanced Labeling Technique For Reddit Text and Fine-tuned Longformer Models For Classifying Depression Severity in English and Luganda, by Richard Kimera et al.

Summary of Producing Plankton Classifiers That Are Robust to Dataset Shift, by Cheng Chen et al.

Related Posts