Loading Now

Summary of A Data Balancing Approach Towards Design Of An Expert System For Heart Disease Prediction, by Rahul Karmakar et al.


A data balancing approach towards design of an expert system for Heart Disease Prediction

by Rahul Karmakar, Udita Ghosh, Arpita Pal, Sattwiki Dey, Debraj Malik, Priyabrata Sain

First submitted to arxiv on: 26 Jul 2024

Categories

  • Main: Machine Learning (cs.LG)
  • Secondary: None

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The proposed study leverages machine learning techniques to enhance early detection and precise prediction of cardiac diseases. Researchers employed five ML methods – Decision Tree (DT), Random Forest (RF), Linear Discriminant Analysis, Extra Tree Classifier, and AdaBoost – to analyze the “Heart disease health indicators” dataset. To optimize model performance, various feature selection techniques were applied, including Sequential Forward FS, Sequential Backward FS, Correlation Matrix, and Chi2. Additionally, K means SMOTE oversampling was employed to enable further analysis. The findings indicate that ensemble approaches, particularly random forests, outperformed individual classifiers in predicting heart disease. Key predictors included smoking, blood pressure, cholesterol, and physical inactivity. Notably, the Random Forest and Decision Tree models achieved an accuracy of 99.83%. This study demonstrates the potential of machine learning models to improve heart disease prediction, particularly when incorporating ensemble methodologies.
Low GrooveSquid.com (original content) Low Difficulty Summary
Researchers used machine learning to help predict heart disease earlier and more accurately. They tested five different models on a big dataset about heart health. To make the models better, they tried four different ways to choose which features were most important. Then, they used something called oversampling to get even more accurate results. The best model was a type of forest that combined lots of smaller models together. It did really well at predicting heart disease and found out that smoking, high blood pressure, high cholesterol, and not being physically active were all big risk factors.

Keywords

» Artificial intelligence  » Decision tree  » Feature selection  » K means  » Machine learning  » Random forest