Summary of A Data Balancing Approach Towards Design Of An Expert System For Heart Disease Prediction, by Rahul Karmakar et al.

A data balancing approach towards design of an expert system for Heart Disease Prediction

by Rahul Karmakar, Udita Ghosh, Arpita Pal, Sattwiki Dey, Debraj Malik, Priyabrata Sain

First submitted to arxiv on: 26 Jul 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary The proposed study leverages machine learning techniques to enhance early detection and precise prediction of cardiac diseases. Researchers employed five ML methods – Decision Tree (DT), Random Forest (RF), Linear Discriminant Analysis, Extra Tree Classifier, and AdaBoost – to analyze the “Heart disease health indicators” dataset. To optimize model performance, various feature selection techniques were applied, including Sequential Forward FS, Sequential Backward FS, Correlation Matrix, and Chi2. Additionally, K means SMOTE oversampling was employed to enable further analysis. The findings indicate that ensemble approaches, particularly random forests, outperformed individual classifiers in predicting heart disease. Key predictors included smoking, blood pressure, cholesterol, and physical inactivity. Notably, the Random Forest and Decision Tree models achieved an accuracy of 99.83%. This study demonstrates the potential of machine learning models to improve heart disease prediction, particularly when incorporating ensemble methodologies.
Low	GrooveSquid.com (original content)	Low Difficulty Summary Researchers used machine learning to help predict heart disease earlier and more accurately. They tested five different models on a big dataset about heart health. To make the models better, they tried four different ways to choose which features were most important. Then, they used something called oversampling to get even more accurate results. The best model was a type of forest that combined lots of smaller models together. It did really well at predicting heart disease and found out that smoking, high blood pressure, high cholesterol, and not being physically active were all big risk factors.

Keywords

* Artificial intelligence * Decision tree * Feature selection * K means * Machine learning * Random forest

A data balancing approach towards design of an expert system for Heart Disease Prediction

by Rahul Karmakar, Udita Ghosh, Arpita Pal, Sattwiki Dey, Debraj Malik, Priyabrata Sain

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of Practical Attribution Guidance For Rashomon Sets, by Sichao Li et al.

Summary of Contrastive Learning Of Asset Embeddings From Financial Time Series, by Rian Dolphin et al.

Related Posts