Phase 1 Project Report
Phase 1 Project Report
Phase 1 Project Report
Submitted by
PAVITHRA S 73772254101
of
M.TECH
in
DATA SCIENCE
NOVEMBER 2023
ii
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
Dr.R.POONKUZHALI, M.E., Ph.D., Dr.R.POONKUZHALI, M.E., Ph.D.,
HEAD OF THE DEPARTMENT SUPERVISOR
Professsor Professsor
Department of Information Technology Department of Information Technology
K.S. Rangasamy College of Technology K.S. Rangasamy College of Technology
Tiruchengode - 637 215 Tiruchengode - 637 215
DECLARATION
Signature
____________________
PAVITHRA S
Place: Tiruchengode
Date:
iv
ACKNOWLEDGEMENT
We are highly indebted to provide our heart full thanks to our supervisor
Dr.R.POONKUZHALI, M.E., Ph.D., for her valuable ideas, encouragement and
supportive guidance throughout the project.
We wish to extend our sincere thanks to all faculty members of our Data
Science Department for their valuable suggestions, kind co-operation and constant
encouragement for successful completion of this project.
ABSTRACT
In the realm of predictive healthcare, the quest for accurate and timely
identification of potential cardiac issues has become paramount. Heart disease, a leading
global health concern, necessitates innovative solutions for early detection and proactive
intervention. This project undertakes a pioneering initiative in the domain of heart disease
prediction, employing advanced machine learning algorithms, specifically the Warm and
Naive Bayes models. This project presents a comprehensive approach to heart disease
prediction utilizing machine learning algorithms, Warm and Naive Bayes (NB).
Leveraging a diverse dataset encompassing essential health parameters, including age,
gender, blood pressure, and cholesterol levels, the study addresses data preprocessing
challenges such as missing values and outliers. The models are trained on this refined
dataset, and their predictive performance is rigorously evaluated using standard metrics.
The investigation includes an in-depth comparison of the Warm and Naive Bayes (NB),
shedding light on their respective strengths and limitations in the context of heart disease
prediction.
vi
TABLE OF CONTENT
ABSTRACT v
LIST OF TABLES vii
LIST OF FIGURES viii
LIST OF ABBREVIATIONS ix
1 INTRODUCTION 1
1.1 HEART DISEASE 1
1.2 MACHINE LEARNING 2
1.3 PREDICTIVE MODELING 3
1.4 MEDICAL HISTORY 3
1.5 OBJECTIVES 4
2 LITERATURE SURVEY 5
Summary Of Literature Survey 16
3 SYSTEM ANALYSIS 26
3.1 EXISTING SYSTEM 26
3.1.1 DRAWBACKS 26
3.2 PROPOSED SYSTEM 27
3.2.1 ADVANTAGES 27
3.3 FEASIBILITY STUDY 28
3.3.1 TECHNICAL FEASIBILITY 28
3.3.2 OPERATIONAL FEASIBILITY 29
3.3.3 ECONOMICAL FEASIBILITY 29
4 SYSTEM DESIGN 30
4.1 PROBLEM DEFINITION 30
4.2 MODULE DESCRIPTION 30
4.2.1 LOAD DATA 30
4.2.2 DATA PREPROCESSING 30
4.3 SYSTEM FLOW DIAGRAM 21
4.4 INPUT DESIGN 21
4.5 OUTPUT DESIGN 22
ENHANCEMENT
REFERENCES 23
LIST OF TABLES
LIST OF FIGURES
FIGURE NO FIGURE NAME PAGE NO.
1 Heart Disease 2
2 Machine Learning 4
3 System Flow Diagram 20
ix
ABBREVATIONS
NB - Naive Bayes
SVM - Support Vector Machines
IOT - Internet of Things
HRFLM - Hybrid Random Forest with Linear Model
CVDs - cardiovascular diseases
LASSO - Least Absolute Shrinkage and Selection Operator
DTBM - Decision Tree Bagging Method
KNN - K-nearest neighbours
HD - Heart Disease
ML - Machine learning
1
CHAPTER 1
INTRODUCTION
machine learning. By uncovering hidden patterns and correlations within diverse datasets,
machine learning models can offer a more nuanced understanding of an individual's risk,
paving the way for personalized interventions and ultimately contributing to a paradigm
shift in the prevention and management of heart disease. This intersection of cutting-edge
technology and cardiovascular health heralds a new era in healthcare, where data-driven
insights may hold the key to reducing the global burden of heart-related ailments.
but also holds the potential to revolutionize industries, driving efficiency, and uncovering
insights that were once elusive through conventional computational methods.
Medical history, a narrative intricately woven from the threads of a patient's past
health experiences, stands as a foundational pillar in healthcare. It serves as a
comprehensive record, chronicling the nuances of an individual's health journey—ranging
4
from ailments and treatments to lifestyle choices and genetic predispositions. This
historical tapestry is a crucial roadmap for healthcare professionals, offering invaluable
insights into the patient's health trajectory. By delving into medical history, clinicians can
decipher patterns, identify risk factors, and make informed decisions about diagnosis,
treatment, and preventive measures. In essence, a thorough understanding of medical
history not only informs current medical practices but also facilitates a personalized and
holistic approach to patient care. As healthcare continues to advance, the significance of
mining the rich repository of a patient's medical history becomes increasingly pronounced,
underscoring its role as an indispensable tool in the pursuit of optimal health outcomes.
1.5 OBJECTIVES
1. To develop a novel feature selection algorithm, Warm, that identifies the most
relevant features for heart disease prediction.
CHAPTER 2
LITERATURE REVIEW
Sean C The integration of machine learning into the healthcare sector holds
immense promise, particularly in the early detection and prediction of various medical
conditions. In the context of heart health, the application of machine learning algorithms
becomes particularly crucial. The ability to predict potential heart conditions in advance
provides a valuable advantage in offering timely interventions and personalized treatment
strategies. In this research project, the focus is on evaluating and comparing the
performance of diverse machine learning classifiers for predicting heart conditions. The
classifiers under scrutiny include Decision Tree, Naive Bayes, Logistic Regression,
Support Vector Machines (SVM), and Random Forest. Each of these classifiers brings its
unique strengths and characteristics to the table, and a comparative analysis is essential to
identify which one excels in the specific context of heart health prediction. Furthermore,
the project introduces an innovative approach by proposing an ensemble classifier. This
classifier goes beyond the conventional single-model approach by combining the strengths
of both strong and weak classifiers. The rationale behind this hybrid classification strategy
is rooted in the capacity to harness a large number of training and validation samples
effectively. The ensemble classifier aims to enhance the overall predictive accuracy and
robustness of the model, providing a more reliable tool for early identification of potential
heart conditions. Machine learning, with its ability to analyse vast datasets and discern
complex patterns, empowers healthcare professionals with valuable insights. By leveraging
these algorithms, doctors can receive early warnings about locomotor abnormalities,
cardiac issues, and other health conditions. This proactive approach enables physicians to
tailor their diagnostic and treatment approaches on a per-patient basis, optimizing the
delivery of healthcare services. As the project unfolds, it not only contributes to the
advancement of predictive modelling in the healthcare domain but also underscores the
importance of a holistic evaluation of various machine learning techniques. The findings
from this research have the potential to significantly impact clinical practices, paving the
way for more accurate and timely detection of heart conditions, ultimately improving
patient outcomes and quality of care.
6
prevention of cardiovascular diseases, ultimately saving lives and improving the overall
well-being of individuals worldwide.
Shu Jiang Diving deeper into the global impact of cardiovascular diseases (CVDs),
it is evident that this health crisis affects a staggering number of individuals, making it the
leading cause of death worldwide, surpassing all other causes. The World Health
Organization reports that in 2016 alone, an estimated 17.9 million people succumbed to
CVDs, constituting 31% of all global deaths. The majority of these deaths—85%—resulted
from heart attacks and strokes. This grim reality not only places an immense emotional
burden on affected families but also poses a substantial financial challenge, given the high
mortality rates and the considerable costs associated with cardiovascular surgeries. The
situation is particularly dire in economically disadvantaged regions, where heart disease
emerges as a significant and sometimes insurmountable threat. Therefore, the imperative to
analyse the intricate relationships between various human attributes and the likelihood of
developing heart disease becomes paramount. Developing a robust predictive model
becomes not just a statistical endeavour but a critical tool in anticipating and preventing
heart-related issues. In this context, the application of machine learning emerges as a
powerful ally in the fight against cardiovascular diseases. Machine learning, intricately
linked with computational statistics, harnesses mathematical optimization to provide
methods and theories that address real-world problems in medicine, industry, social
sciences, and business domains. The versatility of machine learning is reflected in its two
broad categories: supervised learning and unsupervised learning. For the specific goal of
predicting heart disease based on physiological attributes, supervised learning becomes the
natural choice. In supervised learning, the algorithm constructs a mathematical model
using a dataset that includes both inputs (attributes) and desired outputs (labels). This
aligns seamlessly with the objective of predicting the likelihood of individuals having heart
disease based on specific physical characteristics. This thesis takes a methodical approach
by employing four distinct models within the realm of supervised learning: logistic
regression, random forest, extreme gradient boosting, and neural networks. Each of these
models brings its unique strengths to the table, offering diverse perspectives on the
complex interplay between various attributes and the probability of developing heart
8
disease. The utilization of a combination of these models enhances the predictive accuracy
and robustness of the analysis, providing a nuanced understanding of the factors
contributing to heart disease susceptibility. As the research unfolds, it not only contributes
to the growing body of knowledge on predictive modelling in healthcare but also holds
promise in guiding preventive measures and interventions. By understanding and
harnessing the power of machine learning, this thesis aims to pave the way for more
effective strategies in identifying and supporting individuals at risk of heart disease,
ultimately contributing to a global effort to reduce the staggering mortality rates associated
with cardiovascular conditions.
traditional classifiers with bagging and boosting methods during the training process. This
amalgamation aims to capitalize on the strengths of both approaches, enhancing the overall
predictive power of the model. To rigorously evaluate the performance of the proposed
model, a range of machine learning algorithms is employed to calculate key metrics such
as Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE), F1 Score (F1),
Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate
(FNR). The comprehensive analysis of these metrics allows for a nuanced understanding of
the model's strengths and areas for potential improvement. The culmination of these efforts
and analyses reveals that the proposed model, particularly when employing the Random
Forest Bagging Method (RFBM) in conjunction with Relief feature selection, achieves an
impressive accuracy of 99.05%. This result underscores the efficacy of the hybrid
classifiers and the meticulous approach to feature selection, suggesting the potential of this
model as a robust tool for predicting heart disease with a high level of precision and
reliability.
Harshit Jindal in the face of the escalating incidence of heart diseases, the
imperative to predict and diagnose these conditions in advance has become increasingly
crucial. The complexity of this diagnostic task demands precision and efficiency,
necessitating the exploration of innovative approaches. The research paper under
discussion addresses precisely this challenge—determining which patients are more likely
to be afflicted by heart disease based on a myriad of medical attributes. To tackle this
formidable task, the researchers have developed a heart disease prediction system that
leverages the rich medical history of patients. This system aims to predict whether a given
patient is prone to heart disease, thereby providing a proactive means of intervention. The
utilization of various machine learning algorithms, such as logistic regression and K-
nearest neighbours (KNN), underscores the versatility of modern computational techniques
in the realm of healthcare diagnostics. One notable feature of the research is its focus on
enhancing the accuracy of heart disease predictions. The authors have employed a
thoughtful approach to fine-tuning the model, seeking to optimize its performance and
reliability. The combination of KNN and logistic regression as predictive tools yielded
satisfactory results, showcasing an improvement in accuracy compared to previously
employed classifiers like naive Bayes. This augmentation in predictive accuracy is
10
such as Random Forest, Support Vector Machine (SVM), Naive Bayes, and Decision Tree,
the study aims to develop a robust predictive model. An intriguing aspect of this research is
the exploration of correlations between various attributes within the dataset. By employing
standard machine learning methods, the study systematically identifies and analyses these
correlations, seeking to harness them effectively in predicting the likelihood of heart
disease. This nuanced approach goes beyond mere model development, providing a deeper
understanding of the interplay between different parameters. Results from the research
underscore the efficiency of Random Forest compared to other machine learning
techniques, showcasing superior accuracy within a shorter timeframe. The findings suggest
that Random Forest stands out as a promising choice for heart disease prediction, offering
a balance between precision and computational efficiency. This insight is invaluable for
medical practitioners, as it not only enhances predictive accuracy but also optimizes the
time required for decision-making. The implications of this model extend beyond research
realms, with potential applications in real-world medical settings. Positioning the
developed model as a decision support system for medical practitioners, it offers a valuable
tool in clinics for assessing the likelihood of heart disease in patients. As technology
continues to reshape healthcare landscapes, endeavours like this contribute not only to
predictive analytics but also to the practical integration of machine learning in improving
patient outcomes.
Leveraging an existing dataset from the Cleveland database of the UCI repository, which
encompasses information on 303 instances and 76 attributes related to heart disease
patients, the study strategically narrows its focus to 14 key attributes for testing. This
selective approach ensures a thorough evaluation of the performance of different
algorithms. The primary objective of the research is to envisage the probability of
developing heart disease in patients, a task of paramount importance in preventive
healthcare. By employing various supervised learning algorithms, the study aims to discern
patterns and relationships within the dataset that can contribute to accurate predictions.
This nuanced exploration allows for a comprehensive understanding of which algorithm
performs optimally in predicting the likelihood of heart disease. In the course of the
research, the results highlight that the K-Nearest Neighbours (KNN) algorithm achieves
the highest accuracy score. This outcome underscores the efficacy of KNN in this specific
context and suggests its potential as a preferred model for heart disease prediction. As the
findings contribute to the growing body of knowledge in predictive analytics for heart
diseases, the research provides valuable insights into which machine learning algorithms
may offer the most reliable predictions. The implications extend beyond the research
domain, offering potential applications in clinical settings for early intervention and
personalized healthcare strategies.
The wealth of stored information becomes a valuable resource for predicting the likelihood
of future diseases, especially those related to the heart. Several data mining and machine
learning techniques have been employed to predict heart diseases, and this research project
delves into the application of Artificial Neural Network (ANN), Random Forest, and
Support Vector Machine (SVM). The significance of these predictive models is
underscored by the challenging landscape faced by healthcare professionals in India and
worldwide. The urgency to reduce the scale of deaths attributed to heart diseases
necessitates the discovery of quick and efficient detection techniques, and machine
learning proves to be a pivotal tool in this quest. The researchers are actively engaged in
accelerating their efforts to develop software that utilizes machine learning algorithms.
This software aims to support doctors in both predicting and diagnosing heart diseases. The
main objective of this research project is to harness the power of machine learning to
enhance the accuracy and efficiency of heart disease prediction, ultimately contributing to
more effective healthcare interventions and improved patient outcomes. As advancements
in technology continue to reshape the healthcare landscape, the fusion of machine learning
and medical expertise holds tremendous promise in addressing the global burden of
cardiovascular diseases.
Amin Ul Haq The early detection of Heart Disease (HD) is paramount for effective
treatment and recovery. Machine learning (ML) models have proven to be instrumental in
this regard, providing a powerful tool for physicians to identify and classify individuals
with heart-related issues. This study contributes to the field by proposing an identification
system that employs ML models to distinguish between individuals with heart disease and
healthy subjects. A notable feature of this research lies in the use of Sequential Backward
Selection (SBS) algorithm for feature selection. This algorithm systematically identifies
and selects the most relevant features, contributing to increased classification accuracy and
a reduction in computational time for the predictive system. The choice of features is a
critical aspect of ML models, and the SBS algorithm enhances the efficiency of the
classification process. The Cleveland heart disease dataset serves as the basis for
evaluating the proposed system. The dataset is partitioned, with 70% allocated for training
15
the model and the remaining 30% for validation. This division ensures a robust evaluation
of the system's performance on unseen data. The study's outcomes are measured using
evaluation metrics, shedding light on the effectiveness of the proposed identification
system. The experimental results demonstrate that the Sequential Backward Selection
algorithm adeptly selects relevant features. These selected features, in turn, contribute to an
enhanced accuracy when employing the K-Nearest Neighbour supervised machine learning
classifier. The promising accuracy achieved in this study suggests that the proposed model
holds potential for effectively identifying individuals with heart disease and distinguishing
them from healthy subjects. The incorporation of Sequential Backward Selection for
feature selection is a key contributor to the success of the system, showcasing the
significance of thoughtful feature engineering in ML applications for medical diagnostics.
As the field of ML continues to evolve, such studies contribute not only to the
advancement of predictive modelling in healthcare but also to the development of more
efficient and accurate diagnostic tools. The proposed model, with its emphasis on feature
selection and classification accuracy, stands as a valuable addition to the ongoing efforts to
enhance the early detection of heart disease, ultimately improving patient outcomes.
TITLE OF HARDWARE/
S.NO TITLE OF YEAR OBJECTIVE PROS CONS REMARKS
THE SOFTWARE
THE PAPER OF THE
JOURNAL DETAILS
PAPER
1. Dependenc
HEART Journal of 2023 Predicting Machine Early Utilizing a
e on the
DISEASE Medical Heart Learning detection combination
quality and
PREDICTIO Informatics Conditions Algorithms of heart of classifiers
quantity of
N USING using ML (Decision conditions through an
training
MACHINE Algorithms Tree, Naive through ensemble
data
LEARNING Bayes, predictive approach can
Logistic analytics enhance
Regression, prediction
SVM, accuracy and
Random robustness
Forest),
Ensemble
2.1 SUMMARY OF THE LITERATURE SURVEY
Classifier
16
TITLE OF TITLE OF HARDWARE/
S.NO YEAR OBJECTIVE OF PROS CONS REMARKS
THE
THE SOFTWARE
THE PAPER
PAPER JOURNAL DETAILS
2. Enhanced
Effective Heart Health 2020 Improve accuracy Hybrid RF Limited Successful
performance
Disease Prediction Informatics in CVD prediction with Linear informatio application
(88.7%
using Hybrid Model n on in CVD
accuracy)
Machine Learning hardware/s prediction
Techniques oftware
17
TITLE OF HARDWARE/
S.NO TITLE OF YEAR OBJECTIVE OF PROS CONS REMARKS
THE SOFTWARE
THE THE PAPER
JOURNAL DETAILS
PAPER
6. Enhancing Journal of 2022 Improve accuracy in GridsearchC High Soft Novel use of
Heart Medicine early identification of V, Five-fold accuracy voting GridSearchC
Disease cardiovascular issues Cross- achieved in ensemble V,
Prediction using six ML validation Cleveland further evaluation
Accuracy algorithms (Random dataset with improved with
through Forest, K-NN, Logistic accuracy accuracy and
Machine Logistic Regression, Regression to 93.44% negative log
Learning Naïve Bayes, (90.16%) (Cleveland loss metrics,
Techniques Gradient Boosting, and in IEEE ) and 95% and
and AdaBoost) and Dataport (IEEE comparison
Optimizatio ensemble classifier. dataset with Dataport). to existing
n AdaBoost studies.
(90%).
21
TITLE OF HARDWARE/
S.NO TITLE OF YEAR OBJECTIVE OF PROS CONS REMARKS
THE SOFTWARE
THE THE PAPER
JOURNAL DETAILS
PAPER
9. Heart Journal of 2023 To evaluate the Python with High Requires a Novel
Disease Medical performance of scikit-learn accuracy in large approach in
Prediction Informatics Artificial Neural library predicting amount of leveraging
Network (ANN) heart labeled data machine
diseases learning for
prediction
24
TITLE OF HARDWARE/
S.NO TITLE OF YEAR OBJECTIVE PROS CONS REMARKS
THE SOFTWARE
THE OF THE
JOURNAL DETAILS
PAPER PAPER
10. Heart Journal of 2020 To review and Implemented Provides a Lacks Useful for
Disease Medical Systems compare various in Python comprehensi specific gaining
Prediction ML techniques using scikit- ve overview hardware/so insights into
System for heart disease learn library of existing ftware the variety of
Using Model prediction ML details ML
Of Machine approaches. techniques
Learning and - Highlights used in this
Sequential the strengths domain.
Backward and
Selection weaknesses
Algorithm of each
for Features technique.
Selection
25
26
CHAPTER 3
SYSTEM ANALYSIS
Predicting cardiac disease is considered one of the most challenging tasks in the
medical field. It takes a lot of time and effort to figure out what’s causing this, especially
for doctors and other medical experts. In this paper, various Machine Learning algorithms
such as LR, KNN, SVM, and GBC, together with the GridSearchCV, predict cardiac
disease. The system uses a 5-fold cross-validation technique for verification. A
comparative study is given for these four methodologies. The Datasets for both Cleveland,
Hungary, Switzerland, and Long Beach V and UCI Kaggle are used to analyse the models’
performance. It is found in the analysis that the Extreme Gradient Boosting Classifier with
GridSearchCV gives the highest and nearly comparable testing and training accuracies as
100% and 99.03% for both the datasets (Hungary, Switzerland & Long Beach V and UCI
Kaggle). Moreover, it is found in the analysis that XGBoost Classifier without
GridSearchCV gives the highest and nearly comparable testing and training accuracies as
98.05% and 100% for both the datasets (Hungary, Switzerland & Long Beach V and UCI
Kaggle). Furthermore, the analytical results of the proposed technique are compared with
previous heart disease prediction studies. It is evident that amongst the proposed approach,
the Extreme Gradient Boosting Classifier with GridSearchCV is producing the best hyper
parameter for testing accuracy. The primary aim of this paper is to develop a unique
model-creation technique for solving real-world problems.
3.1.1 DRAWBACKS
This is sensitive to outliers in the data. This means that outliers can have a large
impact on the model's predictions. To reduce the impact of outliers, it is important
to remove outliers from the data before training the model.
3.2.1 ADVANTAGES
1. Develop a heart disease prediction model using the Warm and Naive Bayes (NB).
2. Evaluate and compare the predictive performance of the algorithms based on key
metrics.
4. Explore optional hyper parameter tuning to optimize the models for enhanced
accuracy.
5. Emphasize ethical considerations in the use of health data throughout the system
development and deployment process.
28
Technical Feasibility
Operation Feasibility
Economical Feasibility
The technical issue usually raised during the feasibility stage of the investigation
includes the following:
bandwidth exists for providing a fast feedback to the users irrespective of the number of
users using the system.
Proposed projects are beneficial only if they can be turned out into information
system. That will meet the organization’s operating requirements. Operational feasibility
aspects of the project are to be taken as an important part of the project implementation.
Some of the important issues raised are to test the operational feasibility of a project
includes the following: -
The well-planned design would ensure the optimal utilization of the computer
resources and would help in the improvement of performance status.
A system can be developed technically and that will be used if installed must still
be a good investment for the organization. In the economic feasibility, the development
cost in creating the system is evaluated against the ultimate benefit derived from the new
systems. Financial benefits must equal or exceed the costs.
The system is economically feasible. It does not require any addition hardware or
software. Since the interface for this system is developed using the existing resources and
technologies available at NIC, there is nominal expenditure and economical feasibility for
certain.
30
CHAPTER 4
SYSTEM DESIGN
Accurately predicting cardiac disease is a critical challenge in the medical field, as it can
lead to timely interventions and improved patient outcomes. However, traditional methods
for predicting cardiac disease, such as relying solely on medical expertise and historical
data, can be time-consuming and may not capture the complex relationships between
various factors that contribute to heart disease. Therefore, there is a need for a more
efficient and accurate approach to predicting cardiac disease.
Designing the input for a heart disease prediction system involves defining the data that the
model will use to make predictions. Here are key aspects of the input design:
1. Feature Selection:
Identify and select relevant features for heart disease prediction, such as
age, gender, blood pressure, cholesterol levels, and other health indicators.
Specify the data types and formats for each selected feature (e.g., numerical,
categorical) to ensure compatibility with the algorithms.
4. Normalization or Standardization:
The output of the proposed system for heart disease prediction consists of the following:
CHAPTER 5
CONCLUSION
FUTURE ENHANCEMENT
Future work in this domain could explore the integration of advanced feature
engineering techniques and deep learning architectures to enhance the predictive
capabilities of heart disease models. Investigating the impact of additional health
parameters and incorporating real-time monitoring data could provide a more
comprehensive understanding of the dynamic nature of cardiovascular health. Furthermore,
efforts should be directed towards developing interpretable models to enhance
transparency and trust in the predictions, especially in critical healthcare decision-making
scenarios.
34
REFERENCES
[1]. K. Agarwal and t. Kumar, “heart disease prediction using machine learning,” in 2nd
international conference on intelligent computing and control systems(iciccs). Ieee,
2018.
[2]. S. Rajput and a. Arora, “effective heart disease prediction using hybrid machine
learning techniques,” international journal of computer applications, vol. 75, no. 10, pp.
6–12, 2013.
[3]. M. Mohamad and a. Selamat, “heart disease prediction using machine learning
algorithms,” in international conference on computer, communications, and control
technology (i4ct). Ieee, 2015, pp. 227–231.
[4]. J. Ramos et al., “efficient prediction of cardiovascular disease using machine learning
algorithms with relief and lasso feature selection techniques,” in proceedings of the
first instructional conference on machine learning, vol. 242. Piscataway, nj, 2003, pp.
133–142.
[6]. H. Kaur and s. Ajay, “enhancing heart disease prediction accuracy through machine
learning techniques and optimization,” next generation computing technologies(ngct),
pp. 516–521, 2016.
[7]. K. Toutanova and c. Cherry, “heart disease prediction using machine learning and svm
techniques,” in proceedings of the joint conference of the 47th annual meeting of the
acl and the 4th international joint conference on natural language processing of the
afnlp: volume 1- volume 1. Association for computational linguistics, 2009, pp. 486–
494.
[9]. T. Mikolov and g. Zweig, “cardiovascular diseases (cvds) using machine learning,” in
2012 ieee spoken language technology workshop (slt). Ieee, 2012, pp. 234–239.
[10]. Rizky, w. M., ristu, s., afrizal, d. “heart disease prediction system using model of
machine learning and sequential backward selection algorithm for features selection”.
Scientific journal of informatics, vol. 3(2), p. 41-50, nov. 2020.