Finance Project Proposal
Finance Project Proposal
Finance Project Proposal
PROJE
CARDIO VASCULAR DISEASE PREDICTION USING MACHINE LEARNING
CT
TITLE
DOMAIN HEALTH CARE
OR FIELD
TEAM
SHAISTA FATHIMA M (21BBTCS213) ,
MEMBE
RS SUPRIYA PARASAPPA JALAGERI (21BBTCS235)
I. Cardiovascular diseases (CVDs) are a group of disorders affecting the heart and blood
INTRODUCTI vessels, and are a leading cause of mortality and morbidity worldwide. Understanding the
ON TO factors that contribute to the development and progression of CVDs is crucial in order to
DOMAIN AND develop effective prevention and treatment strategies. In this proposal, we aim to explore
PROBLEM the use of machine learning techniques to predict the occurrence of cardiovascular disease
IDENTIFIED events, with the goal of improving early detection and intervention. Event prediction has
been the cornerstone of cardiovascular epidemiology as exemplified by the Framingham
study and other prospective studies that function as pillars for much of what comprises
current cardiovascular medicine. A fundamental goal of such efforts has been event
prediction over relatively long periods of time such as 10 years or a lifetime. These efforts
have allowed us to characterize subclinical disease processes and target key risk factors
for modification (eg, smoking cessation, statin therapy, blood pressure control).
Epidemiological studies used to derive such predictive models frequently contain
hundreds or thousands of variables. It is in this context that machine learning methods
might be useful as a means to identify the best predictors of outcomes from among
millions of phenotypic data points.
III. OBJECTIVES 1. To develop a robust and accurate predictive model that can forecast the likelihood
of cardiovascular disease events, such as heart attacks and strokes, based on patient
data.
2. To identify the key risk factors and variables that have the most significant impact
on the development of cardiovascular disease.
4. This involves splitting the dataset into multiple folds, training the models on a
subset of the data and evaluating them on the remaining unseen data.
Model Selection: The first step in the model selection process is to identify the
most appropriate machine learning algorithms for the task of cardiovascular
disease event prediction. Based on the nature of the problem and the available
data, we will consider a range of supervised learning models, including logistic
regression, decision trees, random forests, and gradient boosting algorithm.
Lloyd-Jones DM, Wilson PW, Larson MG, Beiser A, Leip EP, D’Agostino RB,
Levy D. Framingham risk score and prediction of lifetime risk for coronary heart
disease.Am J Cardiol. 2004
Cox DR. Regression models and life-tables.J R Stat Soc Series B. 1972