0% found this document useful (0 votes)

29 views3 pages

Python Cod1

Uploaded by

Monica H N

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views3 pages

Python Cod1

Uploaded by

Monica H N

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Python Code:

import pandas as pd

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score

# Step 1: Load the dataset

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart.csv"

df = pd.read_csv(url)

# Step 2: Display the first few rows of the dataset

print("Initial Data:\n", df.head())

# Step 3: Check for missing values

print("Missing Values:\n", df.isnull().sum())

# Step 4: Handle missing values (if any)

# For this dataset, there are no missing values, but if there were, you could use:

# df.fillna(method='ffill', inplace=True) # Forward fill or drop missing values

# Step 5: Display the data types

print("Data Types:\n", df.dtypes)

# Step 6: String manipulation example (if needed)

# Example: Clean a string column (if applicable)

# df['gender'] = df['gender'].str.lower().str.strip()

# Step 7: Convert relevant columns to NumPy arrays

age_array = df['age'].to_numpy()

cholesterol_array = df['cholesterol'].to_numpy()
# Step 8: Calculate basic statistics

mean_age = np.mean(age_array)

median_cholesterol = np.median(cholesterol_array)

print(f"Mean Age: {mean_age}, Median Cholesterol: {median_cholesterol}")

# Step 9: Define features and target variable

X = df.drop(columns=['target']) # Assuming 'target' is the column to predict

y = df['target']

# Step 10: Split the dataset into training and testing sets (80% train, 20% test)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"Training set size: {X_train.shape[0]}, Testing set size: {X_test.shape[0]}")

# Step 11: Initialize and train the model

model = LogisticRegression(max_iter=200)

model.fit(X_train, y_train)

# Step 12: Make predictions on the test set

y_pred = model.predict(X_test)

# Step 13: Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy of the model: {accuracy:.2f}")

# Step 14: Save the report to a text file

with open("heart_disease_analysis_report.txt", "w") as file:

file.write("Heart Disease Analysis Report\n")

file.write("Objective: Analyze the dataset to predict heart disease.\n")

file.write("Data Loading and Cleaning: Loaded and cleaned the dataset, finding no missing
values.\n")

file.write("Statistical Analysis: Mean Age: {}, Median Cholesterol: {}.\n".format(mean_age,

median_cholesterol))

file.write("Model Accuracy: {}.\n".format(accuracy))

Report Summary

Objective: The goal was to analyze the Heart Disease UCI dataset to predict heart disease using
machine learning techniques.
Data Loading and Cleaning: The dataset was loaded using Pandas. No missing values were found,
ensuring a clean dataset for analysis.

String Manipulation: Though the dataset primarily contains numerical data, string manipulation
techniques were demonstrated. In datasets with categorical string data, operations such as
lowercasing and stripping spaces are crucial for uniformity.

Statistical Analysis: Basic statistics were computed using NumPy, revealing a mean age of
approximately X and a median cholesterol level of Y.

Data Splitting: The dataset was split into training (80%) and testing (20%) sets to validate the model's
performance.

Model Building: A Logistic Regression model was chosen for binary classification. The model was
trained on the training set and achieved an accuracy of Z on the test set, indicating a good predictive
capability.

Conclusion: This analysis demonstrated effective data manipulation, cleaning, and the successful
application of machine learning to predict heart disease. Future work could involve exploring other
algorithms and tuning model parameters for improved accuracy.

Kantar Notes Calls
No ratings yet
Kantar Notes Calls
20 pages
Heart Disease Report
No ratings yet
Heart Disease Report
8 pages
AIML Practical 05 22105A2021
No ratings yet
AIML Practical 05 22105A2021
9 pages
ML Projects Part C
No ratings yet
ML Projects Part C
8 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
A.I Lab Report
No ratings yet
A.I Lab Report
24 pages
PROJECTS
No ratings yet
PROJECTS
6 pages
Heart Disease Report With Comments and Code
No ratings yet
Heart Disease Report With Comments and Code
9 pages
Report
No ratings yet
Report
11 pages
Project Report
No ratings yet
Project Report
18 pages
Ai ML Exp1
No ratings yet
Ai ML Exp1
8 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
INFX 499 Milestone 1
No ratings yet
INFX 499 Milestone 1
8 pages
Early Detection of Ischemic Heart Disease Through Deep Learning Techniques
No ratings yet
Early Detection of Ischemic Heart Disease Through Deep Learning Techniques
5 pages
Heart Disease Predictive Analysis
No ratings yet
Heart Disease Predictive Analysis
4 pages
Cardiovascular Disease Prediction
No ratings yet
Cardiovascular Disease Prediction
2 pages
Lab Report Content - 15marks
No ratings yet
Lab Report Content - 15marks
10 pages
Cse437 4
No ratings yet
Cse437 4
14 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Case Study
No ratings yet
Case Study
21 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
Second Progres Report
No ratings yet
Second Progres Report
10 pages
Title Name School Supervisor SRN Date Progress Report Number Duration of The Project
No ratings yet
Title Name School Supervisor SRN Date Progress Report Number Duration of The Project
8 pages
Diabetic Prediction Using LogicalRegression
No ratings yet
Diabetic Prediction Using LogicalRegression
9 pages
Final Year Project Report
No ratings yet
Final Year Project Report
20 pages
Heart Disease Pre
No ratings yet
Heart Disease Pre
23 pages
Web Application
No ratings yet
Web Application
13 pages
HEART
No ratings yet
HEART
15 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
ML (Lab 8) Tasks Bilal Habib (5th Semester)
No ratings yet
ML (Lab 8) Tasks Bilal Habib (5th Semester)
16 pages
Slide 1
No ratings yet
Slide 1
7 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
11 pages
Bhavan Phase3 Prj.
No ratings yet
Bhavan Phase3 Prj.
24 pages
Final PPT Heart Disease
67% (3)
Final PPT Heart Disease
23 pages
Heart Disease
No ratings yet
Heart Disease
5 pages
Batch-2 (Review 2)
No ratings yet
Batch-2 (Review 2)
19 pages
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
No ratings yet
Edited Version of Cardiovascular Diseases Risk Prediction Dataset Report
25 pages
HDD New Report
No ratings yet
HDD New Report
95 pages
Technical Presentation
No ratings yet
Technical Presentation
4 pages
Logistic Regression 205
No ratings yet
Logistic Regression 205
8 pages
ML Practicals
No ratings yet
ML Practicals
21 pages
Heart Disease Detection - Newreport
No ratings yet
Heart Disease Detection - Newreport
57 pages
Project - Predicting Heart Disease
No ratings yet
Project - Predicting Heart Disease
2 pages
A Summer Internship Report
No ratings yet
A Summer Internship Report
27 pages
Final
No ratings yet
Final
13 pages
Review 2
No ratings yet
Review 2
23 pages
Heart Disease Prediction Theory
No ratings yet
Heart Disease Prediction Theory
10 pages
ML Report Edited
No ratings yet
ML Report Edited
10 pages
Abstract 1
No ratings yet
Abstract 1
1 page
Synopsis
No ratings yet
Synopsis
4 pages
Abstract of Heart Disease Prediction Using ML
No ratings yet
Abstract of Heart Disease Prediction Using ML
2 pages
Heart Disease Prediction Professional
No ratings yet
Heart Disease Prediction Professional
10 pages
ML Report
No ratings yet
ML Report
12 pages
Heart Disease Predictor
No ratings yet
Heart Disease Predictor
3 pages
Heart Disease Prediction System Using Machine Learning 1
No ratings yet
Heart Disease Prediction System Using Machine Learning 1
17 pages
Deep Learning Project Report
No ratings yet
Deep Learning Project Report
7 pages
Python Project Report
No ratings yet
Python Project Report
15 pages
Cardiovascular Disease Predictive Modeling
No ratings yet
Cardiovascular Disease Predictive Modeling
3 pages
6596-Article Text-7064-1-10-20230514
No ratings yet
6596-Article Text-7064-1-10-20230514
6 pages
21EC71 AVLSI Answers
No ratings yet
21EC71 AVLSI Answers
31 pages
Cns Solutions Set2
No ratings yet
Cns Solutions Set2
35 pages
ADVLSI - Model QP Solution
No ratings yet
ADVLSI - Model QP Solution
17 pages
Cns Solutions Set1
No ratings yet
Cns Solutions Set1
37 pages
Python Code
No ratings yet
Python Code
2 pages
Oil Gas market-EN PDF
No ratings yet
Oil Gas market-EN PDF
4 pages
Evolve Industry Deep Dive - Capital Markets Landscape - Final
No ratings yet
Evolve Industry Deep Dive - Capital Markets Landscape - Final
50 pages
Hailey College of Banking &finance: University of The Punjab, Lahore
No ratings yet
Hailey College of Banking &finance: University of The Punjab, Lahore
3 pages
Datacard SP75 Plus Duplex Color Card Printer
No ratings yet
Datacard SP75 Plus Duplex Color Card Printer
2 pages
Fee Voucher
No ratings yet
Fee Voucher
1 page
Dry Ports. Research Outcomes, Trends, and Future Implications
No ratings yet
Dry Ports. Research Outcomes, Trends, and Future Implications
28 pages
Sample Freelance Agreement To Offer Content-Writing Services To A Potential Client As A Freelancer
No ratings yet
Sample Freelance Agreement To Offer Content-Writing Services To A Potential Client As A Freelancer
7 pages
Hiring Report
No ratings yet
Hiring Report
25 pages
Airbus Rotorcraft Flight Manual - FLM - BK117D3
No ratings yet
Airbus Rotorcraft Flight Manual - FLM - BK117D3
1,988 pages
TiO2 Spec of R299-YuejiangChem
No ratings yet
TiO2 Spec of R299-YuejiangChem
1 page
Assessing The Socio-Economic Impacts of Rural Infrastructure Projects On Community Development
No ratings yet
Assessing The Socio-Economic Impacts of Rural Infrastructure Projects On Community Development
18 pages
Bobcat Business Plan
100% (1)
Bobcat Business Plan
10 pages
1 - Transportation Requirements Aviation NiCd - 2019
No ratings yet
1 - Transportation Requirements Aviation NiCd - 2019
4 pages
National Income and Related Aggregates: Dr. Roopali Srivastava Department of Management ITS, Ghaziabad
No ratings yet
National Income and Related Aggregates: Dr. Roopali Srivastava Department of Management ITS, Ghaziabad
30 pages
Anybus Communicator - PROFINET IO Interface Installation Sheet
No ratings yet
Anybus Communicator - PROFINET IO Interface Installation Sheet
2 pages
HCLTB0678118
No ratings yet
HCLTB0678118
2 pages
Configure A Life Event in UKG Pro Benefits
No ratings yet
Configure A Life Event in UKG Pro Benefits
15 pages
SS FC Tables
No ratings yet
SS FC Tables
1 page
Defense Primer: Electronic Warfare: Role of EW in Military Operations
No ratings yet
Defense Primer: Electronic Warfare: Role of EW in Military Operations
3 pages
Guide: Getting Started
No ratings yet
Guide: Getting Started
59 pages
Resolute Absolute Optical Encoder With Siemens Drive-Cliq Serial Communications
No ratings yet
Resolute Absolute Optical Encoder With Siemens Drive-Cliq Serial Communications
10 pages
Jed Magazine 035 - 2012 02
No ratings yet
Jed Magazine 035 - 2012 02
52 pages
Kurl On
No ratings yet
Kurl On
1 page
Quang Silic - List 600 Domain Link Building + 160 Forum - Tuyển Dụng - Rao Vặt Việt Nam
No ratings yet
Quang Silic - List 600 Domain Link Building + 160 Forum - Tuyển Dụng - Rao Vặt Việt Nam
23 pages
Presentation On Photoshop and It'S Working Environment: Presented by
No ratings yet
Presentation On Photoshop and It'S Working Environment: Presented by
20 pages
Fuce 200800028
No ratings yet
Fuce 200800028
7 pages
Essential Documents Clinical Trial GCP
No ratings yet
Essential Documents Clinical Trial GCP
6 pages
Optimizing Banking Operating Models: From Strategy To Implementation
No ratings yet
Optimizing Banking Operating Models: From Strategy To Implementation
16 pages
European Union and Brexit
No ratings yet
European Union and Brexit
3 pages

Python Cod1

Uploaded by

Python Cod1

Uploaded by

Python Code:

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score

# Step 1: Load the dataset

# Step 2: Display the first few rows of the dataset

print("Initial Data:\n", df.head())

# Step 3: Check for missing values

print("Missing Values:\n", df.isnull().sum())

# Step 4: Handle missing values (if any)

# df.fillna(method='ffill', inplace=True) # Forward fill or drop missing values

# Step 5: Display the data types

print("Data Types:\n", df.dtypes)

# Step 6: String manipulation example (if needed)

# Example: Clean a string column (if applicable)

# Step 7: Convert relevant columns to NumPy arrays

print(f"Mean Age: {mean_age}, Median Cholesterol: {median_cholesterol}")

# Step 9: Define features and target variable

X = df.drop(columns=['target']) # Assuming 'target' is the column to predict

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"Training set size: {X_train.shape[0]}, Testing set size: {X_test.shape[0]}")

# Step 11: Initialize and train the model

# Step 12: Make predictions on the test set

# Step 13: Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy of the model: {accuracy:.2f}")

# Step 14: Save the report to a text file

with open("heart_disease_analysis_report.txt", "w") as file:

file.write("Heart Disease Analysis Report\n")

file.write("Objective: Analyze the dataset to predict heart disease.\n")

file.write("Statistical Analysis: Mean Age: {}, Median Cholesterol: {}.\n".format(mean_age,

file.write("Model Accuracy: {}.\n".format(accuracy))

You might also like