0% found this document useful (0 votes)

13 views5 pages

Experiment 2

The document loads and cleans student academic performance data from a CSV file. It performs various operations like handling missing values, calculating descriptive statistics, and adding new columns with derived values.

Uploaded by

MR. GAMER

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views5 pages

Experiment 2

Uploaded by

MR. GAMER

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

In [1]: import numpy as np

import matplotlib.pyplot as plt

%matplotlib inline

In [2]: import pandas as pd

In [3]: df = pd.read_csv(r"D:\College\TE\SEM-2\Practical\DSBDA\2\AcademicPerformance.csv")

In [4]: print(df)

gender race/ethnicity parental level of education lunch \

0 female group B bachelor's degree standard
1 female group C some college standard
2 female group B master's degree standard
3 male group A associate's degree free/reduced
4 male group C some college standard
... ... ... ... ...
2235 NaN NaN NaN NaN
2236 NaN NaN NaN NaN
2237 NaN NaN NaN NaN
2238 NaN NaN NaN NaN
2239 NaN NaN NaN NaN

test preparation course Year_Birth math score reading score \

0 none 1970.0 72.0 72
1 completed 1961.0 NaN na
2 none 1958.0 90.0 95
3 none 1967.0 NaN NaN
4 none 1989.0 76.0 78
... ... ... ... ...
2235 NaN NaN NaN NaN
2236 NaN NaN NaN NaN
2237 NaN NaN NaN NaN
2238 NaN NaN NaN NaN
2239 NaN NaN NaN NaN

writing score Dt_Admission College_Fees

0 74 6/16/14 $84,835.00
1 A 6/15/14 $57,091.00
2 93 5/13/14 $67,267.00
3 44 05-11-2014 $32,474.00
4 75 04-08-2014 $21,474.00
... ... ... ...
2235 NaN NaN NaN
2236 NaN NaN NaN
2237 NaN NaN NaN
2238 NaN NaN NaN
2239 NaN NaN NaN

[2240 rows x 11 columns]

In [5]: print(df['math score'])

0 72.0
1 NaN
2 90.0
3 NaN
4 76.0
...
2235 NaN
2236 NaN
2237 NaN
2238 NaN
2239 NaN
Name: math score, Length: 2240, dtype: float64

In [6]: print(df['math score'].isnull())

0 False
1 True
2 False
3 True
4 False
...
2235 True
2236 True
2237 True
2238 True
2239 True
Name: math score, Length: 2240, dtype: bool

In [7]: print(df['reading score'])

0 72
1 na
2 95
3 NaN
4 78
...
2235 NaN
2236 NaN
2237 NaN
2238 NaN
2239 NaN
Name: reading score, Length: 2240, dtype: object

In [8]: print(df['reading score'].isnull())

0 False
1 False
2 False
3 True
4 False
...
2235 True
2236 True
2237 True
2238 True
2239 True
Name: reading score, Length: 2240, dtype: bool
In [9]: missing_values = ["n/a", "na", "--"]
df = pd.read_csv(r"D:\College\TE\SEM-2\Practical\DSBDA\2\AcademicPerformance.csv", na_v

In [10]: print(df['reading score'])

0 72.0
1 NaN
2 95.0
3 NaN
4 78.0
...
2235 NaN
2236 NaN
2237 NaN
2238 NaN
2239 NaN
Name: reading score, Length: 2240, dtype: float64

In [11]: print(df['reading score'].isnull())

0 False
1 True
2 False
3 True
4 False
...
2235 True
2236 True
2237 True
2238 True
2239 True
Name: reading score, Length: 2240, dtype: bool

In [12]: dataset = [11,41,20,3,101,55,68,97,99,6]

In [13]: sorted(dataset)

Out[13]: [3, 6, 11, 20, 41, 55, 68, 97, 99, 101]

In [14]: quantile1, quantile3 = np.percentile(dataset, [25,75])

In [15]: print(quantile1, quantile3)

13.25 89.75

In [16]: iqr_value = (quantile3 - quantile1)

In [17]: print(iqr_value)

76.5

In [18]: lower_bound_value = quantile1 - (1.5*iqr_value)

In [19]: upper_bound_value = quantile3 + (1.5*iqr_value)

In [20]: print(lower_bound_value, upper_bound_value)

-101.5 204.5

In [21]: from datetime import date

df['age'] = date.today().year - df['Year_Birth']

In [22]: df['Year'] = pd.DatetimeIndex(df['Dt_Admission']).year

df['E_L'] = date.today().year - df['Year']

In [23]: df.head(5)

Out[23]:
parental test
math reading writing
gender race/ethnicity level of lunch preparation Year_Birth Dt_Admi
score score score
education course

bachelor's
0 female group B standard none 1970.0 72.0 72.0 74 6/
degree

some
1 female group C standard completed 1961.0 NaN NaN A 6/
college

master's
2 female group B standard none 1958.0 90.0 95.0 93 5/
degree

associate's
3 male group A free/reduced none 1967.0 NaN NaN 44 05-11
degree

some
4 male group C standard none 1989.0 76.0 78.0 75 04-08
college

In [24]: df['Fees$'] = df['College_Fees'].str.replace(',', '').str.replace('$', '').str.replace(

df['Fees_M$'] = df['Fees$'].apply(lambda X:round(X/1000000))

In [25]: df.head(5)

Out[25]:
parental test
math reading writing
gender race/ethnicity level of lunch preparation Year_Birth Dt_Admi
score score score
education course

bachelor's
0 female group B standard none 1970.0 72.0 72.0 74 6/
degree

some
1 female group C standard completed 1961.0 NaN NaN A 6/
college

master's
2 female group B standard none 1958.0 90.0 95.0 93 5/
degree

associate's
3 male group A free/reduced none 1967.0 NaN NaN 44 05-11
degree

some
4 male group C standard none 1989.0 76.0 78.0 75 04-08
college

In [ ]:

Crown Catalog (2012)
100% (1)
Crown Catalog (2012)
466 pages
Data Preprocessing
No ratings yet
Data Preprocessing
27 pages
01 Working With CSV Files
No ratings yet
01 Working With CSV Files
27 pages
Rajat DM
No ratings yet
Rajat DM
54 pages
B "Hello, World!" Print (B (2:5) ) Llo
No ratings yet
B "Hello, World!" Print (B (2:5) ) Llo
52 pages
DS&BDA 1-14
No ratings yet
DS&BDA 1-14
95 pages
Student Dropout
No ratings yet
Student Dropout
38 pages
Vantika Kamra's Practical File 12 Diamond (26600872)
No ratings yet
Vantika Kamra's Practical File 12 Diamond (26600872)
46 pages
Chapter 18 Matrix Analysis of Beams and Frames by The Direct Stiffness Method
100% (1)
Chapter 18 Matrix Analysis of Beams and Frames by The Direct Stiffness Method
38 pages
Week 5 LAB
No ratings yet
Week 5 LAB
23 pages
Data Analysis Process
No ratings yet
Data Analysis Process
95 pages
1723524625270_Data_Frame_Notes3
No ratings yet
1723524625270_Data_Frame_Notes3
39 pages
DW 14
No ratings yet
DW 14
14 pages
Panda Merged
No ratings yet
Panda Merged
19 pages
Import: Sys - Executable - M Pip Install
No ratings yet
Import: Sys - Executable - M Pip Install
23 pages
student analysis
No ratings yet
student analysis
16 pages
Jamboree_Case_Study
No ratings yet
Jamboree_Case_Study
24 pages
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
Tutorial 6
No ratings yet
Tutorial 6
13 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Info Practical
No ratings yet
Info Practical
56 pages
Data Preprocessing - Ipynb - Colaboratory
No ratings yet
Data Preprocessing - Ipynb - Colaboratory
7 pages
e 6222002
No ratings yet
e 6222002
33 pages
Data Manipulation With Python Pandas 1700003764
No ratings yet
Data Manipulation With Python Pandas 1700003764
10 pages
vertopal.com_student_performance_analysis (1)
No ratings yet
vertopal.com_student_performance_analysis (1)
22 pages
vertopal.com_homework1
No ratings yet
vertopal.com_homework1
17 pages
ML Cops
No ratings yet
ML Cops
17 pages
Assignment 5
No ratings yet
Assignment 5
14 pages
Vertopal.com_457 Labs
No ratings yet
Vertopal.com_457 Labs
19 pages
DSGTChap 5 Algebraic Systems
No ratings yet
DSGTChap 5 Algebraic Systems
68 pages
Document (4)
No ratings yet
Document (4)
15 pages
Lambda Functions & Alternative Methods in Python
No ratings yet
Lambda Functions & Alternative Methods in Python
8 pages
Student Notebook HR Analysis
No ratings yet
Student Notebook HR Analysis
11 pages
230103-ECON209_S2025__Lab_2.ipynb-Colab
No ratings yet
230103-ECON209_S2025__Lab_2.ipynb-Colab
10 pages
Samarth Raghav
No ratings yet
Samarth Raghav
15 pages
Data Preprocessing
No ratings yet
Data Preprocessing
5 pages
4ems
No ratings yet
4ems
38 pages
Handling Missing Data in Pandas by Jaume Boguñá
No ratings yet
Handling Missing Data in Pandas by Jaume Boguñá
17 pages
_payal_2_practical (1)_edited
No ratings yet
_payal_2_practical (1)_edited
9 pages
DALab Part-B BCU&BU
No ratings yet
DALab Part-B BCU&BU
12 pages
02. Python Pandas - 2 2020-21
No ratings yet
02. Python Pandas - 2 2020-21
21 pages
Ch24 Testbank
100% (1)
Ch24 Testbank
40 pages
DSBDA02
No ratings yet
DSBDA02
8 pages
00 - Project - Your First Data Science Project - Jupyter Notebook
No ratings yet
00 - Project - Your First Data Science Project - Jupyter Notebook
8 pages
Group Theory (MIlne)
No ratings yet
Group Theory (MIlne)
133 pages
Lab 09
No ratings yet
Lab 09
3 pages
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
No ratings yet
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
12 pages
List of Practical Ip065 Xii Session 2025 Ckc Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 Ckc Academy
19 pages
Apex Financial Services Loan Data Automation
No ratings yet
Apex Financial Services Loan Data Automation
18 pages
vertopal.com_IBA Practical Set A 14th Dec
No ratings yet
vertopal.com_IBA Practical Set A 14th Dec
3 pages
Project_Prog
No ratings yet
Project_Prog
6 pages
Assignment
No ratings yet
Assignment
2 pages
07 Instruction Manual - P626
100% (4)
07 Instruction Manual - P626
233 pages
Practical List 2022-23
100% (1)
Practical List 2022-23
4 pages
Data Wrangling- Jupyter Notebook
No ratings yet
Data Wrangling- Jupyter Notebook
5 pages
Comprehensive Trigonometry For IIT JEE Main and Advanced Rejaul Makshud McGraw Hill (PDFDrive) - 51-100
No ratings yet
Comprehensive Trigonometry For IIT JEE Main and Advanced Rejaul Makshud McGraw Hill (PDFDrive) - 51-100
50 pages
Credit Card Default
No ratings yet
Credit Card Default
5 pages
Experiment 1
No ratings yet
Experiment 1
5 pages
Article_Gabo_Ergonomics
No ratings yet
Article_Gabo_Ergonomics
20 pages
DA Lab Manual r22
No ratings yet
DA Lab Manual r22
31 pages
00 - Lesson - Data Science Workflow - Jupyter Notebook
No ratings yet
00 - Lesson - Data Science Workflow - Jupyter Notebook
6 pages
Chapter-4_Humanoid , Forward and Inverse Kinematics
No ratings yet
Chapter-4_Humanoid , Forward and Inverse Kinematics
22 pages
Unit3_3) Pandas.ipynb - Colab
No ratings yet
Unit3_3) Pandas.ipynb - Colab
11 pages
DSBDA_prac2
No ratings yet
DSBDA_prac2
2 pages
2503.07885v1
No ratings yet
2503.07885v1
23 pages
Lab3.ipynb - Colaboratory
No ratings yet
Lab3.ipynb - Colaboratory
7 pages
Pandas Revision1
No ratings yet
Pandas Revision1
2 pages
codealpha_studentseda
No ratings yet
codealpha_studentseda
2 pages
4 PLC Hardware and Logic Gates (1)
No ratings yet
4 PLC Hardware and Logic Gates (1)
18 pages
Practical File Questions With Answers
No ratings yet
Practical File Questions With Answers
7 pages
Manual DO 2720
No ratings yet
Manual DO 2720
12 pages
Beckman Industrial UC10 Operators Manual
No ratings yet
Beckman Industrial UC10 Operators Manual
18 pages
Eaton 5px Ups Upgrade Instructions
No ratings yet
Eaton 5px Ups Upgrade Instructions
14 pages
Math6 Q3 Module5 Week5
0% (1)
Math6 Q3 Module5 Week5
4 pages
Core Topics HL Chapters Summaries
No ratings yet
Core Topics HL Chapters Summaries
8 pages
Diffie-Hellman Key Exchange
No ratings yet
Diffie-Hellman Key Exchange
6 pages
Exhaustive Aircraft Design Syllabus Fixed
No ratings yet
Exhaustive Aircraft Design Syllabus Fixed
4 pages
Releasing Module: Standard Features
No ratings yet
Releasing Module: Standard Features
10 pages
Features and Applications Trenching Dimensions
0% (1)
Features and Applications Trenching Dimensions
2 pages
Calibration Curves: Area (2-Me-1-Buoh) Compou ND PPM (M/M) Area (1-Proh) Area (Etoac) 95% Confiden Ce
No ratings yet
Calibration Curves: Area (2-Me-1-Buoh) Compou ND PPM (M/M) Area (1-Proh) Area (Etoac) 95% Confiden Ce
4 pages
Summative Test in Grade Math 2nd Quarter Copy Copy 2
No ratings yet
Summative Test in Grade Math 2nd Quarter Copy Copy 2
4 pages
Pump VFD ES Estimator
No ratings yet
Pump VFD ES Estimator
2 pages
@ Belt Flex Tester
No ratings yet
@ Belt Flex Tester
2 pages
Concave Function
No ratings yet
Concave Function
3 pages
Mock 1
No ratings yet
Mock 1
12 pages
6 Lessons To Help You Find Trading Opportunities in Any Market
No ratings yet
6 Lessons To Help You Find Trading Opportunities in Any Market
9 pages
Excel Shortcut Keys
No ratings yet
Excel Shortcut Keys
4 pages
Embedded System Syllabus
No ratings yet
Embedded System Syllabus
3 pages
Aw 50-40
No ratings yet
Aw 50-40
0 pages
FreeCAD 0.20 Black Book
From Everand
FreeCAD 0.20 Black Book
Gaurav Verma
5/5 (1)