0% found this document useful (0 votes)

141 views

Python Pandas

This document provides an overview of Python Pandas concepts including data structures, accessing data, working with CSV files, indexing dataframes, data cleaning, aggregation, and merging data. It includes code examples to demonstrate these Pandas techniques on student height and weight data. The document also invites readers to join a Telegram channel for more Pandas hands-on lessons.

Uploaded by

Pushpendra Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

141 views

Python Pandas

Uploaded by

Pushpendra Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Join our channel if you haven’t joined yet https://t.

me/fresco_milestone ( @fresco_milestone )

Python Pandas HandsOns

1. Pandas Data Structures

import pandas as pd
importnumpy as np

heights_A= pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']
print(heights_A.shape)

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']
print(weights_A.dtypes)

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A
print(df_A.shape)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B= pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
heights_B.index = ['s1', 's2', 's3', 's4','s5']

my_mean1 = 75.0
my_std1 = 12.0
weights_B =pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']
print(heights_B.mean())

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B
print(df_B.columns)

2. Accessing Pandas Data Structures

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']
print(heights_A[1])
print(heights_A[[1,2,3]])

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

height = df_A['Student_height']
print(type(height))

df_s1s2 = df_A[df_A.index.isin(['s1','s2'])]
print(df_s1s2)

df_s2s5s1 = df_A[df_A.index.isin(['s1','s2','s5'])]
df_s2s5s1 = df_s2s5s1.reindex(['s2', 's5', 's1'])
print(df_s2s5s1)

df_s1s4 = df_A[df_A.index.isin(['s1','s4'])]
print(df_s1s4)

3. Working with CSV files

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A.to_csv('classA.csv')

df_A2 = pd.read_csv('classA.csv')
print(df_A2)

df_A3 = pd.read_csv('classA.csv',index_col='Unnamed: 0')

print(df_A3)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B = pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

heights_B.index = ['s1', 's2', 's3', 's4','s5']

my_mean1 = 75.0
my_std1 = 12.0
np.random.seed(100)
weights_B = pd.Series(np.random.normal(loc=my_mean1, scale=my_std1, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B

df_B.to_csv('classB.csv', index=False)

df_B2 = pd.read_csv('classB.csv')
print(df_B2)

df_B3 = pd.read_csv('classB.csv',header=None)
print(df_B3)

df_B4 = pd.read_csv('classB.csv',header=None,skiprows=2)
print(df_B4)

4. Indexing Dataframes

#Write your code here

import pandas as pd
import numpy as np

DatetimeIndex = pd.date_range(start='09/1/2017', end='09/15/2017')

print(DatetimeIndex[2])

datelist = ['14-Sep-2017', '9-Sep-2017']

dates_to_be_searched = pd.to_datetime(datelist)

print(dates_to_be_searched)

print(dates_to_be_searched.isin(DatetimeIndex))

arraylist = [['classA']5 + ['classB']5, ['s1', 's2', 's3','s4', 's5']*2]

mi_index = pd.MultiIndex.from_product(arraylist, names=['First Level','Second Level'])
print(mi_index.levels)
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

5. Data Cleaning

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A.loc['s3'] = np.nan
df_A.loc['s5'][1] = np.nan

df_A2 = df_A.dropna(how ='any')

print(df_A2)

6. Data Aggregation

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A_filter1 = df_A[(df_A.Student_height > 160.0) & (df_A.Student_weight < 80.0)]

print(df_A_filter1)

df_A_filter2 = df_A[df_A.index.isin(['s5'])]
print(df_A_filter2)
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

df_groups = df_A.groupby('Gender')
print(df_groups.mean())

7. Data Merge 1

#Write your code here

import pandas as pd
import numpy as np

heights_A = pd.Series([176.2, 158.4, 167.6, 156.2, 161.4])

heights_A.index = ['s1', 's2', 's3', 's4','s5']

weights_A = pd.Series([85.1, 90.2, 76.8, 80.4 , 78.9])

weights_A.index = ['s1', 's2', 's3', 's4','s5']

df_A = pd.DataFrame()
df_A['Student_height'] = heights_A
df_A['Student_weight'] = weights_A

df_A['Gender'] = ['M', 'F', 'M', 'M', 'F']

s = pd.Series([165.4, 82.7, 'F'],index=['Student_height', 'Student_weight', 'Gender'],name='s6')

df_AA = df_A.append(s)
print(df_AA)

my_mean = 170.0
my_std = 25.0
np.random.seed(100)
heights_B = pd.Series(np.random.normal(loc=my_mean, scale=my_std, size=5))
heights_B.index = ['s1', 's2', 's3', 's4','s5']

my_mean1 = 75.0
my_std1 = 12.0
np.random.seed(100)
weights_B = pd.Series(np.random.normal(loc=my_mean1, scale=my_std1, size=5))
weights_B.index = ['s1', 's2', 's3', 's4','s5']

df_B = pd.DataFrame()
df_B['Student_height'] = heights_B
df_B['Student_weight'] = weights_B
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

df_B.index = [ 's7', 's8', 's9', 's10', 's11']

df_B['Gender'] = ['F', 'M', 'F', 'F', 'M']

df = pd.concat([df_AA,df_B])
print(df)

8. Data Merge – 2

#Write your code here

import pandas as pd
import numpy as np

nameid = pd.Series(range(101, 111))

name = pd.Series(['person' + str(i) for i in range(1, 11)])
master = pd.DataFrame()
master['nameid'] = nameid
master['name'] = name

transaction = pd.DataFrame({'nameid':[108, 108, 108,103], 'product':['iPhone', 'Nokia', 'Micromax', 'Viv

o']})

mdf = pd.merge(master,transaction,on='nameid')
print(mdf)

Wings1 T1 Nodejs APIs (62637)
50% (2)
Wings1 T1 Nodejs APIs (62637)
4 pages
Final Project Report
50% (4)
Final Project Report
52 pages
CS501 Quiz 1 Solved by VU Answer
No ratings yet
CS501 Quiz 1 Solved by VU Answer
11 pages
Assignment 61
100% (2)
Assignment 61
4 pages
Error Codes KM C2525e C3232e C4035e SM Uk Ref 03
No ratings yet
Error Codes KM C2525e C3232e C4035e SM Uk Ref 03
28 pages
A0006 DAG1000-4s4owithElastixServerSetup
No ratings yet
A0006 DAG1000-4s4owithElastixServerSetup
16 pages
Student Management System
No ratings yet
Student Management System
9 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
Pandas Guide
No ratings yet
Pandas Guide
64 pages
DS+C25 PGDDS+Masters
No ratings yet
DS+C25 PGDDS+Masters
13 pages
UpGrad Python Practice Question
No ratings yet
UpGrad Python Practice Question
2 pages
Data Science Presentation
100% (3)
Data Science Presentation
113 pages
International Indian School, Riyadh WORKSHEET (2020-2021) Grade - Xii - Informatics Practices - Second Term
No ratings yet
International Indian School, Riyadh WORKSHEET (2020-2021) Grade - Xii - Informatics Practices - Second Term
9 pages
Record 2022-23
No ratings yet
Record 2022-23
92 pages
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
No ratings yet
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
7 pages
Class 12 Ip Practical Programs 2024-25
No ratings yet
Class 12 Ip Practical Programs 2024-25
37 pages
SQL Database Notes
No ratings yet
SQL Database Notes
8 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
12 Ip
No ratings yet
12 Ip
5 pages
Data Visualization
No ratings yet
Data Visualization
9 pages
Data Analytics With Python-1
No ratings yet
Data Analytics With Python-1
12 pages
Data Science PPT Module 1
100% (1)
Data Science PPT Module 1
24 pages
On Data Handling Using Pandas-I
100% (2)
On Data Handling Using Pandas-I
64 pages
Python - Module 3
No ratings yet
Python - Module 3
86 pages
IPL DATA ANLYSIS (1)
No ratings yet
IPL DATA ANLYSIS (1)
20 pages
Data Visualization and Matplot
No ratings yet
Data Visualization and Matplot
11 pages
LMRS Ip 2020 21
No ratings yet
LMRS Ip 2020 21
21 pages
Classification Error: Training Errors Generalization Errors
No ratings yet
Classification Error: Training Errors Generalization Errors
39 pages
Data Generalization
No ratings yet
Data Generalization
3 pages
Python Pandas-Series-neww
100% (1)
Python Pandas-Series-neww
80 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
6 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Introduction To Data Visualization in Python
No ratings yet
Introduction To Data Visualization in Python
16 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
48 pages
Project
No ratings yet
Project
18 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
25 pages
Pandas Dataframe Assignment No 3 - Answerkey
No ratings yet
Pandas Dataframe Assignment No 3 - Answerkey
10 pages
Tuple in Python PDF
No ratings yet
Tuple in Python PDF
20 pages
SQL Python Connectivity
No ratings yet
SQL Python Connectivity
61 pages
Poly
100% (1)
Poly
108 pages
Yashica IP Practical
No ratings yet
Yashica IP Practical
51 pages
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
Untitled5 - Jupyter Notebook
100% (2)
Untitled5 - Jupyter Notebook
11 pages
Data Mining
100% (4)
Data Mining
9 pages
Sales Management System Report File - 4
No ratings yet
Sales Management System Report File - 4
23 pages
Python Data Analysis Visualization
No ratings yet
Python Data Analysis Visualization
34 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
SQL Quiz
No ratings yet
SQL Quiz
4 pages
12cs Ernakulam SQP 2223 Solved QP
No ratings yet
12cs Ernakulam SQP 2223 Solved QP
68 pages
Python Pandas2 PDF
No ratings yet
Python Pandas2 PDF
38 pages
Pandas Practice Questions
No ratings yet
Pandas Practice Questions
2 pages
SQL Syntax
No ratings yet
SQL Syntax
321 pages
EDA Assignment
No ratings yet
EDA Assignment
15 pages
Research Paper Presentation Pandas Moshiul Arefin
No ratings yet
Research Paper Presentation Pandas Moshiul Arefin
30 pages
Fds Unit - III
No ratings yet
Fds Unit - III
58 pages
Data Wrangling
No ratings yet
Data Wrangling
13 pages
Tutorial 2 - Clustering
100% (2)
Tutorial 2 - Clustering
6 pages
Pandas
100% (1)
Pandas
1,131 pages
Employee Management
100% (1)
Employee Management
12 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
75 pages
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Interface Python With MySQL
0% (1)
Interface Python With MySQL
2 pages
Python Pandas Handson
No ratings yet
Python Pandas Handson
6 pages
Pandas
No ratings yet
Pandas
4 pages
Image Classification Handson-Image - Test
No ratings yet
Image Classification Handson-Image - Test
5 pages
Question Wise Details
No ratings yet
Question Wise Details
4 pages
Wa0002
No ratings yet
Wa0002
11 pages
Image Classification Hands-On
100% (1)
Image Classification Hands-On
1 page
Gradle
No ratings yet
Gradle
6 pages
Automation Anywhere - Latest1
No ratings yet
Automation Anywhere - Latest1
4 pages
Bitbucket
No ratings yet
Bitbucket
2 pages
Infrastructre As Code
No ratings yet
Infrastructre As Code
2 pages
Mobile Primer Course
No ratings yet
Mobile Primer Course
4 pages
Ansible - Automation Sibelius
No ratings yet
Ansible - Automation Sibelius
4 pages
MILESTONE CHALLENGE Not Cleared Contigency Using Data Visualization
No ratings yet
MILESTONE CHALLENGE Not Cleared Contigency Using Data Visualization
2 pages
Magento
No ratings yet
Magento
10 pages
Kafka Remanere
No ratings yet
Kafka Remanere
7 pages
Fresco Points Details
No ratings yet
Fresco Points Details
6 pages
Asset Procedure
No ratings yet
Asset Procedure
2 pages
Wings1 T1 Full-Stack Application (62638)
No ratings yet
Wings1 T1 Full-Stack Application (62638)
6 pages
Arrowhead PDF
No ratings yet
Arrowhead PDF
31 pages
Uccx - B - 125getting Started Ip Ivr
No ratings yet
Uccx - B - 125getting Started Ip Ivr
110 pages
Full Prep Ms-102
No ratings yet
Full Prep Ms-102
373 pages
Bashsploit Level 8
No ratings yet
Bashsploit Level 8
3 pages
Assignment No1 of System Analysis and Design: Submitted To Submitted by
No ratings yet
Assignment No1 of System Analysis and Design: Submitted To Submitted by
7 pages
Dinakaran S: Register No.: Student Name
No ratings yet
Dinakaran S: Register No.: Student Name
2 pages
Azure ShortNotes
No ratings yet
Azure ShortNotes
2 pages
Jadwal Wfh-Wfo Qcs 4-25 Jan 2021 Rev.1
No ratings yet
Jadwal Wfh-Wfo Qcs 4-25 Jan 2021 Rev.1
1 page
Đề 3
No ratings yet
Đề 3
4 pages
SmartGuard 600 Controller - CompactLogix - Chingon
No ratings yet
SmartGuard 600 Controller - CompactLogix - Chingon
28 pages
Form 4 EOT 3 Exam June 2024
No ratings yet
Form 4 EOT 3 Exam June 2024
15 pages
BS7 2022 End of Term 3 Computing Paper 2
No ratings yet
BS7 2022 End of Term 3 Computing Paper 2
4 pages
SVERKER-900 CT-testing AN en V02
No ratings yet
SVERKER-900 CT-testing AN en V02
12 pages
TROM
No ratings yet
TROM
5 pages
ProStream 9100 Release
100% (1)
ProStream 9100 Release
79 pages
Listado de Videos Del Curso Power System Protection
No ratings yet
Listado de Videos Del Curso Power System Protection
5 pages
CCNA Cyber Ops Version 11 Chapter 2 Exam Answers Full
No ratings yet
CCNA Cyber Ops Version 11 Chapter 2 Exam Answers Full
13 pages
Python Interview Questions and Answers For 2019 - Intellipaat
No ratings yet
Python Interview Questions and Answers For 2019 - Intellipaat
25 pages
CANNON Vixia HF r10
No ratings yet
CANNON Vixia HF r10
184 pages
Library System Thesis PDF
100% (3)
Library System Thesis PDF
4 pages
XII CS practical (1-17)Session 2024-25
No ratings yet
XII CS practical (1-17)Session 2024-25
46 pages
Whitepaper 01
No ratings yet
Whitepaper 01
13 pages
Dbms Lab Manual r20 Syllabus
No ratings yet
Dbms Lab Manual r20 Syllabus
48 pages
Developing Maturity Models for IT Management, A Procedure Model and its Application
No ratings yet
Developing Maturity Models for IT Management, A Procedure Model and its Application
11 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Debate On Hospital Information System and Manual Documentation
No ratings yet
Debate On Hospital Information System and Manual Documentation
16 pages