Python Data Analysis: Exploratory Data Analysis

This document is a cheat sheet for exploratory data analysis using Python, detailing various methods and their corresponding code examples. It covers techniques such as correlation matrices, scatter plots, regression plots, box plots, grouping by attributes, group by statements, pivot tables, pseudocolor plots, and calculating the Pearson coefficient and p-value. Each method is accompanied by a brief description and a code snippet for implementation.

Uploaded by

w123lucy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views1 page

Python Data Analysis: Exploratory Data Analysis

Uploaded by

w123lucy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

2/23/25, 9:18 PM about:blank

Data Analysis with Python

Cheat Sheet: Exploratory Data Analysis

Package/Method Description Code Example

df.corr()
Complete dataframe correlation Correlation matrix created using all the attributes of the dataset.

df[['attribute1','attribute2',...]].corr()
Specific Attribute correlation Correlation matrix created using specific attributes of the dataset.

Create a scatter plot using the data points of the dependent from matlplotlib import pyplot as
Scatter Plot variable along the x-axis and the independent variable along the plt plt.scatter(df[['attribute_1']],df[['attribute_2']])
y-axis.

Uses the dependent and independent variables in a Pandas data import seaborn as sns
Regression Plot frame to create a scatter plot with a generated linear regression sns.regplot(x='attribute_1',y='attribute_2', data=df)
line for the data.

Create a box-and-whisker plot that uses the pandas dataframe, import seaborn as sns
Box plot sns.boxplot(x='attribute_1',y='attribute_2', data=df)
the dependent, and the independent variables.

Create a group of different attributes of a dataset to create a df_group = df[['attribute_1','attribute_2',...]]

Grouping by attributes
subset of the data.

a. Group the data by different categories of an attribute,

displaying the average value of numerical attributes with the a) df_group = df_group.groupby(['attribute_1'],as_index=False).mean()
same category. b) df_group = df_group.groupby(['attribute_1',
GroupBy statements 'attribute_2'],as_index=False).mean()
b. Group the data by different categories of multiple attributes,
displaying the average value of numerical attributes with the
same category.

Create Pivot tables for better representation of data based on grouped_pivot = df_group.pivot(index='attribute_1',columns='attribute_2')
Pivot Tables
parameters

Create a heatmap image using a PsuedoColor plot (or pcolor) from matlplotlib import pyplot as plt
Pseudocolor plot plt.pcolor(grouped_pivot, cmap='RdBu')
using the pivot table as data.

From scipy import stats

Calculate the Pearson Coefficient and p-value of a pair of pearson_coef,p_value=stats.pearsonr(df['attribute_1'],
Pearson Coefficient and p-value
attributes df['attribute_2'])

about:blank 1/1

Star Wars Jedi Survivor 3d3c9
100% (1)
Star Wars Jedi Survivor 3d3c9
3 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Data Analysis W Pandas
No ratings yet
Data Analysis W Pandas
4 pages
Exploratory Data Analysis (EDA) in Python
No ratings yet
Exploratory Data Analysis (EDA) in Python
6 pages
2.1 Exploratory Data Analysis Using Python
No ratings yet
2.1 Exploratory Data Analysis Using Python
12 pages
IOT-Domain Analyst
No ratings yet
IOT-Domain Analyst
11 pages
Comprehensive EDA Python Guide
No ratings yet
Comprehensive EDA Python Guide
13 pages
Data_Engineer_Interview__1740985064
No ratings yet
Data_Engineer_Interview__1740985064
14 pages
Unit 1 - Intro To EDA
No ratings yet
Unit 1 - Intro To EDA
40 pages
Exploratory Data Analysis: by Neha Mathur
No ratings yet
Exploratory Data Analysis: by Neha Mathur
14 pages
UNIT-2
No ratings yet
UNIT-2
36 pages
EDA_CODE_SNIPPETS
No ratings yet
EDA_CODE_SNIPPETS
17 pages
EDA
No ratings yet
EDA
52 pages
data analysis
No ratings yet
data analysis
42 pages
Day 30 UnderstandingYourData 7steps
No ratings yet
Day 30 UnderstandingYourData 7steps
4 pages
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
No ratings yet
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
7 pages
AIDS C04-Session-22
No ratings yet
AIDS C04-Session-22
22 pages
Data Analisis 2
No ratings yet
Data Analisis 2
13 pages
Exploratory Data Analysis: by Neha Mathur
No ratings yet
Exploratory Data Analysis: by Neha Mathur
14 pages
Python Basics - Hamza Zahoor
No ratings yet
Python Basics - Hamza Zahoor
6 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
EDA+Cheatsheet+ +Class+Note
No ratings yet
EDA+Cheatsheet+ +Class+Note
29 pages
Lab Cs
No ratings yet
Lab Cs
38 pages
DSA lab manual pgms_fINAL
No ratings yet
DSA lab manual pgms_fINAL
34 pages
4 PythonPandas
No ratings yet
4 PythonPandas
8 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
Data Exploration in Python PDF
No ratings yet
Data Exploration in Python PDF
1 page
PythonForMachineLearning
No ratings yet
PythonForMachineLearning
66 pages
analyse
No ratings yet
analyse
2 pages
1DA (1)
No ratings yet
1DA (1)
18 pages
pandas (1)
No ratings yet
pandas (1)
25 pages
EDA+Cheatsheet+ +Class+Note
No ratings yet
EDA+Cheatsheet+ +Class+Note
29 pages
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
100% (1)
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
12 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
EDS - Python Cheat Sheet
0% (1)
EDS - Python Cheat Sheet
3 pages
Exp_1_Introduction to Data Analytics and Python fundamentals_sdk_ok
No ratings yet
Exp_1_Introduction to Data Analytics and Python fundamentals_sdk_ok
9 pages
EDA_INDEPTH
No ratings yet
EDA_INDEPTH
19 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
Exploratory Data Analysis-1
No ratings yet
Exploratory Data Analysis-1
10 pages
Data science and analtics Laboratory
No ratings yet
Data science and analtics Laboratory
21 pages
unit 6
No ratings yet
unit 6
3 pages
Attribute Types
No ratings yet
Attribute Types
11 pages
EDA LAB ASSIGNMENT2
No ratings yet
EDA LAB ASSIGNMENT2
10 pages
Lesson 2 - Data Preprocessing
100% (1)
Lesson 2 - Data Preprocessing
72 pages
EXP-12
No ratings yet
EXP-12
4 pages
BDA File
No ratings yet
BDA File
26 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Data Mining Vs Data Exploration UNIT-II
No ratings yet
Data Mining Vs Data Exploration UNIT-II
11 pages
python_cheatsheet
No ratings yet
python_cheatsheet
3 pages
2,3. Introduction Pandas & Matplotlib - Copy
No ratings yet
2,3. Introduction Pandas & Matplotlib - Copy
32 pages
Eda
No ratings yet
Eda
4 pages
Nikita Prasad - Exploratory Data Analysis (EDA)
No ratings yet
Nikita Prasad - Exploratory Data Analysis (EDA)
18 pages
Python For DS Cheat Sheet
100% (2)
Python For DS Cheat Sheet
6 pages
609008987-EDA-Lab-Manual
No ratings yet
609008987-EDA-Lab-Manual
93 pages
CSA105-LinearRegression-HousePrice-Prediction - Ipynb - Colaboratory
No ratings yet
CSA105-LinearRegression-HousePrice-Prediction - Ipynb - Colaboratory
17 pages
EDA Lab Manual
100% (2)
EDA Lab Manual
93 pages
Unit 5 Descriptive Statistics
No ratings yet
Unit 5 Descriptive Statistics
7 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
From Everand
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
Ginno
No ratings yet
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
From Everand
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
Matthew Rosch
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
How JavaScript Works Master The Basics of JavaScript and Modern Web App Development (Jonathon Simpson)
100% (2)
How JavaScript Works Master The Basics of JavaScript and Modern Web App Development (Jonathon Simpson)
330 pages
The Total Economic Impact of Vmware Vrealize Automation
No ratings yet
The Total Economic Impact of Vmware Vrealize Automation
26 pages
Mouse Invoice
No ratings yet
Mouse Invoice
2 pages
BEL Notification
No ratings yet
BEL Notification
8 pages
Architecture of A Numeric Machine by Eng. Ibrahim Jomaa
No ratings yet
Architecture of A Numeric Machine by Eng. Ibrahim Jomaa
23 pages
CBC Organic Agricultural NC II New Normal
No ratings yet
CBC Organic Agricultural NC II New Normal
96 pages
Computer Process Control
No ratings yet
Computer Process Control
11 pages
Drones World - April 2024 E Magazine
No ratings yet
Drones World - April 2024 E Magazine
48 pages
Brugermanual En130
No ratings yet
Brugermanual En130
39 pages
Last
No ratings yet
Last
21 pages
Health Management
No ratings yet
Health Management
2 pages
F
No ratings yet
F
39 pages
CH 01 Eng S v1.0
No ratings yet
CH 01 Eng S v1.0
37 pages
10th Maths PDF1
No ratings yet
10th Maths PDF1
238 pages
Code
No ratings yet
Code
3 pages
47pfs7509 12 Dfu Eng
No ratings yet
47pfs7509 12 Dfu Eng
113 pages
11 Onboard Diagnostics OBD For Diesels
No ratings yet
11 Onboard Diagnostics OBD For Diesels
40 pages
PCBD Apr2015
No ratings yet
PCBD Apr2015
65 pages
Delta Switch AG6248 PDF
No ratings yet
Delta Switch AG6248 PDF
24 pages
1-Importance of Computer Ethics
100% (1)
1-Importance of Computer Ethics
21 pages
Classification and Types of Software
No ratings yet
Classification and Types of Software
3 pages
Class Notes Deep-Learning
No ratings yet
Class Notes Deep-Learning
3 pages
ALL PRICE 25 Juni 20 (HP)
No ratings yet
ALL PRICE 25 Juni 20 (HP)
4 pages
EEE Minutes BoS To Dean 2024-2025
No ratings yet
EEE Minutes BoS To Dean 2024-2025
33 pages
4 Updated As On 29-5-21 III, V & VII Sem (Repeater) MCQ Examination TT-July 2021
No ratings yet
4 Updated As On 29-5-21 III, V & VII Sem (Repeater) MCQ Examination TT-July 2021
25 pages
Natnael Zekarias: Work Experience
No ratings yet
Natnael Zekarias: Work Experience
2 pages
Synthesis Thesis Chapter 2
100% (2)
Synthesis Thesis Chapter 2
4 pages
Account Registration
No ratings yet
Account Registration
11 pages
MBR GPT Cheatsheet
No ratings yet
MBR GPT Cheatsheet
3 pages