Python Data Analysis: Exploratory Data Analysis
Python Data Analysis: Exploratory Data Analysis
df.corr()
Complete dataframe correlation Correlation matrix created using all the attributes of the dataset.
df[['attribute1','attribute2',...]].corr()
Specific Attribute correlation Correlation matrix created using specific attributes of the dataset.
Create a scatter plot using the data points of the dependent from matlplotlib import pyplot as
Scatter Plot variable along the x-axis and the independent variable along the plt plt.scatter(df[['attribute_1']],df[['attribute_2']])
y-axis.
Uses the dependent and independent variables in a Pandas data import seaborn as sns
Regression Plot frame to create a scatter plot with a generated linear regression sns.regplot(x='attribute_1',y='attribute_2', data=df)
line for the data.
Create a box-and-whisker plot that uses the pandas dataframe, import seaborn as sns
Box plot sns.boxplot(x='attribute_1',y='attribute_2', data=df)
the dependent, and the independent variables.
Create Pivot tables for better representation of data based on grouped_pivot = df_group.pivot(index='attribute_1',columns='attribute_2')
Pivot Tables
parameters
Create a heatmap image using a PsuedoColor plot (or pcolor) from matlplotlib import pyplot as plt
Pseudocolor plot plt.pcolor(grouped_pivot, cmap='RdBu')
using the pivot table as data.
about:blank 1/1