Data Analytics Using Python Libraries, Pandas
and Matplotlib
We’ll use a car.csv dataset and
perform exploratory data analysis using Pandas
and Matplotlib library functions to manipulate and
visualize the data and find insights.
1. Import the libraries
2. . Load the dataset using
pandas read_csv() function.
3. Display the head of the dataset using
the head() function.
4. Display the bottom 5 rows from the dataset
using the tail() function.
5. Print summary statistics of the dataset using
the describe() function.
6.Plot a histogram for all the variables.
7. Box plot to visualize the relationship between
vehicle size and engine hp.
8. Build a pair plot using the seaborn library
9. Drop irrelevant columns from the dataset
using drop() function.
10. Use rename() function to rename the columns.
11. Print the total number of duplicate rows.
12. Remove the duplicate rows using
the drop_duplicates() function.
13. Drop the missing values from the dataset.
14. Plot a histogram to find the number of cars per
brand.
15. Draw a correlation plot between the variables.