0% found this document useful (0 votes)
7 views

1.1. Data Analysis with Python- importing Datasets 1

Uploaded by

namdudotran1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

1.1. Data Analysis with Python- importing Datasets 1

Uploaded by

namdudotran1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 14

Importing Datasets

Objectives
 Analyze Python data using a dataset
 Identify three Python libraries and describe their
uses

Chapter1- Importing Datasets 2


The Problem
 Why Data Analysis:
 Data is collected everywhere around us:
 Collected manually by scientists or collected digitally,

 Every time you click on a website, or your mobile device.

 Data does NOT mean information.


 Data analysis (data science) help us unlock the information
and insights from raw data to answer our questions.
 Data analysis plays an important role:
 Discover useful information from the data.

 Answer questions.

 Predict the future or the unknown.

3
Chapter1- Importing Datasets
The Problem
 Analysis the scenario- How can we help Mr. Hoa
determine the best price for his car?

Hoa

 Give your opinions: estimate used car prices

Chapter1- Importing Datasets 4


The Problem
 Let's say we have a friend named Hoa. And Hoa
wants to sell his car. But the problem is he doesn't
know how much he should sell his car for.
 Hoa wants to sell his car for as much as he can.
 But he also wants to set the price reasonably, so
someone would want to purchase it.
 So the price he sets should represent the value of
the car.

Chapter1- Importing Datasets 5


The Problem
 Let's think like data scientists and clearly define some
of his problems:
 Is there data on the prices of other cars and their characteristics?
 What features of cars affect their prices?
 Color, Brand?
 Does horsepower also effect the selling price, or perhaps
something else?

 As a data analyst or data scientist, these are some of


the questions we can start thinking about. To answer
these questions, we're going to need some data.

Chapter1- Importing Datasets 6


Understanding the Data
 We'll be looking at the dataset on used car prices.
 The dataset used in this course is an open dataset by Jeffrey
C. Schlemmer.

 https://archive.ics.uci.edu/ml/machine-learning-databases/aut
os/

Chapter1- Importing Datasets 7


Understanding the Data
 Each of the attributes in the datasets

 https://archive.ics.uci.edu/ml/machine-learning-
databases/autos/imports-85.names

Chapter1- Importing Datasets 8


Python Packages for Data
Science
 A Python library is a collection of functions and methods:
perform lots of actions without writing any code.
 The libraries usually contain built in modules providing
different functionalities which you can use directly.

 The Python data analysis libraries is divided three groups.


 Scientific computing libraries
 Visualization libraries
 Algorithmic libraries

Chapter1- Importing Datasets 9


Python Packages for Data
Science
 Scientific computing libraries

Chapter1- Importing Datasets 10


Python Packages for Data
Science
 Visualization libraries

Chapter1- Importing Datasets 11


Python Packages for Data
Science
 Algorithmic libraries

Chapter1- Importing Datasets 12


Summary
 Analyze Python data using a dataset
 Identify three Python libraries and describe their uses

Chapter1- Importing Datasets 13


Q&A

Chapter1- Importing Datasets 14

You might also like