chennojivamshi03@gmail.
com
TSK9XIC207
Uber Data Analysis with Python
Sampriti Chatterjee (Great Learning)
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Agenda
1 Why do we need data science? 7 Basic Syntax for Python
2 What is Data science?
8 What is machine Learning?
3 Life cycle of Data science
chennojivamshi03@gmail.com 9 Supervised Learning: Linear
TSK9XIC207
Regression
4 Why Python is so popular? 10
Uber data analysis to predict price
5 Install python
6 Statistical visualization on Python user
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Why do we need Data Science?
• In the past, we used to have data in a structured
format but now as the volume of the data is increasing,
How data science is so the number of structured data becomes very less,
effecting our
so to handle the massive amount of data we need data
everyday life?
science techniques
• Those data can be used to get the proper business
insights and the hidden trends from them.
chennojivamshi03@gmail.com
TSK9XIC207 • These insights helps the organization to predict the
Future
• Using data science decision making can be faster and
effective
• Helps to reduce the production cost
• Build model based on the data to give the ability to the
machine to predicts on its own
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
What is Data Science?
Data science is a process to get some meaningful
information from the massive amount of data. In simple
terms, read and study the data to get proper intuitive
chennojivamshi03@gmail.com
TSK9XIC207 insights. Data Science is a mixture of various tools,
algorithms, and machine learning and deep learning
concepts to discover hidden patterns from the raw and
unstructured data
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Life cycle of Data Science?
Understand the business
problem Data Acquisition Data Cleaning
chennojivamshi03@gmail.com
TSK9XIC207
Exploratory data Analysis
Deploy the model Predict your model accuracy Machine Learning Algorithm
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Most Popular Programming Languages For Data Science
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Introduction to Python
Python is a popular high level, object oriented and interpreted language
chennojivamshi03@gmail.com
TSK9XIC207
High level Interpreted
Object oriented
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
History of Python
But, why the Important Facts
language called as • Python is invented by Guido van Rossum
Python?
in 1989
• Rossum used to love watching comedy
movies from late seventies
chennojivamshi03@gmail.com • He needed a short, unique, and slightly
TSK9XIC207
mysterious name for his language
• In that time he was watching Monty
Inventor of Python Python’s Flying Circus and from that
series he decided to keep his language
name python.
• This how Python invented
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Why should you learn Python?
Python is simple and beginner
3 friendly language
1 Web development using
Python
chennojivamshi03@gmail.com
TSK9XIC207
5 Graphical user interface
4
2 Length of the program is
Mathematical computation
short
can be done easily
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Why Python is so popular?
1 Largest community for Learners and Collaborators 2 Open source
Easy to learn and usable flexibility
chennojivamshi03@gmail.com Huge numbers of Python libraries and
3
TSK9XIC207 4
Frame work
5 Supports Big Data, Machine Learning 6
Supports Automation
and Cloud computing
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Installing Python
This is the site to install Python -> https://www.python.org/downloads/
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Popular IDE for Python: Pycharm
Site to install Python ->
https://www.jetbrains.com/pycharm/download/#section=mac
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Popular IDE for Python: Anaconda
Anaconda installation site->
https://www.anaconda.com/products/individual
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Popular IDE for Python: Google colab
Google collaboratory link->
https://colab.research.google.com/notebooks/intro.ipynb
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Statistical measurement on Python user
chennojivamshi03@gmail.com
TSK9XIC207
In recent time it is prominent that Python is one of the most popular language because of
it’s simplicity
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
chennojivamshi03@gmail.com
TSK9XIC207
Uber Data Analysis
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Exploratory Data Analysis
Exploratory data analysis is basically a technique to understand your
data using statistics concept
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Feature Engineering
Feature engineering process is basically used to create new feature from
the existing data which helps to understand the data more deeply
chennojivamshi03@gmail.com
TSK9XIC207
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
What is Machine Learning?
Machine learning is a sub-set of artificial intelligence (AI) that allows the system to
automatically learn and improve from experience without being explicitly programmed
chennojivamshi03@gmail.com
TSK9XIC207
Training Data Model Building Testing Data
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Traditional Vs Machine Learning
Traditional Programming Machine Learning
chennojivamshi03@gmail.com
Data
TSK9XIC207
Data
Model
Output
Program Output
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
Types Of Machine Learning
Supervised Learning
Unsupervised
chennojivamshi03@gmail.com Learning
TSK9XIC207
Reinforcement
Learning
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
What is Supervised Learning?
Supervised learning works as a supervisor or teacher. Basically, In supervised learning, we teach or train the
machine with labeled data (that means data is already tagged with some predefined class). Then we test our
model with some unknown new set of data and predict the level for them
Learning from the labelled data and applying the knowledge to predict the
chennojivamshi03@gmail.com
label of the new data(test data), is known as Supervised Learning
TSK9XIC207
Types of Supervised Learning:
• Linear Regression
• Logistic regression
• Decision Tree
• Random Forest
• Naïve Bayes Classifier
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
What is Linear Regression?
Regression stands for to model a target value based on independent variables and Linear Regression
is used to find the relationship between dependent(y) and independent variable(x)
Linear regression is a supervised machine learning algorithm
chennojivamshi03@gmail.com
Always works with continuous value
TSK9XIC207
Formula: y = mx+c m=slope of line and c= intercept
Main target for linear regression to find the best value for X and Y
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited
chennojivamshi03@gmail.com
TSK9XIC207
Thank You
This file is meant for personal use by chennojivamshi03@gmail.com only.
Proprietary content.
Sharing or publishing the©Great Learning.
contents in part or full All Rights
is liable Reserved.
for legal action. Unauthorized use or distribution prohibited