0% found this document useful (0 votes)

65 views23 pages

Ch4 Slides Python Learn

This document provides an introduction to NumPy for data science applications in Python. It discusses how NumPy arrays can be used for fast, element-wise calculations on large datasets compared to native Python lists. NumPy allows performing common statistical operations like calculating the mean, median, and standard deviation over entire arrays efficiently. Two-dimensional NumPy arrays are also introduced, which allow storing and accessing multiple related datasets. NumPy's various subsetting capabilities make it easy to extract specific elements, rows or columns from large numeric datasets.

Uploaded by

Achal Bi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views23 pages

Ch4 Slides Python Learn

Uploaded by

Achal Bi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

INTRO TO PYTHON FOR DATA SCIENCE

NumPy
Intro to Python for Data Science

Lists Recap
● Powerful
● Collection of values
● Hold diﬀerent types
● Change, add, remove
● Need for Data Science
● Mathematical operations over collections
● Speed
Intro to Python for Data Science

Illustration
In [1]: height = [1.73, 1.68, 1.71, 1.89, 1.79]

In [2]: height
Out[2]: [1.73, 1.68, 1.71, 1.89, 1.79]

In [3]: weight = [65.4, 59.2, 63.6, 88.4, 68.7]

In [4]: weight
Out[4]: [65.4, 59.2, 63.6, 88.4, 68.7]

In [5]: weight / height ** 2

TypeError: unsupported operand type(s) for **: 'list' and 'int'
Intro to Python for Data Science

Solution: NumPy
● Numeric Python
● Alternative to Python List: NumPy Array
● Calculations over entire arrays
● Easy and Fast
● Installation
● In the terminal: pip3 install numpy
Intro to Python for Data Science

NumPy
In [6]: import numpy as np

In [7]: np_height = np.array(height)

In [8]: np_height
Out[8]: array([ 1.73, 1.68, 1.71, 1.89, 1.79])

In [9]: np_weight = np.array(weight)

In [10]: np_weight
Out[10]: array([ 65.4, 59.2, 63.6, 88.4, 68.7])

In [11]: bmi = np_weight / np_height ** 2

In [12]: bmi
Out[12]: array([ 21.852, 20.975, 21.75 , 24.747, 21.441])
Intro to Python for Data Science

NumPy
In [6]: import numpy as np Element-wise calculations

In [7]: np_height = np.array(height)

In [8]: np_height
Out[8]: array([ 1.73, 1.68, 1.71, 1.89, 1.79])

In [9]: np_weight = np.array(weight)

In [10]: np_weight
Out[10]: array([ 65.4, 59.2, 63.6, 88.4, 68.7])

In [11]: bmi = np_weight / np_height ** 2

In [12]: bmi
Out[12]: array([ 21.852, 20.975, 21.75 , 24.747, 21.441])

= 65.5/1.73 ** 2
Intro to Python for Data Science

Comparison
In [13]: height = [1.73, 1.68, 1.71, 1.89, 1.79]

In [14]: weight = [65.4, 59.2, 63.6, 88.4, 68.7]

In [15]: weight / height ** 2

TypeError: unsupported operand type(s) for **: 'list' and 'int'

In [16]: np_height = np.array(height)

In [17]: np_weight = np.array(weight)

In [18]: np_weight / np_height ** 2

Out[18]: array([ 21.852, 20.975, 21.75 , 24.747, 21.441])
Intro to Python for Data Science

NumPy: remarks
In [19]: np.array([1.0, "is", True])
Out[19]: NumPy arrays: contain only one type
array(['1.0', 'is', 'True'],
dtype='<U32')

In [20]: python_list = [1, 2, 3]

In [21]: numpy_array = np.array([1, 2, 3])

Diﬀerent types: diﬀerent behavior!
In [22]: python_list + python_list
Out[22]: [1, 2, 3, 1, 2, 3]

In [23]: numpy_array + numpy_array

Out[23]: array([2, 4, 6])
Intro to Python for Data Science

NumPy Subse"ing
In [24]: bmi
Out[24]: array([ 21.852, 20.975, 21.75 , 24.747, 21.441])

In [25]: bmi[1]
Out[25]: 20.975

In [26]: bmi > 23

Out[26]: array([False, False, False, True, False], dtype=bool)

In [27]: bmi[bmi > 23]

Out[27]: array([ 24.747])
INTRO TO PYTHON FOR DATA SCIENCE

Let’s practice!
INTRO TO PYTHON FOR DATA SCIENCE

2D NumPy Arrays
Intro to Python for Data Science

Type of NumPy Arrays

In [1]: import numpy as np

In [2]: np_height = np.array([1.73, 1.68, 1.71, 1.89, 1.79])

In [3]: np_weight = np.array([65.4, 59.2, 63.6, 88.4, 68.7])

In [4]: type(np_height)
Out[4]: numpy.ndarray
ndarray = N-dimensional array
In [5]: type(np_weight)
Out[5]: numpy.ndarray
Intro to Python for Data Science

2D NumPy Arrays
In [6]: np_2d = np.array([[1.73, 1.68, 1.71, 1.89, 1.79],
[65.4, 59.2, 63.6, 88.4, 68.7]])

In [7]: np_2d
Out[7]:
array([[ 1.73, 1.68, 1.71, 1.89, 1.79],
[ 65.4 , 59.2 , 63.6 , 88.4 , 68.7 ]])

In [8]: np_2d.shape
2 rows, 5 columns
Out[8]: (2, 5)

In [9]: np.array([[1.73, 1.68, 1.71, 1.89, 1.79],

[65.4, 59.2, 63.6, 88.4, "68.7"]])
Out[9]:
Single type!
array([['1.73', '1.68', '1.71', '1.89', '1.79'],
['65.4', '59.2', '63.6', '88.4', '68.7']],
dtype='<U32')
Intro to Python for Data Science
0 1 2 3 4

Subse"ing array([[
[
1.73,
65.4,
1.68,
59.2,
1.71,
63.6,
1.89,
88.4,
1.79], 0
68.7]]) 1

In [10]: np_2d[0]
Out[10]: array([ 1.73, 1.68, 1.71, 1.89, 1.79])

In [11]: np_2d[0][2]
Out[11]: 1.71

In [12]: np_2d[0,2]
Out[12]: 1.71
Intro to Python for Data Science
0 1 2 3 4

Subse"ing array([[
[
1.73,
65.4,
1.68,
59.2,
1.71,
63.6,
1.89,
88.4,
1.79], 0
68.7]]) 1

In [10]: np_2d[0]
Out[10]: array([ 1.73, 1.68, 1.71, 1.89, 1.79])

In [11]: np_2d[0][2]
Out[11]: 1.71

In [12]: np_2d[0,2]
Out[12]: 1.71

In [13]: np_2d[:,1:3]
Out[13]:
array([[ 1.68, 1.71],
[ 59.2 , 63.6 ]])
Intro to Python for Data Science
0 1 2 3 4

Subse"ing array([[
[
1.73,
65.4,
1.68,
59.2,
1.71,
63.6,
1.89,
88.4,
1.79], 0
68.7]]) 1

In [10]: np_2d[0]
Out[10]: array([ 1.73, 1.68, 1.71, 1.89, 1.79])

In [11]: np_2d[0][2]
Out[11]: 1.71

In [12]: np_2d[0,2]
Out[12]: 1.71

In [13]: np_2d[:,1:3]
Out[13]:
array([[ 1.68, 1.71],
[ 59.2 , 63.6 ]])

In [14]: np_2d[1,:]
Out[14]: array([ 65.4, 59.2, 63.6, 88.4, 68.7])
INTRO TO PYTHON FOR DATA SCIENCE

Let’s practice!
INTRO TO PYTHON FOR DATA SCIENCE

NumPy: Basic Statistics

Intro to Python for Data Science

Data analysis
● Get to know your data
● Li"le data -> simply look at it
● Big data -> ?
Intro to Python for Data Science

City-wide survey
In [1]: import numpy as np

In [2]: np_city = ... # Implementation left out

In [3]: np_city
Out[3]:
array([[ 1.64, 71.78],
[ 1.37, 63.35],
[ 1.6 , 55.09],
...,
[ 2.04, 74.85],
[ 2.04, 68.72],
[ 2.01, 73.57]])
Intro to Python for Data Science

NumPy
In [4]: np.mean(np_city[:,0])
Out[4]: 1.7472

In [5]: np.median(np_city[:,0])
Out[5]: 1.75

In [6]: np.corrcoef(np_city[:,0], np_city[:,1])

Out[6]:
array([[ 1. , -0.01802],
[-0.01803, 1. ]])

In [7]: np.std(np_city[:,0])
Out[7]: 0.1992

● sum(), sort(), ...

● Enforce single data type: speed!
Intro to Python for Data Science

Generate data
distribution  distribution  number of
mean standard dev. samples

In [8]: height = np.round(np.random.normal(1.75, 0.20, 5000), 2)

In [9]: weight = np.round(np.random.normal(60.32, 15, 5000), 2)

In [10]: np_city = np.column_stack((height, weight))

INTRO TO PYTHON FOR DATA SCIENCE

Let’s practice!

Monday Usermanual
No ratings yet
Monday Usermanual
40 pages
KS1 Lent Differentiated Reading Comprehension Activity
100% (2)
KS1 Lent Differentiated Reading Comprehension Activity
10 pages
Fine Motor Development Checklist: Back To Child Development Checklists
No ratings yet
Fine Motor Development Checklist: Back To Child Development Checklists
4 pages
Intro To Python For Data Science: Numpy
No ratings yet
Intro To Python For Data Science: Numpy
25 pages
Intro To Python For Data Science: Numpy
No ratings yet
Intro To Python For Data Science: Numpy
23 pages
Introduction To Python Chapter 1 4 NumPy
No ratings yet
Introduction To Python Chapter 1 4 NumPy
25 pages
Intro To Python For Data Science: Numpy
No ratings yet
Intro To Python For Data Science: Numpy
10 pages
NumPy - Python Professional Programmer Programming
No ratings yet
NumPy - Python Professional Programmer Programming
24 pages
Introduction To Python
No ratings yet
Introduction To Python
11 pages
Fods Lab
No ratings yet
Fods Lab
36 pages
Numpy: Introductiontopython
No ratings yet
Numpy: Introductiontopython
21 pages
Numpy
No ratings yet
Numpy
64 pages
11 NumPy
No ratings yet
11 NumPy
14 pages
Numpy
No ratings yet
Numpy
71 pages
NUPLE
No ratings yet
NUPLE
10 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
24 pages
Numpy (Numerical Python)
No ratings yet
Numpy (Numerical Python)
80 pages
3 - Pandas
No ratings yet
3 - Pandas
87 pages
Numpy
No ratings yet
Numpy
28 pages
GProg Python 6-Print
No ratings yet
GProg Python 6-Print
14 pages
Unit 4 Numpy
No ratings yet
Unit 4 Numpy
14 pages
Day 8 NumPy for Data Science Part 1
No ratings yet
Day 8 NumPy for Data Science Part 1
16 pages
Numpy
No ratings yet
Numpy
4 pages
Introduction To Numpy
No ratings yet
Introduction To Numpy
13 pages
W03 - FA23 - AIC270 - Programming for AI - Syed Ahmed
No ratings yet
W03 - FA23 - AIC270 - Programming for AI - Syed Ahmed
57 pages
NumpyToday's Session
No ratings yet
NumpyToday's Session
8 pages
Unit III - Data Manipulation Using Python
No ratings yet
Unit III - Data Manipulation Using Python
16 pages
FDS Record
No ratings yet
FDS Record
59 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
48 pages
Ty B Tech - Bda - Ai315 - Lab Manual
No ratings yet
Ty B Tech - Bda - Ai315 - Lab Manual
52 pages
Iml Practical Assignment
No ratings yet
Iml Practical Assignment
22 pages
Advanced Python
No ratings yet
Advanced Python
48 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
43 pages
Numpy
No ratings yet
Numpy
9 pages
Swarang Raut EDVA Experiment 1 Numpy Pandas
No ratings yet
Swarang Raut EDVA Experiment 1 Numpy Pandas
58 pages
Datascience Internship
No ratings yet
Datascience Internship
43 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
Numerical Python Numpy
No ratings yet
Numerical Python Numpy
28 pages
APP Lab Manual Final
No ratings yet
APP Lab Manual Final
43 pages
Unit - V
No ratings yet
Unit - V
90 pages
CS3361 - Data Science
No ratings yet
CS3361 - Data Science
56 pages
Data Science Using Python Lab Manual
No ratings yet
Data Science Using Python Lab Manual
68 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
42 pages
Data Science Programs
No ratings yet
Data Science Programs
11 pages
Introduction To NumPy
No ratings yet
Introduction To NumPy
27 pages
Essential Guide To Data Science For Petroleum Engineers
No ratings yet
Essential Guide To Data Science For Petroleum Engineers
150 pages
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
100% (1)
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
84 pages
HKU - 7001 - 3.2 Managing Data II
No ratings yet
HKU - 7001 - 3.2 Managing Data II
67 pages
Python File Semester-4
No ratings yet
Python File Semester-4
42 pages
Python Numpy
100% (1)
Python Numpy
31 pages
Grace Python Numpy MB Final
No ratings yet
Grace Python Numpy MB Final
55 pages
Numpy Cheat Sheet
No ratings yet
Numpy Cheat Sheet
1 page
Unit - V
100% (1)
Unit - V
75 pages
Lect-07 and 08, Week-02
No ratings yet
Lect-07 and 08, Week-02
31 pages
C1 W1 Lab 1 Introduction To Numpy Arrays
No ratings yet
C1 W1 Lab 1 Introduction To Numpy Arrays
12 pages
Unit 5
No ratings yet
Unit 5
75 pages
Unit 2
No ratings yet
Unit 2
38 pages
Advanced NumPy Broadcasting and Strides Guide
No ratings yet
Advanced NumPy Broadcasting and Strides Guide
21 pages
ex1
No ratings yet
ex1
6 pages
Numpy Basics
No ratings yet
Numpy Basics
66 pages
Manual
No ratings yet
Manual
52 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Template Resume
No ratings yet
Template Resume
1 page
Template Resume 2
No ratings yet
Template Resume 2
1 page
Template Resume 1
No ratings yet
Template Resume 1
1 page
Rspec 3 Expectations Cheat Sheet: by Via
No ratings yet
Rspec 3 Expectations Cheat Sheet: by Via
1 page
GitHub - Thoughtbot:Dotfiles: A Set of Vim, ZSH, Git, and Tmux Configuration Files.
No ratings yet
GitHub - Thoughtbot:Dotfiles: A Set of Vim, ZSH, Git, and Tmux Configuration Files.
6 pages
Dance Forms
No ratings yet
Dance Forms
4 pages
RSpec:Expectations Cheat Sheet
No ratings yet
RSpec:Expectations Cheat Sheet
5 pages
Psychology Syllabus
No ratings yet
Psychology Syllabus
31 pages
Job Resume
No ratings yet
Job Resume
3 pages
2023-24 - BCS403 - CT Paper
No ratings yet
2023-24 - BCS403 - CT Paper
3 pages
WS Prac
No ratings yet
WS Prac
4 pages
Madam The Sermon at Benares
No ratings yet
Madam The Sermon at Benares
3 pages
Get Started With Agentforce-Exercise-Guide
No ratings yet
Get Started With Agentforce-Exercise-Guide
13 pages
Mcmaster Divinity College - Hebrew Verbal System
100% (1)
Mcmaster Divinity College - Hebrew Verbal System
46 pages
Jaaliyat-U'L-Akdaar Wa'S-Sayf-U'L-Battaar: Mawlana Diya Ud-Deen Khalid Al-Baghdadi
No ratings yet
Jaaliyat-U'L-Akdaar Wa'S-Sayf-U'L-Battaar: Mawlana Diya Ud-Deen Khalid Al-Baghdadi
5 pages
Discrete Mathematics For Sppu 2 DR H R Bhapkar
No ratings yet
Discrete Mathematics For Sppu 2 DR H R Bhapkar
207 pages
Life of Rizal
No ratings yet
Life of Rizal
21 pages
Software Development
No ratings yet
Software Development
62 pages
A Ya Zain - Beth's Notes
No ratings yet
A Ya Zain - Beth's Notes
7 pages
Pulse Meters Technical Sheet Rev
No ratings yet
Pulse Meters Technical Sheet Rev
2 pages
Fear of Self-Becoming by Riemann
No ratings yet
Fear of Self-Becoming by Riemann
3 pages
2017 AMC 12A Problems and Solutions: Problem 1
No ratings yet
2017 AMC 12A Problems and Solutions: Problem 1
24 pages
Job Application For Students-1
No ratings yet
Job Application For Students-1
4 pages
Week 03 - Process Modeling Part I
No ratings yet
Week 03 - Process Modeling Part I
38 pages
Activity 9: Affirmative Negative Interrogative
No ratings yet
Activity 9: Affirmative Negative Interrogative
5 pages
Spe-71336-Ms - Carbonate Rock Typing Lucia
No ratings yet
Spe-71336-Ms - Carbonate Rock Typing Lucia
16 pages
Lesson Plan HC Resources-2
No ratings yet
Lesson Plan HC Resources-2
3 pages
1830PSS-32 R6 0 New NE Software Installation PDF
No ratings yet
1830PSS-32 R6 0 New NE Software Installation PDF
39 pages
Class 1 What Is Christology
No ratings yet
Class 1 What Is Christology
14 pages
Default Reasoning
No ratings yet
Default Reasoning
42 pages
Applying Activity-Based Costing (ABC) System Using MS Excel: Laboratory Exercise
No ratings yet
Applying Activity-Based Costing (ABC) System Using MS Excel: Laboratory Exercise
3 pages
Maths Worksheet
No ratings yet
Maths Worksheet
2 pages
Discursive Essay Writing - SpeakOut Upper-Intermediate SB
No ratings yet
Discursive Essay Writing - SpeakOut Upper-Intermediate SB
1 page
Paper 2 - Grade 7 - FINAL TERM - PRACTICE TEST Question Paper
No ratings yet
Paper 2 - Grade 7 - FINAL TERM - PRACTICE TEST Question Paper
13 pages
Fasshauer 2008 Lecture4
No ratings yet
Fasshauer 2008 Lecture4
88 pages

Ch4 Slides Python Learn

Uploaded by

Ch4 Slides Python Learn

Uploaded by

INTRO TO PYTHON FOR DATA SCIENCE

In [3]: weight = [65.4, 59.2, 63.6, 88.4, 68.7]

In [5]: weight / height ** 2

In [7]: np_height = np.array(height)

In [9]: np_weight = np.array(weight)

In [11]: bmi = np_weight / np_height ** 2

In [7]: np_height = np.array(height)

In [9]: np_weight = np.array(weight)

In [11]: bmi = np_weight / np_height ** 2

In [14]: weight = [65.4, 59.2, 63.6, 88.4, 68.7]

In [15]: weight / height ** 2

In [16]: np_height = np.array(height)

In [17]: np_weight = np.array(weight)

In [18]: np_weight / np_height ** 2

In [20]: python_list = [1, 2, 3]

In [21]: numpy_array = np.array([1, 2, 3])

In [23]: numpy_array + numpy_array

In [26]: bmi > 23

In [27]: bmi[bmi > 23]

Type of NumPy Arrays

In [2]: np_height = np.array([1.73, 1.68, 1.71, 1.89, 1.79])

In [3]: np_weight = np.array([65.4, 59.2, 63.6, 88.4, 68.7])

In [9]: np.array([[1.73, 1.68, 1.71, 1.89, 1.79],

NumPy: Basic Statistics

In [2]: np_city = ... # Implementation left out

In [6]: np.corrcoef(np_city[:,0], np_city[:,1])

● sum(), sort(), ...

In [8]: height = np.round(np.random.normal(1.75, 0.20, 5000), 2)

In [9]: weight = np.round(np.random.normal(60.32, 15, 5000), 2)

In [10]: np_city = np.column_stack((height, weight))

You might also like