0% found this document useful (0 votes)

62 views

Chapter02 Mathematical-Building-Blocks

This document contains runnable code blocks from a companion notebook for the book Deep Learning with Python. It loads and explores the MNIST dataset, defines simple neural network and training loop classes from scratch in TensorFlow, and contains code for common tensor operations like addition, multiplication, and reshaping. The goal is to provide runnable code examples that accompany the text without including explanatory text or figures.

Uploaded by

Jas Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Chapter02 Mathematical-Building-Blocks

Uploaded by

Jas Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 9

This is a companion notebook for the book Deep Learning with Python, Second Edition.

For
readability, it only contains runnable code blocks and section titles, and omits everything
else in the book: text paragraphs, figures, and pseudocode.
If you want to be able to follow what's going on, I recommend reading the notebook
side by side with your copy of the book.
This notebook was generated for TensorFlow 2.6.

The mathematical building blocks of neural networks

A first look at a neural network
Loading the MNIST dataset in Keras
from tensorflow.keras.datasets import mnist
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

train_images.shape

len(train_labels)

train_labels

test_images.shape

len(test_labels)

test_labels

The network architecture

from tensorflow import keras
from tensorflow.keras import layers
model = keras.Sequential([
layers.Dense(512, activation="relu"),
layers.Dense(10, activation="softmax")
])

The compilation step

model.compile(optimizer="rmsprop",
loss="sparse_categorical_crossentropy",
metrics=["accuracy"])

Preparing the image data

train_images = train_images.reshape((60000, 28 * 28))
train_images = train_images.astype("float32") / 255
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype("float32") / 255
"Fitting" the model
model.fit(train_images, train_labels, epochs=5, batch_size=128)

Using the model to make predictions

test_digits = test_images[0:10]
predictions = model.predict(test_digits)
predictions[0]

predictions[0].argmax()

predictions[0][7]

test_labels[0]

Evaluating the model on new data

test_loss, test_acc = model.evaluate(test_images, test_labels)
print(f"test_acc: {test_acc}")

Data representations for neural networks

Scalars (rank-0 tensors)
import numpy as np
x = np.array(12)
x

x.ndim

Vectors (rank-1 tensors)

x = np.array([12, 3, 6, 14, 7])
x

x.ndim

Matrices (rank-2 tensors)

x = np.array([[5, 78, 2, 34, 0],
[6, 79, 3, 35, 1],
[7, 80, 4, 36, 2]])
x.ndim

Rank-3 and higher-rank tensors

x = np.array([[[5, 78, 2, 34, 0],
[6, 79, 3, 35, 1],
[7, 80, 4, 36, 2]],
[[5, 78, 2, 34, 0],
[6, 79, 3, 35, 1],
[7, 80, 4, 36, 2]],
[[5, 78, 2, 34, 0],
[6, 79, 3, 35, 1],
[7, 80, 4, 36, 2]]])
x.ndim

Key attributes
from tensorflow.keras.datasets import mnist
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

train_images.ndim

train_images.shape

train_images.dtype

Displaying the fourth digit

import matplotlib.pyplot as plt
digit = train_images[4]
plt.imshow(digit, cmap=plt.cm.binary)
plt.show()

train_labels[4]

Manipulating tensors in NumPy

my_slice = train_images[10:100]
my_slice.shape

my_slice = train_images[10:100, :, :]
my_slice.shape

my_slice = train_images[10:100, 0:28, 0:28]

my_slice.shape

my_slice = train_images[:, 14:, 14:]

my_slice = train_images[:, 7:-7, 7:-7]

The notion of data batches

batch = train_images[:128]

batch = train_images[128:256]

n = 3
batch = train_images[128 * n:128 * (n + 1)]

Real-world examples of data tensors

Vector data

Timeseries data or sequence data

Image data

Video data
The gears of neural networks: tensor operations
Element-wise operations
def naive_relu(x):
assert len(x.shape) == 2
x = x.copy()
for i in range(x.shape[0]):
for j in range(x.shape[1]):
x[i, j] = max(x[i, j], 0)
return x

def naive_add(x, y):

assert len(x.shape) == 2
assert x.shape == y.shape
x = x.copy()
for i in range(x.shape[0]):
for j in range(x.shape[1]):
x[i, j] += y[i, j]
return x

import time

x = np.random.random((20, 100))
y = np.random.random((20, 100))

t0 = time.time()
for _ in range(1000):
z = x + y
z = np.maximum(z, 0.)
print("Took: {0:.2f} s".format(time.time() - t0))

t0 = time.time()
for _ in range(1000):
z = naive_add(x, y)
z = naive_relu(z)
print("Took: {0:.2f} s".format(time.time() - t0))

Broadcasting
import numpy as np
X = np.random.random((32, 10))
y = np.random.random((10,))

y = np.expand_dims(y, axis=0)

Y = np.concatenate([y] * 32, axis=0)

def naive_add_matrix_and_vector(x, y):

assert len(x.shape) == 2
assert len(y.shape) == 1
assert x.shape[1] == y.shape[0]
x = x.copy()
for i in range(x.shape[0]):
for j in range(x.shape[1]):
x[i, j] += y[j]
return x

import numpy as np
x = np.random.random((64, 3, 32, 10))
y = np.random.random((32, 10))
z = np.maximum(x, y)

Tensor product
x = np.random.random((32,))
y = np.random.random((32,))
z = np.dot(x, y)

def naive_vector_dot(x, y):

assert len(x.shape) == 1
assert len(y.shape) == 1
assert x.shape[0] == y.shape[0]
z = 0.
for i in range(x.shape[0]):
z += x[i] * y[i]
return z

def naive_matrix_vector_dot(x, y):

assert len(x.shape) == 2
assert len(y.shape) == 1
assert x.shape[1] == y.shape[0]
z = np.zeros(x.shape[0])
for i in range(x.shape[0]):
for j in range(x.shape[1]):
z[i] += x[i, j] * y[j]
return z

def naive_matrix_vector_dot(x, y):

z = np.zeros(x.shape[0])
for i in range(x.shape[0]):
z[i] = naive_vector_dot(x[i, :], y)
return z

def naive_matrix_dot(x, y):

assert len(x.shape) == 2
assert len(y.shape) == 2
assert x.shape[1] == y.shape[0]
z = np.zeros((x.shape[0], y.shape[1]))
for i in range(x.shape[0]):
for j in range(y.shape[1]):
row_x = x[i, :]
column_y = y[:, j]
z[i, j] = naive_vector_dot(row_x, column_y)
return z
Tensor reshaping
train_images = train_images.reshape((60000, 28 * 28))

x = np.array([[0., 1.],
[2., 3.],
[4., 5.]])
x.shape

x = x.reshape((6, 1))
x

x = np.zeros((300, 20))
x = np.transpose(x)
x.shape

Geometric interpretation of tensor operations

A geometric interpretation of deep learning

The engine of neural networks: gradient-based optimization

What's a derivative?

Derivative of a tensor operation: the gradient

Stochastic gradient descent

Chaining derivatives: The Backpropagation algorithm

The chain rule

Automatic differentiation with computation graphs

The gradient tape in TensorFlow

import tensorflow as tf
x = tf.Variable(0.)
with tf.GradientTape() as tape:
y = 2 * x + 3
grad_of_y_wrt_x = tape.gradient(y, x)

x = tf.Variable(tf.random.uniform((2, 2)))
with tf.GradientTape() as tape:
y = 2 * x + 3
grad_of_y_wrt_x = tape.gradient(y, x)

W = tf.Variable(tf.random.uniform((2, 2)))
b = tf.Variable(tf.zeros((2,)))
x = tf.random.uniform((2, 2))
with tf.GradientTape() as tape:
y = tf.matmul(x, W) + b
grad_of_y_wrt_W_and_b = tape.gradient(y, [W, b])
Looking back at our first example
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()
train_images = train_images.reshape((60000, 28 * 28))
train_images = train_images.astype("float32") / 255
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype("float32") / 255

model = keras.Sequential([
layers.Dense(512, activation="relu"),
layers.Dense(10, activation="softmax")
])

model.compile(optimizer="rmsprop",
loss="sparse_categorical_crossentropy",
metrics=["accuracy"])

model.fit(train_images, train_labels, epochs=5, batch_size=128)

Reimplementing our first example from scratch in TensorFlow

A simple Dense class

import tensorflow as tf

class NaiveDense:
def __init__(self, input_size, output_size, activation):
self.activation = activation

w_shape = (input_size, output_size)

w_initial_value = tf.random.uniform(w_shape, minval=0, maxval=1e-1)
self.W = tf.Variable(w_initial_value)

b_shape = (output_size,)
b_initial_value = tf.zeros(b_shape)
self.b = tf.Variable(b_initial_value)

def call(self, inputs):

return self.activation(tf.matmul(inputs, self.W) + self.b)

@property
def weights(self):
return [self.W, self.b]

A simple Sequential class

class NaiveSequential:
def __init__(self, layers):
self.layers = layers

def call(self, inputs):

x = inputs
for layer in self.layers:
x = layer(x)
return x

@property
def weights(self):
weights = []
for layer in self.layers:
weights += layer.weights
return weights

model = NaiveSequential([
NaiveDense(input_size=28 * 28, output_size=512, activation=tf.nn.relu),
NaiveDense(input_size=512, output_size=10, activation=tf.nn.softmax)
])
assert len(model.weights) == 4

A batch generator
import math

class BatchGenerator:
def __init__(self, images, labels, batch_size=128):
assert len(images) == len(labels)
self.index = 0
self.images = images
self.labels = labels
self.batch_size = batch_size
self.num_batches = math.ceil(len(images) / batch_size)

def next(self):
images = self.images[self.index : self.index + self.batch_size]
labels = self.labels[self.index : self.index + self.batch_size]
self.index += self.batch_size
return images, labels

Running one training step

def one_training_step(model, images_batch, labels_batch):
with tf.GradientTape() as tape:
predictions = model(images_batch)
per_sample_losses = tf.keras.losses.sparse_categorical_crossentropy(
labels_batch, predictions)
average_loss = tf.reduce_mean(per_sample_losses)
gradients = tape.gradient(average_loss, model.weights)
update_weights(gradients, model.weights)
return average_loss

learning_rate = 1e-3

def update_weights(gradients, weights):

for g, w in zip(gradients, weights):
w.assign_sub(g * learning_rate)
from tensorflow.keras import optimizers

optimizer = optimizers.SGD(learning_rate=1e-3)

def update_weights(gradients, weights):

optimizer.apply_gradients(zip(gradients, weights))

The full training loop

def fit(model, images, labels, epochs, batch_size=128):
for epoch_counter in range(epochs):
print(f"Epoch {epoch_counter}")
batch_generator = BatchGenerator(images, labels)
for batch_counter in range(batch_generator.num_batches):
images_batch, labels_batch = batch_generator.next()
loss = one_training_step(model, images_batch, labels_batch)
if batch_counter % 100 == 0:
print(f"loss at batch {batch_counter}: {loss:.2f}")

from tensorflow.keras.datasets import mnist

(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

train_images = train_images.reshape((60000, 28 * 28))

train_images = train_images.astype("float32") / 255
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype("float32") / 255

fit(model, train_images, train_labels, epochs=10, batch_size=128)

Evaluating the model

predictions = model(test_images)
predictions = predictions.numpy()
predicted_labels = np.argmax(predictions, axis=1)
matches = predicted_labels == test_labels
print(f"accuracy: {matches.mean():.2f}")

Summary

Lathe Operation Manual
No ratings yet
Lathe Operation Manual
95 pages
Cambridge English: Advanced (CAE) : Difficulty Level: Who Is It For?
No ratings yet
Cambridge English: Advanced (CAE) : Difficulty Level: Who Is It For?
6 pages
Notebook - Tensorflow Keras
No ratings yet
Notebook - Tensorflow Keras
25 pages
nndl2 (2)
No ratings yet
nndl2 (2)
67 pages
1-Linear Regression and TensorFlow
No ratings yet
1-Linear Regression and TensorFlow
79 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
21BCE5775 Neural Networks
No ratings yet
21BCE5775 Neural Networks
19 pages
Software Laboratory II Code
No ratings yet
Software Laboratory II Code
27 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
CCS355-Neural networks and deep learning_____Assignment 1
No ratings yet
CCS355-Neural networks and deep learning_____Assignment 1
15 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Lab 4-Image Segmentation Using U-Net
No ratings yet
Lab 4-Image Segmentation Using U-Net
9 pages
ML Project
No ratings yet
ML Project
10 pages
NNDL Manual
No ratings yet
NNDL Manual
19 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
24mcs1025-ex1-part-b
No ratings yet
24mcs1025-ex1-part-b
10 pages
NN & DL Lab Manual 1[1]
No ratings yet
NN & DL Lab Manual 1[1]
44 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
AD3511 - Deep Learning Lab Manual - - Copy
No ratings yet
AD3511 - Deep Learning Lab Manual - - Copy
61 pages
Deep Learning Manual (1)
No ratings yet
Deep Learning Manual (1)
53 pages
NNDL 7&8 Programs
No ratings yet
NNDL 7&8 Programs
7 pages
Practicals 2.odt
No ratings yet
Practicals 2.odt
21 pages
PyTorch Cheat Sheet & Quick Reference
No ratings yet
PyTorch Cheat Sheet & Quick Reference
6 pages
CCS355-Neural networks and deep learning__Assignment 1
No ratings yet
CCS355-Neural networks and deep learning__Assignment 1
15 pages
nndlmac
No ratings yet
nndlmac
9 pages
Introduction To Keras!: Vincent Lepetit!
No ratings yet
Introduction To Keras!: Vincent Lepetit!
33 pages
Pdf
No ratings yet
Pdf
41 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
Tensorflow and Deep Learning
No ratings yet
Tensorflow and Deep Learning
51 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
25 pages
ass_3
No ratings yet
ass_3
5 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
DL Practical
No ratings yet
DL Practical
23 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
Lab Manual
No ratings yet
Lab Manual
45 pages
nndlrepo2
No ratings yet
nndlrepo2
3 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Lab 1_harshil_parmar (1)
No ratings yet
Lab 1_harshil_parmar (1)
2 pages
niraj dl
No ratings yet
niraj dl
15 pages
CNN TF Keras
No ratings yet
CNN TF Keras
6 pages
NNDL_RECORD_MANUAL
No ratings yet
NNDL_RECORD_MANUAL
36 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Building Your Deep Neural Network - Step by Step v8 PDF
No ratings yet
Building Your Deep Neural Network - Step by Step v8 PDF
44 pages
Font Transfer 2 Autoencoders
No ratings yet
Font Transfer 2 Autoencoders
78 pages
lab 8
No ratings yet
lab 8
10 pages
DeepTrading With TensorFlow 4 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 4 - TodoTrader
14 pages
Unit III
No ratings yet
Unit III
28 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Homework IntroToDL
No ratings yet
Homework IntroToDL
3 pages
Introduction To Deep Learning Assignment 0: September 2023
No ratings yet
Introduction To Deep Learning Assignment 0: September 2023
3 pages
Python Deep Learning Lab Programs (2)
No ratings yet
Python Deep Learning Lab Programs (2)
35 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
Neonatal ICU Monitoring: Lena Hellstro M-Westas, Linda S. de Vries and Ingmar Rose N
No ratings yet
Neonatal ICU Monitoring: Lena Hellstro M-Westas, Linda S. de Vries and Ingmar Rose N
16 pages
Chapter05 Fundamentals-Of-Ml
No ratings yet
Chapter05 Fundamentals-Of-Ml
7 pages
8086 Sys
No ratings yet
8086 Sys
5 pages
Real-Mode Memory Addressing
No ratings yet
Real-Mode Memory Addressing
20 pages
Lecture Note 7-8
No ratings yet
Lecture Note 7-8
9 pages
Networking v7
No ratings yet
Networking v7
197 pages
09 Phonological Awareness Activity Package
100% (3)
09 Phonological Awareness Activity Package
63 pages
Folk Costume
No ratings yet
Folk Costume
2 pages
Bock D.L. - Jewish Expressions in Mark 14.61-62 and The Authenticity of The Jewish Examination of Jesus (JSHJ 2003)
100% (1)
Bock D.L. - Jewish Expressions in Mark 14.61-62 and The Authenticity of The Jewish Examination of Jesus (JSHJ 2003)
14 pages
Timesaver Grammar Activities Elem-PRESENTSIMPLE2
No ratings yet
Timesaver Grammar Activities Elem-PRESENTSIMPLE2
1 page
JUNE GRADE 8 TO 12 EXAM TIMETABLE FINAL
No ratings yet
JUNE GRADE 8 TO 12 EXAM TIMETABLE FINAL
1 page
IPv6 Subnetting
No ratings yet
IPv6 Subnetting
2 pages
Eng Intermedio 10-11
No ratings yet
Eng Intermedio 10-11
59 pages
Modal 6 Topic: Answer Key
No ratings yet
Modal 6 Topic: Answer Key
16 pages
Using and Interpreting Statistics 3Rd Edition Corty Solutions Manual Full Chapter PDF
100% (25)
Using and Interpreting Statistics 3Rd Edition Corty Solutions Manual Full Chapter PDF
18 pages
DATA GATHERING TOOL-FORM EDITEd
No ratings yet
DATA GATHERING TOOL-FORM EDITEd
4 pages
F5.Pre .201.by .VCEplus.91q-DEMO
No ratings yet
F5.Pre .201.by .VCEplus.91q-DEMO
28 pages
Reina de Triana
No ratings yet
Reina de Triana
1 page
22 Excel Basics
No ratings yet
22 Excel Basics
31 pages
Grade 6 layput as per BIM
No ratings yet
Grade 6 layput as per BIM
13 pages
Contoh CV
No ratings yet
Contoh CV
2 pages
TOAD Tips - Quick Reference
No ratings yet
TOAD Tips - Quick Reference
9 pages
Iqbal's concept of Khudi
No ratings yet
Iqbal's concept of Khudi
6 pages
Elementary Number Theory With Applications 2nd Edition by Thomas Koshy 0123724872 9780123724878 - The ebook is ready for download to explore the complete content
100% (8)
Elementary Number Theory With Applications 2nd Edition by Thomas Koshy 0123724872 9780123724878 - The ebook is ready for download to explore the complete content
91 pages
The Impending Conflict: Lesson 11 For June 15, 2024
No ratings yet
The Impending Conflict: Lesson 11 For June 15, 2024
11 pages
Student Objectives Senior 5
No ratings yet
Student Objectives Senior 5
2 pages
Classical Latin Phrases
No ratings yet
Classical Latin Phrases
10 pages
Re95360 2008-06
No ratings yet
Re95360 2008-06
8 pages
SBI Clerk Mains Reasoning (ENG)
No ratings yet
SBI Clerk Mains Reasoning (ENG)
2,021 pages
9781466807181 Rgg
No ratings yet
9781466807181 Rgg
2 pages
(Ebook) The Prophet: The Life of Leon Trotsky by Isaac Deutscher ISBN 9781781685600, 1781685606 download
100% (3)
(Ebook) The Prophet: The Life of Leon Trotsky by Isaac Deutscher ISBN 9781781685600, 1781685606 download
50 pages
Rencana Pelaksanaan Pembelajaran Spoof
No ratings yet
Rencana Pelaksanaan Pembelajaran Spoof
4 pages
Day 1 Tableau 30 June 2024
No ratings yet
Day 1 Tableau 30 June 2024
3 pages