0% found this document useful (0 votes)

95 views64 pages

Lecture8 Computational Graph Pytorch TF

The document discusses computation graphs and deep learning frameworks like PyTorch and TensorFlow. It defines a computation graph as a directed acyclic graph with nodes for variables and operations. Computation graphs allow automatic calculation of gradients using backpropagation. Frameworks like PyTorch and TensorFlow implement computation graphs with dynamic "define by run" graphs or static "define and run" graphs. This allows automatic calculation of gradients for training neural networks.

Uploaded by

kpratik41

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views64 pages

Lecture8 Computational Graph Pytorch TF

Uploaded by

kpratik41

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 64

Lecture 6 – Computational Graphs; PyTorch and

Tensorflow

DD2424

April 11, 2019

DD2424 - Lecture 8 1
Outline

• First Part
• Computation Graphs
• TensorFlow
• PyTorch
• Notes

• Second Part

DD2424 - Lecture 8 2
Frameworks

DD2424 - Lecture 8 3
Frameworks

DD2424 - Lecture 8 4
O’Reilly Poll: Most popular framework for machine learning

[ Source: https://www.techrepublic.com/google-amp/article/most-popular-
programming-language-frameworks-and-tools-for-machine-learning/ ]

DD2424 - Lecture 8 5
What are computation graphs?

DD2424 - Lecture 8 6
Computation Graph

• DAG (directed acyclic graph)

• Nodes
• Variables
• Mathematical Operations
var
• Edges
• Feeding input op

var

DD2424 - Lecture 8 7
Computation Graph

•𝑐 = 𝑎+𝑏

𝒄=𝒂+𝒃

DD2424 - Lecture 8 8
Computation Graph

•𝑐 = 𝑎+𝑏∗2

𝒄=𝒂+𝒛

𝒃 𝒛=𝒃∗𝟐

DD2424 - Lecture 8 9
Computation Graph

• Tensors: Multi-dimensional arrays

• 𝒂 = 𝑊𝒙 + 𝒃

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

DD2424 - Lecture 8 10
Computation Graph

• A feed-forward neural network

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒃𝟏

DD2424 - Lecture 8 11
Computation Graph

• A multi-layer feed-forward neural network

𝑾𝟏 𝑾𝟐

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑧2 = 𝑾𝟐 𝒔𝟏 𝒂𝟐 = 𝒛𝟐 + 𝒃𝟐 𝒔𝟐 = 𝝈(𝒂𝟐 )

𝒃𝟏 𝒃𝟐

DD2424 - Lecture 8 12
Python (NumPy)

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

𝒃
DD2424 - Lecture 8 13
PyTorch

NumPy

PyTorch
𝑾

𝑧 = 𝑾𝑥 a= 𝒛 + 𝒃

𝒃
DD2424 - Lecture 8 14
PyTorch

NumPy

PyTorch

Not always!

DD2424 - Lecture 8 15
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

DD2424 - Lecture 8 16
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

Shared Memory

DD2424 - Lecture 8 17
PyTorch-NumPy

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

DD2424 - Lecture 8 18
“Define by Run” Computation Graphs

This kind of computation graph is called “define by run“

Also referred to as “dynamic”

DD2424 - Lecture 8 19
“Define and Run” Computation Graphs

• First define the graph structure

• Then run it by feeding in the (input) variables.
Define graph G Run the graph G

𝑾𝟏
• Run G with 𝑥1 , 𝑊1 , 𝑏1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒙 • Run G with 𝑥2 , 𝑊2 , 𝑏2
𝒃𝟏

• …

Also known as “static graphs”

DD2424 - Lecture 8 20
Run graph
Define graph
many times

DD2424 - Lecture 8
21
TensorFlow
Data loop

• Dynamic Graph • Static Graph

DD2424 - Lecture 8 22
Why computation graphs at all?!

DD2424 - Lecture 8 23
Why computation graphs?

• In lecture 3, you’ve learnt how to do backprop using the chain rule

DD2424 - Lecture 8 24
Why computation graphs?

• Is it feasible?

DD2424 - Lecture 8 25
Why computation graphs?

• Automatic chain rule

• automatic back-prop using implemented operations
• Each operation has their gradient already implemented
• If you want to use a novel operation, then you have to provide it’s gradient w.r.t. inputs
and its learnable parameters (if any)

DD2424 - Lecture 8 26
Let’s look at examples in PyTorch and TensorFlow

DD2424 - Lecture 8 27
Computation Graph

• A feed-forward neural network

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 )

𝒃𝟏

DD2424 - Lecture 8 28
Computation Graph

• A feed-forward neural network with squared 𝐿2 loss

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 29
Backprop in Computation Graph

• Learnable parameters

𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 30
Backprop in Computation Graph

𝜕𝑙
𝜕𝑊1
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 31
Backprop in Computation Graph

𝜕𝑙
𝜕𝑧1
𝜕𝑊1
𝑾𝟏 𝜕𝑊1 𝜕𝑎1 𝜕𝑠1 𝜕𝑙
𝜕𝑧1 𝜕𝑎1 𝜕𝑠1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒙
𝜕𝑎1
𝒃𝟏 𝒚
𝜕𝑏1
𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 32
Backprop in Computation Graph

A deep learning framework provides an automatic gradient calculation

of its output variables w.r.t. its input variables
𝜕𝑙
𝜕𝑧1
𝜕𝑊1
𝑾𝟏 𝜕𝑊1 𝜕𝑎1 𝜕𝑠1 𝜕𝑙
𝜕𝑧1 𝜕𝑎1 𝜕𝑠1
𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒙
𝜕𝑎1
𝒃𝟏 𝒚
𝜕𝑏1
𝜕𝑙
𝜕𝑏1

DD2424 - Lecture 8 33
Backprop in Computation Graph

• Addition Node
• Forward pass: 𝑎 = 𝑏 + 𝑐
𝜕𝑎 𝜕𝑎
• Backward pass: 𝜕𝑏 = 1 and 𝜕𝑐
=1

DD2424 - Lecture 8 34
Backprop in Computation Graph

• Max Node
• Forward pass: 𝑎 = max 𝑏, 𝑐

• Backward pass:
• If b < c
𝜕𝑎 𝜕𝑎
• 𝜕𝑏
= 0 and
𝜕𝑐
=1
max
• If b > c
𝜕𝑎 𝜕𝑎
• 𝜕𝑏
= 1 and
𝜕𝑐
=0

DD2424 - Lecture 8 35
Variables and Ops

• Ops
• Intermediate or final nodes

• Variables
• intrinsic parameters of the model
• input to the model
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 36
Variables and Ops

• Ops
• Intermediate or final nodes

• Variables
• intrinsic parameters of the model
• input to the model
𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

𝒃𝟏 𝒚

DD2424 - Lecture 8 37
Variables and Ops

• Variables
• Intrinsic parameters of the model
• Input to the model

• TensorFlow
• Variables
• Place Holders 𝑾𝟏

𝑧1 = 𝑾𝟏 𝑥 𝒂𝟏 = 𝒛𝟏 + 𝒃𝟏 𝒔𝟏 = 𝝈(𝒂𝟏 ) 𝑙 = |𝑠1 − 𝑦 |2

• PyTorch 𝒙
• Variables
𝒃𝟏 𝒚

DD2424 - Lecture 8 38
Variable

PyTorch Autograd

• package: torch.autograd

Data
Tensor

Gradient
w.r.t.
this variable

Function
that created
this variable DD2424 - Lecture 8 39
Pytorch Autograd

DD2424 - Lecture 8 40
Pytorch Autograd

DD2424 - Lecture 8 41
Pytorch Autograd

DD2424 - Lecture 8 42
Pytorch Autograd

DD2424 - Lecture 8 43
PyTorch Autograd

• Calculate gradient using backward() method of a Variable

• var.backward()

DD2424 - Lecture 8 44
TensorFlow gradients

• Add gradient nodes in the graph where necessary using

Tf.gradients(ys, xs, gs)

• And evaluate it

DD2424 - Lecture 8 45
TensorFlow gradients

• Then update the parameters

DD2424 - Lecture 8 46
TensorFlow gradient

• Use tf.Variable instead

DD2424 - Lecture 8 47
How to use GPU?

DD2424 - Lecture 8 48
PyTorch GPU

Turn variables into “GPU” variables by the following command:

• var = var.cuda(#)

DD2424 - Lecture 8 49
PyTorch GPU

Turn back variables into “CPU” variables by the following command:

• var = var.cpu()

DD2424 - Lecture 8 50
TensorFlow GPU

• In TF variables or operations can sit on specific device

• tf.device(/gpu:0)
• tf.device(/gpu:1)
•…
• tf.device(/cpu:0)

DD2424 - Lecture 8 51
TensorFlow GPU

• In TF variables or operations can sit on specific device

tf.Session(config=tf.ConfigProto(log_device_placement=True))

MatMul: (MatMul): /job:localhost/replica:0/task:0/device:GPU:0

2018-04-10 12:59:09.508497: I tensorflow/core/common_runtime/placer.cc:874] MatMul: (MatMul)/job:localhost/replica:0/task:0/device:GPU:0
add: (Add): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508513: I tensorflow/core/common_runtime/placer.cc:874] add: (Add)/job:localhost/replica:0/task:0/device:GPU:0
Maximum: (Maximum): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508525: I tensorflow/core/common_runtime/placer.cc:874] Maximum: (Maximum)/job:localhost/replica:0/task:0/device:GPU:0
Maximum/y: (Const): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508537: I tensorflow/core/common_runtime/placer.cc:874] Maximum/y: (Const)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_2: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508548: I tensorflow/core/common_runtime/placer.cc:874] Placeholder_2: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_1: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508558: I tensorflow/core/common_runtime/placer.cc:874] Placeholder_1: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2018-04-10 12:59:09.508567: I tensorflow/core/common_runtime/placer.cc:874] Placeholder: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0

DD2424 - Lecture 8 52
TensorFlow GPU

• Some TF operations do not have a CUDA implementation

tf.Session(config=tf.ConfigProto(
allow_soft_placement=True, log_device_placement=True))

DD2424 - Lecture 8 53
How to implement complicated models in practice?

DD2424 - Lecture 8 54
PT High-Level Library

• PyTorch package called nn and class called Module

DD2424 - Lecture 8 55
TF High-Level Libraries

• Keras: highest abstraction

• SLIM: best pre-trained models
• TFLearn,
• Sonnet,
• Pretty Tensor,
•…

DD2424 - Lecture 8 56
Data, storage, and loading

!!!Important!!!

• Always monitor CPU/GPU usage (linux: nvidia-smi, top)

• Make storage more efficient (TF Records, etc.)

• Make reading pipeline more efficient (parallel readers, prefetching,

etc.)

DD2424 - Lecture 8 57
Use Visualization

• Always monitor the loss function on the training and validation sets visually
• Monitor all other important scalars, such as learning rate, regularization loss,
layer activations summary, how full your data queues are, and …

• If you have an imbalanced classification problem, visualize the CE loss separately

for each class.

• If you work with images, time to time visualize samples from the batch, if you do
data augmentation, visualize the original sample as well as the augmented one

• TensorBoard for TF
• TensorBoardX, matplotlib, seaborn, … for PT
DD2424 - Lecture 8 58
Use Visualization

You can have the configuration shown as a text file in tensorboard!

DD2424 - Lecture 8 59
Which one is better? PyTorch or TensorFlow?

DD2424 - Lecture 8 60
pros and cons

• PyTorch: easier for prototyping

• PyTorch: much easier to implement flexible graphs
• PyTorch: different structures in each iteration (dependent on data). This is possible with TF too, but is a pain.
• PyTorch: manipulating weight and gradients
• PyTorch: code-level debugging (breakpoints, imperative, tracing your own code instead of TF kernels)
• PyTorch: probably better abstractions for dataset, variable, parallelism, etc. but TF has many high-level wrappers with better abstractions
• Tie?!: Faster run-time, (NHWC v.s. NCHW)
• TF: TensorBoard
• TF: research-level debugging (TensorBoard)
• TF: windows
• TF: distributed training (PyTorch has it now too, but seems not as developed as the TF version)
• TF: easier with distributing the code over multiple devices (GPUs/CPU) (maybe not anymore)
• TF: online community is noticeably larger
• TF: data readers
• TF: supposedly more optimizations of the graph (done by the engine)
• TF: documentation and tutorials
• TF: more models available
• TF: Serialization, code and portability (saving and loading models for across platforms, or checkpoints)
• TF: Deployment: Server, Mobile, etc. (TensorFlow Serving, TensorFlow Lite)
• TF: Richer API (e.g. FFT)
• TF: Automatic shape inference
• TF has a MOOC: https://eu.udacity.com/course/deep-learning--ud730

DD2424 - Lecture 8 61
TensorFlow Eager execution

• Eager Execution
• Dynamic!

• tf.enable_eager_execuation()

• Considerably Slower (being worked on)

• https://www.tensorflow.org/guide/eager

DD2424 - Lecture 8 62
Caffe(2)

• Portability is seamless (e.g. mobile apps)

• Simplest framework for fine-tuning or feature extraction

• Used to be fastest (Caffe)

DD2424 - Lecture 8 63
Summary

• Don’t take the following statements too seriously! -- it depends on many factors
• If you want to use pretrained classic deep networks (AlexNet, VGG, ResNet, …) for feature extraction and/or fine-
tuning → Use Caffe and/or Caffe2
• If you have a mobile application in mind → Use Caffe/Caffe2 or TensorFlow
• If you want more pythonic → use PyTorch
• If you are familiar with Matlab and don’t need much flexibility or advanced layers → use MatConvNet
• If you don’t want so much of flexibility and still use python → use Keras
• If you are working on NLP applications or complicated RNNs → use PyTorch
• If you want large community help, sustainable learning of a framework → use TensorFlow
• If you want to work on bleeding-edge papers → See what framework has the original and/or cleanest
implementation (most likely TensorFlow)
• If you want to prototype many different novel setups → Use PyTorch or TF Eager

DD2424 - Lecture 8 64

Carding For Beginners?
100% (12)
Carding For Beginners?
18 pages
E Bill
100% (1)
E Bill
2 pages
Mitesh Patel PDF
77% (13)
Mitesh Patel PDF
92 pages
01 - Lecture Slide - Overview of Tensorflow
100% (1)
01 - Lecture Slide - Overview of Tensorflow
65 pages
Cours 3 - Custom Models and Training With TensorFlow
No ratings yet
Cours 3 - Custom Models and Training With TensorFlow
36 pages
Hackers Guide To Machine Learning With Python PDF
100% (15)
Hackers Guide To Machine Learning With Python PDF
272 pages
Hackers Guide To Machine Learning With Python PDF
100% (15)
Hackers Guide To Machine Learning With Python PDF
272 pages
CHL Unit 2 Task 3 Response
100% (2)
CHL Unit 2 Task 3 Response
3 pages
Lecture 2: Introduction To Pytorch
No ratings yet
Lecture 2: Introduction To Pytorch
7 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
INTRO TO Deep Learning Focusing On ToolS - Knowlexon - Biswa
No ratings yet
INTRO TO Deep Learning Focusing On ToolS - Knowlexon - Biswa
37 pages
L6 Hardware and Software For DL en
No ratings yet
L6 Hardware and Software For DL en
66 pages
1 TensorFlow
No ratings yet
1 TensorFlow
66 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
AML Lecture1.3
No ratings yet
AML Lecture1.3
72 pages
Tensor Flow 101
100% (8)
Tensor Flow 101
58 pages
Computational Graph
No ratings yet
Computational Graph
17 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
TF Recitation
No ratings yet
TF Recitation
38 pages
Appendix Tensorflow PDF
50% (8)
Appendix Tensorflow PDF
14 pages
Tensorflow Usage: Babii Andrii
No ratings yet
Tensorflow Usage: Babii Andrii
33 pages
CSE488 - Lab7 - Neural Networks and TensorFlow
No ratings yet
CSE488 - Lab7 - Neural Networks and TensorFlow
21 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
Lec2 - Intro To Tensorflow
No ratings yet
Lec2 - Intro To Tensorflow
120 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
GNNV4
No ratings yet
GNNV4
55 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
DL Unit II
No ratings yet
DL Unit II
29 pages
24 TensorFlow Clipper
No ratings yet
24 TensorFlow Clipper
35 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
35 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Tensor
No ratings yet
Tensor
19 pages
Tensorflow Ensai SID 13 01 17
No ratings yet
Tensorflow Ensai SID 13 01 17
99 pages
Tensorflow PDF
No ratings yet
Tensorflow PDF
62 pages
Chapter DeepLearningwithTensorFlow
No ratings yet
Chapter DeepLearningwithTensorFlow
19 pages
MLT Unit 1 & 2
No ratings yet
MLT Unit 1 & 2
119 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
What Is TensorFlow
No ratings yet
What Is TensorFlow
38 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
Tesis 7
No ratings yet
Tesis 7
76 pages
Dynamic Computation Graphs
No ratings yet
Dynamic Computation Graphs
12 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
Bacciu 2020
No ratings yet
Bacciu 2020
62 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
S06 DNN Tensorflow PyTorch Wip
No ratings yet
S06 DNN Tensorflow PyTorch Wip
24 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Astro AI
No ratings yet
Astro AI
20 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
AD PyTorch
No ratings yet
AD PyTorch
4 pages
Week 13 GCP Lec Notes
No ratings yet
Week 13 GCP Lec Notes
28 pages
Introduction To TensorFlow
No ratings yet
Introduction To TensorFlow
3 pages
Deep Learning With Keras - Quick Guide
No ratings yet
Deep Learning With Keras - Quick Guide
22 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
No ratings yet
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
3 pages
Neural Networks & Deep Learning - Study Notes
No ratings yet
Neural Networks & Deep Learning - Study Notes
8 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
Computational Infrastructure: Sweta Agrawal
No ratings yet
Computational Infrastructure: Sweta Agrawal
18 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-3
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-3
18 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Big Data On AWS 3.1 LabGuide
No ratings yet
Big Data On AWS 3.1 LabGuide
112 pages
Jacob Eisenstein - Natural Language Processing-MIT Press
No ratings yet
Jacob Eisenstein - Natural Language Processing-MIT Press
591 pages
Inhibitor Data 9-2-2014
No ratings yet
Inhibitor Data 9-2-2014
7 pages
3 Minute French - Course 1 Bumper Crossword: Test Your Knowledge of All The Words We've Learnt in This Course
No ratings yet
3 Minute French - Course 1 Bumper Crossword: Test Your Knowledge of All The Words We've Learnt in This Course
2 pages
Otc 24470 MS
No ratings yet
Otc 24470 MS
13 pages
Eclipse Run User Guid
No ratings yet
Eclipse Run User Guid
54 pages
Spe 151797 MS
No ratings yet
Spe 151797 MS
9 pages
Spe 151740 MS
No ratings yet
Spe 151740 MS
6 pages
How To Build Wealth Like Warren Buffet
100% (1)
How To Build Wealth Like Warren Buffet
38 pages
IPTC 16957 A Model For Wettability Alteration in Fractured Reservoirs
No ratings yet
IPTC 16957 A Model For Wettability Alteration in Fractured Reservoirs
10 pages
Introduction To STARS
100% (3)
Introduction To STARS
40 pages
Eng Analysis Cheat Notes
No ratings yet
Eng Analysis Cheat Notes
4 pages
Indian Companies Result Calendar
No ratings yet
Indian Companies Result Calendar
12 pages
Homework Assignment Well Logging
100% (1)
Homework Assignment Well Logging
10 pages
Ad DP D Ti Advanced Production Engineering: Funded Research Program
No ratings yet
Ad DP D Ti Advanced Production Engineering: Funded Research Program
15 pages
Thesis Kumar Official
No ratings yet
Thesis Kumar Official
123 pages
Persian Academic Reading Abbas Aghdassi Download
No ratings yet
Persian Academic Reading Abbas Aghdassi Download
56 pages
Harshil Kaushik Amity Oida Taxation
No ratings yet
Harshil Kaushik Amity Oida Taxation
13 pages
BMW E30 3 Series Specifications
100% (1)
BMW E30 3 Series Specifications
4 pages
Helium
No ratings yet
Helium
2 pages
Heather R. Flores: Creative Director
No ratings yet
Heather R. Flores: Creative Director
1 page
Type 2 Diabetes Good Appointment Guide 2020
No ratings yet
Type 2 Diabetes Good Appointment Guide 2020
14 pages
General Information: How To Use This Manual
No ratings yet
General Information: How To Use This Manual
23 pages
4thquarter Dmea Acr
No ratings yet
4thquarter Dmea Acr
5 pages
Memo NOM For 1st BLSS Online Cascade
No ratings yet
Memo NOM For 1st BLSS Online Cascade
4 pages
Lenovo Xclarity Controller Adv Ent Upg Data Sheet 1
No ratings yet
Lenovo Xclarity Controller Adv Ent Upg Data Sheet 1
8 pages
Tutorial: Create An Excel Dashboard: Download The Example Dashboard
No ratings yet
Tutorial: Create An Excel Dashboard: Download The Example Dashboard
12 pages
Invoice Và Packing List Cho G Ván Ép
No ratings yet
Invoice Và Packing List Cho G Ván Ép
4 pages
Laporan April
No ratings yet
Laporan April
6 pages
DDCO Model Paper
No ratings yet
DDCO Model Paper
35 pages
Scalable Pattern Recognition For Large-Scale Scientific Data Mining
No ratings yet
Scalable Pattern Recognition For Large-Scale Scientific Data Mining
14 pages
Industrial Attachment Report Copy 2
No ratings yet
Industrial Attachment Report Copy 2
39 pages
L1 - Virtual Systems - T (Introduction To Virtualization)
No ratings yet
L1 - Virtual Systems - T (Introduction To Virtualization)
7 pages
Deptals Questions
No ratings yet
Deptals Questions
2 pages
Off Boarding Checklist
No ratings yet
Off Boarding Checklist
2 pages
Cyber Secure - DNV 2021
No ratings yet
Cyber Secure - DNV 2021
40 pages
Mvh-285bt Operating Manual Ing - Esp - Por
100% (1)
Mvh-285bt Operating Manual Ing - Esp - Por
84 pages
Pset 4
No ratings yet
Pset 4
2 pages
Agricultural Policy 2020
100% (1)
Agricultural Policy 2020
55 pages
Covid 19 Exposure Form
No ratings yet
Covid 19 Exposure Form
2 pages
02-Catalogue Thiết Bị Phòng Sạch - 13.02
No ratings yet
02-Catalogue Thiết Bị Phòng Sạch - 13.02
8 pages
Tolerance Tables
No ratings yet
Tolerance Tables
2 pages
Resolution For The Appointment of SK Secretary
No ratings yet
Resolution For The Appointment of SK Secretary
2 pages

Lecture8 Computational Graph Pytorch TF

Uploaded by

Lecture8 Computational Graph Pytorch TF

Uploaded by

Lecture 6 – Computational Graphs; PyTorch and

April 11, 2019

• DAG (directed acyclic graph)

• Tensors: Multi-dimensional arrays

• A feed-forward neural network

• A multi-layer feed-forward neural network

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

• Converting a Torch Tensor to a NumPy array and vice versa is a breeze.

This kind of computation graph is called “define by run“

Also referred to as “dynamic”

• First define the graph structure

Also known as “static graphs”

• Dynamic Graph • Static Graph

• In lecture 3, you’ve learnt how to do backprop using the chain rule

• Automatic chain rule

• A feed-forward neural network

• A feed-forward neural network with squared 𝐿2 loss

A deep learning framework provides an automatic gradient calculation

• Calculate gradient using backward() method of a Variable

• Add gradient nodes in the graph where necessary using

• Then update the parameters

• Use tf.Variable instead

Turn variables into “GPU” variables by the following command:

Turn back variables into “CPU” variables by the following command:

• In TF variables or operations can sit on specific device

• In TF variables or operations can sit on specific device

MatMul: (MatMul): /job:localhost/replica:0/task:0/device:GPU:0

• Some TF operations do not have a CUDA implementation

• PyTorch package called nn and class called Module

• Keras: highest abstraction

• Always monitor CPU/GPU usage (linux: nvidia-smi, top)

• Make storage more efficient (TF Records, etc.)

• Make reading pipeline more efficient (parallel readers, prefetching,

• If you have an imbalanced classification problem, visualize the CE loss separately

You can have the configuration shown as a text file in tensorboard!

• PyTorch: easier for prototyping

• Considerably Slower (being worked on)

• Portability is seamless (e.g. mobile apps)

• Simplest framework for fine-tuning or feature extraction

• Used to be fastest (Caffe)

You might also like