0% found this document useful (0 votes)

70 views28 pages

DL Lecture8 Autoencoder

- Autoencoders are neural networks trained to copy their input to their output. They have an encoder that maps the input to a hidden representation, and a decoder that maps the hidden representation back to the output. - Undercomplete autoencoders constrain the hidden representation to have smaller dimension than the input, forcing it to learn a compressed representation. This can be viewed as learning the principal components of the data. - Regularized autoencoders add regularization terms to encourage other properties, like sparsity of the hidden representation (sparse autoencoder) or robustness to noise (denoising autoencoder).

Uploaded by

gourav Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views28 pages

DL Lecture8 Autoencoder

Uploaded by

gourav Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Deep Learning Basics

Lecture 8: Autoencoder & DBM

Princeton University COS 495
Instructor: Yingyu Liang
Autoencoder
Autoencoder
• Neural networks trained to attempt to copy its input to its output

• Contain two parts:

• Encoder: map the input to a hidden representation
• Decoder: map the hidden representation to the output
Autoencoder

ℎ Hidden representation (the code)

Input 𝑥 𝑟 Reconstruction
Autoencoder

Encoder 𝑓(⋅) Decoder 𝑔(⋅)

𝑥 𝑟

ℎ = 𝑓 𝑥 , 𝑟 = 𝑔 ℎ = 𝑔(𝑓 𝑥 )
Why want to copy input to output
• Not really care about copying

• Interesting case: NOT able to copy exactly but strive to do so

• Autoencoder forced to select which aspects to preserve and thus
hopefully can learn useful properties of the data

• Historical note: goes back to (LeCun, 1987; Bourlard and Kamp, 1988;
Hinton and Zemel, 1994).
Undercomplete autoencoder
• Constrain the code to have smaller dimension than the input
• Training: minimize a loss function
𝐿 𝑥, 𝑟 = 𝐿(𝑥, 𝑔 𝑓 𝑥 )

𝑥 ℎ 𝑟
Undercomplete autoencoder
• Constrain the code to have smaller dimension than the input
• Training: minimize a loss function
𝐿 𝑥, 𝑟 = 𝐿(𝑥, 𝑔 𝑓 𝑥 )

• Special case: 𝑓, 𝑔 linear, 𝐿 mean square error

• Reduces to Principal Component Analysis
Undercomplete autoencoder
• What about nonlinear encoder and decoder?

• Capacity should not be too large

• Suppose given data 𝑥1 , 𝑥2 , … , 𝑥𝑛
• Encoder maps 𝑥𝑖 to 𝑖
• Decoder maps 𝑖 to 𝑥𝑖
• One dim ℎ suffices for perfect reconstruction
Regularization
• Typically NOT
• Keeping the encoder/decoder shallow or
• Using small code size

• Regularized autoencoders: add regularization term that encourages

the model to have other properties
• Sparsity of the representation (sparse autoencoder)
• Robustness to noise or to missing inputs (denoising autoencoder)
• Smallness of the derivative of the representation
Sparse autoencoder
• Constrain the code to have sparsity
• Training: minimize a loss function
𝐿𝑅 = 𝐿(𝑥, 𝑔 𝑓 𝑥 ) + 𝑅(ℎ)

𝑥 ℎ 𝑟
Probabilistic view of regularizing ℎ
• Suppose we have a probabilistic model 𝑝(ℎ, 𝑥)
• MLE on 𝑥
log 𝑝(𝑥) = log ෍ 𝑝(ℎ′ , 𝑥)
ℎ′

•  Hard to sum over ℎ′

Probabilistic view of regularizing ℎ
• Suppose we have a probabilistic model 𝑝(ℎ, 𝑥)
• MLE on 𝑥
max log 𝑝(𝑥) = max log ෍ 𝑝(ℎ′ , 𝑥)
ℎ′

• Approximation: suppose ℎ = 𝑓(𝑥) gives the most likely hidden

representation, and σℎ′ 𝑝(ℎ′ , 𝑥) can be approximated by 𝑝(ℎ, 𝑥)
Probabilistic view of regularizing ℎ
• Suppose we have a probabilistic model 𝑝(ℎ, 𝑥)
• Approximate MLE on 𝑥, ℎ = 𝑓(𝑥)
max log 𝑝(ℎ, 𝑥) = max log 𝑝(𝑥|ℎ) + log 𝑝(ℎ)

Loss Regularization
Sparse autoencoder
• Constrain the code to have sparsity
𝜆 𝜆
• Laplacian prior: 𝑝 ℎ = exp(− ℎ 1)
2 2

• Training: minimize a loss function

𝐿𝑅 = 𝐿(𝑥, 𝑔 𝑓 𝑥 ) + 𝜆 ℎ 1
Denoising autoencoder
• Traditional autoencoder: encourage to learn 𝑔 𝑓 ⋅ to be identity

• Denoising : minimize a loss function

𝐿 𝑥, 𝑟 = 𝐿(𝑥, 𝑔 𝑓 𝑥෤ )
where 𝑥෤ is 𝑥 + 𝑛𝑜𝑖𝑠𝑒
Boltzmann machine
Boltzmann machine
• Introduced by Ackley et al. (1985)

• General “connectionist” approach to learning arbitrary probability

distributions over binary vectors
exp(−𝐸 𝑥 )
• Special case of energy model: 𝑝 𝑥 =
𝑍
Boltzmann machine
• Energy model:
exp(−𝐸 𝑥 )
𝑝 𝑥 =
𝑍
• Boltzmann machine: special case of energy model with
𝐸 𝑥 = −𝑥 𝑇 𝑈𝑥 − 𝑏 𝑇 𝑥
where 𝑈 is the weight matrix and 𝑏 is the bias parameter
Boltzmann machine with latent variables
• Some variables are not observed
𝑥 = 𝑥𝑣 , 𝑥ℎ , 𝑥𝑣 visible, 𝑥ℎ hidden
𝐸 𝑥 = −𝑥𝑣𝑇 𝑅𝑥𝑣 − 𝑥𝑣𝑇 𝑊𝑥ℎ − 𝑥ℎ𝑇 𝑆𝑥ℎ − 𝑏 𝑇 𝑥𝑣 − 𝑐 𝑇 𝑥ℎ

• Universal approximator of probability mass functions

Maximum likelihood
• Suppose we are given data 𝑋 = 𝑥𝑣1 , 𝑥𝑣2 , … , 𝑥𝑣𝑛
• Maximum likelihood is to maximize
log 𝑝 𝑋 = ෍ log 𝑝(𝑥𝑣𝑖 )
𝑖
where
1
𝑝 𝑥𝑣 = ෍ 𝑝(𝑥𝑣 , 𝑥ℎ ) = ෍ exp(−𝐸(𝑥𝑣 , 𝑥ℎ ))
𝑍
𝑥ℎ 𝑥ℎ

• 𝑍 = σ exp(−𝐸(𝑥𝑣 , 𝑥ℎ )): partition function, difficult to compute

Restricted Boltzmann machine
• Invented under the name harmonium (Smolensky, 1986)
• Popularized by Hinton and collaborators to Restricted Boltzmann
machine
Restricted Boltzmann machine
• Special case of Boltzmann machine with latent variables:
exp(−𝐸 𝑣, ℎ )
𝑝 𝑣, ℎ =
𝑍
where the energy function is
𝐸 𝑣, ℎ = −𝑣 𝑇 𝑊ℎ − 𝑏 𝑇 𝑣 − 𝑐 𝑇 ℎ
with the weight matrix 𝑊 and the bias 𝑏, 𝑐
• Partition function
𝑍 = ෍ ෍ exp(−𝐸 𝑣, ℎ )
𝑣 ℎ
Restricted Boltzmann machine

Figure from Deep Learning,

Goodfellow, Bengio and Courville
Restricted Boltzmann machine
• Conditional distribution is factorial
𝑝(𝑣, ℎ)
𝑝 ℎ|𝑣 = = ෑ 𝑝(ℎ𝑗 |𝑣)
𝑝(𝑣)
𝑗
and
𝑝 ℎ𝑗 = 1|𝑣 = 𝜎 𝑐𝑗 + 𝑣 𝑇 𝑊:,𝑗
is logistic function
Restricted Boltzmann machine
• Similarly,
𝑝(𝑣, ℎ)
𝑝 𝑣|ℎ = = ෑ 𝑝(𝑣𝑖 |ℎ)
𝑝(ℎ)
𝑖
and
𝑝 𝑣𝑖 = 1|ℎ = 𝜎 𝑏𝑖 + 𝑊𝑖,: ℎ
is logistic function
Deep Boltzmann machine
• Special case of energy model. Take 3 hidden layers and ignore bias:
exp(−𝐸 𝑣, ℎ1 , ℎ2 , ℎ3 )
𝑝 𝑣, ℎ1 , ℎ2 , ℎ3 =
𝑍
• Energy function
𝐸 𝑣, ℎ1 , ℎ2 , ℎ3 = −𝑣 𝑇 𝑊 1 ℎ1 − (ℎ1 )𝑇 𝑊 2 ℎ2 − (ℎ2 )𝑇 𝑊 3 ℎ3
with the weight matrices 𝑊 1 , 𝑊 2 , 𝑊 3
• Partition function
𝑍= ෍ exp(−𝐸 𝑣, ℎ1 , ℎ2 , ℎ3 )
𝑣,ℎ1 ,ℎ2 ,ℎ3
Deep Boltzmann machine

Figure from Deep Learning,

Goodfellow, Bengio and Courville

Era of Chaos - Roster Calculator
No ratings yet
Era of Chaos - Roster Calculator
30 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Mechanical Drafting Tools
60% (5)
Mechanical Drafting Tools
3 pages
Salomey Final
No ratings yet
Salomey Final
42 pages
One Step Inequalities PDF
0% (1)
One Step Inequalities PDF
2 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
28 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
Module 4
No ratings yet
Module 4
10 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Unit-V Deep Generative Models Part-01
No ratings yet
Unit-V Deep Generative Models Part-01
41 pages
Gen AI Unit 2
100% (1)
Gen AI Unit 2
65 pages
DL Mod 5
No ratings yet
DL Mod 5
2 pages
Unit V Deep Generative Models - Part 01
No ratings yet
Unit V Deep Generative Models - Part 01
33 pages
Autoencoder
No ratings yet
Autoencoder
24 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Lecture 23b Auto Encoder
No ratings yet
Lecture 23b Auto Encoder
27 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
MODULE 5 Auto-Encoders and Generative Models
No ratings yet
MODULE 5 Auto-Encoders and Generative Models
25 pages
Deep Learning & Neural Networks: Kevin Duh
No ratings yet
Deep Learning & Neural Networks: Kevin Duh
86 pages
Unit5 Autoencoders
No ratings yet
Unit5 Autoencoders
45 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
DL Unit - 4
No ratings yet
DL Unit - 4
26 pages
Auto Encoder
No ratings yet
Auto Encoder
39 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
AAI Module 3
No ratings yet
AAI Module 3
11 pages
UNIT-5 Part1
No ratings yet
UNIT-5 Part1
15 pages
DL Class5
No ratings yet
DL Class5
23 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Autoencoder
No ratings yet
Autoencoder
14 pages
Boltzmann Machine
No ratings yet
Boltzmann Machine
6 pages
Module 5
No ratings yet
Module 5
23 pages
ML Lec 19 Autoencoder
No ratings yet
ML Lec 19 Autoencoder
54 pages
Module 5
No ratings yet
Module 5
23 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Unit 3
No ratings yet
Unit 3
23 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
Unit 3
No ratings yet
Unit 3
38 pages
Autoencoders
No ratings yet
Autoencoders
20 pages
DL Unit 4
No ratings yet
DL Unit 4
21 pages
Brief Introduction On Current Research Areas - Autoencoders
No ratings yet
Brief Introduction On Current Research Areas - Autoencoders
20 pages
Restricted Boltzmann Machines: Abstract
No ratings yet
Restricted Boltzmann Machines: Abstract
21 pages
Study Materials - Restricted Boltzmann Machine
No ratings yet
Study Materials - Restricted Boltzmann Machine
6 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Autoencoders and Restricted Boltzmann Machines: Amir H. Payberah
No ratings yet
Autoencoders and Restricted Boltzmann Machines: Amir H. Payberah
139 pages
Ad3501-Dl-Unit 5 Notes
No ratings yet
Ad3501-Dl-Unit 5 Notes
16 pages
Module 03
No ratings yet
Module 03
13 pages
Autoencoders - Buffalo University
100% (1)
Autoencoders - Buffalo University
36 pages
Nips10 Workshop Tutorial Final PDF
No ratings yet
Nips10 Workshop Tutorial Final PDF
73 pages
Chapter 7 - Autoencoders
No ratings yet
Chapter 7 - Autoencoders
91 pages
Boltzmann Learning
No ratings yet
Boltzmann Learning
47 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
03 Autoencoders 4
No ratings yet
03 Autoencoders 4
159 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
94 pages
Autoencoders_Notes
No ratings yet
Autoencoders_Notes
3 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
5423 Generative Adversarial Nets
No ratings yet
5423 Generative Adversarial Nets
9 pages
PRJP - 1425
No ratings yet
PRJP - 1425
11 pages
CNN
No ratings yet
CNN
31 pages
ECE OLED Technology
No ratings yet
ECE OLED Technology
22 pages
Couplers: Types of Directional Couplers
No ratings yet
Couplers: Types of Directional Couplers
7 pages
Rtu Report Format
No ratings yet
Rtu Report Format
3 pages
Smart Energy Network
No ratings yet
Smart Energy Network
22 pages
Ece 8085 Microprocessor
No ratings yet
Ece 8085 Microprocessor
26 pages
Ece 8085 Microprocessor PDF Report
No ratings yet
Ece 8085 Microprocessor PDF Report
19 pages
Final-Maths-11th Syllabus 2024-25
No ratings yet
Final-Maths-11th Syllabus 2024-25
1 page
Maths Project File Term 2
No ratings yet
Maths Project File Term 2
19 pages
Cylindrical Waves: Daniel S. Weile
No ratings yet
Cylindrical Waves: Daniel S. Weile
82 pages
Manufacturing System Flow Analysis: Ron@sie - Arizona.edu
No ratings yet
Manufacturing System Flow Analysis: Ron@sie - Arizona.edu
30 pages
2024 - Y9 Surds Topic Test
No ratings yet
2024 - Y9 Surds Topic Test
5 pages
Derivation of The Heat Equation in One Dimension: Homework 11/11/21 Irene Pascual-Heranz Bronchalo (100452087)
No ratings yet
Derivation of The Heat Equation in One Dimension: Homework 11/11/21 Irene Pascual-Heranz Bronchalo (100452087)
3 pages
Bab 07 Restraints
No ratings yet
Bab 07 Restraints
42 pages
AR1731 Abstract
No ratings yet
AR1731 Abstract
12 pages
Notes On Supremums and Infimums
No ratings yet
Notes On Supremums and Infimums
3 pages
A (Book Review) Survey of Visual Language Specification and Recognition, Visual Language Theory 1998.
No ratings yet
A (Book Review) Survey of Visual Language Specification and Recognition, Visual Language Theory 1998.
2 pages
02 Design of A Compliant Lever-Type Passive Vibration Isolator With Quasi-Zero-Stiffness Mechanism
No ratings yet
02 Design of A Compliant Lever-Type Passive Vibration Isolator With Quasi-Zero-Stiffness Mechanism
16 pages
Solution: Table E2-W.1 Processed Data
100% (1)
Solution: Table E2-W.1 Processed Data
4 pages
MATATAG Math 4 FINAL
No ratings yet
MATATAG Math 4 FINAL
7 pages
WB Division Multiplication
No ratings yet
WB Division Multiplication
4 pages
Lecture 12
No ratings yet
Lecture 12
2 pages
The Handbook of Student Affairs Administration 5th Edition George S. Mcclellan pdf download
No ratings yet
The Handbook of Student Affairs Administration 5th Edition George S. Mcclellan pdf download
117 pages
Ip PT Ip Computing Y6 t1 Prog MG
No ratings yet
Ip PT Ip Computing Y6 t1 Prog MG
3 pages
Science 10 Q2 Module 3
No ratings yet
Science 10 Q2 Module 3
7 pages
Original Paper A Rapid and Sensitive LC-MS/MS Method For Determination of Coenzyme Q in Tobacco (Nicotiana Tabacum L.) Leaves
No ratings yet
Original Paper A Rapid and Sensitive LC-MS/MS Method For Determination of Coenzyme Q in Tobacco (Nicotiana Tabacum L.) Leaves
6 pages
数学个人陈述
100% (2)
数学个人陈述
8 pages
Radian and Degree Measure
No ratings yet
Radian and Degree Measure
13 pages
B.Tech CSE-2
No ratings yet
B.Tech CSE-2
144 pages
Es 102 - Module 1
No ratings yet
Es 102 - Module 1
2 pages
Course Syllabus
No ratings yet
Course Syllabus
2 pages
Math 1530 Syllabus-Ada Compliant f19
No ratings yet
Math 1530 Syllabus-Ada Compliant f19
6 pages
2 Surface Representations
No ratings yet
2 Surface Representations
78 pages

DL Lecture8 Autoencoder

Uploaded by

DL Lecture8 Autoencoder

Uploaded by

Deep Learning Basics

Lecture 8: Autoencoder & DBM

• Contain two parts:

ℎ Hidden representation (the code)

Encoder 𝑓(⋅) Decoder 𝑔(⋅)

• Interesting case: NOT able to copy exactly but strive to do so

• Special case: 𝑓, 𝑔 linear, 𝐿 mean square error

• Capacity should not be too large

• Regularized autoencoders: add regularization term that encourages

•  Hard to sum over ℎ′

• Approximation: suppose ℎ = 𝑓(𝑥) gives the most likely hidden

• Training: minimize a loss function

• Denoising : minimize a loss function

• General “connectionist” approach to learning arbitrary probability

• Universal approximator of probability mass functions

• 𝑍 = σ exp(−𝐸(𝑥𝑣 , 𝑥ℎ )): partition function, difficult to compute

Figure from Deep Learning,

Figure from Deep Learning,

You might also like