Deep Learning - IIT Ropar - Unit 12 - Week 9
Deep Learning - IIT Ropar - Unit 12 - Week 9
Deep Learning - IIT Ropar - Unit 12 - Week 9
(https://swayam.gov.in) (https://swayam.gov.in/nc_details/NPTEL)
vmcse09@gmail.com
2) You are given the one hot representation of two words below:
Week 7 ()
CAR= [1, 0, 0, 0, 0] , BUS= [0, 0, 0, 1, 0]
Week 8 ()
What is the Euclidean distance between CAR and BUS?
Week 9 () 1.4142
https://onlinecourses.nptel.ac.in/noc24_cs114/unit?unit=115&assessment=297 1/4
10/27/24, 1:13 PM Deep Learning - IIT Ropar - - Unit 12 - Week 9
(unit? Score: 1
unit=115&less Accepted Answers:
on=116) (Type: Range) 1.40,1.42
1 point
Distributed
Representatio 3) Suppose we are learning the representations of words using Glove representations. 1 point
ns of words If we observe that the cosine similarity between two representations vi and vj for words `i' and `
(unit?
j ' is very high. which of the following statements is true?( parameter bi = 0.02 and bj = 0.07)
unit=115&less
on=117)
X ij = 0.02
SVD for
learning word
X ij = 0.2
representation
s (unit?
X ij = 0.88
unit=115&less
on=118)
X ij = 0
(unit?
unit=115&less 4) Which of the following is an advantage of the CBOW model compared to the Skip- 1 point
on=119)
gram model?
Continuous
bag of words It is faster to train
model (unit? It requires less memory
unit=115&less
It performs better on rare words
on=120)
All of the above
Skip-gram
Yes, the answer is correct.
model (unit?
Score: 1
unit=115&less
Accepted Answers:
on=121)
It is faster to train
Skip-gram
model (Contd.) 5) Which of the following is true about the input representation in the CBOW model? 1 point
(unit?
unit=115&less Each word is represented as a one-hot vector
on=122)
Each word is represented as a continuous vector
Contrastive Each word is represented as a sequence of one-hot vectors
estimation
Each word is represented as a sequence of continuous vectors
(unit?
unit=115&less Yes, the answer is correct.
on=123) Score: 1
Accepted Answers:
Hierarchical Each word is represented as a one-hot vector
softmax (unit?
unit=115&less
6) What is the role of the softmax function in the skip-gram method? 1 point
on=124)
GloVe To calculate the dot product between the target word and the context words
representation To transform the dot product into a probability distribution
s (unit?
To calculate the distance between the target word and the context words
unit=115&less
on=125) To adjust the weights of the neural network during training
https://onlinecourses.nptel.ac.in/noc24_cs114/unit?unit=115&assessment=297 2/4
10/27/24, 1:13 PM Deep Learning - IIT Ropar - - Unit 12 - Week 9
Lecture X ij = 0.35.
Material for
Week 9 (unit? X ij = 0.
Quiz: Week 9
2
: Assignment O(n )
9
(assessment? O(nlogn)
name=297)
O(logn)
week 10 ()
Yes, the answer is correct.
Score: 1
Week 11 () Accepted Answers:
O(n)
Week 12 ()
9) How does Hierarchical Softmax reduce the computational complexity of computing 1 point
Download the softmax function?
Videos ()
It replaces the softmax function with a linear function
Books () It uses a binary tree to approximate the softmax function
It uses a heuristic to compute the softmax function faster
Text It does not reduce the computational complexity of computing the softmax function
Transcripts
() Yes, the answer is correct.
Score: 1
Accepted Answers:
Problem
It uses a binary tree to approximate the softmax function
Solving
Session -
10) What is the disadvantage of using Hierarchical Softmax? 1 point
July 2024 ()
It requires more memory to store the binary tree
It is slower than computing the softmax function directly
https://onlinecourses.nptel.ac.in/noc24_cs114/unit?unit=115&assessment=297 3/4
10/27/24, 1:13 PM Deep Learning - IIT Ropar - - Unit 12 - Week 9
https://onlinecourses.nptel.ac.in/noc24_cs114/unit?unit=115&assessment=297 4/4