tutorial 3

Uploaded by

tusharmukherjee2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

tutorial 3

Uploaded by

tusharmukherjee2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Q. Explain learning rate in context of gradient descent.

Ans: The learning rate in gradient descent is a hyperparameter that controls the step size at
each iteration while moving toward the minimum of the loss function. It determines how
much the model's parameters (weights and biases) are adjusted during training.
Role of Learning Rate
• Small Learning Rate:
o Leads to smaller updates.
o Ensures more precise convergence but makes the training slower.
o May get stuck in local minima or saddle points.
• Large Learning Rate:
o Leads to larger updates.
o Speeds up training but risks overshooting the minimum or causing
instability (oscillations around the optimal value).
Learning Rate in Gradient Descent Update Rule
In gradient descent, weights (w) are updated using:

where η is the learning rate, and is the gradient loss function.

Finding the Right Learning Rate
Choosing an appropriate learning rate is crucial for effective training. Common strategies
include:
• Fixed Learning Rate: Remains constant throughout training.
• Learning Rate Schedulers: Adjust η\etaη dynamically (e.g., decay it over
epochs).
• Adaptive Methods: Algorithms like Adam or RMSProp adapt the learning rate
per parameter.
A balance between too small and too large learning rates ensures efficient convergence to the
loss function's minimum.

Q. Explain linear perceptron in detail.

Ans: A linear perceptron is a simple neural network model used for binary classification. It
computes a weighted sum of inputs and applies a step activation function
to produce an output of 0 or 1. The perceptron adjusts weights and biases during training
using a simple learning rule to minimize classification errors.
Key Features:
1. Weights and Bias:
o Weights determine the importance of inputs.
o Bias helps adjust the decision boundary.
2. Activation Function:
o A step function outputs 1 if z ≥0 and 0 otherwise.
3. Training:
o Weights are updated using the perceptron learning rule:

where t is the true label, y is the output, and η is the learning rate.

Advantages:
1. Simplicity: Easy to implement and understand.
2. Fast Training: Efficient for small and linearly separable datasets.
3. Foundation for Neural Networks: Forms the basis for more complex
architectures like multi-layer perceptrons.
Disadvantages:
1. Linear Separability: Can only solve problems where data is linearly separable.
2. No Probabilistic Outputs: Outputs are strictly 000 or 111, with no confidence
scores.
3. Limited Flexibility: Cannot handle complex, non-linear relationships.
The perceptron is best suited for simple tasks and serves as a stepping stone for understanding
advanced models.

Q. What is activation function? Explain its characteristics.

Ans: An activation function is a mathematical function applied to the output of a neuron in a
neural network. It introduces non-linearity to the network, enabling it to learn and represent
complex patterns. Without activation functions, a neural network would behave like a simple
linear model, regardless of its depth.
Characteristics of Activation Functions:
1. Non-Linearity: Helps the network model complex data relationships.
2. Differentiability: Must be differentiable for backpropagation.
3. Range: Defines the output range (e.g., [0,1], [−1,1]).
4. Monotonicity: Ensures consistent gradient directions for stable optimization.
5. Efficiency: Should be computationally inexpensive.
6. Gradient Stability: Prevents vanishing or exploding gradients during training.
Common Activation Functions and Their Characteristics
1. Sigmoid:
o Formula:

o Range: (0,1)
o Characteristics: Smooth, but can cause vanishing gradients.
2. ReLU (Rectified Linear Unit):
o Formula:

o Range: [0, 1)
o Characteristics: Efficient, avoids vanishing gradients, but can suffer from
"dead neurons."
3. Tanh:
o Formula:

o Range: (−1,1)
o Characteristics: Centred around zero; can still suffer from vanishing
gradients.
4. Softmax:
o Formula:

o Range: (0,1) (probabilities sum to 1)

o Characteristics: Used for multi-class classification.
5. Leaky ReLU:
o Formula:
o Range: (−∞,∞)
o Characteristics: Addresses the dead neuron issue by allowing small
negative slopes.
6. Swish:
o Formula:

o Range: (−∞,∞)
o Characteristics: Smooth, improves gradient flow and training.
Role of Activation Functions
• Introduce Non-Linearity: Allows networks to model complex data relationships.
• Control Output: Adjusts the range and form of the output for different tasks.
• Facilitate Optimization: Helps in learning by enabling effective
backpropagation.

Q. Explain the concept of tensor in tensorflow.

Ans: In TensorFlow, a tensor is the central data structure used to represent data. Tensors are
multidimensional arrays that are generalized versions of scalars, vectors, and matrices. They
enable TensorFlow to efficiently perform numerical computations, particularly for machine
learning and deep learning tasks.
Key Concepts of Tensors
1. Rank (Order):
o Refers to the number of dimensions in the tensor.
o Examples:
▪ Scalar: Rank 0 (e.g., 333).
▪ Vector: Rank 1 (e.g., [3,4,5]).
▪ Matrix: Rank 2 (e.g., [[1,2],[3,4]]).
▪ Higher ranks: [[[...]]], etc.
2. Shape:
o Describes the number of elements along each dimension.
o Example: A tensor with shape (3,2) has 3 rows and 2 columns.
3. Data Type (dtype):
o Specifies the type of data stored in the tensor (e.g., float32, int32, string).
4. Immutability:
o In TensorFlow, tensors are immutable; their values cannot change after
creation.
Tensors in TensorFlow
Tensors in TensorFlow are created using functions like:
• Constant Tensor: tf.constant([1, 2, 3])
• Variable Tensor: tf.Variable([[1.0, 2.0], [3.0, 4.0]])
• Placeholder (for older TF versions): Used to feed data dynamically during
execution.
Operations on Tensors
TensorFlow supports a wide variety of tensor operations such as:
• Arithmetic operations: Addition, subtraction, multiplication, etc.
• Reshaping: Changing the shape without altering the data.
• Slicing: Extracting subsets of data.
• Broadcasting: Automatic expansion of tensors for compatible operations.
Importance of Tensors in TensorFlow
• Core Data Structure: Tensors represent all inputs, outputs, and computations.
• Parallel Processing: Optimized for GPUs and TPUs for high-performance
computing.
• Flexibility: Tensors support various dimensions, data types, and operations.
Tensors enable TensorFlow to model and execute complex numerical computations in an
efficient and scalable way.

Q. Explain difference between tensorflow 1.0 and tensorflow 2.0.

Ans: Key Differences Between TensorFlow 1.0 and TensorFlow 2.0

Aspect TensorFlow 1.0 TensorFlow 2.0

Eager Execution Used static computation graphs, Enabled eager execution by default,
requiring sessions for execution. allowing operations to run immediately
like regular Python code.
Keras integration Keras was separate. Fully integrated as tf.keras for easier
model building.
Simpler syntax Required placeholders and Eliminated placeholders and sessions,
verbose session management. simplifying the workflow.
Backward No built-in support for future Provides tf.compat.v1 to support
compatibility changes. legacy code.
Debugging Debugging was difficult due to Easier debugging with eager
static computation graph. execution and dynamic operations.
Inconsistent APIs across modules. Unified and consistent APIs for
API Consistency
better usability.
Required explicit sessions to Removed sessions; eager execution
Sessions
execute the computation graph. handles computation dynamically.

Q. Explain difference between constants, variables and placeholders.

Ans: In TensorFlow, constants, variables, and placeholders are used to represent and manage
data in computation graphs. Here's how they differ:
Constants Variables Placeholders
Definition Fixed values that doValues that can be Tensors that act as inputs
not change duringupdated or modified to the computation graph, where
execution. during training or values are fed at runtime.
execution.
Usage Used for static dataUsed for trainable parametersUsed
like for dynamically
that remains constant,
weights feeding data during
like configuration and biases in machine learningtraining or inference.
values or unchangingmodels.
inputs.
Mutability Immutable Mutable Requires feed at
runtime
Initialization Predefined value Must be initialized No initialization
TensorFlow 2.0 Fully supported Fully supported Replaced by tf.function
and eager execution
Example tf.constant(5) tf.Variable(initial_value=1.0) tf.placeholder(dtype=tf.float32,
# A constant tensor# A variable with an initial shape=[None, 3])
with value 5 value of 1.0 # Placeholder for a batch
of 3-dimensional inputs

Q. Explain the concept of computation graph and its advantages.

Ans: A computation graph is a graphical representation of mathematical operations in
TensorFlow. It consists of nodes, which represent operations, and edges, which represent the
data (tensors) flowing between operations. TensorFlow uses this graph to define, optimize, and
execute computations efficiently.
Concept of Computation Graph
1. Nodes: Represent operations like addition, multiplication, or activation
functions.
2. Edges: Represent tensors (data) passed between nodes.
3. Directed Acyclic Graph (DAG): The graph has a direction (data flows from
inputs to outputs) and no cycles.
Example: If you define:

The computation graph looks like:

• Node 1: Square operation on x.
• Node 2: Add operation combining x^2 and y.
Advantages of Computation Graphs
1. Optimized Execution:
o TensorFlow can optimize computations, like reusing intermediate results
or parallelizing operations.
2. Flexibility:
o Separate graph definition and execution allow deployment on various
devices (CPU, GPU, TPU).
3. Portability:
o Computation graphs can be serialized and executed on different
platforms (e.g., mobile, servers).
4. Support for Distributed Computing:
o Allows splitting computations across multiple devices or machines.
5. Visualization:
o Graphs can be visualized (e.g., using TensorBoard) for debugging and
understanding the model structure.
6. Memory Management:
o TensorFlow manages memory usage efficiently by constructing the graph
before execution.
Q. Explain session and fetches in computation graph.
Ans: In the context of computation graphs, especially in frameworks like TensorFlow, sessions
and fetches play a crucial role in executing operations and retrieving results. Let's break down
these concepts:
Explain session and fetches in computation graph.
In the context of computation graphs, especially in frameworks like TensorFlow, sessions and
fetches play a crucial role in executing operations and retrieving results. Let's break down
these concepts:
Session:
• A session is an environment in which operations (nodes) in the computation
graph are executed.
• In TensorFlow, for instance, a tf.Session object encapsulates the environment
and control of the execution of the computation graph.
How Sessions Work:
1. Graph Definition: First, you define the computation graph.
2. Create a Session: Instantiate a session to run the graph.
3. Run Operations: Execute the operations within the session.

Fetches:
• Fetches refer to the process of retrieving the output(s) of one or more
operations from the computation graph.
• You specify the nodes (operations) to fetch the results from when running a
session.
How Fetches Work:
1. Specify Fetches: While running the session, you can specify the operations
whose results you want to fetch.
2. Retrieve Outputs: The session returns the results of these specified operations.

Chat Bot Final
100% (1)
Chat Bot Final
48 pages
Tensorflow PDF
No ratings yet
Tensorflow PDF
62 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
31 pages
Tensorflow Usage: Babii Andrii
No ratings yet
Tensorflow Usage: Babii Andrii
33 pages
AML Lecture1.3
No ratings yet
AML Lecture1.3
72 pages
CSE488_Lab7_Neural Networks and TensorFlow
No ratings yet
CSE488_Lab7_Neural Networks and TensorFlow
21 pages
MLT Unit 1 & 2
No ratings yet
MLT Unit 1 & 2
119 pages
Assigment-19
No ratings yet
Assigment-19
3 pages
DL mod 1 final
No ratings yet
DL mod 1 final
4 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
Unit 2
No ratings yet
Unit 2
18 pages
Introduction To TensorFlow
No ratings yet
Introduction To TensorFlow
3 pages
1 TensorFlow
No ratings yet
1 TensorFlow
66 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
mlt ese
No ratings yet
mlt ese
21 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
ANN-CNN-RNN
No ratings yet
ANN-CNN-RNN
26 pages
NN unit_1
No ratings yet
NN unit_1
27 pages
Tensorflow
No ratings yet
Tensorflow
29 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
a imprimer 4
No ratings yet
a imprimer 4
4 pages
Activation Functions
No ratings yet
Activation Functions
11 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Basics of TensorFlow
No ratings yet
Basics of TensorFlow
30 pages
AI UNIT 4 PART 2
No ratings yet
AI UNIT 4 PART 2
45 pages
Chapter DeepLearningwithTensorFlow
No ratings yet
Chapter DeepLearningwithTensorFlow
19 pages
465-Lecture 2-4
No ratings yet
465-Lecture 2-4
43 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
ML prep for samsung
No ratings yet
ML prep for samsung
73 pages
tutorial 1,2
No ratings yet
tutorial 1,2
12 pages
Unit 2_Activation Function_PR
No ratings yet
Unit 2_Activation Function_PR
22 pages
Unit III
No ratings yet
Unit III
28 pages
f8194544 Microsoft PowerPoint DeepLearning
No ratings yet
f8194544 Microsoft PowerPoint DeepLearning
28 pages
Understanding Activation Functions in Neural Networks
No ratings yet
Understanding Activation Functions in Neural Networks
15 pages
Deep Learning
No ratings yet
Deep Learning
40 pages
Tensors_Operations_and_Deep_Learning_Cycle - Jupyter Notebook
No ratings yet
Tensors_Operations_and_Deep_Learning_Cycle - Jupyter Notebook
24 pages
29122024
No ratings yet
29122024
12 pages
MSCDA 605 Machine Learning Exam Model Answers May_2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May_2019
7 pages
Different Activation Functions With The Equations
No ratings yet
Different Activation Functions With The Equations
6 pages
Chap 3 TensorFlow
No ratings yet
Chap 3 TensorFlow
24 pages
chp 3 (3)
No ratings yet
chp 3 (3)
6 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
unit v
No ratings yet
unit v
9 pages
Data Science Interview Questions #Week2
No ratings yet
Data Science Interview Questions #Week2
71 pages
Lab Manual ML
No ratings yet
Lab Manual ML
52 pages
UNIT V (1)
No ratings yet
UNIT V (1)
25 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
DLunit 3
No ratings yet
DLunit 3
13 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
01 - Lecture Slide - Overview of Tensorflow
100% (1)
01 - Lecture Slide - Overview of Tensorflow
65 pages
TensorFlow Workshop
No ratings yet
TensorFlow Workshop
49 pages
Deep_Learning_Interview_Q&A
No ratings yet
Deep_Learning_Interview_Q&A
10 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
SWE Resume Template 1
No ratings yet
SWE Resume Template 1
1 page
Assignment Text Classification Using Hugging Face
No ratings yet
Assignment Text Classification Using Hugging Face
6 pages
HCIA-AI V1.0 Lab Guide
No ratings yet
HCIA-AI V1.0 Lab Guide
269 pages
Prerna Sharma 2020
No ratings yet
Prerna Sharma 2020
14 pages
TUM-CPE_203_Module_1
No ratings yet
TUM-CPE_203_Module_1
5 pages
Major Project PPT
No ratings yet
Major Project PPT
16 pages
Bui Thi Résumé
No ratings yet
Bui Thi Résumé
1 page
Final Paper IEEE
No ratings yet
Final Paper IEEE
4 pages
GRP 8
No ratings yet
GRP 8
48 pages
AI Adoption in The Enterprise 2020
No ratings yet
AI Adoption in The Enterprise 2020
20 pages
Chat Gpt
No ratings yet
Chat Gpt
3 pages
nvidia-learning-training course-catalog
No ratings yet
nvidia-learning-training course-catalog
38 pages
AI Engineers - Internship leads to Full time - Ticking Minds
No ratings yet
AI Engineers - Internship leads to Full time - Ticking Minds
3 pages
Balaji
No ratings yet
Balaji
34 pages
H13-311 V3.0.
No ratings yet
H13-311 V3.0.
43 pages
Software Engineering Software Requirements Specification (SRS) Document
No ratings yet
Software Engineering Software Requirements Specification (SRS) Document
13 pages
Jawaban Huawei
No ratings yet
Jawaban Huawei
58 pages
Learning High-Frequency Trading (HF
No ratings yet
Learning High-Frequency Trading (HF
2 pages
Aselle Resume
No ratings yet
Aselle Resume
5 pages
Crop Yield
No ratings yet
Crop Yield
112 pages
80164c05-fbaf-48ca-855c-5df79b3660b1
No ratings yet
80164c05-fbaf-48ca-855c-5df79b3660b1
7 pages
HCIA AI Dump File
No ratings yet
HCIA AI Dump File
180 pages
10 Popular Data Science Tools To Consider Exploring
No ratings yet
10 Popular Data Science Tools To Consider Exploring
9 pages
Fire extinguisher prediction using machine learning report
No ratings yet
Fire extinguisher prediction using machine learning report
48 pages
AKTU Circular Regarding Technical Fest
No ratings yet
AKTU Circular Regarding Technical Fest
29 pages
Huawei HCIA-AI V3.0 Certification Exam
No ratings yet
Huawei HCIA-AI V3.0 Certification Exam
3 pages
Shreyas Kulkarni 1309036282
No ratings yet
Shreyas Kulkarni 1309036282
2 pages
Ms. Humera Shaziya
No ratings yet
Ms. Humera Shaziya
13 pages
HCIA-AI V3.0 Exam Outline
No ratings yet
HCIA-AI V3.0 Exam Outline
3 pages