0% found this document useful (0 votes)

71 views17 pages

CNN Apps

The document discusses convolutional neural networks for image recognition. It provides an overview of CNN structure and components like convolutional layers, pooling layers, and fully connected layers. It also describes popular CNN models like LeNet, AlexNet, GoogleNet, VGGNet, and ResNet and datasets like MNIST, CIFAR10, and ImageNet.

Uploaded by

asidharth157

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views17 pages

CNN Apps

Uploaded by

asidharth157

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Convolutional neural network for

Image recognition

Sanjay Sharma
Dense neural network and Convolutional
neural network
A simple CNN structure

CONV: Convolutional kernel layer

RELU: Activation function
POOL: Dimension reduction layer
FC: Fully connection layer
Convolutional kernel
This is a gif image
Convolutional kernel

Padding on the
input volume with
zeros in such way
that the conv layer
does not alter the
spatial dimensions
of the input
Rectified linear unit，ReLU
Pooling layer
Pooling
MNIST dataset
The MNIST database of handwritten
digits,
available from this page,
has a training set of 60,000
examples, and a test set of 10,000
examples.
It is a subset of a larger set
available from NIST.
The digits have been size-
normalized and centered in a fixed-
size image.
LeNet-5 for MNIST
CIFAR10 dataset and state of the art
The CIFAR-10 dataset consists of 60000 32x32 color images in 10 classes,
with 6000 images per class. There are 50000 training images and 10000 test images.
ImageNet
• The ImageNet project is a large visual database designed
for use in visual object recognition software research. As
of 2016, over ten million URLs of images have been hand-
annotated by ImageNet to indicate what objects are
pictured; in at least one million of the images, bounding
boxes are also provided.[1] The database of annotations of
third-party image URL's is freely available directly from
ImageNet; however, the actual images are not owned by
ImageNet.[2] Since 2010, the ImageNet project runs an
annual software contest, the ImageNet Large Scale Visual
Recognition Challenge (ILSVRC), where software
programs compete to correctly classify and detect objects
and scenes.
Case studies
• LeNet. The first successful applications of Convolutional
Networks were developed by Yann LeCun in 1990’s. Of these,
the best known is the LeNet architecture that was used to read
zip codes, digits, etc.

• AlexNet. The first work that popularized Convolutional

Networks in Computer Vision was the AlexNet, developed by
Alex Krizhevsky, Ilya Sutskever and Geoff Hinton. The
AlexNet was submitted to the ImageNet ILSVRC challenge in
2012 and significantly outperformed the second runner-up (top
5 error of 16% compared to runner-up with 26% error). The
Network had a very similar architecture to LeNet, but was
deeper, bigger, and featured Convolutional Layers stacked on
top of each other (previously it was common to only have a
single CONV layer always immediately followed by a POOL
layer).
Case studies
• GoogLeNet. The ILSVRC 2014 winner was a
Convolutional Network from Szegedy et al. from Google.
Its main contribution was the development of an Inception
Module that dramatically reduced the number of
parameters in the network (4M, compared to AlexNet with
60M). Additionally, this paper uses Average Pooling
instead of Fully Connected layers at the top of the
ConvNet, eliminating a large amount of parameters that do
not seem to matter much. There are also several followup
versions to the GoogLeNet, most recently Inception-v4.
Case studies
• VGGNet. The runner-up in ILSVRC 2014 was the network
from Karen Simonyan and Andrew Zisserman that became
known as the VGGNet. Its main contribution was in showing
that the depth of the network is a critical component for good
performance. Their final best network contains 16 CONV/FC
layers and, appealingly, features an extremely homogeneous
architecture that only performs 3x3 convolutions and 2x2
pooling from the beginning to the end. Their pretrained
model is available for plug and play use in Caffe. A downside
of the VGGNet is that it is more expensive to evaluate and
uses a lot more memory and parameters (140M). Most of
these parameters are in the first fully connected layer, and it
was since found that these FC layers can be removed with no
performance downgrade, significantly reducing the number of
necessary parameters.
Case studies
• ResNet. Residual Network developed by Kaiming He et al.
was the winner of ILSVRC 2015. It features special skip
connections and a heavy use of batch normalization. The
architecture is also missing fully connected layers at the
end of the network. The reader is also referred to
Kaiming’s presentation (video, slides), and some recent
experiments that reproduce these networks in Torch.
ResNets are currently by far state of the art Convolutional
Neural Network models and are the default choice for
using ConvNets in practice (as of May 10, 2016). In
particular, also see more recent developments that tweak
the original architecture from Kaiming He et al. Identity
Mappings in Deep Residual Networks (published March
2016).
VGG-16 GoogleNet ResNet

Deep Learning - Question Papers
50% (2)
Deep Learning - Question Papers
7 pages
Engineering Physics Notes Lpu
100% (1)
Engineering Physics Notes Lpu
55 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
Csiplearninghub-Co... 8
No ratings yet
Csiplearninghub-Co... 8
25 pages
Original Slides by Daniel Liang Modified Slides by Salam Abdulla
No ratings yet
Original Slides by Daniel Liang Modified Slides by Salam Abdulla
112 pages
Unit-V 6703
100% (1)
Unit-V 6703
84 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
7th Sem Syllabus
No ratings yet
7th Sem Syllabus
11 pages
Course Handout - Operating Systems Design - S-19cs2106s
No ratings yet
Course Handout - Operating Systems Design - S-19cs2106s
24 pages
PHP - 5 - Units Notes - PPT
No ratings yet
PHP - 5 - Units Notes - PPT
561 pages
sc-unit-3-application-of-soft-computing-kcs056
No ratings yet
sc-unit-3-application-of-soft-computing-kcs056
25 pages
PHP Program
No ratings yet
PHP Program
16 pages
NODEJS Handwritten Notes
No ratings yet
NODEJS Handwritten Notes
18 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
No ratings yet
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
43 pages
Active Learning For Data Streams A Survey
No ratings yet
Active Learning For Data Streams A Survey
48 pages
OOP Unit 1 Notes
No ratings yet
OOP Unit 1 Notes
54 pages
26 Intro To AI
No ratings yet
26 Intro To AI
50 pages
7th Sem Cse 2010 Scheme - Question Paper
67% (3)
7th Sem Cse 2010 Scheme - Question Paper
8 pages
Master of Computer Application: 2nd Year, Semester-3
No ratings yet
Master of Computer Application: 2nd Year, Semester-3
8 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
Web Technologies MCQs
No ratings yet
Web Technologies MCQs
53 pages
Lab 1 - MyRIO Embedded Design
100% (1)
Lab 1 - MyRIO Embedded Design
8 pages
Module 1
100% (1)
Module 1
65 pages
Embedded Lab Manual Final
No ratings yet
Embedded Lab Manual Final
63 pages
IPCV Unit 04
No ratings yet
IPCV Unit 04
12 pages
Architecture and Learning process in neural network - GeeksforGeeks
No ratings yet
Architecture and Learning process in neural network - GeeksforGeeks
6 pages
PHP Tutorial
No ratings yet
PHP Tutorial
432 pages
DTW-2-Arduino Lab - Manual
No ratings yet
DTW-2-Arduino Lab - Manual
13 pages
Raspberry Pi: Led Blinking
No ratings yet
Raspberry Pi: Led Blinking
38 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
127 pages
1.JQuery Notes
No ratings yet
1.JQuery Notes
172 pages
Unit - 3 PHP - 1
No ratings yet
Unit - 3 PHP - 1
60 pages
Unit IV
No ratings yet
Unit IV
22 pages
Machine Learning: Chapter 2 Clustering
No ratings yet
Machine Learning: Chapter 2 Clustering
23 pages
Web Programming Unit-1 Notes
No ratings yet
Web Programming Unit-1 Notes
85 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
SQL - Basics - Handwritten Notes
No ratings yet
SQL - Basics - Handwritten Notes
9 pages
LED (Built-In LED) //PUSH BUTTON (ON On-Hold - Pulldown)
No ratings yet
LED (Built-In LED) //PUSH BUTTON (ON On-Hold - Pulldown)
10 pages
Server Side Programming: by Dr. Babaousmail Hassen Lecturer at Binjiang College of NUIST
No ratings yet
Server Side Programming: by Dr. Babaousmail Hassen Lecturer at Binjiang College of NUIST
44 pages
PDF 35562 PDF
No ratings yet
PDF 35562 PDF
130 pages
Generative AI Sample 1
No ratings yet
Generative AI Sample 1
13 pages
Raspberry Pi Int
No ratings yet
Raspberry Pi Int
95 pages
JQuery Tutorial
50% (2)
JQuery Tutorial
51 pages
C Structures and Unions
No ratings yet
C Structures and Unions
61 pages
Cheatsheet Convolutional Neural Networks
No ratings yet
Cheatsheet Convolutional Neural Networks
5 pages
Why Go: Võ Anh Duy @voanhduy1512
No ratings yet
Why Go: Võ Anh Duy @voanhduy1512
23 pages
Mobile Application Development
No ratings yet
Mobile Application Development
193 pages
Jquery Print
No ratings yet
Jquery Print
101 pages
1.6 Cpus: Programming Input and Output
No ratings yet
1.6 Cpus: Programming Input and Output
8 pages
CP4253 Map Unit Ii
No ratings yet
CP4253 Map Unit Ii
23 pages
Text To Video Generation Using Deep Learning
No ratings yet
Text To Video Generation Using Deep Learning
7 pages
React Intro
No ratings yet
React Intro
45 pages
DL Unit 2.3
No ratings yet
DL Unit 2.3
16 pages
Unit 4 - Device Drivers
No ratings yet
Unit 4 - Device Drivers
8 pages
Presented By: Canara Bank School of Management Studies
No ratings yet
Presented By: Canara Bank School of Management Studies
15 pages
Data Structure Syllabus
No ratings yet
Data Structure Syllabus
4 pages
Chapter 6 Iot Systems Logical Design Using Python
No ratings yet
Chapter 6 Iot Systems Logical Design Using Python
33 pages
Unit Iv Parametric Machine Learning
No ratings yet
Unit Iv Parametric Machine Learning
4 pages
Beyond Lexical Boundaries: LLM-Generated Text Detection For Romanian Digital Libraries
No ratings yet
Beyond Lexical Boundaries: LLM-Generated Text Detection For Romanian Digital Libraries
31 pages
Ecs Cse 7thsem Unit 1 For VTU, Belgaum
No ratings yet
Ecs Cse 7thsem Unit 1 For VTU, Belgaum
81 pages
Hybrid Storage Solutions
100% (1)
Hybrid Storage Solutions
76 pages
Introduction To Javascript: Unit-2 Syllabus
No ratings yet
Introduction To Javascript: Unit-2 Syllabus
340 pages
Manual Arduino
No ratings yet
Manual Arduino
15 pages
Communication
No ratings yet
Communication
12 pages
Question Bank Format B.tech (CSe)
No ratings yet
Question Bank Format B.tech (CSe)
5 pages
Integration of Spin-RAM Technology in FPGA Circuits - 2
No ratings yet
Integration of Spin-RAM Technology in FPGA Circuits - 2
22 pages
Error Detection Techniques
No ratings yet
Error Detection Techniques
27 pages
A Neural Network is a Series of Algorithms That Endeavors to Recognize Underlying
No ratings yet
A Neural Network is a Series of Algorithms That Endeavors to Recognize Underlying
4 pages
STL PDF
No ratings yet
STL PDF
22 pages
PHP (4th) May2019 PDF
No ratings yet
PHP (4th) May2019 PDF
2 pages
PHP Paper PDF
100% (1)
PHP Paper PDF
3 pages
Format of PPT For 7 CSE AND AI&DS
No ratings yet
Format of PPT For 7 CSE AND AI&DS
7 pages
Installation Guide of Arduino Ide
No ratings yet
Installation Guide of Arduino Ide
5 pages
Backpropagation Exercises
No ratings yet
Backpropagation Exercises
7 pages
Assign 01
No ratings yet
Assign 01
2 pages
JAVA
No ratings yet
JAVA
43 pages
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
No ratings yet
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
8 pages
Nptel Week 6 - 2
No ratings yet
Nptel Week 6 - 2
4 pages
Golang 140118232950
No ratings yet
Golang 140118232950
21 pages
Object detection research paper
No ratings yet
Object detection research paper
4 pages
Chapter 4 File Handlinf Final (New)
100% (1)
Chapter 4 File Handlinf Final (New)
78 pages
Next-gen Network Attack Detection With Machine Learning and Deep Learning Techniques
No ratings yet
Next-gen Network Attack Detection With Machine Learning and Deep Learning Techniques
5 pages
7.what Is The MAIN Benefit of Designing Tests Early in The Life Cycle?
No ratings yet
7.what Is The MAIN Benefit of Designing Tests Early in The Life Cycle?
8 pages
4430 1 PDF
No ratings yet
4430 1 PDF
2 pages
Naukri_SANJIVSAWDEKAR[25y_0m]
No ratings yet
Naukri_SANJIVSAWDEKAR[25y_0m]
2 pages
CS8691 AI Model Exam
No ratings yet
CS8691 AI Model Exam
1 page
D. Assume Model A Is Trained For Multi-Label Classification and Binary Cross-Entropy Is The Loss, The Loss Value For Label Sunny Is - Log0.2
No ratings yet
D. Assume Model A Is Trained For Multi-Label Classification and Binary Cross-Entropy Is The Loss, The Loss Value For Label Sunny Is - Log0.2
2 pages
2017 1 Multivariate Data Analysis
No ratings yet
2017 1 Multivariate Data Analysis
2 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet

CNN Apps

Uploaded by

CNN Apps

Uploaded by

Convolutional neural network for

CONV: Convolutional kernel layer

• AlexNet. The first work that popularized Convolutional

You might also like