CNN Apps
CNN Apps
Image recognition
Sanjay Sharma
Dense neural network and Convolutional
neural network
A simple CNN structure
Padding on the
input volume with
zeros in such way
that the conv layer
does not alter the
spatial dimensions
of the input
Rectified linear unit,ReLU
Pooling layer
Pooling
MNIST dataset
The MNIST database of handwritten
digits,
available from this page,
has a training set of 60,000
examples, and a test set of 10,000
examples.
It is a subset of a larger set
available from NIST.
The digits have been size-
normalized and centered in a fixed-
size image.
LeNet-5 for MNIST
CIFAR10 dataset and state of the art
The CIFAR-10 dataset consists of 60000 32x32 color images in 10 classes,
with 6000 images per class. There are 50000 training images and 10000 test images.
ImageNet
• The ImageNet project is a large visual database designed
for use in visual object recognition software research. As
of 2016, over ten million URLs of images have been hand-
annotated by ImageNet to indicate what objects are
pictured; in at least one million of the images, bounding
boxes are also provided.[1] The database of annotations of
third-party image URL's is freely available directly from
ImageNet; however, the actual images are not owned by
ImageNet.[2] Since 2010, the ImageNet project runs an
annual software contest, the ImageNet Large Scale Visual
Recognition Challenge (ILSVRC), where software
programs compete to correctly classify and detect objects
and scenes.
Case studies
• LeNet. The first successful applications of Convolutional
Networks were developed by Yann LeCun in 1990’s. Of these,
the best known is the LeNet architecture that was used to read
zip codes, digits, etc.