0% found this document useful (0 votes)

11 views53 pages

02 Semantic Segmentation 2024

The document discusses semantic segmentation in computer vision, focusing on the classification of each pixel in an image without differentiating between instances. It highlights the challenges in data collection, evaluation metrics, and various methods such as Fully Convolutional Networks and Mask R-CNN for effective segmentation. Additionally, it contrasts semantic segmentation with other tasks like object detection and instance segmentation.

Uploaded by

20221235littlebottle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views53 pages

02 Semantic Segmentation 2024

Uploaded by

20221235littlebottle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

Lecture 11:

Semantic Segmentation

1
Computer Vision Tasks

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

This image is CC0 public domain

2
So far: Image Classification

Class Scores
Cat: 0.9
Dog: 0.05
Fully-Connected:
Car: 0.01
Vector: 4096 to 1000
This image is CC0 public domain ...
4096

Figure copyright Alex Krizhevsky, Ilya Sutskever, and

Geoffrey Hinton, 2012. Reproduced with permission.

3
Convolutional Neural Networks
Feature maps

Normalization

Spatial pooling

Non-linearity

Convolution
(Learned)

Input Image
Convolutional Neural Networks
Feature maps

Normalization

Spatial pooling

Non-linearity
.
.
Convolution .
(Learned)

Input Feature Map

Input Image
Convolutional Neural Networks
Feature maps

Normalization

Spatial pooling

Non-linearity

Convolution
(Learned)

Input Image
Convolutional Neural Networks
Feature maps

Normalization
Max

Spatial pooling

Non-linearity

Convolution
(Learned)

Input Image
Convolutional Neural Networks
Feature maps

Normalization

Spatial pooling Feature Maps Feature Maps

After Contrast
Normalization
Non-linearity

Convolution
(Learned)

Input Image
Convolutional Neural Networks
Feature maps

Normalization
Convolutional filters are trained in a
supervised manner by back-propagating
Spatial pooling classification error

Non-linearity

Convolution
(Learned)

Input Image
Simplified architecture

Softmax layer:
exp(w c ⋅ x)
P(c | x) = C

∑ exp(w k ⋅ x)
k=1
Tasks: Semantic Segmentation

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

11
Semantic Segmentation
This image is CC0 public domain

Label each pixel in the image

with a category label

Don’t differentiate instances,

only care about pixels

s
Sky Sky

ee
Tr

Tr
ee
s
Cat Cow

Grass Grass

12
Evaluation metric

• Pixel classification!
• Accuracy?
• Heavily unbalanced
• Intersection over Union
• Average across classes
and images
• Per-class accuracy
• Average across classes
and images
Challenges in data collection
• Precise localization is hard to annotate

• Annotating every pixel leads to heavy tails

• Common solution: annotate few classes (often things),

mark rest as “Other”

• Common datasets: PASCAL VOC 2012 (~1500 images,

20 categories), COCO (~100k images, 20 categories)
Example: TextonBoost

Label Image Model

field parameters

Local data
term

Smoothing
term

J. Shotton, J. Winn, C. Rother, and A. Criminisi,

TextonBoost: Joint Appearance, Shape And Context Modeling For Multi-class Object
Recognition And Segmentation, ECCV 2006.
Example: SuperParsing

• CRF energy function is defined on superpixels

• Unaries are based on nearest neighbor retrieval
• Pairwise potentials capture class co-occurrence statistics

J. Tighe and S. Lazebnik, SuperParsing: Scalable Nonparametric Image Parsing with Superpixels,
ECCV 2010
Example: SuperParsing
• CRF energy function is defined on superpixels
• Unaries are based on nearest neighbor retrieval
• Pairwise potentials capture class co-occurrence statistics

Maximum likelihood
Original image labeling Edge penalties Final labeling
sky sky

road

tree
sea
sea
road
sand sand

J. Tighe and S. Lazebnik, SuperParsing: Scalable Nonparametric Image Parsing with Superpixels,
ECCV 2010
Semantic segmentation using
convolutional networks

person
bicycle
Segmentation: Sliding Window
Extract Classify center
patch pixel with CNN
Full image
Cow

Cow

Grass

Farabet et al, “Learning Hierarchical Features for Scene Labeling,” TPAMI 2013
Pinheiro and Collobert, “Recurrent Convolutional Neural Networks for Scene Labeling”, ICML 2014

21
Segmentation: Sliding Window
Extract Classify center
patch pixel with CNN
Full image
Cow

Cow

Grass
Problem: Very inefficient! Not
reusing shared features
between overlapping patches

Farabet et al, “Learning Hierarchical Features for Scene Labeling,” TPAMI 2013
Pinheiro and Collobert, “Recurrent Convolutional Neural Networks for Scene Labeling”, ICML 2014

22
Fully Convolutional Network

Design a network as a bunch of convolutional

layers to make predictions for pixels all at once!

Conv Conv Conv Conv argmax

Input:
3xHxW Scores: Predictions:
Convolutions: CxHxW HxW
DxHxW
Loss function: Per-Pixel cross-entropy

Long et al, “Fully convolutional networks for semantic segmentation”, CVPR 2015

23
Fully Convolutional Network

Design a network as a bunch of convolutional

layers to make predictions for pixels all at once!

Conv Conv Conv Conv argmax

Input: Problem #1: Effective receptive

3 x H x W field size is linear in number of
conv layers: With L 3x3 conv
layers, receptive field is 1+2L

Long et al, “Fully convolutional networks for semantic segmentation”, CVPR 2015

24
Fully Convolutional Network

Design a network as a bunch of convolutional

layers to make predictions for pixels all at once!

Conv Conv Conv Conv argmax

Input:
3xHxW
Problem #1: Effective receptive
field size is linear in number of Problem #2: Convolution on
conv layers: With L 3x3 conv high res images is expensive!
layers, receptive field is 1+2L

Long et al, “Fully convolutional networks for semantic segmentation”, CVPR 2015

25
Fully Convolutional Network
Design network as a bunch of convolutional layers, with
downsampling and upsampling inside the network!

Med-res: Med-res:
D2 x H/4 x W/4 D2 x H/4 x W/4

Low-res:
Input: D3 x H/4 x
3xHxW High-res: W/4 High-res: Predictions:
D1 x H/2 x W/2 D1 x H/2 x W/2 HxW
Downsampling:
Upsampling:
Pooling, strided
???
convolution
Long, Shelhamer, and Darrell, “Fully Convolutional Networks for Semantic Segmentation”, CVPR 2015
Noh et al, “Learning Deconvolution Network for Semantic Segmentation”, ICCV 2015

26
In-Network Upsampling: “Unpooling”

Bed of Nails Nearest Neighbor

1 0 2 0 1 1 2 2
1 2 0 0 0 0 1 2 1 1 2 2
3 4 3 0 4 0 3 4 3 3 4 4
0 0 0 0 3 3 4 4
Input Output Input Output
Cx2x2 Cx4x4 Cx2x2 Cx4x4

27
Upsampling: Bilinear Interpolation

1.00 1.25 1.75 2.00

1 2 1.50 1.75 2.25 2.50

2.50 2.75 3.25 3.50

3 4
3.00 3.25 3.75 4.00

Input: C x 2 x 2 Output: C x 4 x 4

Use two closest neighbors in x and

y to construct linear approximations

28
Transposed Convolution

29
Transposed Convolution

30
Skip Connection

31
32
33
35
36
37
38
39
40
41
42
43
44
Tasks: Object Detection

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

This image is CC0 public domain

45
Object Detection Progress
Faster R-CNN

Fast R-CNN
”Slow” R-CNN

Reproduced with permission.

46
Tasks: Instance Segmentation

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

47
Instance Segmentation

Instance Segmentation:
Detect all objects in the Cow
image, and identify the
pixels that belong to
each object Cow

This image is CC0 public domain

48
Instance Segmentation

Instance Segmentation:
Detect all objects in the Cow
image, and identify the
pixels that belong to
each object Cow

Approach: Perform
object detection, then
predict a segmentation
mask for each object!

This image is CC0 public domain

49
Object Detection: Faster R-CNN

Ren et al, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, NeurIPS 2015

50
Instance Segmentation: Mask R-CNN
Mask
Prediction

He et al, “Mask R-CNN”, ICCV 2017

51
Mask R-CNN
Classification Scores: C
Box coordinates (per class):
4*C

CNN Conv Conv

RoI Align
+RPN
256 x 14 x 14 256 x 14 x 14
Predict a mask for
each of C classes:
C x 28 x 28

He et al, “Mask R-CNN”, ICCV 2017

52
Mask R-CNN: Very Good Results!

He et al, “Mask R-CNN”, ICCV 2017

53
Summary: Computer Vision Tasks

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

This image is CC0 public domain

12. Object Detection-compressed
No ratings yet
12. Object Detection-compressed
80 pages
Lect-7 Segmentation Localization
No ratings yet
Lect-7 Segmentation Localization
151 pages
AML - Lecture_10 - 15nov24
No ratings yet
AML - Lecture_10 - 15nov24
169 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
deep-segmentation (3)
No ratings yet
deep-segmentation (3)
38 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
8-Image Detection and Segmentation
No ratings yet
8-Image Detection and Segmentation
73 pages
lecture4
No ratings yet
lecture4
46 pages
Object Detyection Using CNN
No ratings yet
Object Detyection Using CNN
113 pages
05 CNN 2
No ratings yet
05 CNN 2
92 pages
5. Object Detection and Segmentation - part 2
No ratings yet
5. Object Detection and Segmentation - part 2
36 pages
L10-Lecture-Detection.Segmentation-v2.5
No ratings yet
L10-Lecture-Detection.Segmentation-v2.5
35 pages
Lecture 5 - CNNs For Detection and Segmentation
No ratings yet
Lecture 5 - CNNs For Detection and Segmentation
62 pages
DL UNIT 5
No ratings yet
DL UNIT 5
63 pages
Chapter 7 - Part 3 - DL For CV
No ratings yet
Chapter 7 - Part 3 - DL For CV
79 pages
Lecture-21-Semantic-Segmentation
No ratings yet
Lecture-21-Semantic-Segmentation
24 pages
Lecture Sematic-Segmentation
No ratings yet
Lecture Sematic-Segmentation
23 pages
Overview of semantic segmentation
No ratings yet
Overview of semantic segmentation
20 pages
NN 09
No ratings yet
NN 09
34 pages
CS60010_CNN 4
No ratings yet
CS60010_CNN 4
32 pages
REF-6-DeepLab_Semantic_Image_Segmentation_with_Deep_Convolutional_Nets_Atrous_Convolution_and_Fully_Connected_CRFs
No ratings yet
REF-6-DeepLab_Semantic_Image_Segmentation_with_Deep_Convolutional_Nets_Atrous_Convolution_and_Fully_Connected_CRFs
15 pages
Semantic Segmentation by Using Down-Sampling and S
No ratings yet
Semantic Segmentation by Using Down-Sampling and S
14 pages
He Mask R-CNN ICCV 2017 Paper PDF
No ratings yet
He Mask R-CNN ICCV 2017 Paper PDF
9 pages
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
No ratings yet
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
14 pages
Fully_Convolutional_Networks_for_Semantic_Segmentation
No ratings yet
Fully_Convolutional_Networks_for_Semantic_Segmentation
12 pages
He Mask R-CNN Iccv 2017 Paper
No ratings yet
He Mask R-CNN Iccv 2017 Paper
9 pages
Deconvolution Network ICCV 2015 Paper PDF
No ratings yet
Deconvolution Network ICCV 2015 Paper PDF
9 pages
segmentation_by_gan
No ratings yet
segmentation_by_gan
18 pages
Seggpt Paper
No ratings yet
Seggpt Paper
12 pages
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
No ratings yet
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
20 pages
2018 - Understanding Convolution For Semantic Segmentation
No ratings yet
2018 - Understanding Convolution For Semantic Segmentation
10 pages
1511.04377v3
No ratings yet
1511.04377v3
10 pages
Segmentation-Aware Convolutional Networks Using Local Attention Masks
No ratings yet
Segmentation-Aware Convolutional Networks Using Local Attention Masks
11 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
Vision
No ratings yet
Vision
24 pages
A Beginner's Guide to Deep Learning Based Semantic Segmentation Using Keras _ Divam Gupta
No ratings yet
A Beginner's Guide to Deep Learning Based Semantic Segmentation Using Keras _ Divam Gupta
14 pages
Dlcv2017d3l1segmentation 170623173102
No ratings yet
Dlcv2017d3l1segmentation 170623173102
36 pages
Semantic Segmentation: Tingwu Wang Machine Learning Group, University of Toronto
No ratings yet
Semantic Segmentation: Tingwu Wang Machine Learning Group, University of Toronto
28 pages
Instance Segmentation
No ratings yet
Instance Segmentation
51 pages
He 2017
No ratings yet
He 2017
9 pages
NNDL Unit 5
No ratings yet
NNDL Unit 5
21 pages
Fully Convolutional Networks For Semantic Segmentation
No ratings yet
Fully Convolutional Networks For Semantic Segmentation
12 pages
Strudel Transformer Segmentation
No ratings yet
Strudel Transformer Segmentation
17 pages
Mask
No ratings yet
Mask
12 pages
Semantic Image Segmentation With Task-Specific Edge Detection Using Cnns and A Discriminatively Trained Domain Transform
No ratings yet
Semantic Image Segmentation With Task-Specific Edge Detection Using Cnns and A Discriminatively Trained Domain Transform
10 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
No ratings yet
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
10 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
large kernel matters
No ratings yet
large kernel matters
11 pages
【全局卷积GAP】2017_Large_Kernel _Matters_Improve_Semantic_Segmentation_by_Global_Convolutional_Network
No ratings yet
【全局卷积GAP】2017_Large_Kernel _Matters_Improve_Semantic_Segmentation_by_Global_Convolutional_Network
9 pages
2018_ SeGAN_adversarial network with multi-scale l 1 loss for medical
No ratings yet
2018_ SeGAN_adversarial network with multi-scale l 1 loss for medical
10 pages
W-Net A Deep Model For Fully Unsupervised Image Segmentation
No ratings yet
W-Net A Deep Model For Fully Unsupervised Image Segmentation
13 pages
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
No ratings yet
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
5 pages
Exact Differential Equation
No ratings yet
Exact Differential Equation
12 pages
Segmentation Detection
100% (1)
Segmentation Detection
109 pages
Fully Convolutional Networks For Semantic Segmentation
No ratings yet
Fully Convolutional Networks For Semantic Segmentation
17 pages
Thesis AlexanderJaus BIBTEX
No ratings yet
Thesis AlexanderJaus BIBTEX
9 pages
Sensors: Semantic Segmentation With Transfer Learning For Off-Road Autonomous Driving
No ratings yet
Sensors: Semantic Segmentation With Transfer Learning For Off-Road Autonomous Driving
21 pages
Unit 2 - Source Coding-4
No ratings yet
Unit 2 - Source Coding-4
57 pages
Journal of Computer Science Research - Vol.5, Iss.3 July 2023
No ratings yet
Journal of Computer Science Research - Vol.5, Iss.3 July 2023
78 pages
A Comparative Study of Real-Time Semantic Segmentation For Autonomous Driving
No ratings yet
A Comparative Study of Real-Time Semantic Segmentation For Autonomous Driving
11 pages
IT5409 - Ch7 - Part3 - DL For CV-v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part3 - DL For CV-v2 - 4pages
42 pages
Lab Based Project Report
No ratings yet
Lab Based Project Report
40 pages
Introduction To MATLAB For Engineers, Third Edition
No ratings yet
Introduction To MATLAB For Engineers, Third Edition
49 pages
unit-1-dm
No ratings yet
unit-1-dm
62 pages
Moment Distribution Method
100% (2)
Moment Distribution Method
62 pages
Examples - Naive Bayes - Baysian Network
No ratings yet
Examples - Naive Bayes - Baysian Network
24 pages
House Price Prediction
100% (1)
House Price Prediction
17 pages
SimulatedAnnealing pdf
No ratings yet
SimulatedAnnealing pdf
26 pages
Cubic: Qian HE (Steve) CS 577 - Prof. Bob Kinicki
No ratings yet
Cubic: Qian HE (Steve) CS 577 - Prof. Bob Kinicki
22 pages
Heart Rate Variability
No ratings yet
Heart Rate Variability
15 pages
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms For Dcts and Dsts
No ratings yet
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms For Dcts and Dsts
20 pages
Diya Basera
No ratings yet
Diya Basera
15 pages
2-3 Tree PDF
No ratings yet
2-3 Tree PDF
8 pages
Simplex Projection Walkthrough
No ratings yet
Simplex Projection Walkthrough
8 pages
Modelling and Simulation of DC Drive Using PI and PID Controller
No ratings yet
Modelling and Simulation of DC Drive Using PI and PID Controller
4 pages
Sure! Here's a 500-word lecture on
No ratings yet
Sure! Here's a 500-word lecture on
3 pages
Econometric Lecture1
No ratings yet
Econometric Lecture1
13 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Intelligent System Dessign-Approaches
No ratings yet
Intelligent System Dessign-Approaches
7 pages
Law 1 Stat q3 Week 1 2
No ratings yet
Law 1 Stat q3 Week 1 2
12 pages
Introduction To Automata Theory, Languages, and Computation: Solutions For Chapter 8
No ratings yet
Introduction To Automata Theory, Languages, and Computation: Solutions For Chapter 8
5 pages
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
No ratings yet
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
8 pages
Bayesian Workshop - Syllabus
No ratings yet
Bayesian Workshop - Syllabus
2 pages
Rubrics
No ratings yet
Rubrics
1 page
Assembly Line Balancing Methods-A Case Study: Vrittika V Pachghare, R. S. Dalu
No ratings yet
Assembly Line Balancing Methods-A Case Study: Vrittika V Pachghare, R. S. Dalu
5 pages
Thales Luna Network HSM
No ratings yet
Thales Luna Network HSM
3 pages
Gauss Seidel MethodRTF
No ratings yet
Gauss Seidel MethodRTF
5 pages
CE 383 Course Outline Spring 2017 2018
No ratings yet
CE 383 Course Outline Spring 2017 2018
2 pages
EECP 3375-Digital Control Systems
No ratings yet
EECP 3375-Digital Control Systems
1 page