0% found this document useful (0 votes)

80 views

Lecture Notes

Computer vision involves using computer algorithms to analyze and understand images and videos. It has applications in areas like 3D reconstruction, object recognition, tracking, and medical imaging. One common technique is mean-shift clustering, which uses a kernel density estimate to identify clusters in image data without specifying the number of clusters in advance. The mean-shift algorithm iteratively shifts data points to the mean of nearby points, converging when points no longer shift. This provides an automated way to segment images into clusters of similar pixels.

Uploaded by

Sohail Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views

Lecture Notes

Uploaded by

Sohail Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

EE 589/689 Foundations of computer vision: Lecture notes

Fall quarter 2006, OGI/OHSU

Miguel Á. Carreira-Perpiñán

Based mainly on:

David Forsyth and Jean Ponce: Computer Vision. A Modern Approach. Prentice-Hall, 2003.
Introduction to computer vision
Computer vision has been around since the 1960s. Recent developments:

• Increasing availability of cheap, powerful cameras (e.g. digital cameras, webcams) and other
sensors.

• Increasing availability of massive amounts of image and multimedia content on the web
(e.g. face databases, streaming video or image-based communication).

• Increasing availability of cheap, powerful computers (processor speed and memory capac-
ity).

• Introduction of techniques from machine learning and statistics (complex, data-driven mod-
els and algorithms).

Three related areas:

Computer vision
Image 2D 3D
processing image(s) world
Computer graphics

• Computer graphics: representation of a 3D scene in 2D image(s).

• Computer vision: recovery of information about the 3D world from 2D image(s); the inverse
problem of computer graphics.

• Image processing: operate one one image to produce another image (e.g. denoising, deblur-
ring, enhancement, deconvolution—in particular in medical imaging).

Some problems of computer vision:

• Structure-from-motion (3D reconstruction from multiple views, stereo reconstruction)

• Shape-from-X (single image):

– shape-from-texture
– shape-from-shading
– shape-from-focus

• Segmentation

• Tracking

• Object recognition

1
A few applications of computer vision:

• Structure-from-motion:

– Throw away motion, keep structure: image-based rendering (e.g. 3D models of build-
ings, etc. for architecture or entertainment industry)
– Throw away structure, keep motion: mobile robot control (we know the structure but
not the robot location)

• Image collections:

– Image retrieval: find me pictures containing cars and trees

– Image annotation: textual description of objects in image

• Finding faces in a group picture, crowd, etc.

• Recovering articulated pose of a person from a video

• Medical applications:

– Image enhancement
– Segmentation of brain
– Image registration or alignment: compare brains of different people, or brains be-
fore/after lesion
– Blood vessels: track cells
– Unobstrusive patient monitoring

• HCI: track eye motion; recognize physical gestures (e.g. sign language)

2
Mean-shift clustering
Represent each pixel xn , n = 1, . . . , N by a feature vector as in spectral clustering, typically
position & intensity (i, j, I) or colour (i, j, L∗ , u∗ , v ∗ ).
Idea: define a function that represents the density of the data set {xn }N D
n=1 ⊂ R , then declare
each maximum as a cluster representative and assign each pixel to a maximum via the mean-shift
algorithm.
Kernel density estimate (smooth multivariate histogram) with bandwidth σ:

X
1 X
N N
x − xn
p(x) = p(n)p(x|n) = K x ∈ RD
n=1
N n=1
σ
R
The kernel K satisfies K(x) dx = 1 and K(x) ≥ 0 (so the kernel is a pdf). Typical kernels:

1 x−xn 2
• Gaussian (infinite support): K x−xσ
n
∝ exp − 2 σ
( x−x
0, n
>0
• Epanechnikov (finite support): K x−xn
∝ x−x 2 σ
.
σ
1− n
, otherwise
σ

C3 C1 C2

eplacements p(x)

Mean-shift algorithm for Gaussian kde: maxima (also minima, saddle points) of p(x) satisfy

1 X − 21 k x−x X X
N N N
n 2
k x − x n
0 = ∇p(x) ∝ − e σ ∝ p(x) p(n|x)(x − x n ) =⇒ x = p(n|x)xn = f (x)
N n=1 σ2 n=1 n=1

with “shifts” x − xn (thus ∇p(x) ∝ mean shift) and posterior probabilities (by Bayes’ th.)
2
p(x|n)p(n) exp − 12 k(x − xn )/σk
p(n|x) = = PN 2 .
p(x) 1
n0 =1 exp − 2 k(x − xn0 )/σk

This shows that the fixed points of f are stationary points of p, and suggests defining a fixed-point
iterative scheme by starting from a data point xn and iteratively applying f till convergence (and
repeating this for all pixels). It is possible to prove that this algorithm converges from nearly
any initial x to a maximum with linear convergence rate (in fact it is an EM algorithm).
Advantages:

• Nonparametric clustering: only need to set σ.

• No step size needed.

19
• Works well with clusters having complex shapes.

• The number of clusters is determined automatically by σ.

Disadvantages:

• The mean-shift iteration is slow.

• Large total computational cost: O(kN 2 ) where k = average number of mean-shift iterations
per pixel (k ≈ 20–100). Accelerations are possible that produce almost the same segmentation.

Gaussian mean-shift (GMS) algorithm Gaussian blurring mean-shift (GBMS) algorithm

for n ∈ {1, . . . , N } For each data point repeat Iteration loop

x ← xn Starting point for m ∈ {1, . . . , N } For each data point

repeat

Iteration loop exp − 21 k(xm −xn )/σk2
2
∀n: p(n|xm ) ← PN
exp − 21 k(x−xn )σk 2

1
n0 =1 exp − 2 k(xm −xn0 )/σk
∀n: p(n|x) ← PN
ym ← N n=1 p(n|xm )xn
2

1
P
n0 =1 exp − 2 k(x−xn0 )/σk One GMS step
PN
x ← n=1 p(n|x)xn Update x end
until x’s update < tol ∀m: xm ← ym Update whole data set

zn ← x Maximum until stop

end connected-components({xn }N n=1 ) Clusters

connected-components({zn }N n=1 ) Clusters

Figure 1: Pseudocode. The “connected-components” step collects all equivalent but numerically
slightly different points.

Mean-shift blurring clustering

Like mean-shift clustering, but actually move data points at each step. It obtains very similar
segmentations to those of mean-shift clustering but quite faster (cubic convergence rate for
Gaussian clusters): the total computational cost is still O(kN 2 ) but k is quite smaller (k ≈
5). It is related to spectral clustering, since effectively the algorithm is iterating (X ← PX,
update P) where X = (x1 , . . . , xN ) and P = D−1 A (stochastic, random-walk PN matrix), and
1 2
Amn = exp − 2 k(xm − xn )/σk are the Gaussian affinities and D = diag ( n=1 Amn ) the
degree matrix.

Exercises
1. Derive the mean-shift algorithm for a general kernel K.

2. Prove that the mean-shift algorithm is gradient ascent on p(x) with an adaptive step size.

3. Derive the mean-shift algorithm for the Gaussian kde with a bandwidth that is a full
covariance matrix Σn for each point.

20
9
7

clusters
5
3
1

iterations
60
40
PSfrag replacements
20
0
9 10 11 12 13 14 15 16 17 18 19
σ

Figure 2: Segmentation results with GMS for hand 50 × 40.

References
[1] Keinosuke Fukunaga and Larry D. Hostetler. The estimation of the gradient of a density function, with application in pattern
recognition. IEEE Trans. Information Theory, IT–21(1):32–40, January 1975.

[2] Yizong Cheng. Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Analysis and Machine Intelligence, 17(8):790–799,
August 1995.

[3] Miguel Á. Carreira-Perpiñán. Mode-finding for mixtures of Gaussian distributions. IEEE Trans. Pattern Analysis and Machine
Intelligence, 22(11):1318–1323, November 2000.

[4] Dorin Comaniciu and Peter Meer. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Analysis
and Machine Intelligence, 24(5):603–619, May 2002.

[5] Miguel Á. Carreira-Perpiñán. Acceleration strategies for Gaussian mean-shift image segmentation. In Cordelia Schmid, Stefano
Soatto, and Carlo Tomasi, editors, Proc. of the 2006 IEEE Computer Society Conf. Computer Vision and Pattern Recognition
(CVPR’06), pages 1160–1167, New York, NY, June 17–22 2006.

[6] Miguel Á. Carreira-Perpiñán. Fast nonparametric clustering with Gaussian blurring mean-shift. In William W. Cohen and
Andrew Moore, editors, Proc. of the 23rd Int. Conf. Machine Learning (ICML–06), pages 153–160, Pittsburgh, PA, June 25–29
2006.

[7] Miguel Á. Carreira-Perpiñán. Gaussian mean shift is an EM algorithm. IEEE Trans. Pattern Analysis and Machine Intelligence.
To appear.

Microsoft Dynamics CRM Architecture Overview
No ratings yet
Microsoft Dynamics CRM Architecture Overview
7 pages
cv123
No ratings yet
cv123
31 pages
Introduction To Mean Shift
No ratings yet
Introduction To Mean Shift
13 pages
Mean Shift An Information Theoretic Pers
No ratings yet
Mean Shift An Information Theoretic Pers
9 pages
CV Lecture 7
No ratings yet
CV Lecture 7
119 pages
Mean Shift
No ratings yet
Mean Shift
5 pages
12 Segmentation Part2 4pp
No ratings yet
12 Segmentation Part2 4pp
20 pages
Unit II - Chapter 5 - Segmentation
No ratings yet
Unit II - Chapter 5 - Segmentation
64 pages
11 Image Segmentation
No ratings yet
11 Image Segmentation
76 pages
Dari Pakisan
No ratings yet
Dari Pakisan
6 pages
12 Image Segmentation
No ratings yet
12 Image Segmentation
62 pages
Mean Shift Tracking
No ratings yet
Mean Shift Tracking
8 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
Mean-Shift Blob Tracking Through Scale Space: Robert T. Collins Carnegie Mellon University
No ratings yet
Mean-Shift Blob Tracking Through Scale Space: Robert T. Collins Carnegie Mellon University
7 pages
DIP3E - Chapter05 - Art - Image Segmentation
No ratings yet
DIP3E - Chapter05 - Art - Image Segmentation
34 pages
Segmentation Algorithms: Václav Krajíček
No ratings yet
Segmentation Algorithms: Václav Krajíček
51 pages
Tutte Cose
No ratings yet
Tutte Cose
8 pages
Mean Shift Clustering
No ratings yet
Mean Shift Clustering
2 pages
Image Segmentation
No ratings yet
Image Segmentation
82 pages
Mean Shift, Mode Seeking, and Clustering: Cheng
No ratings yet
Mean Shift, Mode Seeking, and Clustering: Cheng
10 pages
Computer Vision: Chapter 5. Segmentation
100% (1)
Computer Vision: Chapter 5. Segmentation
16 pages
Real Time Face and Object Tracking As Component of Perceptual User Interface
No ratings yet
Real Time Face and Object Tracking As Component of Perceptual User Interface
6 pages
CS4442_CS9542_Part 2_Lecture 1_Intro_Filtering
No ratings yet
CS4442_CS9542_Part 2_Lecture 1_Intro_Filtering
40 pages
IT5409 Ch5 Segmentation v2
No ratings yet
IT5409 Ch5 Segmentation v2
64 pages
Clustering
No ratings yet
Clustering
62 pages
Clustering Segmentation
No ratings yet
Clustering Segmentation
5 pages
Mean-Shift Tracking: R.Collins, CSE, PSU CSE598G Spring 2006
No ratings yet
Mean-Shift Tracking: R.Collins, CSE, PSU CSE598G Spring 2006
93 pages
An Improved Camshift Algorithm For Target Tracking in Video Surveillance
No ratings yet
An Improved Camshift Algorithm For Target Tracking in Video Surveillance
10 pages
Article LR
No ratings yet
Article LR
18 pages
IT5409 Ch5 Segmentation v2
No ratings yet
IT5409 Ch5 Segmentation v2
70 pages
Mean Shift Cluster
No ratings yet
Mean Shift Cluster
10 pages
computer_vision_3_segmentation_2_students
No ratings yet
computer_vision_3_segmentation_2_students
76 pages
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
No ratings yet
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
49 pages
Klasifikasi Gambar Medis
No ratings yet
Klasifikasi Gambar Medis
23 pages
Image Operations I
No ratings yet
Image Operations I
41 pages
Image Segmentation
No ratings yet
Image Segmentation
40 pages
Adaptive Mean Shift-Based Clustering
No ratings yet
Adaptive Mean Shift-Based Clustering
11 pages
PR Ip 1M 2010
No ratings yet
PR Ip 1M 2010
143 pages
Kmeans and Adaptive K Means
No ratings yet
Kmeans and Adaptive K Means
6 pages
Lecture 12 - Unsupervised Learning - Shoould Be Marged
No ratings yet
Lecture 12 - Unsupervised Learning - Shoould Be Marged
31 pages
Filtering Basics
No ratings yet
Filtering Basics
83 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
An Enhanced Medical Images Segmentation Using Skey Gaussian Mixture Model and Hierarchical Clustering Algorithm
No ratings yet
An Enhanced Medical Images Segmentation Using Skey Gaussian Mixture Model and Hierarchical Clustering Algorithm
11 pages
MAJORPROJECT
No ratings yet
MAJORPROJECT
10 pages
Segmentation
No ratings yet
Segmentation
37 pages
Image Processing Using Matlab
100% (1)
Image Processing Using Matlab
66 pages
Lec19 Segmshift
No ratings yet
Lec19 Segmshift
23 pages
Clustering Techniques in ML: Submitted By: Pooja 16EJICS072
No ratings yet
Clustering Techniques in ML: Submitted By: Pooja 16EJICS072
26 pages
III Unit Mtech 2023
No ratings yet
III Unit Mtech 2023
121 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
L9 Segmentation
No ratings yet
L9 Segmentation
89 pages
tmp904 TMP
No ratings yet
tmp904 TMP
14 pages
ML 03 Clustering
No ratings yet
ML 03 Clustering
63 pages
Interactive Image Segmentation: Mahesh Jagtap
No ratings yet
Interactive Image Segmentation: Mahesh Jagtap
43 pages
Documentation Image Processing Day 1
No ratings yet
Documentation Image Processing Day 1
11 pages
Motion Detection and Tracking: CS6350: Computer Vision CS6350: Computer Vision
No ratings yet
Motion Detection and Tracking: CS6350: Computer Vision CS6350: Computer Vision
29 pages
Lecture 8 Segmentation
No ratings yet
Lecture 8 Segmentation
54 pages
Hutten Loc Her
No ratings yet
Hutten Loc Her
9 pages
Report
No ratings yet
Report
23 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
HONDA R&D DETAILS 2024&2023 BATCH_Location
No ratings yet
HONDA R&D DETAILS 2024&2023 BATCH_Location
2 pages
07_Row_Level_Functions
No ratings yet
07_Row_Level_Functions
86 pages
Innovation & Ent - Part 2
No ratings yet
Innovation & Ent - Part 2
33 pages
12_SQL_Projects_Data_Analytics
No ratings yet
12_SQL_Projects_Data_Analytics
1 page
Daily One Liners Current Affairs - 15 January 2025
No ratings yet
Daily One Liners Current Affairs - 15 January 2025
1 page
Resume-Guide
No ratings yet
Resume-Guide
23 pages
How To Write An Outline: What Is It?
No ratings yet
How To Write An Outline: What Is It?
5 pages
Clavija L5-30P Ref 2611
No ratings yet
Clavija L5-30P Ref 2611
3 pages
Simulation Modeling With Simul8 Web
No ratings yet
Simulation Modeling With Simul8 Web
415 pages
Studi Karakteristik Airfoil Naca 2410 Dan Naca 0012 Pada Berbagai Variasi Angle of Attack
No ratings yet
Studi Karakteristik Airfoil Naca 2410 Dan Naca 0012 Pada Berbagai Variasi Angle of Attack
9 pages
Ordinary and Partial Differential Equations and Applications
No ratings yet
Ordinary and Partial Differential Equations and Applications
16 pages
Launching Systems For Unmanned Vehicles Onboard Naval Vessels
No ratings yet
Launching Systems For Unmanned Vehicles Onboard Naval Vessels
9 pages
How To Properly Complete An IIAR 6 System Safety Inspection Checklist Form?
No ratings yet
How To Properly Complete An IIAR 6 System Safety Inspection Checklist Form?
4 pages
1st Quarter Math 8 Exam
100% (2)
1st Quarter Math 8 Exam
3 pages
Estatica 2 DHB
No ratings yet
Estatica 2 DHB
6 pages
Adversarial Validation Approach To Concept Drift Problem in User Targeting Automation Systems at Uber
No ratings yet
Adversarial Validation Approach To Concept Drift Problem in User Targeting Automation Systems at Uber
6 pages
Design Criteria Electrical
100% (4)
Design Criteria Electrical
38 pages
C# Unit-5
No ratings yet
C# Unit-5
17 pages
Engine CM
No ratings yet
Engine CM
6 pages
Catia v5 Basic Training English Cax 2012
100% (1)
Catia v5 Basic Training English Cax 2012
104 pages
Continuous Assessment Adavanced Manufacturing Process Lab (CNC) DME, 5 Semester SET NO.01
No ratings yet
Continuous Assessment Adavanced Manufacturing Process Lab (CNC) DME, 5 Semester SET NO.01
10 pages
Sea Math Skills Checklist
No ratings yet
Sea Math Skills Checklist
8 pages
Student Exploration: Free Fall Tower
No ratings yet
Student Exploration: Free Fall Tower
4 pages
Manual HP 7900
No ratings yet
Manual HP 7900
159 pages
2468ca DB
No ratings yet
2468ca DB
174 pages
Chapter Four-MDC 2
No ratings yet
Chapter Four-MDC 2
42 pages
Design and Assembly of Ornithopter
No ratings yet
Design and Assembly of Ornithopter
8 pages
Board Buses: System Buses (Also Referred To As "Main," "Local," or "Processor-Memory" Buses)
No ratings yet
Board Buses: System Buses (Also Referred To As "Main," "Local," or "Processor-Memory" Buses)
8 pages
CS Assignment
No ratings yet
CS Assignment
11 pages
(Ebook) Principles of Structure, Fifth Edition by Ken Wyatt, Richard Hough ISBN 9780415667272, 0415667275 - The latest updated ebook is now available for download
100% (1)
(Ebook) Principles of Structure, Fifth Edition by Ken Wyatt, Richard Hough ISBN 9780415667272, 0415667275 - The latest updated ebook is now available for download
49 pages
APITECH 03 Decrypted
100% (2)
APITECH 03 Decrypted
23 pages
2017july-8 KNEK PAST PAPERS
No ratings yet
2017july-8 KNEK PAST PAPERS
4 pages
Immediate Download (Ebook PDF) The Infinite 3rd Edition by A.W. Moore Ebooks 2024
100% (5)
Immediate Download (Ebook PDF) The Infinite 3rd Edition by A.W. Moore Ebooks 2024
41 pages
ChiWriter Tutorial
No ratings yet
ChiWriter Tutorial
11 pages
Name of Candidate (Block Letters)
No ratings yet
Name of Candidate (Block Letters)
1 page
ELE2103 Linear Systems and Control: Introductory Book
No ratings yet
ELE2103 Linear Systems and Control: Introductory Book
39 pages