0% found this document useful (0 votes)

59 views13 pages

SVM and Kernels

The document discusses kernels and the kernel trick in support vector machines. It explains that kernels allow computing the dot product of data points mapped into a higher dimensional feature space without explicitly performing the mapping. This is done through kernel functions that evaluate the similarity between pairs of data points. Common kernel functions include polynomial kernels and Gaussian radial basis function kernels. The kernel trick allows support vector machines to operate in infinite dimensional feature spaces while maintaining computational efficiency.

Uploaded by

Ulil Herdianto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views13 pages

SVM and Kernels

Uploaded by

Ulil Herdianto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Kernels and the Kernel Trick

Martin Hofmann

Reading Club "Support Vector Machines"

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

1 / 13

Optimization Problem

maximize:

W() =

m
X
i=1

m
1X
j j yi yj hxi xj i
2
i,j=1

subject to i 0, i = 1, . . . , m and

m
i=1

i yi = 0

data not linear separable in input space

map into some feature space where data is linear separable

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

2 / 13

Mapping Example
map data points into feature space with some function
e.g.:
: R2 R2

(x2 , x2 ) (z1 , z2 , z3 ) := (x12 , 2x1 x2 , x22 )

hyperplane hw zi = 0, as a function of x:

w1 x12 + w2 2x1 x2 + w3 x22 = 0

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

3 / 13

Kernel Trick
solve maximisation problem using mapped data points

W() =

m
X
i=1

m
1X
i
j j yi yj h(xi ) (xj )i
2
i,j=1

Dual Representation of Hyperplane ( primal Lagrangian):

f (x) = hw xi + b =

i yi hxi xi with w =

i yi xi

weight vector represented only by data points

only inner product of data points necessary, no coordinates
kernel function K(x1 , x2 ) = h(xi ) (xj )i

not necessary any more

possible to operate in any n-dimensional FS
complexity independent of FS

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

4 / 13

Example Kernel Trick

~x = (x1 , x2 )
~z = (z1 , z2 )
2

K(x, z) = h~x ~zi

K(x, z)

= h~x ~zi
= (x1 z1 + x2 z2 )2
= (x12 z21 + 2x1 z1 x2 z2 + x22 z22 )
E
D

=
(x12 , 2x1 x2 , x22 ) (z21 , 2z1 z2 , z22 )
= h(~x) (~z)i

mapping function fused in K

implicit (~x) = (x12 , 2x1 x2 , x22 )

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

5 / 13

Typical Kernels
Polynomial Kernel
d

K(x, z) = (hx zi + ) ,

for d 0

Radial Basis Function (Gaussian Kernel)

K(x, z) = e

kxzk2
2 2

kxk :=

hx xi

(Sigmoid Kernel)

K(x, z) = tanh( hx zi +
Inverse multi-quadric

K(x, z) = p

kx zk2 2 2 + c2

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

6 / 13

Typical Kernels Cont.

Kernels for Sets -

, 0
0

K s(, ) =

N N0
X
X

k(xi , xj0 )

i=1 j=1

where k(xi , xj0 ) is a kernel on elements in

, 0

Kernels for strings (Spectral Kernels) and trees

no one-fits-all kernel
model search and cross-validation in practice
low polynomial or RBF a good initial try

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

7 / 13

Kernel Properties

Symmetry

K(x, z) = h(x) (z)i = h(z) (x)i = K(z, x)

Cauchy-Schwarz Inequality
2

K(x, z)2 = h(x) (z)i k(x)k2 k(z)k2

= h(x) (x)i h(z) ( z)i
= K(x, x)K(z, z)

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

8 / 13

Making Kernels from Kernels

create complex Kernels by combining simpler ones

Closure Properties:

K(x, z)
K(x, z)
K(x, z)
K(x, z)
K(x, z)

=
=
=
=
=

c K1 (x, z)
c + K1 (x, z)
K1 (x, z) + K2 (x, z)
K1 (x, z) K2 (x, z)
f (x) f (z)

if K1 and K2 are kernels, f : X R, and c > 0

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

9 / 13

Gram Matrix

Kernel function as similarity measure between input objects

Gram Matrix (Similarity/Kernel Matrix) represents similarities between

input vectors
let V = ~v1 , . . . ,~vn a set of input vectors, then the Gram Matrix K is defined
as:

h(~v1 ) (~v1 )i . . . h(~v1 ) (~vn )i

..
h(~v2 ) (~v1 )i . . .

.
h(~vn ) (~v1 )i . . . h(~vn ) (~vn )i
K is symmetric and positive semis-definite (positive eigenvalues)

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

10 / 13

Mercers Theorem
assume:
finite input space X = {x1 , . . . , xn }
symmetric function K(x, z) on X
Gram Matrix K = (K(xi , xj ))ni,j=1
since K is symmetric there exists an orthogonal matrix V s.t. K = VV0
diagonal containing eigenvalues t of K
and eigenvectors vt = (vti )ni=1 as columns of V
all eigenvalues are non-negative and let feature mapping be
: xi 7

i vti

t=1

then

h(xi ) (xj )i =

n
X

Rn , i = 1, . . . , n.

t vti vtj = (VV0 )ij = Kij = K(xi , xj )

t=1

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

11 / 13

Mercers Theorem Cont.

every Gram Matrix is symmetric and positive semi-definite

every spsd matrix can be regarded as a Kernel Matrix, i.e. as an inner

product matrix in some space

diagonal matrix satisfies Mercers criteria, but not good as Gram Matrix
self-similarity dominates between-sample similarity
represents orthogonal samples
generalization for infinite input space
eigenvectors of the data in can be used to detect directions of maximum
variance
kernel principal components analysis

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

12 / 13

Summary

Kernel calculates dot product of mapped data points without mapping

function
represented by symmetric, positive semi-definite Gram Matrix
fuses information about data and kernel
standard kernels (cross validation)
every similarity matrix can be used as kernel (satisfying Mercers criteria)
ongoing research to estimate Kernel Matrix from available data

Kernels and the Kernel Trick

Reading Club "Support Vector Machines"

13 / 13

Special Types of Matrix PDF
33% (3)
Special Types of Matrix PDF
20 pages
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
Mva - Slides Machine Learning With Kernel Methods
No ratings yet
Mva - Slides Machine Learning With Kernel Methods
644 pages
Machine Learning With Kernel Methods
No ratings yet
Machine Learning With Kernel Methods
760 pages
Kernel Methods For General Pattern Analysis PDF
No ratings yet
Kernel Methods For General Pattern Analysis PDF
77 pages
Kernal Methods Machine Learning
No ratings yet
Kernal Methods Machine Learning
53 pages
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
No ratings yet
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
65 pages
Icml Tutorial
No ratings yet
Icml Tutorial
85 pages
جبر خطي 1
No ratings yet
جبر خطي 1
46 pages
Lê Xuân Đ I Linear - System - Handout
No ratings yet
Lê Xuân Đ I Linear - System - Handout
90 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
ADVANCED ENGINEERING MATHEMATICS Lecture Module Part 1 PDF
No ratings yet
ADVANCED ENGINEERING MATHEMATICS Lecture Module Part 1 PDF
12 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
45 pages
Classes of Kernels For Machine Learning: A Statistics Perspective
No ratings yet
Classes of Kernels For Machine Learning: A Statistics Perspective
14 pages
Lectures NOTES
No ratings yet
Lectures NOTES
178 pages
Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nystr Om Method, and Use of Kernels in Machine Learning: Tutorial and Survey
No ratings yet
Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nystr Om Method, and Use of Kernels in Machine Learning: Tutorial and Survey
31 pages
Mind Map - Matrices - Class 12
75% (4)
Mind Map - Matrices - Class 12
5 pages
cs229 Notes3
No ratings yet
cs229 Notes3
30 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
Slides Chap5 KernelMethods
No ratings yet
Slides Chap5 KernelMethods
24 pages
Computational Mathematics Open University
No ratings yet
Computational Mathematics Open University
144 pages
Lec5 SVM Kernel SoftMargin
No ratings yet
Lec5 SVM Kernel SoftMargin
44 pages
Gauss-Jordan Elimination, Linear Algebra, Alexandria University
No ratings yet
Gauss-Jordan Elimination, Linear Algebra, Alexandria University
52 pages
Kernel Functions
No ratings yet
Kernel Functions
35 pages
SD-M1 TSI Chapitre 4
No ratings yet
SD-M1 TSI Chapitre 4
42 pages
Kernel Methods: Feature Mapping at No Cost
No ratings yet
Kernel Methods: Feature Mapping at No Cost
25 pages
SVM Class 2
No ratings yet
SVM Class 2
87 pages
Chap6.1-KernelMethods
No ratings yet
Chap6.1-KernelMethods
36 pages
4c Kernels
No ratings yet
4c Kernels
31 pages
0701907v3
No ratings yet
0701907v3
53 pages
10 Understanding Kernels
No ratings yet
10 Understanding Kernels
41 pages
תרגול - SVM 1
No ratings yet
תרגול - SVM 1
32 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
kernel_perceptron
No ratings yet
kernel_perceptron
28 pages
Combining Entropy Measures For Anomaly Detection
No ratings yet
Combining Entropy Measures For Anomaly Detection
14 pages
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
No ratings yet
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
15 pages
Singular Value Decomposition Worked Numerical Examples
No ratings yet
Singular Value Decomposition Worked Numerical Examples
24 pages
1.6 More on Linear Systems and Invertible Matrices
No ratings yet
1.6 More on Linear Systems and Invertible Matrices
57 pages
Kernels and Kernelized Perceptron: Instructor: Alan Ritter
No ratings yet
Kernels and Kernelized Perceptron: Instructor: Alan Ritter
13 pages
Topic 2 Matrices
No ratings yet
Topic 2 Matrices
10 pages
Conference Matrix
No ratings yet
Conference Matrix
5 pages
Lecture 05
No ratings yet
Lecture 05
49 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
No ratings yet
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
15 pages
Lecture 8_Kernels
No ratings yet
Lecture 8_Kernels
32 pages
KernelTrick PDF
No ratings yet
KernelTrick PDF
4 pages
Lecture17 Kernels
No ratings yet
Lecture17 Kernels
23 pages
Lecture4
No ratings yet
Lecture4
49 pages
The Representation of Similarities in Linear Spaces
No ratings yet
The Representation of Similarities in Linear Spaces
17 pages
Ds 11
No ratings yet
Ds 11
21 pages
Linear Algebra Chapter 12 - Complex Vector Spaces
No ratings yet
Linear Algebra Chapter 12 - Complex Vector Spaces
6 pages
Lect 3 - Inverse Using Elementary Matrices - 16B1NMA533
No ratings yet
Lect 3 - Inverse Using Elementary Matrices - 16B1NMA533
11 pages
05 Kernel
No ratings yet
05 Kernel
24 pages
Section4 2
No ratings yet
Section4 2
21 pages
Kernel Method
No ratings yet
Kernel Method
5 pages
44 Multiplicity of Eigenvalues
No ratings yet
44 Multiplicity of Eigenvalues
2 pages
ML Kernel Methods
No ratings yet
ML Kernel Methods
51 pages
A Combinatorial Determinant
No ratings yet
A Combinatorial Determinant
5 pages
Assignment 01
No ratings yet
Assignment 01
3 pages
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
No ratings yet
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
6 pages
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
No ratings yet
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
6 pages
Kernal and Multiclass
No ratings yet
Kernal and Multiclass
51 pages
Desima Roida Glori Simorangkir
No ratings yet
Desima Roida Glori Simorangkir
4 pages
Rank of Matrix#1
No ratings yet
Rank of Matrix#1
4 pages
lec16
No ratings yet
lec16
23 pages
KernelMethods
No ratings yet
KernelMethods
19 pages
Assignment M1
No ratings yet
Assignment M1
2 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Lec3-The Kernel Trick
No ratings yet
Lec3-The Kernel Trick
4 pages
SVM Kernel Functions
No ratings yet
SVM Kernel Functions
12 pages
2011 Final
No ratings yet
2011 Final
15 pages
Lecture 14: Kernels — Applied ML
No ratings yet
Lecture 14: Kernels — Applied ML
14 pages
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
No ratings yet
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
6 pages
2024 MAM2084F CT1 Memo
No ratings yet
2024 MAM2084F CT1 Memo
10 pages
Worksheet 2 Linear transformations 2023-2024
No ratings yet
Worksheet 2 Linear transformations 2023-2024
3 pages
10th ICSE MATHS MATRICES
No ratings yet
10th ICSE MATHS MATRICES
21 pages
Spin Operators Verification
No ratings yet
Spin Operators Verification
2 pages
SVM 4
No ratings yet
SVM 4
8 pages
Lecture 13_ Kernels
No ratings yet
Lecture 13_ Kernels
5 pages
Kernel_Methods_in_Machine_Learning
No ratings yet
Kernel_Methods_in_Machine_Learning
3 pages
More Kernels and Their Properties
No ratings yet
More Kernels and Their Properties
3 pages
Math 012/012B Numerical Methods and Analysis Matlab Activity 3.1 Matrices
No ratings yet
Math 012/012B Numerical Methods and Analysis Matlab Activity 3.1 Matrices
11 pages
SciPyGUIA PYTHON-02
No ratings yet
SciPyGUIA PYTHON-02
1 page
SolutionManual ch1
100% (1)
SolutionManual ch1
13 pages
Kernel Methods For Pattern Analysis
100% (3)
Kernel Methods For Pattern Analysis
478 pages
Worksheet Matrices
100% (1)
Worksheet Matrices
2 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet