Support Vector Machines: Theory and Applications: January 2001

This document summarizes a workshop on support vector machines (SVM) theory and applications. It provides an overview of SVM background theory within the framework of statistical learning theory, including hypothesis spaces, empirical risk minimization, and structural risk minimization. It also summarizes three papers presented at the workshop that applied SVMs to tasks like time series prediction, face recognition and medical diagnosis. The document discusses measuring complexity of hypothesis spaces and how this relates to SVM parameter selection.

Uploaded by

Prince Gurajena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views7 pages

Support Vector Machines: Theory and Applications: January 2001

Uploaded by

Prince Gurajena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/221621494

Support Vector Machines: Theory and Applications

Conference Paper · January 2001

DOI: 10.1007/3-540-44673-7_12 · Source: DBLP

CITATIONS READS
55 11,832

2 authors:

Theodoros Evgeniou Massimiliano Pontil

INSEAD University College London
87 PUBLICATIONS 6,462 CITATIONS 181 PUBLICATIONS 12,636 CITATIONS

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Two-Stage Adaptive Conjoint Analysis View project

All content following this page was uploaded by Massimiliano Pontil on 31 May 2014.

The user has requested enhancement of the downloaded file.

WORKSHOP ON SUPPORT VECTOR MACHINES: THEORY AND APPLICATIONS
Theodoros Evgeniou and Massimiliano Pontil
Center for Biological and Computational Learning, and
Artificial Intelligence Laboratory,
MIT, E25-201,
Cambridge, MA 02139,
USA
Abstract
This paper presents a summary of the issues discussed during the one day workshop on "Support Vector Machines (SVM)
Theory and Applications" organized as part of the Advanced Course on Artificial Intelligence (ACAI 99) in Chania, Greece. The
goal of the paper is twofold: to present an overview of the background theory and current understanding of SVM, and to discuss
the papers presented as well as the issues that arose during the workshop.

INTRODUCTION
Support Vector Machines (SVM) have been recently developed in the framework of statistical learning theory
(Vapnik, 1998) (Cortes and Vapnik, 1995), and have been successfully applied to a number of applications, ranging
from time series prediction (Fernandez, 1999), to face recognition (Tefas et al., 1999), to biological data processing
for medical diagnosis (Veropoulos et al., 1999). Their theoretical foundations and their experimental success
encourage further research on their characteristics, as well as their further use.
In this report we present a brief introduction to the theory and implementation of SVM, and we discuss the five
papers presented during the workshop. The report is organized as follows: section 2 presents the theoretical
foundations of SVM. A brief overview of statistical learning theory also using the discussion in (Vayatis and
Azencott, 1999) is given. The mathematical formulation of SVM is presented, and theory for the implementation of
SVM, as in (Trafalis, 1999), is briefly discussed. Section 3 summarizes the experimental work of (Veropoulos et al.,
1999), (Fernandez, 1999) and (Tefas et al., 1999) and the variations of the standard SVM proposed in these papers.
Finally section 4 presents some conclusions and suggestions for future research.

A BRIEF OVERVIEW OF THE SVM THEORY

Support Vector Machines have been developed in the framework of Statistical Learning Theory - see for
example (Vapnik, 1998). We first briefly discuss some basic ideas of the theory.
Statistical Learning Theory: a primer
In statistical learning theory (SLT) the problem of supervised learning is formulated as follows. We are given a
set of l training data {(x1,y1)...(xl,yl)} in Rn × R sampled according to unknown probability distribution P(x,y), and a
loss function V(y,f(x)) that measures the error done when, for a given x, f(x) is "predicted" instead of the actual
value y. The problem consists in finding a function f that minimizes the expectation of the error on new data, that is,
find a function f that minimizes the expected error:
∫ V(y, f(x)) P(x, y) dx dy
Since P(x,y) in unknown, we need to use some induction principle in order to infer from the l available training
examples a function that minimizes the expected error. The principle used is Empirical Risk Minimization (ERM)
over a set of possible functions, called hypothesis space. Formally this can be written as minimizing the empirical
error:
1 l
∑ V(y i , f(x i ))
l i =1
with f being restricted to be in a space of functions - hypothesis space - say H. An important question is how close
the empirical error of the solution (minimizer of the empirical error) is to the minimum of the expected error that can
be achieved with functions from H. A central result of the theory states the conditions under which the two errors are
close to each other, and provides probabilistic bounds on the distance between empirical and expected errors (see
theorem 1 below). These bounds are given in terms of a measure of complexity of the hypothesis space H: the more
"complex" H is, the larger the distance between the empirical and expected errors is in probability (see theorem 1
below).
From the bounds that the theory provides, it occurs that it is possible to improve the ERM inductive principle by
considering a structure of hypothesis spaces H1 ⊂ H2 ⊂... ⊂ Hm, with ordered "complexity" (i.e. Hi+1 is more
"complex” than Hi). ERM is performed in each of these spaces, and the choice of the final solution can be done
using the aforementioned bounds. This principle of performing ERM over a structure (sequence) of nested
hypothesis spaces is known as Structural Risk Minimization (SRM) (Vapnik, 1998).
An important question that arises in SLT is that of measuring the "complexity" of a hypothesis space - which, as
we discussed, we need in order to choose the final optimal solution to the learning problem. In (Vayatis and
Azencott, 1999) quantities measuring this "complexity" of a hypothesis space are discussed, and suggestions for
how to measure such quantities experimentally are made. We briefly describe the "complexity" quantities discussed
in (Vayatis and Azencott, 1999).
The first quantity discussed is a standard one in SLT (Vapnik, 1998), and is called the VC dimension of a set of
functions. This is a combinatorial quantity that characterizes the capacity of the set of functions to shatter a set of
points (for more information we refer the reader to the literature). Using the VC dimension of a hypothesis spaces H,
the aforementioned distance between empirical and expected errors can be bound as follows:
Theorem 1 (Vapnik and Chervonenkis, 1971) If V is the VC-dimension of a hypothesis space H, then with
probability 1-η, the minimum of the expected error that can be achieved with functions from H, say L, and the
minimum empirical error, say Lemp, satisfy the constraint:
  2l   η    2l   η 
V 1 + log   − log  V 1 + log   − log 
  V   4   V   4
Lemp − 4 2 ≤ L ≤ Lemp + 4 2
l l
independent of the distribution of the data P(x,y).
Theorem 1 holds independently of the probability distribution of the data P(x,y). In (Vayatis and Azencott) the
distribution P(x,y) is also taken into account (similar work has already been done in the past - for example see
(Vapnik 1998) and references therein), and the distance between empirical and expected error is bounded using a
"complexity" quantity that takes P(x,y) into account. The bounds presented are tighter than that of theorem 1, but
require the knowledge of the distribution dependent complexity quantity. The goal of (Vayatis and Azencott, 1999)
was to introduce a possible experimental setup to compute the distribution-depended complexity quantity they
suggest. To summarize, (Vayatis and Azencott, 1999) discuss the basic theoretical framework in which learning
machines such as SVM have been developed, and suggest possible directions of research that can lead to
improvements of the theory and so possibly to improvements of SVM (for example the theory can be used to choose
the parameters of an SVM, such as the kernel and the regularization parameter C - see below).
We now discuss how SVM emerge from the theoretical framework we outlined.
Support Vector Machines formulation
Support Vector machines realize the ideas outlined above. To see why, we need specify two things: the
hypothesis spaces used by SVM, and the loss functions used. The folklore view of SVM is that they find an
"optimal" hyperplane as the solution to the learning problem. The simplest formulation of SVM is the linear one,
where the hyperplane lies on the space of the input data x. In this case the hypothesis space is a subset of all
hyperplanes of the form:
f(x) = w⋅x +b.
In their most general formulation, SVM find a hyperplane in a space different from that of the input data x. It is a
hyperplane in a feature space induced by a kernel K (the kernel defines a dot product in that space (Wahba, 1990)).
Through the kernel K the hypothesis space is defined as a set of "hyperplanes" in the feature space induced by K.
This can also be seen as a set of functions in a Reproducing Kernel Hilbert Space (RKHS) defined by K (Wahba,
1990), (Vapnik, 1998). We do not discuss RKHS here and refer the reader to the literature.
So to summarize, the hypothesis space used by SVM is a subset of the set of hyperplanes defined in some space -
an RKHS. This space can be formally written as
{f : f 2
K
<∞ }
2
where K is the kernel that defines the RKHS, and f K is the RKHS norm of the function (Wahba, 1990). For
example, for the linear case mentioned above, K is the kernel K(x1, x2) = x1⋅x2, the functions considered are of the
2 2
form f(x) = w⋅x + b, and the RKHS norm of these functions is simply the norm of w, namely f K
= w .
In fact SVM consider subsets of this space, namely sets of the form
{f : f 2
K
≤ A2 }
for some constant A. In the SLT framework discussed above, the constant A is used to define a structure of
hypothesis spaces (the larger A is, the more complex the hypothesis space is). The goal of SVM is to find the
solution with the "optimal" RKHS norm, that is, to find the optimal A.
Instead of searching many hypothesis spaces one by one by performing ERM for each choice of A, SVM search
2
for an A (or the optimal RKHS norm f K ) in a different way, as it will be obvious from the SVM formulation
2
presented below. This "search method" for the optimal f K has been extensively discussed in the literature (see for
example (Bartlett and Shawe-Taylor, 1998), (Burges, 1998), (Evgeniou et al., 1999)), and we do not discuss it here
any further.
The second choice is that of the loss function. For this we have to distinguish between SVM classifiers and SVM
regressors. For classification ideally the misclassification error needs to be minimized, so a loss function of the form
sign(-yf(x)) should be used (in classification y takes binary values ±1, and classification is done by taking the sign of
function f(x)). However because of scaling as well as computational reasons (Vapnik, 1998), the actual loss function
used for SVM classification is |1-yf(x)|+ (that is, 0 if 1-yf(x) < 0, and 1-yf(x) otherwise). This is also called the "soft
margin" loss function because of its standard "margin" interpretation: the points for which the loss function is zero
are the ones that have "margin"
2
yf ( x) f K
2
at least 1 f K (that is 1 - yf(x) ≤ 0 ⇒ yf(x) f K ≥ 1 f K ). The margin is an important geometric quantity associated
2 2

with SVM classification. For more information we refer the reader to the literature.
For regression the loss function used is the so-called epsilon-insensitive loss function |y-f(x)|ε which is equal to
|y-f(x)|-ε if |y-f(x)| > ε, and 0 otherwise.
To summarize, following the SLT ideas outlined above for the given choices of the loss function and the
hypothesis spaces, SVM are learning machines that minimize the empirical error while taking into account the
2
"complexity" of the hypothesis space used by also minimizing the RKHS norm of the solution f K . SVM in
practice minimize a trade off between empirical error and complexity of hypothesis space. Formally this is done by
solving the following minimization problems:
SVM classification
l
+ C ∑ | 1 − y i f(x i ) | + (1)
2
min f K
f
i =1
SVM regression
l
+ C ∑ y i - f( x i ) ε
2
min f (2)
f K
i =1
where C is a so called "regularization parameter" that controls the trade off between empirical error and
complexity of the hypothesis space used.
Having discussed how SVM stem out of the theory outlined above, we now turn to their actual implementation.
The next section briefly discusses how the minimization problems (1) and (2) can be done, taking also into account
(Trafalis, 1999).

SVM IMPLEMENTATION
As mentioned above, training an SVM means solving problems (1) or (2). It turns out that both problems can be
rewritten as constrained quadratic programming (QP) problems. We present the QP formulation for SVM
classification, and regarding regression we refer the reader to the literature (Vapnik, 1998).
Problem (1) can be rewritten as follows:
SV classification:
l
+ C ∑ ξi
2
min f (3)
f,ξi K
i =1

subject to: yif(xi) ≥ 1 - ξi, for all i

ξi ≥ 0
Variables ξi are called slack variables and they measure the error made at point (xi,yi). We see that the number
of constraints is equal to the number of training data, l. Training SVM, that is, solving constrained QP problem (3),
becomes quite challenging when the number of training points is large. A number of methods for fast SVM training
have been proposed in the literature. Decomposing the QP problem into a number of smaller ones through chunking
or decomposition algorithms is one approach suggested (for example see (Osuna et al., 1997), (Vapnik, 1998)). A
sequential optimization method has also recently been proposed (Platt, 1998).
In (Trafalis, 1999) the approach suggested to solve the QP problem (3) was that of Interior Point Methods (IPM).
Trafalis (Trafalis, 1999) presents an overview of IPM, concentrating on primal-dual optimization methods. These
methods consist of solving iteratively the QP problem by moving between the formulation (3) (called the "primal”
formulation) and its dual formulation, which can be found to be (see (Vapnik, 1998), (Trafalis, 1999)):
SVM classification, Dual formulation:
l
1 l l
min ∑ α i − ∑ ∑ α i α j y i y j K( x i , x j ) (4)
αi
i =1 2 i=1 j=1
subject to: 0 ≤ αi ≤ C, for all i
l

∑α y
i =1
i i =0

Typically in the literature SVM are trained by solving the dual optimization problem (4) ((Osuna et al., 1997),
(Vapnik, 1998), (Burges, 1998)). Trafalis (Trafalis, 1999) proposes primal-dual IPM methods for SVM training
which differ from the ones typically used.
In (Trafalis, 1999) the IPM discussed are also used to train learning machines other than SVM. In particular,
(Trafalis, 1999) shows how the proposed primal-dual IPM can be used to train Artificial Neural Networks (ANN)
typically trained using backpropagation One of the main differences between ANN and SVM is that, as mentioned
in (Trafalis, 1999), while for ANN there can be many local optimal solutions, for SVM there is only one optimal
solution for problem (3), since SVM are trained by solving a QP problem which has one global optimal solution.
This is one practical “advantage” of SVM when compared with ANN.

EXPERIMENTS WITH SVM AND SOME VARIATIONS

Application of SVM to medical decision support.

The paper (Veropoulos et al., 1999) proposed an application of SVM classifiers to medical diagnosis of
Tuberculosis from photomicrographs of Sputum smears. Except the fact that this is the first time that SVMs are used
in a medical problem, another interesting point was the introduction of two methods that can be used for controlling
the performance of the system on a particular class of the data (that is, force the SVM to better classify the data from
one of the two classes of the classification task). In most medical problems, medical experts must have the ability to
put more weight on one of the classes of the problem (usually the class on which the diagnosis is 'heavily' based).
Another common problem in a wider area of applications is the presence of unbalanced data sets (the set of
examples from one class is significantly larger than the set of examples from the other class). For these reasons,
controlling the performance of a system on a particular class of the data is practically very useful. To do so,
(Veropoulos et al., 1999) used a slightly modified version of the standard SVM formulation (3) – the same idea was
suggested in (Osuna et al., 1997). The idea is to use different regularization parameter C for each of the two classes.
This translates in the following SVM formulation:

∑ξ ∑ξ (5)
2
min f K
+ C1 i + C2 i
f,ξ i
i ∈ class1 i ∈ class 2

subject to: yif(xi) ≥ 1 - ξi, for all i

ξi ≥ 0
By changing the ratio C1/C2, (Veropoulos et al., 1999) showed how to influence the performance of the SVM on
one of the classes, therefore altering the false negative vs false positive ratio for one of the classes.
A different approach to dealing with the problem of unbalanced data or to putting more weight on one of the
classes is also discussed in (Veropoulos et al., 1999). This second approach is based on a version of SVM slightly
different from the one described above, so we do not discuss it here and refer the reader to (Veropoulos et al., 1999).
Time series prediction using Local SVMs
An application of SVM regression was discussed in (Fernandez, 1999). The problem was time series prediction.
The approach taken was the use of SVM regression to model the dynamics of the time series and subsequently
predict future values of the series using the constructed model. Instead of using the standard SVM regression
formulation described above, a variation developed in (Scholkopf et al., 1998) was used. Using this variation the ε
parameter of the SVM regression loss function (see above) is automatically estimated. Furthermore, (Fernandez,
1999) used an approach to learning which is different from the standard one: instead of developing one global
regression model from all the available training data, (Fernandez, 1999) develops a number of SVM regression
models, each one trained using only part of the initial training data. The idea, which has been suggested in (Bottou
and Vapnik, 1992), is to split the initial training data set into parts, each part consisting only of training data that are
close to each other (in a Euclidean distance sense). Then a "local" SVM is trained for each subset of the data. The
claim in (Bottou and Vapnik, 1992) is that such an approach can lead to a number of simple (low complexity, in the
SLT sense outlined above) learning machines, instead of a single machine that is required to fit all data.
In (Fernandez, 1999) each of the individual SVM machines had its ε parameter estimated independently. The ε
parameter of the SVM loss function is known to be related to the noise of the data (Pontil et al., 1998). So, in a
sense, the approach of (Fernandez, 1999) leads to local SVMs each having an ε parameter that depends on the noise
of the data in particular regions of the space (instead of a single ε that needs to "model" the noise of all the data).
The experiments described in (Fernandez, 1999) show that training many local SVMs instead of one global
learning machine leads to significant improvements in performance. In fact, this was also the finding of (Bottou and
Vapnik, 1992) who first showed experiments with local learning machines.
An Application of SVM to face authentication
Starting from Fisher Linear Discriminant (FLT) (Duda and Hart, 1973), (Tefas et al., 1999) develop variations of
this standard classification method, and compare them with SVM classification. In (Tefas et al., 1999) ideas behind
the formulation of FLT are used to, effectively, choose a kernel for SVM classification. We briefly review FLT, and
we then show the SVM used in (Tefas et al.).
Given data from two classes, FTL leads to a hyperplane separating the two classes using the following criterion:
the projections of the data on the hyperplane are such that the between-class variance of the projections is
maximized, while the within-class variance is minimized (Duda and Hart, 1973). If m1 and m2 are the means of the
two classes, and Sb and Sw are the between-class and within class scatter matrices, then FLT yields the hyperplane w
that maximizes the so called Fisher discriminant ratio wTSbw / wTSww.
In (Tefas et al., 1999) the solution wFLT of FLT is also shown to be the optimal solution of a constrained QP
optimization problem. It is noted in (Tefas et al., 1999) that SVM are also formulated as constrained QP problems
(as we discussed in the previous section), and a comparison between the SVM formulation with the one yielding
wFLT is made.
Furthermore, (Tefas et al., 1999) use the idea of FLT of maximizing the within class variance in order to design a
kernel K for SVM classification. To see how this is done, we state again the SVM formulation for linear kernels, in
which case, as discussed above, the RKHS norm of the functions f(x) = w⋅x + f is simply ||w||2. In this case SVM is
formulated as:
SV linear classification
l
min w + C ∑ξ i
2

w ,ξi
i =1

subject to: yi(w⋅xi +b) ≥ 1 - ξi, for all i

ξi ≥ 0
Instead of using this linear SVM machine, (Tefas et al., 1999) use the following machine:
SV classification with “FLT” kernel:
l
min w TS w w + C ∑ξ i (6)
w ,ξi
i =1

subject to: yi(w⋅xi +b) ≥ 1 - ξi, for all i

ξi ≥ 0
It turns out that (Tefas et al., 1999) use an SVM classifier with kernel K(x1, x2) = x1T Sw-1 x2 (this can be seen
through the dual formulation of problem (6)). In (Tefas et al., 1999) the "FLT kernel" SVM was compared with the
standard linear SVM for the task of face recognition from images. They show that the "FLT" SVM outperforms the
standard SVM. On the other hand, they mention than using non-liner SVMs they can further improve performance.

CONCLUSIONS
The report presented an overview of the theory of SVM in parallel with a summary of the papers presented in the
ACAI 99 workshop on "Support Vector Machines: theory and applications". Some of the important conclusions of
this report as well as of the workshop are summarized below:
(i) SVM are motivated through statistical learning theory. The theory characterizes the performance of learning
machines using bounds on their ability to predict future data. One of the papers in the workshop (Vayatis and
Azencott, 1999) presented new bounds on the performance of learning machines, and suggested a method to use
them experimentally in order to better understand the learning machines (including SVM).
(ii) SVM are trained by solving a constrained quadratic optimization problem. Among others, this implies that
there is a unique optimal solution for each choice of the SVM parameters. This is unlike other learning machines,
such as standard Neural Networks trained using backpropagation.
(iii) Primal dual interior point optimization methods may be used to efficiently train SVM with large data sets, as
described in (Trafalis, 1999).
(iv) Training many local SVMs instead of a single global one can lead to significant improvement in the
performance of a learning machine, as shown in (Fernandez, 1999).
(v) SVM has been successfully used for medical diagnosis (Veropoulos et al., 1999). Methods for dealing with
unbalanced training data, or for biasing the performance of an SVM towards one of the classes during classification
were suggested and used in (Veropoulos et al., 1999).
(vi) An SVM using a kernel motivated from Fisher Linear Discriminant was shown to outperform the standard
linear SVM for a face recognition task in (Tefas et al., 1999).
The ideas presented in the papers and discussed in the workshop suggest a number of future research directions:
from tuning the basic statistical learning theory results, to developing efficient training methods for SVM, to
designing variations of the standard SVM for practical usage. Some of the main issues regarding the design and use
of SVMs are, among others, the choice of the kernel of the SVM (as (Tefas et al., 1999) showed), and the choice of
the regularization parameter (as (Veropoulos et al., 1999) discussed). On the other hand, significant improvements in
the performance of SVM may be achieved if ensembles of SVMs are used (like in (Fernandez, 1999))
Acknowledgements: The authors would like to thank the organizers of ACAI 99, and the co-organizers of the
workshop: Constantine Papageorgiou, Tomaso Poggio, and Ioannis Pitas.

REFERENCES
Bartlett P. and Shawe-Taylor J., “Generalization performance of support vector machine and other pattern
classifiers”, In C.~Burges B.~Scholkopf, editor, “Advances in Kernel Methods--Support Vector Learning”. MIT
press, 1998.
Bottou L. and Vapnik V., “Local learning algorithms”, Neural Computation, 4(6): 888--900, November 1992.
Burges C., “A tutorial on support vector machines for pattern recognition”, In “Data Mining and Knowledge
Discovery”. Kluwer Academic Publishers, Boston, 1998, (Volume 2).
Cortes C. and Vapnik V., “Support vector networks”, Machine Learning, 20:1--25, 1995.
Duda R. and Hart P., "Pattern Classification and Scene Analysis", Wiley, New York 1973.
Evgeniou T., Pontil M., and Poggio T., “A unified framework for regularization networks and support vector
machines” A.I. Memo No. 1654, Artificial Intelligence Laboratory, MIT, 1999.
Fernandez R., “Predicting time series with a local support vector regression machine", ACAI99.
Osuna E., Freund R., and Girosi F., “Support Vector Machines: Training and Applications”, A.I. Memo No. 1602,
Artificial Intelligence Laboratory, MIT, 1997.
Platt J., “Fast training of Support Vector Machines using sequential minimal optimization”, In C.~Burges
B.~Scholkopf, editor, “Advances in Kernel Methods--Support Vector Learning”. MIT press, 1998.
Pontil M., Mukherjee S., and Girosi F., “On the noise model of Support Vector Machine regression” A.I. Memo,
MIT Artificial Intelligence Laboratory, 1998.
Tefas A., Kotropoulos C., and Pitas I., "Enhancing the performance of elastic graph matching for face
authentications by using Support Vector Machines", ACAI99.
Trafalis T., "Primal-dual optimization methods in neural networks and support vector machines training", ACAI99.
Vapnik V., ”Statistical Learning Theory”, Wiley, New York, 1998.
Vapnik V. and Chervonenkis A., “On the uniform convergence of relative frequencies of events to their
probabilities”, in ”Th. Prob. and its Applications”, 17(2): 264--280, 1971.
Vayatis N. and Azencott R., "How to estimate the Vapnik-Chervonenkis Dimension of Support Vector Machines
through simulations", ACAI99.
Veropoulos K., Cristianini N., and Campbell C., "The Application of Support Vector Machines to Medical Decision
Support: A Case Study", ACAI99.
Wahba G., “Splines Models for Observational Data”, Series in Applied Mathematics, Vol. 59, SIAM.

View publication stats

Stat Learning Notes IV2
No ratings yet
Stat Learning Notes IV2
333 pages
Thesis
No ratings yet
Thesis
364 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
CS229 Andrew NG Lecture Notes
No ratings yet
CS229 Andrew NG Lecture Notes
216 pages
Statistical Machine Learning-The Basic Approach and Current Research Challenges
No ratings yet
Statistical Machine Learning-The Basic Approach and Current Research Challenges
35 pages
Support Vector Machines Theory and Applications
No ratings yet
Support Vector Machines Theory and Applications
7 pages
ai512_book
No ratings yet
ai512_book
127 pages
알기쉬운 선형대수 1장
No ratings yet
알기쉬운 선형대수 1장
80 pages
Statistical Machine Learning-The Basic Approach and Current Research Challenges
No ratings yet
Statistical Machine Learning-The Basic Approach and Current Research Challenges
35 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
16 pages
Notes of College Physics
No ratings yet
Notes of College Physics
453 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Mygfg PDF
No ratings yet
Mygfg PDF
209 pages
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
No ratings yet
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
120 pages
A Tutorial On Support Vector Regression
No ratings yet
A Tutorial On Support Vector Regression
77 pages
AI ML 2024 Solved Question Paper - Vaibhavpandit_tele_250522_224429
No ratings yet
AI ML 2024 Solved Question Paper - Vaibhavpandit_tele_250522_224429
41 pages
1901.05331v5
No ratings yet
1901.05331v5
41 pages
Survey Piccialli sciandrone4OR
No ratings yet
Survey Piccialli sciandrone4OR
29 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
Unit Iii ML
No ratings yet
Unit Iii ML
13 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
Ralf Herbrich Learning Kernel Classifiers. Theory and Algorithms 2001ã. 382ñ. ISBN ISBN10 026208306X PDF
No ratings yet
Ralf Herbrich Learning Kernel Classifiers. Theory and Algorithms 2001ã. 382ñ. ISBN ISBN10 026208306X PDF
382 pages
Maquina de Vetores Suporte
No ratings yet
Maquina de Vetores Suporte
19 pages
Support Vector Machines Jie Tang
No ratings yet
Support Vector Machines Jie Tang
28 pages
Brief Intro To ML PDF
No ratings yet
Brief Intro To ML PDF
236 pages
BTMMeeting25Nov2020-StatisticalLearning
No ratings yet
BTMMeeting25Nov2020-StatisticalLearning
49 pages
TAZ-TFG-2016-2057
No ratings yet
TAZ-TFG-2016-2057
52 pages
1
No ratings yet
1
42 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
119 pages
08 Classification
No ratings yet
08 Classification
46 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
poly_aml
No ratings yet
poly_aml
76 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
svm2 (1)fin
No ratings yet
svm2 (1)fin
24 pages
Topic - 04 - Public Key Cryptography and MSG Auth
No ratings yet
Topic - 04 - Public Key Cryptography and MSG Auth
27 pages
Hearst SVM
No ratings yet
Hearst SVM
12 pages
FPGA Implementations of The Round Two SHA-3 Candidates
No ratings yet
FPGA Implementations of The Round Two SHA-3 Candidates
18 pages
Materi Pemrograman
No ratings yet
Materi Pemrograman
58 pages
Support Vector Machines: Review and Applications in Civil: October 2011
No ratings yet
Support Vector Machines: Review and Applications in Civil: October 2011
15 pages
Fuzzy Support Vector Machines: IEEE Transactions On Neural Networks March 2002
No ratings yet
Fuzzy Support Vector Machines: IEEE Transactions On Neural Networks March 2002
9 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
An Improved Training Algorithm For Support Vector Machines
No ratings yet
An Improved Training Algorithm For Support Vector Machines
10 pages
Svm
No ratings yet
Svm
52 pages
A Tutorial On Support Vector Regression
No ratings yet
A Tutorial On Support Vector Regression
24 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Support Vector Machines as Probabilistic Models
No ratings yet
Support Vector Machines as Probabilistic Models
8 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
CS-601-Machine-learning-Unit-5 (1)
No ratings yet
CS-601-Machine-learning-Unit-5 (1)
18 pages
Video Compression Edited By Amal Punchihewa instant download
No ratings yet
Video Compression Edited By Amal Punchihewa instant download
49 pages
Topics: Minimum Spanning Trees
No ratings yet
Topics: Minimum Spanning Trees
33 pages
9-10. Decrease and Conquer Analysis of Insertion, Bubble, Selection Sort
No ratings yet
9-10. Decrease and Conquer Analysis of Insertion, Bubble, Selection Sort
44 pages
Laguador, Edjay A. Bsa 3 Year Assignment:: Answer
No ratings yet
Laguador, Edjay A. Bsa 3 Year Assignment:: Answer
4 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
UNIT - 2
No ratings yet
UNIT - 2
15 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
UBICC Article 522 522
No ratings yet
UBICC Article 522 522
8 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
Lab # 8 Control System
No ratings yet
Lab # 8 Control System
10 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
17 pages
SimulatedAnnealing pdf
No ratings yet
SimulatedAnnealing pdf
26 pages
This Is
No ratings yet
This Is
7 pages
MA201-Lecture5D
No ratings yet
MA201-Lecture5D
12 pages
Support Vector Machines For Prediction of Futures Prices in Indian Stock Market
No ratings yet
Support Vector Machines For Prediction of Futures Prices in Indian Stock Market
5 pages
The MD5 Encryption & Decryption Technique
No ratings yet
The MD5 Encryption & Decryption Technique
14 pages
2-3 Tree PDF
No ratings yet
2-3 Tree PDF
8 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
Understanding Machine Learning
100% (69)
Understanding Machine Learning
416 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
Provable Fairness: 1.1 Significance
No ratings yet
Provable Fairness: 1.1 Significance
18 pages
Implementing Time Series Stock Price Prediction With LSTM and Yfinance in Python - by SR - Medium
No ratings yet
Implementing Time Series Stock Price Prediction With LSTM and Yfinance in Python - by SR - Medium
14 pages
Mvda - Question Bank
No ratings yet
Mvda - Question Bank
14 pages
Unit - 5 Notes
No ratings yet
Unit - 5 Notes
9 pages
A New Design Based SVM of The CNN Classifier Architecture With Dropout For Offline Arabic Handwritten Recognition
No ratings yet
A New Design Based SVM of The CNN Classifier Architecture With Dropout For Offline Arabic Handwritten Recognition
12 pages
Bmjcred00453 0046 PDF
No ratings yet
Bmjcred00453 0046 PDF
2 pages
CHAP 02e
No ratings yet
CHAP 02e
10 pages
Diya Basera
No ratings yet
Diya Basera
15 pages
Training Data Selection For Support Vector Machine
No ratings yet
Training Data Selection For Support Vector Machine
11 pages
LS30B Winter 2021 Finals Exam SVenugopal
No ratings yet
LS30B Winter 2021 Finals Exam SVenugopal
12 pages
Stability Analysis of Periodically Switched Linear Systems Using Floquet Theory
No ratings yet
Stability Analysis of Periodically Switched Linear Systems Using Floquet Theory
11 pages
Ridge Regression
No ratings yet
Ridge Regression
6 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
R203105A - AI Bits
No ratings yet
R203105A - AI Bits
7 pages
EENG211-INFE211 Final-Solutions
No ratings yet
EENG211-INFE211 Final-Solutions
10 pages
Curvature
No ratings yet
Curvature
3 pages
Unsupervised Learning Using Back Propagation in Neural Networks
No ratings yet
Unsupervised Learning Using Back Propagation in Neural Networks
4 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
Applications of Adaptive Filtering
No ratings yet
Applications of Adaptive Filtering
3 pages
# Compiler Interpreter 1 2 3 Compiled Programs Take More Programs Are More Memory Efficient. 5
No ratings yet
# Compiler Interpreter 1 2 3 Compiled Programs Take More Programs Are More Memory Efficient. 5
1 page
Algebra: Polynomials, Galois Theory and Applications
From Everand
Algebra: Polynomials, Galois Theory and Applications
Frédéric Butin
No ratings yet
Nonlinear Transformations of Random Processes
From Everand
Nonlinear Transformations of Random Processes
Ralph Deutsch
No ratings yet
Mechanics Using Python: An Introductory Guide
From Everand
Mechanics Using Python: An Introductory Guide
Aayushman Dutta
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Asymptotic Expansions
From Everand
Asymptotic Expansions
A. Erdélyi
3/5 (1)