An Introduction To Particle Swarm Optimization: Article

See discussions, stats, and author proﬁles for this publication at: https://www.researchgate.
net/publication/242463151
An Introduction to Particle Swarm Optimization
Article
CITATIONS READS
50 889
1 author:
Matthew L Settles
University of California, Davis
117 PUBLICATIONS 1,645 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
mcaGUI View project
SeqyClean View project
All content following this page was uploaded by Matthew L Settles on 07 February 2014.
The user has requested enhancement of the downloaded ﬁle.

An Introduction to Particle Swarm Optimization
Matthew Settles
Department of Computer Science, University of Idaho,
Moscow, Idaho U.S.A 83844
November 7, 2005
1 Introduction
When the search space is too large to search exhaustively, population based searches may
be a good alternative, however, population based search techniques cannot guarantee
you the optimal (best) solution.
I will discuss a population based search technique, Particle Swarm Optimization (PSO).
The PSO Algorithm shares similar characteristics to Genetic Algorithm, however, the
manner in which the two algorithms traverse the search space is fundamentally differ-
ent.
Both Genetic Algorithms and Paticle Swarm Optimizers share common elements:
1. Both initialize a population in a similar manner.
2. Both use an evaluation function to determine how fit (good) a potential solution is.
3. Both are generational, that is both repeat the same set of processes for a predeter-
mined amount of time.
Algorithm 1 Population Based Searches

1: procedure PBS
2: Initialize the population
3: repeat
4: for i = 1 to number of individuals do
5: G(~xi ) . G() evaluates goodness
6: end for
8: P (~xi , θ) . Modify each individual using parameters θ
9: end for
10: until stopping criteria
11: end procedure
1
2 Particle Swarm Optimization
Particle Swarm Optimization was first introduced by Dr. Russell C. Eberhart 1 and Dr.
James Kennedy 2 in 1995.
As described by Eberhart and Kennedy, the PSO algorithm is an adaptive algorithm
based on a social-psychological metaphor; a population of individuals (referred to as par-
ticles) adapts by returning stochastically toward previously successful regions[1].
Particle Swarm has two primary operators: Velocity update and Position update. Dur-
ing each generation each particle is accelerated toward the particles previous best position
and the global best position. At each iteration a new velocity value for each particle is cal-
culated based on its current velocity, the distance from its previous best position, and the
distance from the global best position. The new velocity value is then used to calculate
the next position of the particle in the search space. This process is then iterated a set
number of times, or until a minimum error is achieved.
Algorithm 2 Particle Swarm algorithm

1: procedure PSO
2: repeat
4: if G(~xi ) > G(~pi ) then . G() evaluates goodness
5: for d = 1 to dimensions do
6: pid = xid . pid is the best state found so far
7: end for
8: end if
9: g=i . arbitrary
10: for j = indexes of neighbors do
11: if G(~pj ) > G(~pg ) then
12: g=j . g is the index of the best performer in the neighborhood
13: end if
14: end for
15: for d = 1 to number of dimensions do

16: vid (t) = f (xid (t − 1), vid (t − 1), pid , pgd ) . update velocity
17: vid ∈ (−Vmax , +Vmax )
18: xid (t) = f (vid (t), xid (t − 1)) . update position
19: end for
20: end for
21: until stopping criteria
22: end procedure
1
Dr. Russell C. Eberhart is the Chair of the Department of Electrical and Computer Engineering, Profes-
sor of Electrical and Computer Engineering, and Adjunct Professor of Biomedical Engineering at the Pur-
due School of Engineering and Technology, Indiana University Purdue University, Indianapolis (IUPUI).
2
Dr. James Kennedy is a research psychologist, at the Bureau of Labor and Statistics in Washington, DC.
2
3 Definitions and Variables Used
PSO Particle Swarm Optimizer.
t means the current time step, t − 1 means the previous time step.
Tmax the maximum number of time step the swarm is allowed to search.
P (xid (t) = 1) is the probability that individual i will choose 1 for the bit at the dth site on
the bitstring.
xid (t) is the current state (position) at site d of individual i.
vid (t) is the current velocity at site d of individual i.
±Vmax is the upper/lower bound placed on vid .
pid is the individual’s i best state (position) found so far at site d.
pgd is the neighborhood best state found so far at site d.
c1 social parameter 1, a positive constant, ususally set to 2.0.
c2 social parameter 2, a positive constant, usually set to 2.0.
ϕ1 is a positive random number drawn form a uniform distribution between 0.0 and 1.0.
ϕ2 is a positive random number drawn from a uniform distribution between 0.0 and 1.0.
ρid is a positive random number, drawn from a uniform distribution between 0.0 and 1.0
(Binary Particle Swarm).
w(t) is the inertia weight (Inertia Particle Swarm).
wstart is the starting inertia weight (w(0) = wstart ). (Inertia Particle Swarm)
wend is the ending inertia weight (w(Tmax ) = wend ). (Inertia Particle Swarm)
χ is the constriction coefficient (Constriction Coefficient Particle Swarm).
4 Binary Particle Swarm Optimizer

Model of Binary Decision [2]. The probability that an individual’s decision will be yes or
no, true or false, or some other binary decision is a function of personal and social factors.
P (xid (t) = 1) = f (xid (t − 1), vid (t − 1), pid , pgd )

P (xid (t) = 0) = 1 − P (xid (t) = 1) (4.1)
3
1
1/(1+exp(-x))
0.9
0.8
0.7
0.6
S(Vid(t))
0.5
0.4
0.3
0.2
0.1
0
-10 -5 0 5 10
Vid(t)
Figure 1: Sigmoidal Function
The parameter vid (t) , an individuals predisposition to make one or the other choice,
will determine a probability threshold. If vid (t) is higher, the individual is more likely to
choose 1, and lower values favor the 0 choice. Such a threshold needs to stay in the range
[0.0,1.0]. The sigmoidal function is a logical choice to do this. The sigmoidal function
squashes the range of vid to a range of [0.0,1.0].
1
s(vid ) = (4.2)
1 + exp(−vid )
Finally a formula for modeling binary decision making is as follows.
vid (t) = vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))

if ρid < s(vid (t)) then xid (t) = 1; else xid (t) = 0 (4.3)
Further more we can limit vid so that s(vid ) does not approach too closely to 0.0 or 1.0.
This ensures that there is always some chance of a bit flipping. A constant parameter Vmax
can be set at the start of a trial to limit the range of vid is often set at ±4.0, so that there
is always at lease a chance of s(vmax ) ≈ 0.0180 that a bit will change state. In this binary
model, Vmax functions similarly to mutation rate in genetic algorithms.
4
5 Standard Particle Swarm Optimizer
In real number space, the parameters of a function can be conceptualized as a point in
space. Furthermore the space in which the particles move is heterogeneous with respect
to fitness; that is some regions are better than others. A number of particles can be evalu-
ated and there is presumed to be some kind of preference or attraction for better regions
of the search space.
xid (t) = f (xid (t − 1), vid (t − 1), pid , pgd ) (5.1)
vid (t) = vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))

xid (t) = xid (t − 1) + vid (t) (5.2)
The standard version of the PSO has a tendancy to explode as oscillations become
wider and wider, unless some method is applied for damping the velocity. The usual
method for preventing explosion is simply to define a parameter Vmax and prevent the
velocity from exceeding it on each dimension d for individual i. Typically Vmax is set to
Xmax , the maximum initalization range of xid .
if vid > Vmax then vid = Vmax

else if vid < −Vmax then vid = −Vmax (5.3)
(5.4)
Other methods have also been introduced that deal with controlling the explosion of
vid , the two most notable are Eberhart and Shi’s PSO with inertia and Clerc’s PSO with
Constriction.
6 Particle Swarm Optimizer with Inertia

In 1998 Shi and Eberhart came up with what they called PSO with inertia. The inertia
weight is multiplied by the previous velocity in the standard velocity equation and is lin-
earally decreased throughtout the run. A nonzero inertia weight introduces a preference
for the particle to continue moving in the same direction it was going on the previous
iteration. Decreasing the inertia over time introduces a shift from the exploratory (global
search) to the exploitative (local search) mode.
xid (t) = f (w(t), xid (t − 1), vid (t − 1), pid , pgd ) (6.1)
vid (t) = w(t) ∗ vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))
xid (t) = xid (t − 1) + vid (t) (6.2)
5
Typically w(t) is reduced linearly, from wstart to wend , each iteration, a good starting
point is to set wstart to 0.9 and wend to 0.4.
(Tmax − t) ∗ (wstart − wend )

w(t) = + wend (6.3)
Tmax
Thought Vmax has been found not to be necessary in the PSO with inertia version,
however it can be useful and is suggested that a Vmax = Xmax be used.
7 Particle Swarm Optimizer with Constriction Coefficient

Another PSO implementation dubbed PSO Constriction Coefficient was developed by
Clerc [3] in 2000. Clerc modeled the Particle Swarms interactions using a set of com-
plicated linear equations. Using a constriction coefficient results in particle convergence
over time. That is the amplitude of the particle’s oscillatoins decreases as it focuses on the
local and neighborhood previous best points. Though the particle converges to a point
over time, the contriction coefficient also prevents colapse if the right social conditions are
in place. The particle will oscillate around the weighted mean of pid and pgd , if the previ-
ous best position and the neighborhood best position are near each other the particle will
perform a local search. If the previous best position and the neighborhood best position
are far apart from each other the particle will perform a more exploratory search (global
search). During the search the neighborhood best position and previous best position will
change and the particle will shift from local search back to global search. The constriction
coefficient method therefor balances the need for local and global search depending on
what social conditions are in place.
xid (t) = f (χ, xid (t − 1), vid (t − 1), pid , pgd ) (7.1)
2k
χ = p , whereϕ = c1 + c2 , ϕ > 4 (7.2)
2 − ϕ − ϕ 2 − 4ϕ

vid (t) = χ[vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))]

xid (t) = xid (t − 1) + vid (t) (7.3)
Clerc, et al., found that by modifying ϕ, the convergence characteristics of the system
can be controlled. Typically k = 1 and c1 = c2 = 2, and ϕ is set to 4.1, thus K = 0.73.
8 Neighborhood Topologies
There are 3 main neighborhood topologies used in PSO: circle, wheel, and star. The
choice for neighborhood topology determines which individual to use for pgd . In the cir-
cle toplogy (See Figure 2), each individual in socially conneted to its k nearest topological
neighbors (pgd = best individual result among its k nearest neighbors, k typically equal 2
6
Figure 2: Circle Topology
). The wheel topology (See Figure 3), effectively isolates individuals from one another, as
information has to be communicated through a focal individual, pf d (pgd = best{pf d , pid }).
The star topology (See Figure 4) is better known as the global best topology, here every
individual is connected to every other individual (pgd = best individual results in the
population).
References
[1] J. Kennedy and R. Eberhart. Swarm Intelligence. Morgan Kaufmann Publishers, Inc.,
San Francisco, CA, 2001.
[2] J. Kennedy and R. C. Eberhart. A discreet binary version of the particle swarm algo-
rithm, 1997.
[3] Clerc M. The swarm and the queen: Towards a determininistic and adaptive particle
swarm optimization. In Congress on Evolutionary Computation (CEC99), pages 1951–
1957, 1999.
7
Figure 3: Wheel Topology
Figure 4: Star Topology
View publication stats

An Introduction To Particle Swarm Optimization: Article

Uploaded by

Copyright:

Available Formats

An Introduction To Particle Swarm Optimization: Article

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

An Introduction To Particle Swarm Optimization: Article

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author proﬁles for this publication at: https://www.researchgate.

An Introduction to Particle Swarm Optimization

mcaGUI View project

SeqyClean View project

The user has requested enhancement of the downloaded ﬁle.

Algorithm 1 Population Based Searches

Algorithm 2 Particle Swarm algorithm

15: for d = 1 to number of dimensions do

xid (t) is the current state (position) at site d of individual i.

vid (t) is the current velocity at site d of individual i.

±Vmax is the upper/lower bound placed on vid .

pid is the individual’s i best state (position) found so far at site d.

pgd is the neighborhood best state found so far at site d.

c1 social parameter 1, a positive constant, ususally set to 2.0.

c2 social parameter 2, a positive constant, usually set to 2.0.

w(t) is the inertia weight (Inertia Particle Swarm).

χ is the constriction coefficient (Constriction Coefficient Particle Swarm).

4 Binary Particle Swarm Optimizer

P (xid (t) = 1) = f (xid (t − 1), vid (t − 1), pid , pgd )

Figure 1: Sigmoidal Function

Finally a formula for modeling binary decision making is as follows.

vid (t) = vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))

xid (t) = f (xid (t − 1), vid (t − 1), pid , pgd ) (5.1)

vid (t) = vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))

if vid > Vmax then vid = Vmax

6 Particle Swarm Optimizer with Inertia

(Tmax − t) ∗ (wstart − wend )

7 Particle Swarm Optimizer with Constriction Coefficient

vid (t) = χ[vid (t − 1) + c1 ϕ1 (pid − xid (t − 1)) + c2 ϕ2 (pgd − xid (t − 1))]

Figure 4: Star Topology

View publication stats

You might also like