Toolbox - Global Optimization PDF
Toolbox - Global Optimization PDF
Toolbox - Global Optimization PDF
User's Guide
R2017b
How to Contact MathWorks
Phone: 508-647-7000
Getting Started
v
Write Files for Optimization Functions
2
Compute Objective Functions . . . . . . . . . . . . . . . . . . . . . . 2-2
Objective (Fitness) Functions . . . . . . . . . . . . . . . . . . . . . 2-2
Write a Function File . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-2
Write a Vectorized Function . . . . . . . . . . . . . . . . . . . . . . 2-3
Gradients and Hessians . . . . . . . . . . . . . . . . . . . . . . . . . 2-4
vi Contents
Set Start Points for MultiStart . . . . . . . . . . . . . . . . . . . . . . . . 3-17
Four Ways to Set Start Points . . . . . . . . . . . . . . . . . . . . . . . . 3-17
Positive Integer for Start Points . . . . . . . . . . . . . . . . . . . . . . 3-17
RandomStartPointSet Object for Start Points . . . . . . . . . . . . 3-18
CustomStartPointSet Object for Start Points . . . . . . . . . . . . 3-18
Cell Array of Objects for Start Points . . . . . . . . . . . . . . . . . . 3-19
vii
Can You Certify a Solution Is Global? . . . . . . . . . . . . . . . . . . 3-58
No Guarantees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-58
Check if a Solution Is a Local Solution with patternsearch . . 3-58
Identify a Bounded Region That Contains a Global
Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-59
Use MultiStart with More Start Points . . . . . . . . . . . . . . . . . 3-60
viii Contents
MultiStart Without Bounds, Widely Dispersed Start Points 3-101
MultiStart with a Regular Grid of Start Points . . . . . . . . . . 3-102
MultiStart with Regular Grid and Promising Start Points . 3-103
ix
Searching and Polling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-28
Definition of Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-28
How to Use a Search Method . . . . . . . . . . . . . . . . . . . . . . . . 4-30
Search Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-30
When to Use Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-31
x Contents
Example of Vectorized Objective and Constraints . . . . . . . . . 4-84
xi
Integer ga Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-37
Resume ga . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-60
Resuming ga From the Final Population . . . . . . . . . . . . . . . . 5-60
Resuming ga From a Previous Run . . . . . . . . . . . . . . . . . . . . 5-64
xii Contents
Vary Mutation and Crossover . . . . . . . . . . . . . . . . . . . . . . . . . 5-90
Setting the Amount of Mutation . . . . . . . . . . . . . . . . . . . . . . 5-90
Setting the Crossover Fraction . . . . . . . . . . . . . . . . . . . . . . . 5-92
Comparing Results for Varying Crossover Fractions . . . . . . . 5-97
xiii
Using Simulated Annealing
7
What Is Simulated Annealing? . . . . . . . . . . . . . . . . . . . . . . . . . . 7-2
Multiobjective Optimization
8
What Is Multiobjective Optimization? . . . . . . . . . . . . . . . . . . . 8-2
Multiobjective Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8-2
Problem Formulation in gamultiobj . . . . . . . . . . . . . . . . . . . . 8-4
xiv Contents
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8-14
Parallel Processing
9
How Solvers Compute in Parallel . . . . . . . . . . . . . . . . . . . . . . . 9-2
Parallel Processing Types in Global Optimization Toolbox . . . 9-2
How Toolbox Functions Distribute Processes . . . . . . . . . . . . . 9-3
Options Reference
10
GlobalSearch and MultiStart Properties (Options) . . . . . . . 10-2
How to Set Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10-2
Properties of Both Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . 10-2
GlobalSearch Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10-6
MultiStart Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10-8
xv
Vectorized and Parallel Options (User Function
Evaluation) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10-24
Options Table for Pattern Search Algorithms . . . . . . . . . . . 10-26
xvi Contents
Options Changes in R2016a . . . . . . . . . . . . . . . . . . . . . . . . . . 10-76
Use optimoptions to Set Options . . . . . . . . . . . . . . . . . . . . . 10-76
Options that optimoptions Hides . . . . . . . . . . . . . . . . . . . . . 10-76
Table of Option Names in Legacy Order . . . . . . . . . . . . . . . 10-80
Table of Option Names in Current Order . . . . . . . . . . . . . . 10-82
xvii
Getting Started
Introducing Global Optimization Toolbox Introduces the algorithms and features.
Functions
(p. 1)
Write Files for Optimization Functions Explains how to write files that compute the
(p. 1) functions you want to optimize.
19
1
Global Optimization Toolbox provides functions that search for global solutions to
problems that contain multiple maxima or minima. The toolbox includes global search,
multistart, pattern search, genetic algorithm, multiobjective genetic algorithm,
simulated annealing, and particle swarm solvers. You can use these solvers to solve
optimization problems where the objective or constraint function is continuous,
discontinuous, stochastic, does not possess derivatives, or includes simulations or black-
box functions.
You can improve solver effectiveness by setting options and customizing creation, update,
and search functions. You can use custom data types with the genetic algorithm and
simulated annealing solvers to represent problems not easily expressed with standard
data types. The hybrid function option lets you improve a solution by applying a second
solver after the first.
Key Features
• Global search and multistart solvers for finding single or multiple global optima
• Genetic algorithm for linear, nonlinear, bound, and integer constraints with
customization by defining parent selection, crossover, and mutation functions
• Multiobjective genetic algorithm with Pareto-front identification, including linear,
nonlinear, and bound constraints
• Pattern search solver for linear, nonlinear, and bound constraints with customization
by defining polling, searching, and other functions
• Simulated annealing solver for bound constraints, with options for defining annealing
process, temperature schedule, and acceptance criteria
• Particle swarm solver for bound constraints with options for defining the initial
particles and swarm behavior
1-2
Solution Quality
Solution Quality
Global Optimization Toolbox solvers repeatedly attempt to locate a global solution.
However, no solver employs an algorithm that can certify a solution as global. For
strategies to check solution quality, see “Can You Certify a Solution Is Global?” on page
3-58.
You can extend the capabilities of Global Optimization Toolbox functions by writing your
own files, by using them in combination with other toolboxes, or with the MATLAB® or
Simulink® environments.
1-3
1 Introducing Global Optimization Toolbox Functions
Function to Optimize
This example shows how to minimize Rastrigin’s function with five solvers. Each solver
has its own characteristics. The characteristics lead to different solutions and run times.
The results, examined in “Compare Syntax and Solutions” on page 1-11, can help you
choose an appropriate solver for your own problems.
Rastrigin’s function has many local minima, with a global minimum at (0,0):
Ras( x) = 20 + x12 + x22 - 10 ( cos 2p x1 + cos 2 px2 ) .
Usually you don't know the location of the global minimum of your objective function. To
show how the solvers look for a global solution, this example starts all the solvers around
the point [20,30], which is far from the global minimum.
The rastriginsfcn.m file implements Rastrigin’s function. This file comes with Global
Optimization Toolbox software. This example employs a scaled version of Rastrigin’s
function with larger basins of attraction. For information, see “Basins of Attraction” on
page 1-14.
rf2 = @(x)rastriginsfcn(x/10);
1-4
Comparison of Five Solvers
This example minimizes rf2 using the default settings of fminunc (an Optimization
Toolbox™ solver), patternsearch, and GlobalSearch. It also uses ga and
particleswarm with a nondefault setting, to obtain an initial population around the
point [20,30].
1-5
1 Introducing Global Optimization Toolbox Functions
fminunc
To solve the optimization problem using the fminunc Optimization Toolbox solver, enter:
fminunc returns
xf =
19.8991 29.8486
ff =
12.9344
flf =
1
of =
iterations: 3
funcCount: 15
stepsize: 1.7776e-06
lssteplength: 1
firstorderopt: 5.9907e-09
algorithm: 'quasi-newton'
message: 'Local minimum found.…'
1-6
Comparison of Five Solvers
patternsearch
patternsearch returns
Optimization terminated: mesh size less than options.MeshTolerance.
xp =
19.8991 -9.9496
fp =
4.9748
flp =
1
op =
function: @(x)rastriginsfcn(x/10)
problemtype: 'unconstrained'
pollmethod: 'gpspositivebasis2n'
maxconstraint: []
searchmethod: []
iterations: 48
funccount: 174
meshsize: 9.5367e-07
rngstate: [1x1 struct]
message: 'Optimization terminated: mesh size less than options.MeshTolerance.'
ga
To solve the optimization problem using the ga Global Optimization Toolbox solver,
enter:
1-7
1 Introducing Global Optimization Toolbox Functions
initpop is a 20-by-2 matrix. Each row of initpop has mean [10,30], and each
element is normally distributed with standard deviation 10. The rows of initpop form
an initial population matrix for the ga solver.
ga uses random numbers, and produces a random result. In this case ga returns:
problemtype: 'unconstrained'
rngstate: [1x1 struct]
generations: 107
funccount: 5400
message: 'Optimization terminated: average change in the fitness value less...'
maxconstraint: []
1-8
Comparison of Five Solvers
particleswarm
xpso =
1.0e-06 *
-0.8839 0.3073
fpso =
1.7373e-12
flgpso =
opso =
1-9
1 Introducing Global Optimization Toolbox Functions
GlobalSearch
Note You must specify fmincon as the solver for GlobalSearch, even for unconstrained
problems.
gs is a default GlobalSearch object. The object contains options for solving the
problem. Calling run(gs,problem) runs problem from multiple start points. The start
points are random, so the following result is also random.
All 4 local solver runs converged with a positive local solver exit flag.
xg =
1.0e-07 *
-0.1405 -0.1405
fg =
0
flg =
1
og =
funcCount: 2178
localSolverTotal: 4
localSolverSuccess: 4
1-10
Comparison of Five Solvers
localSolverIncomplete: 0
localSolverNoSolution: 0
message: 'GlobalSearch stopped because it analyzed all the trial po...'
• fminunc quickly reaches the local solution within its starting basin, but does not
explore outside this basin at all. fminunc has simple calling syntax.
• patternsearch takes more function evaluations than fminunc, and searches
through several basins, arriving at a better solution than fminunc. patternsearch
calling syntax is the same as that of fminunc.
• ga takes many more function evaluations than patternsearch. By chance it arrived
at a better solution. In this case, ga found a point near the global optimum. ga is
stochastic, so its results change with every run. ga has simple calling syntax, but
there are extra steps to have an initial population near [20,30].
• particleswarm takes fewer function evaluations than ga, but more than
patternsearch. In this case, particleswarm found the global optimum.
particleswarm is stochastic, so its results change with every run. particleswarm
has simple calling syntax, but there are extra steps to have an initial population near
[20,30].
• GlobalSearch run takes the same order of magnitude of function evaluations as ga
and particleswarm, searches many basins, and arrives at a good solution. In this
1-11
1 Introducing Global Optimization Toolbox Functions
1-12
What Is Global Optimization?
• A local minimum of a function is a point where the function value is smaller than or
equal to the value at nearby points, but possibly greater than at a distant point.
• A global minimum is a point where the function value is smaller than or equal to the
value at all other feasible points.
Global minimum
Local minimum
Generally, Optimization Toolbox solvers find a local optimum. (This local optimum can be
a global optimum.) They find the optimum in the basin of attraction of the starting point.
For more information, see “Basins of Attraction” on page 1-14.
In contrast, Global Optimization Toolbox solvers are designed to search through more
than one basin of attraction. They search in various ways:
• GlobalSearch and MultiStart generate a number of starting points. They then use
a local solver to find the optima in the basins of attraction of the starting points.
• ga uses a set of starting points (called the population) and iteratively generates better
points from the population. As long as the initial population covers several basins, ga
can examine several basins.
• simulannealbnd performs a random search. Generally, simulannealbnd accepts a
point if it is better than the previous point. simulannealbnd occasionally accepts a
worse point, in order to reach a different basin.
1-13
1 Introducing Global Optimization Toolbox Functions
Basins of Attraction
If an objective function f(x) is smooth, the vector –∇f(x) points in the direction where f(x)
decreases most quickly. The equation of steepest descent, namely
d
x(t) = -— f ( x(t)),
dt
yields a path x(t) that goes to a local minimum as t gets large. Generally, initial values
x(0) that are close to each other give steepest descent paths that tend to the same
minimum point. The basin of attraction for steepest descent is the set of initial values
leading to the same local minimum.
The following figure shows two one-dimensional minima. The figure shows different
basins of attraction with different line styles, and it shows directions of steepest descent
with arrows. For this and subsequent figures, black dots represent local minima. Every
steepest descent path, starting at a point x(0), goes to the black dot in the basin
containing x(0).
f(x)
The following figure shows how steepest descent paths can be more complicated in more
dimensions.
1-14
What Is Global Optimization?
The following figure shows even more complicated paths and basins of attraction.
1-15
1 Introducing Global Optimization Toolbox Functions
Constraints can break up one basin of attraction into several pieces. For example,
consider minimizing y subject to:
• y ≥ |x|
• y ≥ 5 – 4(x–2)2.
The figure shows the two basins of attraction with the final points.
The steepest descent paths are straight lines down to the constraint boundaries. From
the constraint boundaries, the steepest descent paths travel down along the boundaries.
The final point is either (0,0) or (11/4,11/4), depending on whether the initial x-value is
above or below 2.
1-16
Optimization Workflow
Optimization Workflow
To solve an optimization problem:
1 Decide what type of problem you have, and whether you want a local or global
solution (see “Local vs. Global Optima” on page 1-13). Choose a solver per the
recommendations in “Table for Choosing a Solver” on page 1-18.
2 Write your objective function and, if applicable, constraint functions per the syntax
in “Compute Objective Functions” on page 2-2 and “Write Constraints” on page 2-
7.
3 Set appropriate options using optimoptions, or prepare a GlobalSearch or
MultiStart problem as described in “Workflow for GlobalSearch and MultiStart”
on page 3-3. For details, see “Pattern Search Options” on page 10-9, “Particle
Swarm Options” on page 10-59, “Genetic Algorithm Options” on page 10-29, or
“Simulated Annealing Options” on page 10-67.
4 Run the solver.
5 Examine the result. For information on the result, see “Solver Outputs and Iterative
Display” (Optimization Toolbox) or Examine Results for GlobalSearch or
MultiStart.
6 If the result is unsatisfactory, change options or start points or otherwise update
your optimization and rerun it. For information, see “Global Optimization Toolbox
Solver Characteristics” on page 1-20 or Improve Results. For information on
improving solutions that applies mainly to smooth problems, see “When the Solver
Fails” (Optimization Toolbox), “When the Solver Might Have Succeeded”
(Optimization Toolbox), or “When the Solver Succeeds” (Optimization Toolbox).
See Also
More About
• “Optimization Problem Setup”
• “What Is Global Optimization?” on page 1-13
1-17
1 Introducing Global Optimization Toolbox Functions
To start patternsearch at multiple points when you have finite bounds lb and ub on
every component, try
x0 = lb + rand(size(lb)).*(ub - lb);
There are many other solvers with different solution algorithms, including the genetic
algorithm solver ga and the particleswarm solver. Try some of them if the
recommended solvers do not perform well on your problem. For details, see “Global
Optimization Toolbox Solver Characteristics” on page 1-20.
See Also
Related Examples
• “Solver Behavior with a Nonsmooth Problem”
More About
• “Optimization Workflow” on page 1-17
1-18
See Also
1-19
1 Introducing Global Optimization Toolbox Functions
Solver Choices
To attempt to get better or faster solutions, first try tuning the recommended solvers on
page 1-18 by setting appropriate options or bounds. If the results are unsatisfactory, try
other solvers:
Desired Solution Smooth Objective and Constraints Nonsmooth Objective or Constraints
“Explanation of “Desired “Choosing Between Solvers for “Choosing Between Solvers for
Solution”” on page 1-21 Smooth Problems” on page 1-23 Nonsmooth Problems” on page 1-
24
Single local solution Optimization Toolbox functions; fminbnd, patternsearch,
see “Optimization Decision Table” fminsearch, ga,
(Optimization Toolbox) particleswarm,
simulannealbnd
Multiple local solutions GlobalSearch, MultiStart patternsearch, ga,
particleswarm, or
simulannealbnd started from
multiple initial points x0 or from
multiple initial populations
Single global solution GlobalSearch, MultiStart, patternsearch, ga,
patternsearch, particleswarm,
particleswarm, ga, simulannealbnd
simulannealbnd
Single local solution using MultiStart, Optimization patternsearch, ga,
parallel processing Toolbox functions particleswarm
1-20
Global Optimization Toolbox Solver Characteristics
f(x)=100x2(1–x)2–x,
1-21
1 Introducing Global Optimization Toolbox Functions
x2 = fminbnd(fun,0.9,1.1)
x2 =
1.0049
1-22
Global Optimization Toolbox Solver Characteristics
Term Meaning
Single local solution Find one local solution, a point x where the objective
function f(x) is a local minimum. For more details, see
“Local vs. Global Optima” on page 1-13. In the example,
both x1 and x2 are local solutions.
Multiple local solutions Find a set of local solutions. In the example, the complete
set of local solutions is {x1,x2}.
Single global solution Find the point x where the objective function f(x) is a
global minimum. In the example, the global solution is x2.
1 Try GlobalSearch first. It is most focused on finding a global solution, and has an
efficient local solver, fmincon.
2 Try MultiStart second. It has efficient local solvers, and can search a wide variety
of start points.
3 Try patternsearch third. It is less efficient, since it does not use gradients.
However, patternsearch is robust and is more efficient than the remaining local
solvers.
4 Try particleswarm fourth, if your problem is unconstrained or has only bound
constraints. Usually, particleswarm is more efficient than the remaining solvers,
and can be more efficient than patternsearch.
5 Try ga fifth. It can handle all types of constraints, and is usually more efficient than
simulannealbnd.
6 Try simulannealbnd last. It can handle problems with no constraints or bound
constraints. simulannealbnd is usually the least efficient solver. However, given a
slow enough cooling schedule, it can find a global solution.
1-23
1 Introducing Global Optimization Toolbox Functions
GlobalSearch and MultiStart both provide multiple local solutions. For the syntax to
obtain multiple solutions, see “Multiple Solutions” on page 3-27. GlobalSearch and
MultiStart differ in the following characteristics:
• MultiStart can find more local minima. This is because GlobalSearch rejects
many generated start points (initial points for local solution). Essentially,
GlobalSearch accepts a start point only when it determines that the point has a
good chance of obtaining a global minimum. In contrast, MultiStart passes all
generated start points to a local solver. For more information, see “GlobalSearch
Algorithm” on page 3-50.
• MultiStart offers a choice of local solver: fmincon, fminunc, lsqcurvefit, or
lsqnonlin. The GlobalSearch solver uses only fmincon as its local solver.
• GlobalSearch uses a scatter-search algorithm for generating start points. In
contrast, MultiStart generates points uniformly at random within bounds, or allows
you to provide your own points.
• MultiStart can run in parallel. See “How to Use Parallel Processing” on page 9-12.
1-24
Global Optimization Toolbox Solver Characteristics
6 Try simulannealbnd last for unbounded problems, or for problems with bounds.
simulannealbnd provably converges only for a logarithmic cooling schedule, which
is extremely slow. simulannealbnd takes only bound constraints, and is often less
efficient than ga.
Solver Characteristics
Solver Convergence Characteristics
GlobalSearch Fast convergence to local optima Deterministic iterates
for smooth problems. Gradient-based
Automatic stochastic start points
Removes many start points
heuristically
MultiStart Fast convergence to local optima Deterministic iterates
for smooth problems. Can run in parallel; see “How to Use
Parallel Processing” on page 9-12
Gradient-based
Stochastic or deterministic start
points, or combination of both
Automatic stochastic start points
Runs all start points
Choice of local solver: fmincon,
fminunc, lsqcurvefit, or
lsqnonlin
patternsearch Proven convergence to local Deterministic iterates
optimum, slower than gradient- Can run in parallel; see “How to Use
based solvers. Parallel Processing” on page 9-12
No gradients
User-supplied start point
particleswarm No convergence proof. Stochastic iterates
Can run in parallel; see “How to Use
Parallel Processing” on page 9-12
Population-based
1-25
1 Introducing Global Optimization Toolbox Functions
• Convergence — Solvers can fail to converge to any solution when started far from a
local minimum. When started near a local minimum, gradient-based solvers converge
to a local minimum quickly for smooth problems. patternsearch provably converges
for a wide range of problems, but the convergence is slower than gradient-based
solvers. Both ga and simulannealbnd can fail to converge in a reasonable amount of
time for some problems, although they are often effective.
• Iterates — Solvers iterate to find solutions. The steps in the iteration are iterates.
Some solvers have deterministic iterates. Others use random numbers and have
stochastic iterates.
1-26
Global Optimization Toolbox Solver Characteristics
1-27
1 Introducing Global Optimization Toolbox Functions
See Also
Related Examples
• “Solver Behavior with a Nonsmooth Problem”
More About
• “Optimization Workflow” on page 1-17
• “Table for Choosing a Solver” on page 1-18
1-28
2
The file that computes this function must accept a vector x of length 2, corresponding to
the variables x1 and x2, and return a scalar equal to the value of the function at x.
1 Select New > Script (Ctrl+N) from the MATLAB File menu. A new file opens in
the editor.
2 Enter the following two lines of code:
function z = my_fun(x)
z = x(1)^2 - 2*x(1)*x(2) + 6*x(1) + 4*x(2)^2 - 3*x(2);
3 Save the file in a folder on the MATLAB path.
2-2
Compute Objective Functions
my_fun([2 3])
ans =
31
For gamultiobj, suppose you have three objectives. Your objective function returns a
three-element vector consisting of the three objective function values:
function z = my_fun(x)
z = zeros(1,3); % allocate output
z(1) = x(1)^2 - 2*x(1)*x(2) + 6*x(1) + 4*x(2)^2 - 3*x(2);
z(2) = x(1)*x(2) + cos(3*x(2)/(2+x(1)));
z(3) = tanh(x(1) + x(2));
For example, to write the objective function of “Write a Function File” on page 2-2 in a
vectorized fashion,
2-3
2 Write Files for Optimization Functions
function z = my_fun(x)
z = x(:,1).^2 - 2*x(:,1).*x(:,2) + 6*x(:,1) + ...
4*x(:,2).^2 - 3*x(:,2);
options = optimoptions('patternsearch','UseCompletePoll',true,'UseVectorized',true);
[x fval] = patternsearch(@my_fun,[1 1],[],[],[],[],[],[],...
[],options);
options = optimoptions('ga','UseVectorized',true);
[x fval] = ga(@my_fun,2,[],[],[],[],[],[],[],options);
For gamultiobj,
function z = my_fun(x)
z = zeros(size(x,1),3); % allocate output
z(:,1) = x(:,1).^2 - 2*x(:,1).*x(:,2) + 6*x(:,1) + ...
4*x(:,2).^2 - 3*x(:,2);
z(:,2) = x(:,1).*x(:,2) + cos(3*x(:,2)./(2+x(:,1)));
z(:,3) = tanh(x(:,1) + x(:,2));
options = optimoptions('ga','UseVectorized',true);
[x fval] = gamultiobj(@my_fun,2,[],[],[],[],[],[],options);
For more information on writing vectorized functions for patternsearch, see “Vectorize
the Objective and Constraint Functions” on page 4-81. For more information on writing
vectorized functions for ga, see “Vectorize the Fitness Function” on page 5-112.
2-4
See Also
See Also
Related Examples
• “Vectorize the Objective and Constraint Functions” on page 4-81
• “Vectorize the Fitness Function” on page 5-112
• “Maximizing vs. Minimizing” on page 2-6
2-5
2 Write Files for Optimization Functions
If you want to maximize f(x), minimize –f(x), because the point at which the minimum of
–f(x) occurs is the same as the point at which the maximum of f(x) occurs.
See Also
Related Examples
• “Compute Objective Functions” on page 2-2
2-6
Write Constraints
Write Constraints
In this section...
“Consult Optimization Toolbox Documentation” on page 2-7
“Set Bounds” on page 2-7
“Ensure ga Options Maintain Feasibility” on page 2-8
“Gradients and Hessians” on page 2-8
“Vectorized Constraints” on page 2-8
Set Bounds
It is more important to set bounds for global solvers than for local solvers. Global solvers
use bounds in a variety of ways:
• GlobalSearch requires bounds for its scatter-search point generation. If you do not
provide bounds, GlobalSearch bounds each component below by -9999 and above
by 10001. However, these bounds can easily be inappropriate.
• If you do not provide bounds and do not provide custom start points, MultiStart
bounds each component below by -1000 and above by 1000. However, these bounds
can easily be inappropriate.
• ga uses bounds and linear constraints for its initial population generation. For
unbounded problems, ga uses a default of 0 as the lower bound and 1 as the upper
bound for each dimension for initial point generation. For bounded problems, and
problems with linear constraints, ga uses the bounds and constraints to make the
initial population.
2-7
2 Write Files for Optimization Functions
• simulannealbnd and patternsearch do not require bounds, although they can use
bounds.
However, you can set options that cause this feasibility to fail. For example if you set
MutationFcn to @mutationgaussian or @mutationuniform, the mutation function
does not respect constraints, and your population can become infeasible. Similarly, some
crossover functions can cause infeasible populations, although the default
gacreationlinearfeasible does respect bounds and linear constraints. Also, ga can
have infeasible points when using custom mutation or crossover functions.
To ensure feasibility, use the default crossover and mutation functions for ga. Be
especially careful that any custom functions maintain feasibility with respect to bounds
and linear constraints.
ga does not enforce linear constraints when there are integer constraints. Instead, ga
incorporates linear constraint violations into the penalty function. See “Integer ga
Algorithm” on page 5-37.
Vectorized Constraints
The ga and patternsearch solvers optionally compute the nonlinear constraint
functions of a collection of vectors in one function call. This method can take less time
than computing the objective functions of the vectors serially. This method is called a
vectorized function call.
For the solver to compute in a vectorized manner, you must vectorize both your objective
(fitness) function and nonlinear constraint function. For details, see “Vectorize the
Objective and Constraint Functions” on page 4-81.
2-8
Write Constraints
The following code gives these nonlinear constraints in a vectorized fashion, assuming
that the rows of your input matrix x are your population or input vectors:
function [c ceq] = nlinconst(x)
x =
0.2191 0.7500 12.1712
fval =
-148.7480
Using ga:
options = optimoptions('ga','UseVectorized',true);
[x fval] = ga(@vfun,3,[],[],[],[],[],[],@nlinconst,options)
Optimization terminated: maximum number of generations exceeded.
x =
-1.4098 -0.1216 11.6664
2-9
2 Write Files for Optimization Functions
fval =
-138.1066
For this problem patternsearch computes the solution far more quickly and
accurately.
2-10
Set and Change Options
For example, to set the ga maximum time to 300 seconds and set iterative display:
options = optimoptions('ga','MaxTime',300,'Display','iter');
options = optimoptions(options,'MaxTime',5e3);
Ensure that you pass options in your solver call. For example,
[x,fval] = ga(@objfun,2,[],[],[],[],lb,ub,@nonlcon,options);
To see the options you can change, consult the solver function reference pages. For option
details, see the options reference sections.
You can also set and change options using the “Optimization App” (Optimization
Toolbox), although the Optimization app warns that it will be removed in a future
release.
See Also
ga | particleswarm | patternsearch | simulannealbnd
More About
• “Pattern Search Options” on page 10-9
• “Particle Swarm Options” on page 10-59
• “Genetic Algorithm Options” on page 10-29
2-11
2 Write Files for Optimization Functions
2-12
View Options
View Options
optimoptions “hides” some options, meaning it does not display their values. For
example, it hides the patternsearch MaxMeshSize option.
options = optimoptions('patternsearch','MaxMeshSize',1e2)
options =
patternsearch options:
Set properties:
No options set.
Default properties:
AccelerateMesh: 0
ConstraintTolerance: 1.0000e-06
Display: 'final'
FunctionTolerance: 1.0000e-06
InitialMeshSize: 1
MaxFunctionEvaluations: '2000*numberOfVariables'
MaxIterations: '100*numberOfVariables'
MaxTime: Inf
MeshContractionFactor: 0.5000
MeshExpansionFactor: 2
MeshTolerance: 1.0000e-06
OutputFcn: []
PlotFcn: []
PollMethod: 'GPSPositiveBasis2N'
PollOrderAlgorithm: 'consecutive'
ScaleMesh: 1
SearchFcn: []
StepTolerance: 1.0000e-06
UseCompletePoll: 0
UseCompleteSearch: 0
UseParallel: 0
UseVectorized: 0
You can view the value of any option, including “hidden” options, by using dot notation.
For example,
options.MaxMeshSize
2-13
2 Write Files for Optimization Functions
ans =
100
• They are not useful. For example, the ga StallTest option allows you to choose a
stall test that does not work well. Therefore, this option is “hidden”.
• They are rarely used, or it is hard to know when to use them. For example, the
patternsearch and ga MaxMeshSize option is hard to choose, and so is “hidden”.
2-14
3
GlobalSearch and MultiStart work by starting a local solver, such as fmincon, from
a variety of start points. Generally the start points are random. However, for
MultiStart you can provide a set of start points. For more information, see “How
GlobalSearch and MultiStart Work” on page 3-48.
To find out how to use these solvers, see “Workflow for GlobalSearch and MultiStart” on
page 3-3.
3-2
Workflow for GlobalSearch and MultiStart
3-3
3 Using GlobalSearch and MultiStart
Resulting
Object or Problem Structure Solver Object Start Points Object
Structure
run
Results
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-4
Create Problem Structure
1 Define your objective function as a file or anonymous function. For details, see
“Compute Objective Functions” on page 2-2. If your solver is lsqcurvefit or
lsqnonlin, ensure the objective function returns a vector, not scalar.
2 If relevant, create your constraints, such as bounds and nonlinear constraint
functions. For details, see “Write Constraints” on page 2-7.
3 Create a start point. For example, to create a three-dimensional random start point
xstart:
xstart = randn(3,1);
4 (Optional) Create an options structure using optimoptions. For example,
options = optimoptions(@fmincon,'Algorithm','interior-point');
5 Enter
problem = createOptimProblem(solver,
3-5
3 Using GlobalSearch and MultiStart
• 'fmincon'
• 'fminunc'
• 'lsqcurvefit'
• 'lsqnonlin'
problem = createOptimProblem('fmincon','x0',xstart,
7 Include the function handle for your objective function in objective:
[x,fval,eflag,output] = fmincon(problem);
3-6
Create Problem Structure
This example minimizes the function from “Run the Solver” on page 3-21, subject to the
constraint x1 + 2x2 ≥ 4. The objective is
Use the interior-point algorithm of fmincon, and set the start point to [2;3].
objective: @(x)(4*x(1)^2-2.1*x(1)^4+x(1)^6/3+x(1)*x(2)-4*x(2)^2+4*x(2)^4)
x0: [2x1 double]
Aineq: [-1 -2]
bineq: -4
Aeq: []
beq: []
lb: []
ub: []
nonlcon: []
solver: 'fmincon'
options: [1x1 optim.options.Fmincon]
6 Best practice: validate the problem structure by running your solver on the structure:
[x,fval,eflag,output] = fmincon(problem);
3-7
3 Using GlobalSearch and MultiStart
1 Define your objective function as a file or anonymous function. For details, see
“Compute Objective Functions” on page 2-2. If your solver is lsqcurvefit or
lsqnonlin, ensure the objective function returns a vector, not scalar.
2 If relevant, create nonlinear constraint functions. For details, see “Nonlinear
Constraints” (Optimization Toolbox).
3 Create a start point. For example, to create a three-dimensional random start point
xstart:
xstart = randn(3,1);
4 Open the Optimization app by entering optimtool at the command line, or by
choosing the Optimization app from the Apps tab.
• fmincon (default)
• fminunc
• lsqcurvefit
• lsqnonlin
3-8
Create Problem Structure
9 Set bounds, linear constraints, or local Options. For details on constraints, see
“Write Constraints” (Optimization Toolbox).
10 Best practice: run the problem to verify the setup.
11 Choose File > Export to Workspace and select Export problem and options to
a MATLAB structure named
3-9
3 Using GlobalSearch and MultiStart
This example minimizes the function from “Run the Solver” on page 3-21, subject to the
constraint x1 + 2x2 ≥ 4. The objective is
Use the interior-point algorithm of fmincon, and set the start point to [2;3].
3-10
Create Problem Structure
3-11
3 Using GlobalSearch and MultiStart
6 Choose File > Export to Workspace and select Export problem and options to
a MATLAB structure named
See Also
Related Examples
• “Workflow for GlobalSearch and MultiStart” on page 3-3
3-12
Create Solver Object
You do not need to set any preferences. Create a GlobalSearch object named gs with
default settings as follows:
gs = GlobalSearch;
ms = MultiStart;
3-13
3 Using GlobalSearch and MultiStart
3-14
Create Solver Object
• Consider local solutions identical if they are within 0.01 of each other and the
function values are within the default FunctionTolerance tolerance.
• Spend no more than 2000 seconds on the computation.
gs = GlobalSearch('XTolerance',0.01,'MaxTime',2000);
• You consider local solutions identical if they are within 0.01 of each other and the
function values are within the default FunctionTolerance tolerance.
3-15
3 Using GlobalSearch and MultiStart
ms = MultiStart('XTolerance',0.01,'MaxTime',2000);
See Also
Related Examples
• “Workflow for GlobalSearch and MultiStart” on page 3-3
3-16
Set Start Points for MultiStart
Note You can control whether MultiStart uses all start points, or only those points
that satisfy bounds or other inequality constraints. For more information, see “Filter
Start Points (Optional)” on page 3-55.
[xmin,fmin,flag,outpt,allmins] = run(ms,problem,k);
The positive integer k specifies the number of start points MultiStart uses.
MultiStart generates random start points using the dimension of the problem and
3-17
3 Using GlobalSearch and MultiStart
bounds from the problem structure. MultiStart generates k - 1 random start points,
and also uses the x0 start point from the problem structure.
stpoints = RandomStartPointSet;
A RandomStartPointSet object generates start points with the same dimension as the
x0 point in the problem structure; see list.
1 Place the starting points in a matrix. Each row of the matrix represents one starting
point. MultiStart runs all the rows of the matrix, subject to filtering with the
3-18
Set Start Points for MultiStart
tpoints = CustomStartPointSet(ptmatrix);
For example, create a set of 40 five-dimensional points, with each component of a point
equal to 10 plus an exponentially distributed variable with mean 25:
pts = -25*log(rand(40,5)) + 10;
tpoints = CustomStartPointSet(pts);
To get the original matrix of points from a CustomStartPointSet object, use list:
tpoints.StartPointsDimension
ans =
5
tpoints.NumStartPoints
ans =
40
3-19
3 Using GlobalSearch and MultiStart
See Also
Related Examples
• “Workflow for GlobalSearch and MultiStart” on page 3-3
3-20
Run the Solver
For example, suppose you want to find several local minima of the sixmin function
This function is also called the six-hump camel back function [3]. All the local minima lie
in the region –3 ≤ x,y ≤ 3.
3-21
3 Using GlobalSearch and MultiStart
The output of the run (which varies, based on the random seed):
xming,fming,flagg,outptg,manyminsg
xming =
0.0898 -0.7127
fming =
-1.0316
flagg =
1
outptg =
funcCount: 2131
localSolverTotal: 3
localSolverSuccess: 3
localSolverIncomplete: 0
localSolverNoSolution: 0
message: 'GlobalSearch stopped because it analyzed all the trial po..
manyminsg =
1x2 GlobalOptimSolution array with properties:
X
Fval
Exitflag
3-22
Run the Solver
Output
X0
The output of the run (which varies based on the random seed):
xminm,fminm,flagm,outptm,manyminsm
xminm =
0.0898 -0.7127
fminm =
-1.0316
flagm =
1
outptm =
funcCount: 2034
localSolverTotal: 50
localSolverSuccess: 50
localSolverIncomplete: 0
localSolverNoSolution: 0
message: 'MultiStart completed the runs from all start points.…'
manyminsm =
3-23
3 Using GlobalSearch and MultiStart
X
Fval
Exitflag
Output
X0
In this case, MultiStart located all six local minima, while GlobalSearch located two.
For pictures of the MultiStart solutions, see “Visualize the Basins of Attraction” on
page 3-36.
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-24
Single Solution
Single Solution
You obtain the single best solution found during the run by calling run with the syntax
[x,fval,eflag,output] = run(...);
• x is the location of the local minimum with smallest objective function value.
• fval is the objective function value evaluated at x.
• eflag is an exit flag for the global solver. Values:
Global Solver Exit Flags
2 At least one local minimum found. Some runs of the local solver
converged (had positive exit flag).
1 At least one local minimum found. All runs of the local solver converged
(had positive exit flag).
0 No local minimum found. Local solver called at least once, and at least
one local solver exceeded the MaxIterations or
MaxFunctionEvaluations tolerances.
-1 Solver stopped by output function or plot function.
-2 No feasible local minimum found.
-5 MaxTime limit exceeded.
-8 No solution found. All runs had local solver exit flag -1 or smaller.
-10 Failures encountered in user-provided functions.
• output is a structure with details about the multiple runs of the local solver. For
more information, see “Global Output Structures” on page 3-35.
The list of outputs is for the case eflag > 0. If eflag <= 0, then x is the following:
• If some local solutions are feasible, x represents the location of the lowest objective
function value. “Feasible” means the constraint violations are smaller than
problem.options.ConstraintTolerance.
• If no solutions are feasible, x is the solution with lowest infeasibility.
• If no solutions exist, x, fval, and output are empty entries ([]).
3-25
3 Using GlobalSearch and MultiStart
See Also
Related Examples
• “Run the Solver” on page 3-21
3-26
Multiple Solutions
Multiple Solutions
In this section...
“About Multiple Solutions” on page 3-27
“Change the Definition of Distinct Solutions” on page 3-30
[x,fval,eflag,output,manymins] = run(...);
• X — a local minimum
• Fval — the value of the objective function at X
• Exitflag — the exit flag for the local solver (described in the local solver function
reference page: fmincon exitflag, fminunc exitflag, lsqcurvefit exitflag ,
or lsqnonlin exitflag
• Output — an output structure for the local solver (described in the local solver
function reference page: fmincon output, fminunc output, lsqcurvefit
output , or lsqnonlin output
• X0 — a cell array of start points that led to the solution point X
• In the MATLAB Workspace Browser. Double-click the solution object, and then
double-click the resulting display in the Variables editor.
3-27
3 Using GlobalSearch and MultiStart
3-28
Multiple Solutions
fcnvals =
-1.0316 -0.2155 0
To get a cell array of all the start points that led to the lowest function value (the first
element of manymins), enter:
smallX0 = manymins(1).X0
• Plot some field values. For example, to see the range of resulting Fval, enter:
histogram([manymins.Fval],10)
This results in a histogram of the computed function values. (The figure shows a
histogram from a different example than the previous few figures.)
3-29
3 Using GlobalSearch and MultiStart
To deal with this situation, run the solver again with different tolerances. The
XTolerance and FunctionTolerance tolerances determine how the solvers group
their outputs into the GlobalOptimSolution vector. These tolerances are properties of
the GlobalSearch or MultiStart object.
For example, suppose you want to use the active-set algorithm in fmincon to solve
the problem in “Example of Run with MultiStart” on page 3-23. Further suppose that you
want to have tolerances of 0.01 for both XTolerance and FunctionTolerance. The
run method groups local solutions whose objective function values are within
FunctionTolerance of each other, and which are also less than XTolerance apart
from each other. To obtain the solution:
someminsm
someminsm =
1x5 GlobalOptimSolution
Properties:
3-30
See Also
X
Fval
Exitflag
Output
X0
In this case, MultiStart generated five distinct solutions. Here “distinct” means that
the solutions are more than 0.01 apart in either objective function value or location.
See Also
Related Examples
• “Run the Solver” on page 3-21
• “Visualize the Basins of Attraction” on page 3-36
3-31
3 Using GlobalSearch and MultiStart
Iterative Display
In this section...
“Types of Iterative Display” on page 3-32
“Examine Types of Iterative Display” on page 3-32
Both types appear at the command line, depending on global and local options.
Obtain local solver iterative display by setting the Display option in the
problem.options structure to 'iter' or 'iter-detailed' with optimoptions. For
more information, see “Iterative Display” (Optimization Toolbox).
Obtain global solver iterative display by setting the Display property in the
GlobalSearch or MultiStart object to 'iter'.
Global solvers set the default Display option of the local solver to 'off', unless the
problem structure has a value for this option. Global solvers do not override any setting
you make for local options.
Note Setting the local solver Display option to anything other than 'off' can produce
a great deal of output. The default Display option created by
optimoptions(@solver) is 'final'.
3-32
Iterative Display
All 3 local solver runs converged with a positive local solver exit flag.
Run the same example without GlobalSearch iterative display, but with fmincon
iterative display:
gs.Display = 'final';
problem.options.Display = 'iter';
[xming,fming,flagg,outptg,manyminsg] = run(gs,problem);
First-order Norm of
Iter F-count f(x) Feasibility optimality step
0 3 4.823333e+001 0.000e+000 1.088e+002
1 7 2.020476e+000 0.000e+000 2.176e+000 2.488e+000
2 10 6.525252e-001 0.000e+000 1.937e+000 1.886e+000
3 13 -8.776121e-001 0.000e+000 9.076e-001 8.539e-001
4 16 -9.121907e-001 0.000e+000 9.076e-001 1.655e-001
5 19 -1.009367e+000 0.000e+000 7.326e-001 8.558e-002
6 22 -1.030423e+000 0.000e+000 2.172e-001 6.670e-002
7 25 -1.031578e+000 0.000e+000 4.278e-002 1.444e-002
8 28 -1.031628e+000 0.000e+000 8.777e-003 2.306e-003
9 31 -1.031628e+000 0.000e+000 8.845e-005 2.750e-004
10 34 -1.031628e+000 0.000e+000 8.744e-007 1.354e-006
3-33
3 Using GlobalSearch and MultiStart
All 4 local solver runs converged with a positive local solver exit flag.
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-34
Global Output Structures
• A global output structure. This structure contains information about the overall run
from multiple starting points. Details follow.
• Local solver output structures. The vector of GlobalOptimSolution objects contains
one such structure in each element of the vector. For a description of this structure,
see “Output Structures” (Optimization Toolbox), or the function reference pages for
the local solvers: fmincon output, fminunc output, lsqcurvefit output , or
lsqnonlin output .
Field Meaning
funcCount Total number of calls to user-supplied functions (objective or
nonlinear constraint)
localSolverTotal Number of local solver runs started
localSolverSuccess Number of local solver runs that finished with a positive exit flag
localSolverIncomplete Number of local solver runs that finished with a 0 exit flag
localSolverNoSolution Number of local solver runs that finished with a negative exit flag
message GlobalSearch or MultiStart exit message
A positive exit flag from a local solver generally indicates a successful run. A negative
exit flag indicates a failure. A 0 exit flag indicates that the solver stopped by exceeding
the iteration or function evaluation limit. For more information, see “Exit Flags and Exit
Messages” (Optimization Toolbox) or “Tolerances and Stopping Criteria” (Optimization
Toolbox).
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-35
3 Using GlobalSearch and MultiStart
Plot the MultiStart start points from the example, “Example of Run with MultiStart”
on page 3-23, color-coded with the basin where they end.
% rng(14,'twister')
% Uncomment the previous line to get the same output
ms = MultiStart;
opts = optimoptions(@fmincon,'Algorithm','interior-point');
sixmin = @(x)(4*x(1)^2 - 2.1*x(1)^4 + x(1)^6/3 ...
+ x(1)*x(2) - 4*x(2)^2 + 4*x(2)^4);
problem = createOptimProblem('fmincon','x0',[-1,2],...
'objective',sixmin,'lb',[-3,-3],'ub',[3,3],...
'options',opts);
[xminm,fminm,flagm,outptm,manyminsm] = run(ms,problem,50);
possColors = 'kbgcrm';
hold on
for i = 1:size(manyminsm,2)
3-36
Visualize the Basins of Attraction
The figure shows the centers of the basins by colored * symbols. Start points with the
same color as the * symbol converge to the center of the * symbol.
Start points do not always converge to the closest basin. For example, the red points are
closer to the cyan basin center than to the red basin center. Also, many black and blue
start points are closer to the opposite basin centers.
The magenta and red basins are shallow, as you can see in the following contour plot.
3-37
3 Using GlobalSearch and MultiStart
See Also
Related Examples
• “Multiple Solutions” on page 3-27
3-38
Output Functions for GlobalSearch and MultiStart
There are two types of output functions, like the two types of output structures on page
3-35:
• Global output functions run after each local solver run. They also run when the global
solver starts and ends.
• Local output functions run after each iteration of a local solver. See “Output
Functions” (Optimization Toolbox).
• Write output functions using the syntax described in “OutputFcn” on page 10-4.
• Set the OutputFcn property of your GlobalSearch or MultiStart solver to the
function handle of your output function. You can use multiple output functions by
setting the OutputFcn property to a cell array of function handles.
To store local results using nested functions instead of persistent variables, see “Example
of a Nested Output Function” (MATLAB).
3-39
3 Using GlobalSearch and MultiStart
1 Write the output function using the syntax described in “OutputFcn” on page 10-4.
function stop = StopAfterFive(optimValues, state)
persistent foundLocal
stop = false;
switch state
case 'init'
foundLocal = []; % initialized as empty
case 'iter'
newf = optimValues.localsolution.Fval;
eflag = optimValues.localsolution.Exitflag;
% Now check if the exit flag is positive and
% the new value differs from all others by at least 1e-4
% If so, add the new value to the newf list
if eflag > 0 && all(abs(newf - foundLocal) > 1e-4)
foundLocal = [foundLocal;newf];
% Now check if the latest value added to foundLocal
% is less than 1/2
% Also check if there are 5 local minima in foundLocal
% If so, then stop
if foundLocal(end) < 0.5 || length(foundLocal) >= 5
stop = true;
end
end
end
2 Save StopAfterFive.m as a file in a folder on your MATLAB path.
3 Write the objective function and create an optimization problem structure as in
“Find Global or Multiple Local Minima” on page 3-78.
function f = sawtoothxy(x,y)
[t r] = cart2pol(x,y); % change to polar coordinates
h = cos(2*t - 1/2)/2 + cos(t) + 2;
g = (sin(r) - sin(2*r)/2 + sin(3*r)/3 - sin(4*r)/4 + 4) ...
.*r.^2./(r+1);
f = g.*h;
end
4 Save sawtoothxy.m as a file in a folder on your MATLAB path.
5 At the command line, create the problem structure:
problem = createOptimProblem('fmincon',...
'objective',@(x)sawtoothxy(x(1),x(2)),...
'x0',[100,-50],'options',...
optimoptions(@fmincon,'Algorithm','sqp'));
3-40
Output Functions for GlobalSearch and MultiStart
gs = GlobalSearch('OutputFcn',@StopAfterFive,'Display','iter');
7 (Optional) To get the same answer as this example, set the default random number
stream.
rng default
8 Run the problem.
[x,fval] = run(gs,problem)
1 out of 2 local solver runs converged with a positive local solver exit flag.
x =
1.0e-07 *
0.0414 0.1298
fval =
1.5467e-15
The run stopped early because GlobalSearch found a point with a function value less
than 0.5.
3-41
3 Using GlobalSearch and MultiStart
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-42
Plot Functions for GlobalSearch and MultiStart
Plot functions are specialized output functions (see “Output Functions for GlobalSearch
and MultiStart” on page 3-39). There are two predefined plot functions:
Plot function windows have Pause and Stop buttons. By default, all plots appear in one
window.
• Write plot functions using the syntax described in “OutputFcn” on page 10-4.
• Set the PlotFcn property of your GlobalSearch or MultiStart object to the
function handle of your plot function. You can use multiple plot functions by setting
the PlotFcn property to a cell array of function handles.
The built-in plot functions have characteristics that can surprise you.
• @gsplotbestf can have plots that are not strictly decreasing. This is because early
values can result from local solver runs with negative exit flags (such as infeasible
solutions). A subsequent local solution with positive exit flag is better even if its
3-43
3 Using GlobalSearch and MultiStart
function value is higher. Once a local solver returns a value with a positive exit flag,
the plot is monotone decreasing.
• @gsplotfunccount might not plot the total number of function evaluations. This is
because GlobalSearch can continue to perform function evaluations after it calls the
plot function for the last time. For more information, see “GlobalSearch Algorithm” on
page 3-50Properties for GlobalSearch.
The example problem is the same as in “Find Global or Multiple Local Minima” on page
3-78, with additional bounds.
The example uses persistent variables to store previous best values. The plot function
examines the best function value after each local solver run, available in the bestfval
field of the optimValues structure. If the value is not lower than the previous best, the
plot function adds 1 to the number of consecutive calls with no improvement and draws a
bar chart. If the value is lower than the previous best, the plot function starts a new bar
in the chart with value 1. Before plotting, the plot function takes a logarithm of the
number of consecutive calls. The logarithm helps keep the plot legible, since some values
can be much larger than others.
To store local results using nested functions instead of persistent variables, see “Example
of a Nested Output Function” (MATLAB).
3-44
Plot Functions for GlobalSearch and MultiStart
stop = false;
switch state
case 'init'
% Initialize variable to record best function value.
bestfv = [];
problem = createOptimProblem('fmincon',...
'objective',@(x)sawtoothxy(x(1),x(2)),...
'x0',[100,-50],'lb',[-3e3 -4e3],...
'ub',[4e3,3e3],'options',...
optimoptions(@fmincon,'Algorithm','sqp'));
3-45
3 Using GlobalSearch and MultiStart
ms = MultiStart('PlotFcn',{@NumberToNextBest,@gsplotbestf});
6 Run the global solver for 100 local solver runs:
[x,fv] = run(ms,problem,100);
7 The plot functions produce the following figure (your results can differ, since the
solution process is stochastic):
3-46
See Also
serially. Local output and plot functions do not create a display when running on
workers. You do not see any other effects of output and plot functions until the worker
passes its results to the client (the originator of the MultiStart parallel jobs).
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-47
3 Using GlobalSearch and MultiStart
3-48
How GlobalSearch and MultiStart Work
Stage 2:
Loop through remaining trial points,
run fmincon if point satisfies
basin, score, and constraint filters
3-49
3 Using GlobalSearch and MultiStart
The differences between these solver objects boil down to the following decision on which
to use:
GlobalSearch Algorithm
For a description of the algorithm, see Ugray et al. [1].
When you run a GlobalSearch object, the algorithm performs the following steps:
GlobalSearch runs fmincon from the start point you give in the problem structure. If
this run converges, GlobalSearch records the start point and end point for an initial
estimate on the radius of a basin of attraction. Furthermore, GlobalSearch records the
3-50
How GlobalSearch and MultiStart Work
final objective function value for use in the score function (see “Obtain Stage 1 Start
Point, Run” on page 3-51).
The score function is the sum of the objective function value at a point and a multiple of
the sum of the constraint violations. So a feasible point has score equal to its objective
function value. The multiple for constraint violations is initially 1000. GlobalSearch
updates the multiple during the run.
The localSolverThreshold is initially the smaller of the two objective function values
at the solution points. The solution points are the fmincon solutions starting from x0
and from the Stage 1 start point. If both of these solution points do not exist or are
infeasible, localSolverThreshold is initially the penalty function value of the Stage 1
start point.
The GlobalSearch heuristic assumption is that basins of attraction are spherical. The
initial estimate of basins of attraction for the solution point from x0 and the solution
point from Stage 1 are spheres centered at the solution points. The radius of each sphere
is the distance from the initial point to the solution point. These estimated basins can
overlap.
There are two sets of counters associated with the algorithm. Each counter is the number
of consecutive trial points that:
3-51
3 Using GlobalSearch and MultiStart
• Lie within a basin of attraction. There is one counter for each basin.
• Have score function greater than localSolverThreshold. For a definition of the
score, see “Run fmincon from x0” on page 3-50.
GlobalSearch repeatedly examines a remaining trial point from the list, and performs
the following steps. It continually monitors the time, and stops the search if elapsed time
exceeds MaxTime seconds.
Call the trial point p. Run fmincon from p if the following conditions hold:
• p is not in any existing basin. The criterion for every basin i is:
radius is an estimated radius that updates in Update Basin Radius and Threshold
on page 3-53 and React to Large Counter Values on page 3-54.
• score(p) < localSolverThreshold.
• (optional) p satisfies bound and/or inequality constraints. This test occurs if you set
the StartPointsToRun property of the GlobalSearch object to 'bounds' or
'bounds-ineqs'.
1 Reset Counters
If fmincon runs starting from p, it can yield a positive exit flag, which indicates
convergence. In that case, GlobalSearch updates the vector of
GlobalOptimSolution objects. Call the solution point xp and the objective function
value fp. There are two cases:
3-52
How GlobalSearch and MultiStart Work
• For every other solution point xq with objective function value fq,
or
and
There is one minor tweak that can happen to this update. If the exit flag for xq is
greater than 1, and the exit flag for xp is 1, then xp replaces xq. This
replacement can lead to some points in the same basin being more than a
distance of XTolerance from xp.
3 Update Basin Radius and Threshold
When the GlobalSearch Display property is 'iter', every point that fmincon
runs creates one line in the GlobalSearch iterative display.
3-53
3 Using GlobalSearch and MultiStart
1 Update Counters
Increment the counter for every basin containing p. Reset the counter of every other
basin to 0.
For each basin with counter equal to MaxWaitCycle, multiply the basin radius by
1 – BasinRadiusFactor. Reset the counter to 0. (Both MaxWaitCycle and
BasinRadiusFactor are settable properties of the GlobalSearch object.)
Every 200th trial point creates one line in the GlobalSearch iterative display.
Create GlobalOptimSolution
After reaching MaxTime seconds or running out of trial points, GlobalSearch creates a
vector of GlobalOptimSolution objects. GlobalSearch orders the vector by objective
function value, from lowest (best) to highest (worst). This concludes the algorithm.
MultiStart Algorithm
When you run a MultiStart object, the algorithm performs the following steps:
3-54
How GlobalSearch and MultiStart Work
Validate Inputs
MultiStart checks input arguments for validity. Checks include running the local
solver once on problem inputs. Even when run in parallel, MultiStart performs these
checks serially.
[x,fval] = run(ms,problem,k)
A RandomStartPointSet object does not have any points stored inside the object.
Instead, MultiStart calls list, which generates random points within the bounds
given by the problem structure. If an unbounded component exists, list uses an
artificial bound given by the ArtificialBound property of the RandomStartPointSet
object.
The default setting of StartPointsToRun is 'all'. In this case, MultiStart does not
discard infeasible start points.
MultiStart runs the local solver specified in problem.solver, starting at the points
that pass the StartPointsToRun filter. If MultiStart is running in parallel, it sends
3-55
3 Using GlobalSearch and MultiStart
start points to worker processors one at a time, and the worker processors run the local
solver.
The local solver checks whether MaxTime seconds have elapsed at each of its iterations.
If so, it exits that iteration without reporting a solution.
When the local solver stops, MultiStart stores the results and continues to the next
step.
Report to Iterative Display
When the MultiStart Display property is 'iter', every point that the local solver
runs creates one line in the MultiStart iterative display.
MultiStart stops when it runs out of start points. It also stops when it exceeds a total
run time of MaxTime seconds.
1 Sort the local solutions by objective function value (Fval) from lowest to highest. For
the lsqnonlin and lsqcurvefit local solvers, the objective function is the norm of
the residual.
2 Loop over the local solutions j beginning with the lowest (best) Fval.
3 Find all the solutions k satisfying both:
3-56
See Also
After examining all the local solutions, MultiStart gives a summary to the iterative
display. This summary includes the number of local solver runs that converged, the
number that failed to converge, and the number that had errors.
Bibliography
[1] Ugray, Zsolt, Leon Lasdon, John C. Plummer, Fred Glover, James Kelly, and Rafael
Martí. Scatter Search and Local NLP Solvers: A Multistart Framework for Global
Optimization. INFORMS Journal on Computing, Vol. 19, No. 3, 2007, pp. 328–
340.
[2] Glover, F. “A template for scatter search and path relinking.” Artificial Evolution (J.-
K. Hao, E.Lutton, E.Ronald, M.Schoenauer, D.Snyers, eds.). Lecture Notes in
Computer Science, 1363, Springer, Berlin/Heidelberg, 1998, pp. 13–54.
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-57
3 Using GlobalSearch and MultiStart
No Guarantees
How can you tell if you have located the global minimum of your objective function? The
short answer is that you cannot; you have no guarantee that the result of a Global
Optimization Toolbox solver is a global optimum.
However, you can use the strategies in this section for investigating solutions.
problem.solver = 'patternsearch';
Also, change the start point to the solution you just found, and clear the options:
problem.x0 = x;
problem.options = [];
For example, Check Nearby Points (Optimization Toolbox) shows the following:
options = optimoptions(@fmincon,'Algorithm','active-set');
ffun = @(x)(x(1)-(x(1)-x(2))^2);
problem = createOptimProblem('fmincon', ...
'objective',ffun,'x0',[1/2 1/3], ...
'lb',[0 -1],'ub',[1 1],'options',options);
[x,fval,exitflag] = fmincon(problem)
3-58
Can You Certify a Solution Is Global?
x =
1.0e-007 *
0 0.1614
fval =
-2.6059e-016
exitflag =
1
However, checking this purported solution with patternsearch shows that there is a
better solution. Start patternsearch from the reported solution x:
% set the candidate solution x as the start point
problem.x0 = x;
problem.solver = 'patternsearch';
problem.options = [];
[xp,fvalp,exitflagp] = patternsearch(problem)
xp =
1.0000 -1.0000
fvalp =
-3.0000
exitflagp =
Therefore, if you can bound the region where a global solution can exist, you can obtain
some degree of assurance that MultiStart locates the global solution.
3-59
3 Using GlobalSearch and MultiStart
Ê x2 ˆ
f = x6 + y6 + sin( x + y) ( x2 + y2 ) - cos Á ˜ ( 2 + x4 + x2 y2 + y4 ) .
Á 1 + y2 ˜
Ë ¯
The initial summands x6 + y6 force the function to become large and positive for large
values of |x| or |y|. The components of the global minimum of the function must be
within the bounds
since 106 is much larger than all the multiples of 104 that occur in the other summands of
the function.
You can identify smaller bounds for this problem; for example, the global minimum is
between –2 and 2. It is more important to identify reasonable bounds than it is to
identify the best bounds.
For example, see “Example: Searching for a Better Solution” on page 3-65.
To avoid a warning if your objective function does not compute a gradient, change the
local options to have Algorithm set to 'quasi-newton':
problem.options.Algorithm = 'quasi-newton';
• Add an artificial constraint, retaining fmincon as the local solver:
problem.lb = -Inf(size(x0));
3-60
See Also
To search a larger region than the default, see “Refine Start Points” on page 3-62.
See Also
Related Examples
• “Refine Start Points” on page 3-62
3-61
3 Using GlobalSearch and MultiStart
In this section...
“About Refining Start Points” on page 3-62
“Methods of Generating Start Points” on page 3-63
“Example: Searching for a Better Solution” on page 3-65
3-62
Refine Start Points
Uniform Grid
1 Generate multidimensional arrays with ndgrid. Give the lower bound, spacing, and
upper bound for each component.
W = [X(:),Y(:),Z(:)];
custpts = CustomStartPointSet(W);
Call run with the CustomStartPointSet object as the third input. For example,
3-63
3 Using GlobalSearch and MultiStart
Perturbed Grid
Integer start points can yield less robust solutions than slightly perturbed start points.
1 Generate a matrix of start points as in steps 1–2 of “Uniform Grid” on page 3-63.
2 Perturb the start points by adding a random normal matrix with 0 mean and
relatively small variance.
For the example in “Uniform Grid” on page 3-63, after making the W matrix, add a
perturbation:
[X,Y,Z] = ndgrid(-2:.5:0,0:.25:2,-10:5);
W = [X(:),Y(:),Z(:)];
W = W + 0.01*randn(size(W));
3 Put the matrix into a CustomStartPointSet object. For example:
custpts = CustomStartPointSet(W);
Call run with the CustomStartPointSet object as the third input. For example,
Some components of your problem can lack upper or lower bounds. For example:
• Although no explicit bounds exist, there are levels that the components cannot attain.
For example, if one component represents the weight of a single diamond, there is an
implicit upper bound of 1 kg (the Hope Diamond is under 10 g). In such a case, give
the implicit bound as an upper bound.
• There truly is no upper bound. For example, the size of a computer file in bytes has no
effective upper bound. The largest size can be in gigabytes or terabytes today, but in
10 years, who knows?
For truly unbounded components, you can use the following methods of sampling. To
generate approximately 1/n points in each region (exp(n),exp(n+1)), use the following
formula. If u is random and uniformly distributed from 0 through 1, then r = 2u – 1 is
uniformly distributed between –1 and 1. Take
y = sgn( r) ( exp (1 / r ) - e ) .
3-64
Refine Start Points
• x(1) > 0
• x(2) < 100
• x(3) unconstrained
The following is a variant of this algorithm. Generate a number between 0 and infinity
by the method for lower bounds. Use this number as the radius of a point. Generate the
other components of the point by taking random numbers for each component and
multiply by the radius. You can normalize the random numbers, before multiplying by
the radius, so their norm is 1. For a worked example of this method, see “MultiStart
Without Bounds, Widely Dispersed Start Points” on page 3-101.
3-65
3 Using GlobalSearch and MultiStart
problem = createOptimProblem('fminunc',...
'objective',@(x)sawtoothxy(x(1),x(2)),...
'x0',[100,-50],'options',...
optimoptions(@fminunc,'Algorithm','quasi-newton'));
ms = MultiStart;
Run MultiStart on the problem for 200 start points instead of 50:
rng(14,'twister') % for reproducibility
[x,fval,eflag,output,manymins] = run(ms,problem,200)
53 out of 200 local solver runs converged with a positive local solver exit flag.
x =
1.0e-06 *
-0.2284 -0.5567
fval =
2.1382e-12
eflag =
output =
funcCount: 32670
localSolverTotal: 200
localSolverSuccess: 53
localSolverIncomplete: 147
localSolverNoSolution: 0
message: 'MultiStart completed some of the runs from the start points
3-66
Refine Start Points
manymins =
1x53 GlobalOptimSolution
Properties:
X
Fval
Exitflag
Output
X0
This time MultiStart found the global minimum, and found 51 local minima.
3-67
3 Using GlobalSearch and MultiStart
Suppose you believe that the interesting local solutions have absolute values of all
components less than 100. The default value of the bound on start points is 1000. To use
a different value of the bound, generate a RandomStartPointSet with the
ArtificialBound property set to 100:
startpts = RandomStartPointSet('ArtificialBound',100,...
'NumStartPoints',50);
[x,fval,eflag,output,manymins] = run(ms,problem,startpts)
29 out of 50 local solver runs converged with a positive local solver exit flag.
x =
1.0e-08 *
0.9725 -0.6198
fval =
1.4955e-15
eflag =
output =
funcCount: 7431
localSolverTotal: 50
localSolverSuccess: 29
localSolverIncomplete: 21
localSolverNoSolution: 0
message: 'MultiStart completed some of the runs from the start points
manymins =
3-68
Refine Start Points
1x25 GlobalOptimSolution
Properties:
X
Fval
Exitflag
Output
X0
MultiStart found the global minimum, and found 22 distinct local solutions. To see the
range of local solutions, enter histogram([manymins.Fval],10).
Compared to the minima found in “Use More Start Points” on page 3-66, this run found
better (smaller) minima, and had a higher percentage of successful runs.
3-69
3 Using GlobalSearch and MultiStart
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-70
Change Options
Change Options
In this section...
“How to Determine Which Options to Change” on page 3-71
“Changing Local Solver Options” on page 3-72
“Changing Global Options” on page 3-73
• To start your local solver at points only satisfying inequality constraints, set the
StartPointsToRun property in the global solver object to 'bounds-ineqs'. This
setting can speed your solution, since local solvers do not have to attempt to find
points satisfying these constraints. However, the setting can result in many fewer
local solver runs, since the global solver can reject many start points. For an example,
see “Optimize Using Only Feasible Start Points” on page 3-86.
3-71
3 Using GlobalSearch and MultiStart
• To use the fmincon interior-point algorithm, set the local solver Algorithm
option to 'interior-point'. For an example showing how to do this, see “Examples
of Updating Problem Options” on page 3-72.
• For your local solver to have different bounds, set the bounds in the problem
structure. Examine different regions by setting bounds.
• To see every solution that has positive local exit flag, set the XTolerance property in
the global solver object to 0. For an example showing how to do this, see “Changing
Global Options” on page 3-73.
• Update the values using dot notation and optimoptions. The syntax is
problem.options = optimoptions(problem.options,'Parameter',value,...);
problem.options = optimoptions(@solvername,'Parameter',value,...);
• Use dot notation on one local option. The syntax is
problem.options.Parameter = newvalue;
• Recreate the entire problem structure. For details, see “Create Problem Structure” on
page 3-5.
problem.options.Algorithm = 'sqp';
3 Update the problem to use the gradient in the objective function, have a
FunctionTolerance value of 1e-8, and a XTolerance value of 1e-7:
3-72
Change Options
• Use dot notation. For example, suppose you have a default MultiStart object:
ms = MultiStart
ms =
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-06
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
To change ms to have its XTolerance value equal to 1e-3, update the XTolerance
field:
ms.XTolerance = 1e-3
ms =
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-06
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-03
• Reconstruct the object starting from the current settings. For example, to set the
FunctionTolerance field in ms to 1e-3, retaining the nondefault value for
XTolerance:
ms = MultiStart(ms,'FunctionTolerance',1e-3)
ms =
3-73
3 Using GlobalSearch and MultiStart
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-03
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-03
• Convert a GlobalSearch object to a MultiStart object, or vice-versa. For example,
with the ms object from the previous example, create a GlobalSearch object with the
same values of XTolerance and FunctionTolerance:
gs = GlobalSearch(ms)
gs =
NumTrialPoints: 1000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 1.0000e-03
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-03
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-74
Reproduce Results
Reproduce Results
In this section...
“Identical Answers with Pseudorandom Numbers” on page 3-75
“Steps to Take in Reproducing Results” on page 3-75
“Example: Reproducing a GlobalSearch or MultiStart Result” on page 3-75
“Parallel Processing and Random Number Streams” on page 3-77
Both GlobalSearch and MultiStart use the default random number stream.
stream = rng;
2 Run your GlobalSearch or MultiStart problem.
3 Restore the state of the random number stream:
rng(stream)
4 If you run your problem again, you get the same result.
3-75
3 Using GlobalSearch and MultiStart
All 3 local solver runs converged with a positive local solver exit flag.
x =
1.0e-007 *
0.0414 0.1298
fval =
1.5467e-015
You might obtain a different result when running this problem, since the random
stream was in an unknown state at the beginning of the run.
5 Restore the state of the random number stream:
rng(stream)
3-76
See Also
x =
1.0e-007 *
0.0414 0.1298
fval =
1.5467e-015
To reproduce a parallel MultiStart run, use the procedure described in “Steps to Take
in Reproducing Results” on page 3-75. For a description of how to run MultiStart in
parallel, see “How to Use Parallel Processing” on page 9-12.
See Also
Related Examples
• “Global or Multiple Starting Point Search”
3-77
3 Using GlobalSearch and MultiStart
Function to Optimize
This example illustrates how GlobalSearch finds a global minimum efficiently, and
how MultiStart finds many more local minima.
The objective function for this example has many local minima and a unique global
minimum. In polar coordinates, the function is
f(r,t) = g(r)h(t),
where
2
Ê sin(2 r) sin(3 r) sin( 4r ) ˆ r
g( r) = Á sin(r) - + - + 4˜
Ë 2 3 4 ¯ r +1
Ê 1ˆ
cos Á 2t - ˜
h(t) = 2 + cos(t) + Ë 2¯ .
2
3-78
Find Global or Multiple Local Minima
The global minimum is at r = 0, with objective function 0. The function g(r) grows
approximately linearly in r, with a repeating sawtooth shape. The function h(t) has two
local minima, one of which is global.
3-79
3 Using GlobalSearch and MultiStart
3-80
Find Global or Multiple Local Minima
problem = createOptimProblem('fmincon',...
'objective',@(x)sawtoothxy(x(1),x(2)),...
'x0',[100,-50],'options',...
optimoptions(@fmincon,'Algorithm','sqp','Display','off'));
The start point is [100,-50] instead of [0,0], so GlobalSearch does not start at
the global solution.
3 Add an artificial bound, and validate the problem structure by running fmincon:
problem.lb = -Inf([2,1]);
[x,fval] = fmincon(problem)
x =
45.3533 -107.7035
fval =
547.2636
4 Create the GlobalSearch object, and set iterative display:
gs = GlobalSearch('Display','iter');
5 Run the solver:
rng(14,'twister') % for reproducibility
[x,fval] = run(gs,problem)
3 out of 4 local solver runs converged with a positive local solver exit flag.
x =
1.0e-07 *
0.0414 0.1298
3-81
3 Using GlobalSearch and MultiStart
fval =
1.5467e-15
The solver found three local minima, and it found the global minimum near [0,0].
function f = sawtoothxy(x,y)
[t r] = cart2pol(x,y); % change to polar coordinates
h = cos(2*t - 1/2)/2 + cos(t) + 2;
g = (sin(r) - sin(2*r)/2 + sin(3*r)/3 - sin(4*r)/4 + 4) ...
.*r.^2./(r+1);
f = g.*h;
end
2 Create the problem structure. Use the fminunc solver with the Algorithm option
set to 'quasi-newton'. The reasons for these choices are:
problem = createOptimProblem('fminunc',...
'objective',@(x)sawtoothxy(x(1),x(2)),...
'x0',[100,-50],'options',...
optimoptions(@fminunc,'Algorithm','quasi-newton','Display','off'));
3 Validate the problem structure by running it:
[x,fval] = fminunc(problem)
x =
1.7533 -111.9488
fval =
3-82
Find Global or Multiple Local Minima
577.6960
4 Create a default MultiStart object:
ms = MultiStart;
5 Run the solver for 50 iterations, recording the local minima:
% rng(1) % uncomment to obtain the same result
[x,fval,eflag,output,manymins] = run(ms,problem,50)
9 out of 50 local solver runs converged with a positive local solver exit flag.
x =
-142.4608 406.8030
fval =
1.2516e+03
eflag =
output =
funcCount: 8586
localSolverTotal: 50
localSolverSuccess: 9
localSolverIncomplete: 41
localSolverNoSolution: 0
message: 'MultiStart completed some of the runs from the start po
manymins =
3-83
3 Using GlobalSearch and MultiStart
X
Fval
Exitflag
Output
X0
The solver did not find the global minimum near [0,0]. It found 10 distinct local
minima.
6 Plot the function values at the local minima:
histogram([manymins.Fval],10)
3-84
Find Global or Multiple Local Minima
MultiStart started fminunc from start points with components uniformly distributed
between –1000 and 1000. fminunc often got stuck in one of the many local minima.
fminunc exceeded its iteration limit or function evaluation limit 40 times.
3-85
3 Using GlobalSearch and MultiStart
3-86
Optimize Using Only Feasible Start Points
Write a constraint function that is positive on the set where norm(x - [4 4]) > 4
function [c ceq] = myconstraint(x)
ceq = [];
cx = x(:) - [4;4]; % ensure x is a column vector
c = cx'*cx - 16; % negative where tiltcircle(x) is real
To complete the example, create a problem structure and run the solver:
opts = optimoptions(@fmincon,'Algorithm','interior-point');
problem = createOptimProblem('fmincon',...
3-87
3 Using GlobalSearch and MultiStart
'x0',[4 4],'objective',@tiltcircle,...
'nonlcon',@myconstraint,'lb',[-10 -10],...
'ub',[10 10],'options',opts);
rng(7,'twister'); % for reproducibility
[x,fval,exitflag,output,solutionset] = run(gs,problem)
All 5 local solver runs converged with a positive local solver exit flag.
x =
1.1716 1.1716
fval =
-5.6530
exitflag =
output =
funcCount: 3254
localSolverTotal: 5
localSolverSuccess: 5
localSolverIncomplete: 0
localSolverNoSolution: 0
message: 'GlobalSearch stopped because it analyzed all the trial po..
solutionset =
X
Fval
Exitflag
3-88
Optimize Using Only Feasible Start Points
Output
X0
The tiltcircle function has just one local minimum. Yet GlobalSearch (fmincon)
stops at several points. Does this mean fmincon makes an error?
The reason that fmincon stops at several boundary points is subtle. The tiltcircle
function has an infinite gradient on the boundary, as you can see from a one-dimensional
calculation:
d -x
16 - x2 = = ±• at x = 4.
dx 16 - x2
3-89
3 Using GlobalSearch and MultiStart
So there is a huge gradient normal to the boundary. This gradient overwhelms the small
additional tilt from the linear term. As far as fmincon can tell, boundary points are
stationary points for the constrained problem.
This behavior can arise whenever you have a function that has a square root.
3-90
MultiStart Using lsqcurvefit or lsqnonlin
Many fitting problems have multiple local solutions. MultiStart can help find the
global solution, meaning the best fit. While you can use lsqnonlin as the local solver,
this example uses lsqcurvefit simply because it has a convenient syntax.
The model is
where the input data is , and the parameters , , , and are the unknown
model coefficients.
Write an anonymous function that takes a data matrix xdata with N rows and two
columns, and returns a response vector with N rows. It also takes a coefficient matrix p,
corresponding to the coefficient vector .
fitfcn = @(p,xdata)p(1) + p(2)*xdata(:,1).*sin(p(3)*xdata(:,2)+p(4));
Set bounds for lsqcurvefit. There is no reason for to exceed in absolute value,
because the sine function takes values in its full range over any interval of width .
Assume that the coefficient must be smaller than 20 in absolute value, because
allowing a very high frequency can cause unstable responses or spurious convergence.
3-91
3 Using GlobalSearch and MultiStart
lb = [-Inf,-Inf,-20,-pi];
ub = [Inf,Inf,20,pi];
[xfitted,errorfitted] = lsqcurvefit(fitfcn,p0,xdata,ydata,lb,ub)
lsqcurvefit stopped because the final change in the sum of squares relative to
its initial value is less than the default value of the function tolerance.
xfitted =
errorfitted =
28.2524
lsqcurvefit found a local solution that is not particularly close to the model parameter
values (-3,1/4,1/2,1).
problem = createOptimProblem('lsqcurvefit','x0',p0,'objective',fitfcn,...
'lb',lb,'ub',ub,'xdata',xdata,'ydata',ydata);
3-92
MultiStart Using lsqcurvefit or lsqnonlin
Solve the fitting problem using MultiStart with 50 iterations. Plot the smallest error as
the number of MultiStart iterations.
ms = MultiStart('PlotFcns',@gsplotbestf);
[xmulti,errormulti] = run(ms,problem,50)
All 50 local solver runs converged with a positive local solver exit flag.
xmulti =
errormulti =
1.6464
3-93
3 Using GlobalSearch and MultiStart
3-94
Parallel MultiStart
Parallel MultiStart
In this section...
“Steps for Parallel MultiStart” on page 3-95
“Speedup with Parallel Computing” on page 3-97
x =
8.4420 -110.2602
fval =
435.2573
4 Create a MultiStart object, and set the object to use parallel processing and
iterative display:
ms = MultiStart('UseParallel',true,'Display','iter');
3-95
3 Using GlobalSearch and MultiStart
parpool
ans =
Connected: true
NumWorkers: 4
Cluster: local
AttachedFiles: {}
IdleTimeout: 30 minute(s) (30 minutes remaining)
SpmdEnabled: true
6 Run the problem on 50 start points:
[x,fval,eflag,output,manymins] = run(ms,problem,50);
Running the local solvers in parallel.
Notice that the run indexes look random. Parallel MultiStart runs its start points
in an unpredictable order.
Notice that MultiStart confirms parallel processing in the first line of output,
which states: “Running the local solvers in parallel.”
7 When finished, shut down the parallel environment:
3-96
Parallel MultiStart
delete(gcp)
Parallel pool using the 'local' profile is shutting down.
For an example of how to obtain better solutions to this problem, see “Example:
Searching for a Better Solution” on page 3-65. You can use parallel processing along with
the techniques described in that example.
Parallel computing can be slower than serial when you use only a few start points. As the
number of start points increases, parallel computing becomes increasingly more efficient
than serial.
There are many factors that affect speedup (or slowdown) with parallel processing. For
more information, see “Improving Performance with Parallel Computing” (Optimization
Toolbox).
3-97
3 Using GlobalSearch and MultiStart
In this section...
“Difficult-To-Locate Global Minimum” on page 3-98
“Default Settings Cannot Find the Global Minimum — Add Bounds” on page 3-99
“GlobalSearch with Bounds and More Start Points” on page 3-100
“MultiStart with Bounds and Many Start Points” on page 3-101
“MultiStart Without Bounds, Widely Dispersed Start Points” on page 3-101
“MultiStart with a Regular Grid of Start Points” on page 3-102
“MultiStart with Regular Grid and Promising Start Points” on page 3-103
The function –sech(x) is nearly 0 for all |x| > 5, and –sech(0) = –1. The example is a two-
dimensional version of the sech function, with one minimum at [1,1], the other at
[1e5,-1e5]:
f has a global minimum of –21 at (1e5,–1e5), and a local minimum of –11 at (1,1).
3-98
Isolated Global Minimum
The minimum at (1e5,–1e5) shows as a narrow spike. The minimum at (1,1) does not
show since it is too narrow.
The following sections show various methods of searching for the global minimum. Some
of the methods are not successful on this problem. Nevertheless, you might find each
method useful for different problems.
3-99
3 Using GlobalSearch and MultiStart
With additional bounds of –1e6 and 1e6 in problem, GlobalSearch usually does not
find the global minimum:
x1 = [1;1];x2 = [1e5;-1e5];
f = @(x)-10*sech(norm(x(:)-x1)) -20*sech((norm(x(:)-x2))*3e-4) -1;
opts = optimoptions(@fmincon,'Algorithm','active-set');
problem = createOptimProblem('fmincon','x0',[0,0],'objective',f,...
'lb',[-1e6;-1e6],'ub',[1e6;1e6],'options',opts);
gs = GlobalSearch;
rng(14,'twister') % for reproducibility
[xfinal,fval] = run(gs,problem)
xfinal =
1.0000 1.0000
fval =
-11.0000
gs.NumTrialPoints = 1e5;
gs.MaxTime = 300;
[xg,fvalg] = run(gs,problem)
xg =
1.0e+04 *
10.0000 -10.0000
3-100
Isolated Global Minimum
fvalg =
-21.0000
ms = MultiStart(gs);
[xm,fvalm] = run(ms,problem,1e5)
xm =
1.0000 1.0000
fvalm =
-11.0000
The first five lines of code generate 10,000 widely dispersed random start points using
the method described in “Widely Dispersed Points for Unconstrained Components” on
page 3-64. newprob is a problem structure using the fminunc local solver and no
bounds:
rng(0,'twister') % for reproducibility
u = rand(1e4,1);
u = 1./u;
3-101
3 Using GlobalSearch and MultiStart
u = exp(u) - exp(1);
s = rand(1e4,1)*2*pi;
stpts = [u.*cos(s),u.*sin(s)];
startpts = CustomStartPointSet(stpts);
opts = optimoptions(@fminunc,'Algorithm','quasi-newton');
newprob = createOptimProblem('fminunc','x0',[0;0],'objective',f,...
'options',opts);
[xcust,fcust] = run(ms,newprob,startpts)
xcust =
1.0e+05 *
1.0000
-1.0000
fcust =
-21.0000
xgrid =
3-102
Isolated Global Minimum
1.0e+004 *
10.0000
-10.0000
fgrid =
-21.0000
To perform this filtering most efficiently, write your objective function in a vectorized
fashion. For information, see “Write a Vectorized Function” on page 2-3 or “Vectorize the
Objective and Constraint Functions” on page 4-81. The following function handle
computes a vector of objectives based on an input matrix whose rows represent start
points:
x1 = [1;1];x2 = [1e5;-1e5];
g = @(x) -10*sech(sqrt((x(:,1)-x1(1)).^2 + (x(:,2)-x1(2)).^2)) ...
-20*sech(sqrt((x(:,1)-x2(1)).^2 + (x(:,2)-x2(2)).^2))-1;
Suppose you want to run the local solver only for points where the value is less than –2.
Start with a denser grid than in “MultiStart with a Regular Grid of Start Points” on page
3-102, then filter out all the points with high function value:
xx = -1e6:1e3:1e6;
[xxx,yyy] = meshgrid(xx,xx);
z = [xxx(:),yyy(:)];
idx = g(z) < -2; % index of promising start points
zz = z(idx,:);
smallstartset = CustomStartPointSet(zz);
opts = optimoptions(@fminunc,'Algorithm','quasi-newton','Display','off');
newprobg = createOptimProblem('fminunc','x0',[0,0],...
'objective',g,'options',opts);
% row vector x0 since g expects rows
[xfew,ffew] = run(ms,newprobg,smallstartset)
3-103
3 Using GlobalSearch and MultiStart
xfew =
100000 -100000
ffew =
-21
In this case, MultiStart found the global minimum. There are only two start points in
smallstartset, one of which is the global minimum.
3-104
4
Global Optimization Toolbox functions include three direct search algorithms called the
generalized pattern search (GPS) algorithm, the generating set search (GSS) algorithm,
and the mesh adaptive search (MADS) algorithm. All are pattern search algorithms that
compute a sequence of points that approach an optimal point. At each step, the algorithm
searches a set of points, called a mesh, around the current point—the point computed at
the previous step of the algorithm. The mesh is formed by adding the current point to a
scalar multiple of a set of vectors called a pattern. If the pattern search algorithm finds a
point in the mesh that improves the objective function at the current point, the new point
becomes the current point at the next step of the algorithm.
The GPS algorithm uses fixed direction vectors. The GSS algorithm is identical to the
GPS algorithm, except when there are linear constraints, and when the current point is
near a linear constraint boundary. The MADS algorithm uses a random selection of
vectors to define the mesh. For details, see “Patterns” on page 4-13.
4-2
Optimize Using Pattern Search
where
4-3
4 Using Direct Search
As an example, you can run the example described in “Optimize Using the GPS
Algorithm” on page 4-8 from the command line by entering
This returns
x =
-4.7124 -0.0000
fval =
-2.0000
[x,fval] = patternsearch(@objfun,x0,A,b,Aeq,beq,lb,ub,nonlcon)
where
• A is a matrix and b is vector that represent inequality constraints of the form A·x ≤ b.
• Aeq is a matrix and beq is a vector that represent equality constraints of the form
Aeq·x = beq.
• lb and ub are vectors representing bound constraints of the form lb ≤ x and x ≤ ub,
respectively.
• nonlcon is a function that returns the nonlinear equality and inequality vectors, c
and ceq, respectively. The function is minimized such that c(x) ≤ 0 and ceq(x) = 0.
You only need to pass in the constraints that are part of the problem. For example, if
there are no bound constraints or a nonlinear constraint function, use the syntax
[x,fval] = patternsearch(@objfun,x0,A,b,Aeq,beq)
4-4
Optimize Using Pattern Search
Use empty brackets [] for constraint arguments that are not needed for the problem. For
example, if there are no inequality constraints or a nonlinear constraint function, use the
syntax
[x,fval] = patternsearch(@objfun,x0,[],[],Aeq,beq,lb,ub)
[x,fval,exitflag,output] = patternsearch(@objfun,x0)
Besides x and fval, this returns the following additional output arguments:
For more information about these arguments, see the patternsearch reference page.
at the command line, or enter optimtool and then choose patternsearch from the
Solver menu.
4-5
4 Using Direct Search
Choose solver
Enter problem
and constraints
Run solver
View results
You can also start the tool from the MATLAB Apps tab.
4-6
Optimize Using Pattern Search
• Objective function — The objective function you want to minimize. Enter the
objective function in the form @objfun, where objfun.m is a file that computes the
objective function. The @ sign creates a function handle to objfun.
• Start point — The initial point at which the algorithm starts the optimization.
Then, click Start. The tool displays the results of the optimization in the Run solver
and view results pane.
In the Options pane, set the options for the pattern search. To view the options in a
category, click the + sign next to it.
“Finding the Minimum of the Function” on page 4-10 gives an example of using the
Optimization app.
4-7
4 Using Direct Search
Objective Function
This example uses the objective function, ps_example, which is included with Global
Optimization Toolbox software. View the code for the function by entering
type ps_example
4-8
Optimize Using the GPS Algorithm
4-9
4 Using Direct Search
1 Enter
optimtool
Leave the fields in the Constraints pane blank because the problem is
unconstrained.
4 Click Start to run the pattern search.
The Run solver and view results pane displays the results of the pattern search.
4-10
Optimize Using the GPS Algorithm
The reason the optimization terminated is that the mesh size became smaller than the
acceptable tolerance value for the mesh size, defined by the Mesh tolerance parameter
in the Stopping criteria pane. The minimum function value is approximately –2. The
Final point pane displays the point at which the minimum occurs.
Then click Start to run the pattern search. This displays the following plots.
4-11
4 Using Direct Search
The upper plot shows the objective function value of the best point at each iteration.
Typically, the objective function values improve rapidly at the early iterations and then
level off as they approach the optimal value.
The lower plot shows the mesh size at each iteration. The mesh size increases after each
successful iteration and decreases after each unsuccessful one, explained in “How
Pattern Search Polling Works” on page 4-16.
4-12
Pattern Search Terminology
In this section...
“Patterns” on page 4-13
“Meshes” on page 4-14
“Polling” on page 4-15
“Expanding and Contracting” on page 4-15
Patterns
A pattern is a set of vectors {vi} that the pattern search algorithm uses to determine
which points to search at each iteration. The set {vi} is defined by the number of
independent variables in the objective function, N, and the positive basis set. Two
commonly used positive basis sets in pattern search algorithms are the maximal basis,
with 2N vectors, and the minimal basis, with N+1 vectors.
With GPS, the collection of vectors that form the pattern are fixed-direction vectors. For
example, if there are three independent variables in the optimization problem, the
default for a 2N positive basis consists of the following pattern vectors:
v1 = [1 0 0 ] v2 = [ 0 1 0 ] v3 = [0 0 1]
v4 = [ - 1 0 0] v5 = [ 0 -1 0 ] v6 = [0 0 -1]
An N+1 positive basis consists of the following default pattern vectors.
v1 = [1 0 0 ] v2 = [0 1 0 ] v3 = [0 0 1]
v4 = [ - 1 - 1 - 1]
With GSS, the pattern is identical to the GPS pattern, except when there are linear
constraints and the current point is near a constraint boundary. For a description of the
way in which GSS forms a pattern with linear constraints, see Kolda, Lewis, and Torczon
[1]. The GSS algorithm is more efficient than the GPS algorithm when you have linear
constraints. For an example showing the efficiency gain, see “Compare the Efficiency of
Poll Options” on page 4-53.
With MADS, the collection of vectors that form the pattern are randomly selected by the
algorithm. Depending on the poll method choice, the number of vectors selected will be
2N or N+1. As in GPS, 2N vectors consist of N vectors and their N negatives, while N+1
vectors consist of N vectors and one that is the negative of the sum of the others.
4-13
4 Using Direct Search
References
[1] Kolda, Tamara G., Robert Michael Lewis, and Virginia Torczon. “A generating set
direct search augmented Lagrangian algorithm for optimization with a
combination of general and linear constraints.” Technical Report
SAND2006-5315, Sandia National Laboratories, August 2006.
Meshes
At each step, patternsearch searches a set of points, called a mesh, for a point that
improves the objective function. patternsearch forms the mesh by
1 Generating a set of vectors {di} by multiplying each pattern vector vi by a scalar Δm.
Δm is called the mesh size.
2
Adding the { di } to the current point—the point with the best objective function value
found at the previous step.
The algorithm multiplies the pattern vectors by 4 and adds them to the current point to
obtain the following mesh.
The pattern vector that produces a mesh point is called its direction.
4-14
Pattern Search Terminology
Polling
At each step, the algorithm polls the points in the current mesh by computing their
objective function values. When the Complete poll option has the (default) setting Off,
the algorithm stops polling the mesh points as soon as it finds a point whose objective
function value is less than that of the current point. If this occurs, the poll is called
successful and the point it finds becomes the current point at the next iteration.
The algorithm only computes the mesh points and their objective function values up to
the point at which it stops the poll. If the algorithm fails to find a point that improves the
objective function, the poll is called unsuccessful and the current point stays the same at
the next iteration.
When the Complete poll option has the setting On, the algorithm computes the
objective function values at all mesh points. The algorithm then compares the mesh point
with the smallest objective function value to the current point. If that mesh point has a
smaller value than the current point, the poll is successful.
4-15
4 Using Direct Search
Context
patternsearch finds a sequence of points, x0, x1, x2, ... , that approach an optimal
point. The value of the objective function either decreases or remains the same from each
point in the sequence to the next. This section explains how pattern search works for the
function described in “Optimize Using the GPS Algorithm” on page 4-8.
To simplify the explanation, this section describes how the generalized pattern search
(GPS) works using a maximal positive basis of 2N, with Scale set to Off in Mesh
options.
4-16
How Pattern Search Polling Works
This section does not show how the patternsearch algorithm works with bounds or
linear constraints. For bounds and linear constraints, patternsearch modifies poll
points to be feasible, meaning to satisfy all bounds and linear constraints.
Successful Polls
The pattern search begins at the initial point x0 that you provide. In this example,
x0 = [2.1 1.7].
Iteration 1
At the first iteration, the mesh size is 1 and the GPS algorithm adds the pattern vectors
to the initial point x0 = [2.1 1.7] to compute the following mesh points:
4-17
4 Using Direct Search
[1 0] + x0 = [3.1 1.7]
[0 1] + x0 = [2.1 2.7]
[-1 0] + x0 = [1.1 1.7]
[0 -1] + x0 = [2.1 0.7]
The algorithm computes the objective function at the mesh points in the order shown
above. The following figure shows the value of ps_example at the initial point and mesh
points.
Objective Function Values at Initial Point and Mesh Points
3
Initial point x0
Mesh points
5.6347
2.5
1.5
3.6347
0.5
1 1.5 2 2.5 3 3.5
The algorithm polls the mesh points by computing their objective function values until it
finds one whose value is smaller than 4.6347, the value at x0. In this case, the first such
point it finds is [1.1 1.7], at which the value of the objective function is 4.5146, so the
poll at iteration 1 is successful. The algorithm sets the next point in the sequence equal to
x1 = [1.1 1.7]
Note By default, the GPS pattern search algorithm stops the current iteration as soon as
it finds a mesh point whose fitness value is smaller than that of the current point.
4-18
How Pattern Search Polling Works
Consequently, the algorithm might not poll all the mesh points. You can make the
algorithm poll all the mesh points by setting Complete poll to On.
Iteration 2
After a successful poll, the algorithm multiplies the current mesh size by 2, the default
value of Expansion factor in the Mesh options pane. Because the initial mesh size is 1,
at the second iteration the mesh size is 2. The mesh at iteration 2 contains the following
points:
2*[1 0] + x1 = [3.1 1.7]
2*[0 1] + x1 = [1.1 3.7]
2*[-1 0] + x1 = [-0.9 1.7]
2*[0 -1] + x1 = [1.1 -0.3]
The following figure shows the point x1 and the mesh points, together with the
corresponding values of ps_example.
Objective Function Values at x1 and Mesh Points
4
x1
6.5416
Mesh points
3.5
2.5
2
3.25 4.5146 4.7282
1.5
0.5
0
3.1146
−0.5
−1 −0.5 0 0.5 1 1.5 2 2.5 3 3.5
The algorithm polls the mesh points until it finds one whose value is smaller than
4.5146, the value at x1. The first such point it finds is [-0.9 1.7], at which the value of
the objective function is 3.25, so the poll at iteration 2 is again successful. The algorithm
sets the second point in the sequence equal to
4-19
4 Using Direct Search
x2 = [-0.9 1.7]
Because the poll is successful, the algorithm multiplies the current mesh size by 2 to get
a mesh size of 4 at the third iteration.
An Unsuccessful Poll
By the fourth iteration, the current point is
x3 = [-4.9 1.7]
The following figure shows the mesh points and their objective function values.
Objective Function Values at x3 and Mesh Points
10 x3
7.7351
Mesh points
8
−2
−4
−6 4.3351
−8
−10 −5 0 5
At this iteration, none of the mesh points has a smaller objective function value than the
value at x3, so the poll is unsuccessful. In this case, the algorithm does not change the
current point at the next iteration. That is,
4-20
How Pattern Search Polling Works
x4 = x3;
At the next iteration, the algorithm multiplies the current mesh size by 0.5, the default
value of Contraction factor in the Mesh options pane, so that the mesh size at the next
iteration is 4. The algorithm then polls with a smaller mesh size.
A MADS poll uses newly generated pseudorandom mesh vectors at each iteration. The
vectors are randomly shuffled components from the columns of a random lower-
triangular matrix. The components of the matrix have integer sizes up to 1/ mesh size .
In the poll, the mesh vectors are multiplied by the mesh size, so the poll points can be up
to mesh size from the current point.
4-21
4 Using Direct Search
With this setting, the pattern search displays information about each iteration at the
command line. The first four iterations are
Iter f-count f(x) MeshSize Method
0 1 4.63474 1
1 4 4.51464 2 Successful Poll
2 7 3.25 4 Successful Poll
3 10 -0.264905 8 Successful Poll
4 14 -0.264905 4 Refine Mesh
The entry Successful Poll below Method indicates that the current iteration was
successful. For example, the poll at iteration 2 is successful. As a result, the objective
function value of the point computed at iteration 2, displayed below f(x), is less than
the value at iteration 1.
At iteration 4, the entry Refine Mesh tells you that the poll is unsuccessful. As a result,
the function value at iteration 4 remains unchanged from iteration 3.
By default, the pattern search doubles the mesh size after each successful poll and halves
it after each unsuccessful poll.
More Iterations
The pattern search performs 60 iterations before stopping. The following plot shows the
points in the sequence computed in the first 13 iterations of the pattern search.
4-22
How Pattern Search Polling Works
3 2 1 0
1.5
0.5
13
10
0
6
−0.5
−1
−6 −5 −4 −3 −2 −1 0 1 2 3
The numbers below the points indicate the first iteration at which the algorithm finds
the point. The plot only shows iteration numbers corresponding to successful polls,
because the best point doesn't change after an unsuccessful poll. For example, the best
point at iterations 4 and 5 is the same as at iteration 3.
Poll Method
At each iteration, the pattern search polls the points in the current mesh—that is, it
computes the objective function at the mesh points to see if there is one whose function
value is less than the function value at the current point. “How Pattern Search Polling
Works” on page 4-16 provides an example of polling. You can specify the pattern that
defines the mesh by the Poll method option. The default pattern, GPS Positive
basis 2N, consists of the following 2N directions, where N is the number of independent
variables for the objective function.
[1 0 0...0]
[0 1 0...0]
...
4-23
4 Using Direct Search
[0 0 0...1]
[–1 0 0...0]
[0 –1 0...0]
[0 0 0...–1].
For example, if the objective function has three independent variables, the GPS
Positive basis 2N, consists of the following six vectors.
[1 0 0]
[0 1 0]
[0 0 1]
[–1 0 0]
[0 –1 0]
[0 0 –1].
Alternatively, you can set Poll method to GPS Positive basis NP1, the pattern
consisting of the following N + 1 directions.
[1 0 0...0]
[0 1 0...0]
...
[0 0 0...1]
[–1 –1 –1...–1].
For example, if objective function has three independent variables, the GPS Positive
basis Np1, consists of the following four vectors.
[1 0 0]
[0 1 0]
[0 0 1]
[–1 –1 –1].
A pattern search will sometimes run faster using GPS Positive basis Np1 rather
than the GPS Positive basis 2N as the Poll method, because the algorithm
searches fewer points at each iteration. Although not being addressed in this example,
the same is true when using the MADS Positive basis Np1 over the MADS Positive
basis 2N, and similarly for GSS. For example, if you run a pattern search on the
example described in “Linearly Constrained Problem” on page 4-70, the algorithm
performs 1588 function evaluations with GPS Positive basis 2N, the default Poll
method, but only 877 function evaluations using GPS Positive basis Np1. For more
detail, see “Compare the Efficiency of Poll Options” on page 4-53.
4-24
How Pattern Search Polling Works
However, if the objective function has many local minima, using GPS Positive basis
2N as the Poll method might avoid finding a local minimum that is not the global
minimum, because the search explores more points around the current point at each
iteration.
Complete Poll
By default, if the pattern search finds a mesh point that improves the value of the
objective function, it stops the poll and sets that point as the current point for the next
iteration. When this occurs, some mesh points might not get polled. Some of these
unpolled points might have an objective function value that is even lower than the first
one the pattern search finds.
For problems in which there are several local minima, it is sometimes preferable to make
the pattern search poll all the mesh points at each iteration and choose the one with the
best objective function value. This enables the pattern search to explore more points at
each iteration and thereby potentially avoid a local minimum that is not the global
minimum. In the Optimization app you can make the pattern search poll the entire mesh
setting Complete poll to On in Poll options. At the command line, use optimoptions
to set the UseCompletePoll option to true.
4-25
4 Using Direct Search
4-26
How Pattern Search Polling Works
• After a successful poll, the change in the objective function in the previous two
iterations is less than Function tolerance and the mesh size is less than X
tolerance.
The MADS algorithm uses an additional parameter called the poll parameter, Δp, in the
mesh size stopping criterion:
ÏÔ N D m for positive basis N + 1 poll
Dp =Ì
ÔÓ D m for positive basis 2 N poll,
Δp ≤ Mesh tolerance.
For example, if all points in a poll evaluate to NaN, patternsearch considers the poll
unsuccessful, shrinks the mesh, and reevaluates. If even one point in a poll evaluates to a
smaller value than any seen yet, patternsearch considers the poll successful, and
expands the mesh.
4-27
4 Using Direct Search
Definition of Search
In patternsearch, a search is an algorithm that runs before a poll. The search
attempts to locate a better point than the current point. (Better means one with lower
objective function value.) If the search finds a better point, the better point becomes the
current point, and no polling is done at that iteration. If the search does not find a better
point, patternsearch performs a poll.
By default, patternsearch does not use search. To search, see “How to Use a Search
Method” on page 4-30.
The figure “patternsearch With a Search Method” on page 4-29 contains a flow chart of
direct search including using a search method.
4-28
Searching and Polling
Start
Stop
Yes
Update
current Done?
point
No
Yes Search
Search
enabled?
No
Yes No
Success? Poll
Yes No
Expand mesh Success? Refine mesh
4-29
4 Using Direct Search
Iteration limit applies to all built-in search methods except those that are poll methods.
If you select an iteration limit for the search method, the search is enabled until the
iteration limit is reached. Afterwards, patternsearch stops searching and only polls.
• At the command line, create options with a search method using optimoptions. For
example, to use Latin hypercube search:
opts = optimoptions('patternsearch','SearchFcn',@searchlhs);
For more information, including a list of all built-in search methods, consult the
patternsearch function reference page, and the “Search Options” on page 10-14
section of the options reference.
You can write your own search method. Use the syntax described in “Structure of the
Search Function” on page 10-16. To use your search method in a pattern search, give its
function handle as the Custom Function (SearchFcn) option.
Search Types
• Poll methods — You can use any poll method as a search algorithm. patternsearch
conducts one poll step as a search. For this type of search to be beneficial, your search
4-30
Searching and Polling
type should be different from your poll type. (patternsearch does not search if the
selected search method is the same as the poll type.) Therefore, use a MADS search
with a GSS or GPS poll, or use a GSS or GPS search with a MADS poll.
• fminsearch, also called Nelder-Mead — fminsearch is for unconstrained problems
only. fminsearch runs to its natural stopping criteria; it does not take just one step.
Therefore, use fminsearch for just one iteration. This is the default setting. To
change settings, see “Search Options” on page 10-14.
• ga — ga runs to its natural stopping criteria; it does not take just one step. Therefore,
use ga for just one iteration. This is the default setting. To change settings, see
“Search Options” on page 10-14.
• Latin hypercube search — Described in “Search Options” on page 10-14. By default,
searches 15n points, where n is the number of variables, and only searches during the
first iteration. To change settings, see “Search Options” on page 10-14.
• To speed an optimization (see “Search Methods for Increased Speed” on page 4-31)
• To obtain a better local solution, or to obtain a global solution on page 1-13 (see
“Search Methods for Better Solutions” on page 4-31)
Generally, you do not know beforehand whether a search method speeds an optimization
or not. So try a search method when:
Search does not always speed an optimization. For one example where it does, see
“Search and Poll” on page 4-34.
Since search methods run before poll methods, using search can be equivalent to
choosing a different starting point for your optimization. This comment holds for the
Nelder-Mead, ga, and Latin hypercube search methods, all of which, by default, run once
4-31
4 Using Direct Search
at the beginning of an optimization. ga and Latin hypercube searches are stochastic, and
can search through several basins of attraction on page 1-14.
4-32
Setting Solver Tolerances
• Mesh tolerance — When the current mesh size is less than the value of Mesh
tolerance, the algorithm halts.
• X tolerance — After a successful poll, if the distance from the previous best point to
the current best point is less than the value of X tolerance, the algorithm halts.
• Function tolerance — After a successful poll, if the difference between the function
value at the previous best point and function value at the current best point is less
than the value of Function tolerance, the algorithm halts.
• Nonlinear constraint tolerance — The algorithm treats a point to be feasible if
constraint violation is less than ConstraintTolerance.
• Bind tolerance — Bind tolerance applies to linearly constrained problems. It
specifies how close a point must get to the boundary of the feasible region before a
linear constraint is considered to be active. When a linear constraint is active, the
pattern search polls points in directions parallel to the linear constraint boundary as
well as the mesh points.
Usually, you should set Bind tolerance to be at least as large as the maximum of
Mesh tolerance, X tolerance, and Function tolerance.
4-33
4 Using Direct Search
The following example illustrates the use of a search method on the problem described in
“Linearly Constrained Problem” on page 4-70. To set up the example, enter the
following commands at the MATLAB prompt to define the initial point and constraints.
x0 = [2 1 0 9 1 0];
Aineq = [-8 7 3 -4 9 0];
bineq = 7;
Aeq = [7 1 8 3 3 3; 5 0 -5 1 -5 8; -2 -6 7 1 1 9; 1 -1 2 -2 3 -3];
beq = [84 62 65 1];
Then enter the settings shown in the following figures in the Optimization app.
4-34
Search and Poll
For comparison, click Start to run the example without a search method. This displays
the plots shown in the following figure.
4-35
4 Using Direct Search
To see the effect of using a search method, select MADS Positive Basis 2N in the
Search method field in Search options.
This sets the search method to be a pattern search using the pattern for MADS Positive
Basis 2N. Then click Start to run the pattern search. This displays the following plots.
4-36
Search and Poll
Note that using the search method reduces the total function evaluations—from 1206 to
1159—and reduces the number of iterations from 106 to 97.
2
(
f ( x) = 100 x2 - x12 ) 2
+ ( 1 - x1 ) .
4-37
4 Using Direct Search
opts = optimoptions('patternsearch','MaxFunctionEvaluations',5000,'MaxIterations',2
3 Run patternsearch starting from [-1.9 2]:
[x,feval,eflag,output] = patternsearch(dejong2fcn,...
[-1.9,2],[],[],[],[],[],[],[],opts);
feval
feval =
0.8560
The optimization did not complete, and the result is not very close to the optimal
value of 0.
4 Set the options to use fminsearch as the search method:
opts = optimoptions('patternsearch',opts,'SearchFcn',@searchneldermead);
5 Rerun the optimization, the results are much better:
[x2,feval2,eflag2,output2] = patternsearch(dejong2fcn,...
[-1.9,2],[],[],[],[],[],[],[],opts);
Optimization terminated: mesh size less than options.MeshTolerance.
feval2
feval2 =
4.0686e-010
4-38
Nonlinear Constraint Solver Algorithm
such that
ci ( x) £ 0, i = 1 … m
ceqi ( x) = 0, i = m + 1 … mt
A◊ x £ b
Aeq ◊ x = beq
lb £ x £ ub,
where c(x) represents the nonlinear inequality constraints, ceq(x) represents the equality
constraints, m is the number of nonlinear inequality constraints, and mt is the total
number of nonlinear constraints.
The ALPS algorithm attempts to solve a nonlinear optimization problem with nonlinear
constraints, linear constraints, and bounds. In this approach, bounds and linear
constraints are handled separately from nonlinear constraints. A subproblem is
formulated by combining the objective function and nonlinear constraint function using
the Lagrangian and the penalty parameters. A sequence of such optimization problems
are approximately minimized using a pattern search algorithm such that the linear
constraints and bounds are satisfied.
Each subproblem solution represents one iteration. The number of function evaluations
per iteration is therefore much higher when using nonlinear constraints than otherwise.
where
• The components λi of the vector λ are nonnegative and are known as Lagrange
multiplier estimates
4-39
4 Using Direct Search
The algorithm begins by using an initial value for the penalty parameter
(InitialPenalty).
Each subproblem solution represents one iteration. The number of function evaluations
per iteration is therefore much higher when using nonlinear constraints than otherwise.
References
[1] Kolda, Tamara G., Robert Michael Lewis, and Virginia Torczon. “A generating set
direct search augmented Lagrangian algorithm for optimization with a
combination of general and linear constraints.” Technical Report
SAND2006-5315, Sandia National Laboratories, August 2006.
[2] Conn, A. R., N. I. M. Gould, and Ph. L. Toint. “A Globally Convergent Augmented
Lagrangian Algorithm for Optimization with General Constraints and Simple
Bounds,” SIAM Journal on Numerical Analysis, Volume 28, Number 2, pages
545–572, 1991.
[3] Conn, A. R., N. I. M. Gould, and Ph. L. Toint. “A Globally Convergent Augmented
Lagrangian Barrier Algorithm for Optimization with General Inequality
Constraints and Simple Bounds,” Mathematics of Computation, Volume 66,
Number 217, pages 261–288, 1997.
4-40
Custom Plot Function
stop = false;
if(strcmp(flag,'init'))
set(gca,'Yscale','log'); %Set up the plot
hold on;
xlabel('Iteration');
ylabel('Log Change in Values');
title(['Change in Best Function Value']);
end
4-41
4 Using Direct Search
last_best = best;
else
%Change in objective function value
change = last_best - best;
plot(optimvalues.iteration, change, '.r');
end
4-42
Custom Plot Function
5 Since this is a linearly constrained problem, set the Poll method to GSS Positive
basis 2N.
Now, when you run the example, the pattern search tool displays the plots shown in the
following figure.
4-43
4 Using Direct Search
Note that because the scale of the y-axis in the lower custom plot is logarithmic, the plot
will only show changes that are greater than 0.
The most important statements of the custom plot function, psplotchange.m, are
summarized in the following table.
4-44
Custom Plot Function
Statement Description
persistent last_best Creates the persistent variable
last_best, the best objective function
value in the previous generation.
Persistent variables are preserved over
multiple calls to the plot function.
set(gca,'Yscale','log') Sets up the plot before the algorithm
starts.
best = min(optimvalues.fval) Sets best equal to the minimum objective
function value. The field
optimvalues.fval contains the objective
function value in the current iteration. The
variable best is the minimum objective
function value. For a complete description
of the fields of the structure optimvalues,
see “Structure of the Plot Functions” on
page 10-11.
change = last_best - best Sets the variable change to the best
objective function value at the previous
iteration minus the best objective function
value in the current iteration.
plot(optimvalues.iteration, Plots the variable change at the current
change, '.r') objective function value, for the current
iteration contained
inoptimvalues.iteration.
4-45
4 Using Direct Search
Set Options
In this section...
“Set Options Using optimoptions” on page 4-46
“Create Options and Problems Using the Optimization App” on page 4-48
Pass in empty brackets [] for any constraints that do not appear in the problem.
options = optimoptions(@patternsearch)
This returns options with the default values for its fields.
options =
patternsearch options:
Set properties:
No options set.
Default properties:
AccelerateMesh: 0
ConstraintTolerance: 1.0000e-06
Display: 'final'
FunctionTolerance: 1.0000e-06
InitialMeshSize: 1
MaxFunctionEvaluations: '2000*numberOfVariables'
MaxIterations: '100*numberOfVariables'
MaxTime: Inf
MeshContractionFactor: 0.5000
MeshExpansionFactor: 2
MeshTolerance: 1.0000e-06
4-46
Set Options
OutputFcn: []
PlotFcn: []
PollMethod: 'GPSPositiveBasis2N'
PollOrderAlgorithm: 'consecutive'
ScaleMesh: 1
SearchFcn: []
StepTolerance: 1.0000e-06
UseCompletePoll: 0
UseCompleteSearch: 0
UseParallel: 0
UseVectorized: 0
The patternsearch function uses these default values if you do not pass in options as
an input argument.
options.MeshExpansionFactor
ans =
2
To create options with a field value that is different from the default, use
optimoptions. For example, to change the mesh expansion factor to 3 instead of its
default value 2, enter
options = optimoptions('patternsearch','MeshExpansionFactor',3);
This creates options with all values set to defaults except for MeshExpansionFactor,
which is set to 3.
If you now call patternsearch with the argument options, the pattern search uses a
mesh expansion factor of 3.
If you subsequently decide to change another field in options, such as setting PlotFcn
to @psplotmeshsize, which plots the mesh size at each iteration, call optimoptions
with the syntax
options = optimoptions(options,'PlotFcn',@psplotmeshsize)
4-47
4 Using Direct Search
This preserves the current values of all fields of options except for PlotFcn, which is
changed to @plotmeshsize. Note that if you omit the options input argument,
optimoptions resets MeshExpansionFactor to its default value, which is 2.
You can also set both MeshExpansionFactor and PlotFcn with the single command
options = optimoptions('patternsearch','MeshExpansionFactor',3,'PlotFcn',@psplotmeshsiz
except for the default value of 'Display', which is 'final' when created by
optimoptions, but 'none' when created in the Optimization app.
You can also export an entire problem from the Optimization app and run it from the
command line. Enter
patternsearch(problem)
4-48
Polling Types
Polling Types
In this section...
“Using a Complete Poll in a Generalized Pattern Search” on page 4-49
“Compare the Efficiency of Poll Options” on page 4-53
4-49
4 Using Direct Search
−5
−10
−15
−20 10
5
−25
15 0
10
5 −5
0
−5
−10 −10
The global minimum of the function occurs at (0, 0), where its value is -25. However, the
function also has a local minimum at (0, 9), where its value is -16.
To create a file that computes the function, copy and paste the following code into a new
file in the MATLAB Editor.
function z = poll_example(x)
if x(1)^2 + x(2)^2 <= 25
z = x(1)^2 + x(2)^2 - 25;
elseif x(1)^2 + (x(2) - 9)^2 <= 16
z = x(1)^2 + (x(2) - 9)^2 - 16;
else z = 0;
end
To run a pattern search on the function, enter the following in the Optimization app:
4-50
Polling Types
Click Start to run the pattern search with Complete poll set to Off, its default value.
The Optimization app displays the results in the Run solver and view results pane, as
shown in the following figure.
The pattern search returns the local minimum at (0, 9). At the initial point, (0, 5), the
objective function value is 0. At the first iteration, the search polls the following mesh
points.
As soon as the search polls the mesh point (0, 6), at which the objective function value is
less than at the initial point, it stops polling the current mesh and sets the current point
at the next iteration to (0, 6). Consequently, the search moves toward the local minimum
at (0, 9) at the first iteration. You see this by looking at the first two lines of the
command line display.
Iter f-count f(x) MeshSize Method
0 1 0 1
1 3 -7 2 Successful Poll
4-51
4 Using Direct Search
Note that the pattern search performs only two evaluations of the objective function at
the first iteration, increasing the total function count from 1 to 3.
Next, set Complete poll to On and click Start. The Run solver and view results pane
displays the following results.
This time, the pattern search finds the global minimum at (0, 0). The difference between
this run and the previous one is that with Complete poll set to On, at the first iteration
the pattern search polls all four mesh points.
Because the last mesh point has the lowest objective function value, the pattern search
selects it as the current point at the next iteration. The first two lines of the command-
line display show this.
Iter f-count f(x) MeshSize Method
0 1 0 1
1 5 -9 2 Successful Poll
4-52
Polling Types
In this case, the objective function is evaluated four times at the first iteration. As a
result, the pattern search moves toward the global minimum at (0, 0).
The following figure compares the sequence of points returned when Complete poll is
set to Off with the sequence when Complete poll is On.
• GSS is more efficient than GPS or MADS for linearly constrained problems.
4-53
4 Using Direct Search
Note The efficiency of an algorithm depends on the problem. GSS is efficient for linearly
constrained problems. However, predicting the efficiency implications of the other poll
options is difficult, as is knowing which poll type works best with other constraints.
Problem setup
The problem is the same as in “Performing a Pattern Search on the Example” on page 4-
71. This linearly constrained problem uses the lincontest7 file that comes with the
toolbox:
4-54
Polling Types
4-55
4 Using Direct Search
4-56
Polling Types
7 Set Options > Poll > Poll method to GPS Positive basis Np1 and set
Complete poll to off.
8 Run the optimization.
9 Export the result and name it gpsnp1off.
10 Set Complete poll to on.
11 Run the optimization.
12 Export the result and name it gpsnp1on.
13 Continue in a like manner to create solution structures for the other poll methods
with Complete poll set on and off: gss2noff, gss2non, gssnp1off, gssnp1on,
mads2noff, mads2non, madsnp1off, and madsnp1on.
You have the results of 12 optimization runs. The following table shows the efficiency of
the runs, measured in total function counts and in iterations. Your MADS results could
differ, since MADS is a stochastic algorithm.
Algorithm Function Count Iterations
GPS2N, complete poll off 1462 136
GPS2N, complete poll on 1396 96
GPSNp1, complete poll off 864 118
GPSNp1, complete poll on 1007 104
GSS2N, complete poll off 758 84
GSS2N, complete poll on 889 74
GSSNp1, complete poll off 533 94
GSSNp1, complete poll on 491 70
MADS2N, complete poll off 922 162
MADS2N, complete poll on 2285 273
MADSNp1, complete poll off 1155 201
MADSNp1, complete poll on 1651 201
To obtain, say, the first row in the table, enter gps2noff.output.funccount and
gps2noff.output.iterations. You can also examine options in the Variables editor
by double-clicking the options in the Workspace Browser, and then double-clicking the
output structure.
4-57
4 Using Direct Search
4-58
Polling Types
• Setting Complete poll to on generally lowers the number of iterations for GPS and
GSS, but the change in number of function evaluations is unpredictable.
• Setting Complete poll to on does not necessarily change the number of iterations for
MADS, but substantially increases the number of function evaluations.
• The most efficient algorithm/options settings, with efficiency meaning lowest function
count:
1 GSS Positive basis Np1 with Complete poll set to on (function count 491)
2 GSS Positive basis Np1 with Complete poll set to off (function count 533)
3 GSS Positive basis 2N with Complete poll set to off (function count 758)
4 GSS Positive basis 2N with Complete poll set to on (function count 889)
4-59
4 Using Direct Search
You can view the expansion and contraction of the mesh size during the pattern search
by selecting Mesh size in the Plot functions pane. To also display the values of the
mesh size and objective function at the command line, set Level of display to
Iterative in the Display to command window options.
For example, set up the problem described in “Linearly Constrained Problem” on page 4-
70 as follows:
4-60
Set Mesh Options
4-61
4 Using Direct Search
To see the changes in mesh size more clearly, change the y-axis to logarithmic scaling as
follows:
4-62
Set Mesh Options
1 Select Axes Properties from the Edit menu in the plot window.
2 In the Properties Editor, select the Y Axis tab.
3 Set Scale to Log.
Updating these settings in the MATLAB Property Editor shows the plot in the following
figure.
The first 5 iterations result in successful polls, so the mesh sizes increase steadily during
this time. You can see that the first unsuccessful poll occurs at iteration 6 by looking at
the command-line display:
4-63
4 Using Direct Search
Note that at iteration 5, which is successful, the mesh size doubles for the next iteration.
But at iteration 6, which is unsuccessful, the mesh size is multiplied 0.5.
To see how Expansion factor and Contraction factor affect the pattern search, make
the following changes:
Then click Start. The Run solver and view results pane shows that the final point is
approximately the same as with the default settings of Expansion factor and
Contraction factor, but that the pattern search takes longer to reach that point.
4-64
Set Mesh Options
When you change the scaling of the y-axis to logarithmic, the mesh size plot appears as
shown in the following figure.
4-65
4 Using Direct Search
Note that the mesh size increases faster with Expansion factor set to 3.0, as compared
with the default value of 2.0, and decreases more slowly with Contraction factor set to
0.75, as compared with the default value of 0.5.
Mesh Accelerator
The mesh accelerator can make a pattern search converge faster to an optimal point by
reducing the number of iterations required to reach the mesh tolerance. When the mesh
size is below a certain value, the pattern search contracts the mesh size by a factor
smaller than the Contraction factor factor. Mesh accelerator applies only to the GPS
and GSS algorithms.
Note For best results, use the mesh accelerator for problems in which the objective
function is not too steep near the optimal point, or you might lose some accuracy. For
differentiable problems, this means that the absolute value of the derivative is not too
large near the solution.
4-66
Set Mesh Options
To use the mesh accelerator, set Accelerator to On in the Mesh options. Or, at the
command line, set the AccelerateMesh option to true.
For example, set up the problem described in “Linearly Constrained Problem” on page 4-
70 as follows:
4-67
4 Using Direct Search
The number of iterations required to reach the mesh tolerance is 78, as compared with 84
when Accelerator is set to Off.
You can see the effect of the mesh accelerator by setting Level of display to Iterative
in Display to command window. Run the example with Accelerator set to On, and
then run it again with Accelerator set to Off. The mesh sizes are the same until
iteration 70, but differ at iteration 71. The MATLAB Command Window displays the
following lines for iterations 70 and 71 with Accelerator set to Off.
Iter f-count f(x) MeshSize Method
70 618 1919.54 6.104e-05 Refine Mesh
71 630 1919.54 3.052e-05 Refine Mesh
Note that the mesh size is multiplied by 0.5, the default value of Contraction factor.
For comparison, the Command Window displays the following lines for the same
iteration numbers with Accelerator set to On.
Iter f-count f(x) MeshSize Method
70 618 1919.54 6.104e-05 Refine Mesh
71 630 1919.54 1.526e-05 Refine Mesh
4-68
Set Mesh Options
4-69
4 Using Direct Search
where
È 36 17 19 12 8 15 ˘
Í 17 33 18 11 7 14 ˙˙
Í
Í19 18 43 13 8 16 ˙
H =Í ˙,
Í 12 11 13 18 6 11 ˙
Í8 7 8 6 9 8˙
Í ˙
ÍÎ 15 14 16 11 8 29 ˙˚
f = [ 20 15 21 18 29 24 ],
where
4-70
Constrained Minimization Using patternsearch
A = [ -8 7 3 -4 9 0 ],
b = 7,
È7 1 8 3 3 3˘
Í 5 0 -5 1 -5 8 ˙
Aeq = Í ˙,
Í -2 -6 7 1 1 9 ˙
Í ˙
Î 1 - 1 2 - 2 3 -3 ˚
beq = [84 62 65 1].
optimtool('patternsearch')
to open the Optimization app, or enter optimtool and then choose patternsearch
from the Solver menu. Then type the following function in the Objective function
field:
@lincontest7
x0 = [2 1 0 9 1 0];
Aineq = [-8 7 3 -4 9 0];
bineq = 7;
Aeq = [7 1 8 3 3 3; 5 0 -5 1 -5 8; -2 -6 7 1 1 9; 1 -1 2 -2 3 -3];
beq = [84 62 65 1];
• Set A to Aineq.
• Set b to bineq.
4-71
4 Using Direct Search
Since this is a linearly constrained problem, set the Poll method to GSS Positive
basis 2N. For more information about the efficiency of the GSS search methods for
linearly constrained problems, see “Compare the Efficiency of Poll Options” on page 4-53.
Then click Start to run the pattern search. When the search is finished, the results are
displayed in Run solver and view results pane, as shown in the following figure.
4-72
Constrained Minimization Using patternsearch
( ) ( )
min f ( x) = 4 - 2 .1 x12 - x14 / 3 x12 + x1 x2 + - 4 + 4 x22 x22
x
Begin by creating the objective and constraint functions. First, create a file named
simple_objective.m as follows:
function y = simple_objective(x)
y = (4 - 2.1*x(1)^2 + x(1)^4/3)*x(1)^2 + x(1)*x(2) + (-4 + 4*x(2)^2)*x(2)^2;
The pattern search solver assumes the objective function will take one input x where x
has as many elements as number of variables in the problem. The objective function
4-73
4 Using Direct Search
computes the value of the function and returns that scalar value in its one return
argument y.
The pattern search solver assumes the constraint function will take one input x, where x
has as many elements as the number of variables in the problem. The constraint function
computes the values of all the inequality and equality constraints and returns two
vectors, c and ceq, respectively.
Next, to minimize the objective function using the patternsearch function, you need to
pass in a function handle to the objective function as well as specifying a start point as
the second argument. Lower and upper bounds are provided as LB and UB respectively.
In addition, you also need to pass a function handle to the nonlinear constraint function.
ObjectiveFunction = @simple_objective;
X0 = [0 0]; % Starting point
LB = [0 0]; % Lower bound
UB = [1 13]; % Upper bound
ConstraintFunction = @simple_constraint;
[x,fval] = patternsearch(ObjectiveFunction,X0,[],[],[],[],...
LB,UB,ConstraintFunction)
x =
0.8122 12.3122
fval =
9.1324e+004
Next, plot the results. Create options using optimoptions that selects two plot
functions. The first plot function psplotbestf plots the best objective function value at
every iteration. The second plot function psplotmaxconstr plots the maximum
constraint violation at every iteration.
4-74
Constrained Minimization Using patternsearch
Note You can also visualize the progress of the algorithm by displaying information to
the Command Window using the 'Display' option.
options = optimoptions('patternsearch','PlotFcn',{@psplotbestf,@psplotmaxconstr},'Display','iter');
[x,fval] = patternsearch(ObjectiveFunction,X0,[],[],[],[],LB,UB,ConstraintFunction,options)
Max
Iter Func-count f(x) Constraint MeshSize Method
0 1 0 10 0.8919
1 28 113580 0 0.001 Increase penalty
2 105 91324 1.782e-07 1e-05 Increase penalty
3 192 91324 1.188e-11 1e-07 Increase penalty
Optimization terminated: mesh size less than options.MeshTolerance
and constraint violation is less than options.ConstraintTolerance.
x =
0.8122 12.3122
fval =
9.1324e+04
4-75
4 Using Direct Search
Best Objective Function Value and Maximum Constraint Violation at Each Iteration
4-76
Use Cache
Use Cache
Typically, at any given iteration of a pattern search, some of the mesh points might
coincide with mesh points at previous iterations. By default, the pattern search
recomputes the objective function at these mesh points even though it has already
computed their values and found that they are not optimal. If computing the objective
function takes a long time—say, several minutes—this can make the pattern search run
significantly longer.
You can eliminate these redundant computations by using a cache, that is, by storing a
history of the points that the pattern search has already visited. To do so, set Cache to
On in Cache options. At each poll, the pattern search checks to see whether the current
mesh point is within a specified tolerance, Tolerance, of a point in the cache. If so, the
search does not compute the objective function for that point, but uses the cached
function value and moves on to the next point.
Note When Cache is set to On, the pattern search might fail to identify a point in the
current mesh that improves the objective function because it is within the specified
tolerance of a point in the cache. As a result, the pattern search might run for more
iterations with Cache set to On than with Cache set to Off. It is generally a good idea to
keep the value of Tolerance very small, especially for highly nonlinear objective
functions.
For example, set up the problem described in “Linearly Constrained Problem” on page 4-
70 as follows:
4-77
4 Using Direct Search
4-78
Use Cache
After the pattern search finishes, the plots appear as shown in the following figure.
4-79
4 Using Direct Search
Now, set Cache to On and run the example again. This time, the plots appear as shown
in the following figure.
4-80
Vectorize the Objective and Constraint Functions
If there are nonlinear constraints, the objective function and the nonlinear constraints all
need to be vectorized in order for the algorithm to compute in a vectorized manner.
Note Write your vectorized objective function or nonlinear constraint function to accept a
matrix with an arbitrary number of points. patternsearch sometimes evaluates a
single point even during a vectorized calculation.
• If the initial point x0 is a column vector of size m, the objective function takes each
column of the matrix as a point in the pattern and returns a row vector of size n.
4-81
4 Using Direct Search
• If the initial point x0 is a row vector of size n, the objective function takes each row of
the matrix as a point in the pattern and returns a column vector of size m.
• If the initial point x0 is a scalar, patternsearch assumes that x0 is a row vector.
Therefore, the input matrix has one column (n = 1, the input matrix is a vector), and
each entry of the matrix represents one row for the objective function to evaluate. The
output of the objective function in this case is a column vector of size m.
Pictorially, the matrix and calculation are represented by the following figure.
Results
...
If x0 is a Each column represents
column one search/poll point
If x0 is a row Results
4-82
Vectorize the Objective and Constraint Functions
If the initial vector x0 is a column vector, such as [0;0], a function for vectorized
evaluation is
function f = vectorizedc(x)
f = x(1,:).^4+x(2,:).^4-4*x(1,:).^2-2*x(2,:).^2 ...
+3*x(1,:)-.5*x(2,:);
If the initial vector x0 is a row vector, such as [0,0], a function for vectorized evaluation
is
function f = vectorizedr(x)
f = x(:,1).^4+x(:,2).^4-4*x(:,1).^2-2*x(:,2).^2 ...
+3*x(:,1)-.5*x(:,2);
Tip If you want to use the same objective (fitness) function for both pattern search and
genetic algorithm, write your function to have the points represented by row vectors, and
write x0 as a row vector. The genetic algorithm always takes individuals as the rows of a
matrix. This was a design decision—the genetic algorithm does not require a user-
supplied population, so needs to have a default format.
options=optimoptions('patternsearch','UseVectorized',true,'UseCompletePoll',true);
x0=[0;0];
[x,fval]=patternsearch(@vectorizedc,x0,...
[],[],[],[],[],[],[],options)
x =
-1.5737
1.0575
fval =
-10.0088
4-83
4 Using Direct Search
The same considerations hold for constraint functions as for objective functions: the
initial point x0 determines the type of points (row or column vectors) in the poll or
search. If the initial point is a row vector of size k, the matrix x passed to the constraint
function has k columns. Similarly, if the initial point is a column vector of size k, the
matrix of poll or search points has k rows. The figure “Structure of Vectorized Functions”
on page 4-82 may make this clear. If the initial point is a scalar, patternsearch
assumes that it is a row vector.
Your nonlinear constraint function returns two matrices, one for inequality constraints,
and one for equality constraints. Suppose there are nc nonlinear inequality constraints
and nceq nonlinear equality constraints. For row vector x0, the constraint matrices have
nc and nceq columns respectively, and the number of rows is the same as in the input
matrix. Similarly, for a column vector x0, the constraint matrices have nc and nceq rows
respectively, and the number of columns is the same as in the input matrix. In figure
“Structure of Vectorized Functions” on page 4-82, “Results” includes both nc and nceq.
c(:,1)=x(:,1).^2/9+x(:,2).^2/4-1;
c(:,2)=cosh(x(:,1))-x(:,2)-1;
ceq=[];
4-84
Vectorize the Objective and Constraint Functions
x0=[0,0];
options = optimoptions('patternsearch','UseVectorized',true,'UseCompletePoll',true);
[x,fval] = patternsearch(@vectorizedr,x0,...
[],[],[],[],[],[],@ellipsecosh,options)
x =
-1.3516 1.0612
fval =
-9.5394
4-85
4 Using Direct Search
It also shows how to avoid computing the objective and nonlinear constraint function
twice when the ODE solution returns both. The example compares patternsearch and
ga in terms of time to run the solver and the quality of the solutions.
The problem is to change the position and angle of a cannon to fire a projectile as far as
possible beyond a wall. The cannon has a muzzle velocity of 300 m/s. The wall is 20 m
high. If the cannon is too close to the wall, it has to fire at too steep an angle, and the
projectile does not travel far enough. If the cannon is too far from the wall, the projectile
does not travel far enough either.
Air resistance slows the projectile. The resisting force is proportional to the square of the
velocity, with proportionality constant 0.02. Gravity acts on the projectile, accelerating it
downward with constant 9.81 m/s2. Therefore, the equations of motion for the trajectory
x(t) are
d2 x(t)
= - 0.02 v(t) v( t) - (0, 9 .81),
dt 2
The initial position x0 and initial velocity xp0 are 2-D vectors. However, the initial
height x0(2) is 0, so the initial position depends only on the scalar x0(1). And the
initial velocity xp0 has magnitude 300 (the muzzle velocity), so depends only on the
initial angle, a scalar. For an initial angle th, xp0 = 300*(cos(th),sin(th)).
Therefore, the optimization problem depends only on two scalars, so it is a 2-D problem.
Use the horizontal distance and the angle as the decision variables.
ODE solvers require you to formulate your model as a first-order system. Augment the
trajectory vector (x1(t),x2(t)) with its time derivative (x'1(t),x'2(t)) to form a 4-D trajectory
vector. In terms of this augmented vector, the differential equation becomes
4-86
Optimize an ODE in Parallel
È x3 (t) ˘
Í ˙
d Í x4 (t) ˙
x(t) = Í .
dt - .02 ( x3 ( t), x4 ( t) ) x3 (t) ˙
Í ˙
Í -.02 ( x3 (t), x4 (t) ) x4 (t) - 9.81 ˙
Î ˚
Write the differential equation as a function file, and save it on your MATLAB path.
function f = cannonfodder(t,x)
Visualize the solution of the ODE starting 30 m from the wall at an angle of pi/3.
4-87
4 Using Direct Search
x0 = [-30;0;300*cos(pi/3);300*sin(pi/3)];
sol = ode45(@cannonfodder,[0,10],x0);
% Find the time when the projectile lands
zerofnd = fzero(@(r)deval(sol,r,2),[sol.x(2),sol.x(end)]);
t = linspace(0,zerofnd); % equal times for plot
xs = deval(sol,t,1); % interpolated x values
ys = deval(sol,t,2); % interpolated y values
plot(xs,ys)
hold on
plot([0,0],[0,20],'k') % Draw the wall
xlabel('Horizontal distance')
ylabel('Trajectory height')
4-88
Optimize an ODE in Parallel
legend('Trajectory','Wall','Location','NW')
ylim([0 120])
hold off
The problem is to find initial position x0(1) and initial angle x0(2) to maximize the
distance from the wall the projectile lands. Because this is a maximization problem,
minimize the negative of the distance (see “Maximizing vs. Minimizing” on page 2-6).
To use patternsearch to solve this problem, you must provide the objective, constraint,
initial guess, and options.
These two files are the objective and constraint functions. Copy them to a folder on your
MATLAB path.
function f = cannonobjective(x)
x0 = [x(1);0;300*cos(x(2));300*sin(x(2))];
sol = ode45(@cannonfodder,[0,15],x0);
ceq = [];
x0 = [x(1);0;300*cos(x(2));300*sin(x(2))];
sol = ode45(@cannonfodder,[0,15],x0);
4-89
4 Using Direct Search
Notice that the objective and constraint functions set their input variable x0 to a 4-D
initial point for the ODE solver. The ODE solver does not stop if the projectile hits the
wall. Instead, the constraint function simply becomes positive, indicating an infeasible
initial value.
The initial position x0(1) cannot be above 0, and it is futile to have it be below –200. (It
should be near –20 because, with no air resistance, the longest trajectory would start at –
20 at an angle pi/4.) Similarly, the initial angle x0(2) cannot be below 0, and cannot be
above pi/2. Set bounds slightly away from these initial values:
lb = [-200;0.05];
ub = [-1;pi/2-.05];
x0 = [-30,pi/3]; % initial guess
Set the UseCompletePoll option to true. This gives a higher-quality solution, and
enables direct comparison with parallel processing, because computing in parallel
requires this setting.
opts = optimoptions('patternsearch','UseCompletePoll',true);
xsolution =
-28.8123 0.6095
distance =
-125.9880
eflag =
1
outpt =
function: @cannonobjective
problemtype: 'nonlinearconstr'
pollmethod: 'gpspositivebasis2n'
maxconstraint: 0
4-90
Optimize an ODE in Parallel
searchmethod: []
iterations: 5
funccount: 269
meshsize: 8.9125e-07
rngstate: [1x1 struct]
message: 'Optimization terminated: mesh size less than options.MeshTolerance…
Starting the projectile about 29 m from the wall at an angle 0.6095 radian results in the
farthest distance, about 126 m. The reported distance is negative because the objective
function is the negative of the distance to the wall.
sol = ode45(@cannonfodder,[0,15],x0);
% Find the time when the projectile lands
zerofnd = fzero(@(r)deval(sol,r,2),[sol.x(2),sol.x(end)]);
t = linspace(0,zerofnd); % equal times for plot
xs = deval(sol,t,1); % interpolated x values
ys = deval(sol,t,2); % interpolated y values
plot(xs,ys)
hold on
plot([0,0],[0,20],'k') % Draw the wall
xlabel('Horizontal distance')
ylabel('Trajectory height')
legend('Trajectory','Wall','Location','NW')
ylim([0 70])
hold off
4-91
4 Using Direct Search
Both the objective and nonlinear constraint function call the ODE solver to calculate
their values. Use the technique in “Objective and Nonlinear Constraints in the Same
Function” (Optimization Toolbox) to avoid calling the solver twice. The runcannon file
implements this technique. Copy this file to a folder on your MATLAB path.
function [x,f,eflag,outpt] = runcannon(x0,opts)
4-92
Optimize an ODE in Parallel
lb = [-200;0.05];
ub = [-1;pi/2-.05];
% Call patternsearch
[x,f,eflag,outpt] = patternsearch(fun,x0,[],[],[],[],lb,ub,cfun,opts);
function y = objfun(x)
if ~isequal(x,xLast) % Check if computation is necessary
x0 = [x(1);0;300*cos(x(2));300*sin(x(2))];
sol = ode45(@cannonfodder,[0,15],x0);
xLast = x;
end
% Now compute objective function
% First find when the projectile hits the ground
zerofnd = fzero(@(r)deval(sol,r,2),[sol.x(2),sol.x(end)]);
% Then compute the x-position at that time
y = deval(sol,zerofnd,1);
y = -y; % take negative of distance
end
end
4-93
4 Using Direct Search
[xsolution,distance,eflag,outpt] = runcannon(x0,opts);
toc
The solver ran faster than before. If you examine the solution, you see that the output is
identical.
Try to save more time by computing in parallel. Begin by opening a parallel pool of
workers.
parpool
ans =
Connected: true
NumWorkers: 4
Cluster: local
AttachedFiles: {}
IdleTimeout: 30 minute(s) (30 minutes remaining)
SpmdEnabled: true
Set the options to use parallel computing, and rerun the solver.
opts = optimoptions('patternsearch',opts,'UseParallel',true);
x0 = [-30;pi/3];
tic
[xsolution,distance,eflag,outpt] = runcannon(x0,opts);
toc
In this case, parallel computing was slower. If you examine the solution, you see that the
output is identical.
You can also try to solve the problem using the genetic algorithm. However, the genetic
algorithm is usually slower and less reliable.
4-94
Optimize an ODE in Parallel
The runcannonga file calls the genetic algorithm and avoids double evaluation of the
ODE solver. It resembles runcannon, but calls ga instead of patternsearch, and also
checks whether the trajectory reaches the wall. Copy this file to a folder on your
MATLAB path.
lb = [-200;0.05];
ub = [-1;pi/2-.05];
% Call ga
[x,f,eflag,outpt] = ga(fun,2,[],[],[],[],lb,ub,cfun,opts);
function y = objfun(x)
if ~isequal(x,xLast) % Check if computation is necessary
x0 = [x(1);0;300*cos(x(2));300*sin(x(2))];
sol = ode45(@cannonfodder,[0,15],x0);
xLast = x;
end
% Now compute objective function
% First find when the projectile hits the ground
zerofnd = fzero(@(r)deval(sol,r,2),[sol.x(2),sol.x(end)]);
% Then compute the x-position at that time
y = deval(sol,zerofnd,1);
y = -y; % take negative of distance
end
4-95
4 Using Direct Search
end
opts = optimoptions('ga','UseParallel',true);
rng default % for reproducibility
tic
[xsolution,distance,eflag,outpt] = runcannonga(opts)
toc
xsolution =
-17.9172 0.8417
distance =
-116.6263
eflag =
1
outpt =
problemtype: 'nonlinearconstr'
rngstate: [1x1 struct]
generations: 5
funccount: 20212
message: [1x140 char]
maxconstraint: 0
4-96
See Also
The ga solution is not as good as the patternsearch solution: 117 m versus 126 m. ga
took much more time: about 120 s versus under 5 s.
See Also
Related Examples
• “Objective and Nonlinear Constraints in the Same Function” (Optimization Toolbox)
More About
• “Parallel Computing”
4-97
5
The genetic algorithm uses three main types of rules at each step to create the next
generation from the current population:
• Selection rules select the individuals, called parents, that contribute to the population
at the next generation.
• Crossover rules combine two parents to form children for the next generation.
• Mutation rules apply random changes to individual parents to form children.
See Also
More About
• “Genetic Algorithm Terminology” on page 5-15
5-2
See Also
5-3
5 Using the Genetic Algorithm
Optimize Using ga
In this section...
“Calling the Function ga at the Command Line” on page 5-4
“Use the Optimization App” on page 5-4
where
For an example, see “Finding the Minimum from the Command Line” on page 5-10.
5-4
Optimize Using ga
at the command line, or enter optimtool and then choose ga from the Solver menu.
You can also start the tool from the MATLAB Apps tab.
5-5
5 Using the Genetic Algorithm
To use the Optimization app, you must first enter the following information:
• Fitness function — The objective function you want to minimize. Enter the fitness
function in the form @fitnessfun, where fitnessfun.m is a file that computes the
fitness function. “Compute Objective Functions” on page 2-2 explains how write this
file. The @ sign creates a function handle to fitnessfun.
• Number of variables — The length of the input vector to the fitness function. For
the function my_fun described in “Compute Objective Functions” on page 2-2, you
would enter 2.
You can enter constraints or a nonlinear constraint function for the problem in the
Constraints pane. If the problem is unconstrained, leave these fields blank.
To run the genetic algorithm, click the Start button. The tool displays the results of the
optimization in the Run solver and view results pane.
You can change the options for the genetic algorithm in the Options pane. To view the
options in one of the categories listed in the pane, click the + sign next to it.
5-6
Minimize Rastrigin's Function
Rastrigin's Function
This section presents an example that shows how to find the minimum of Rastrigin's
function, a function that is often used to test the genetic algorithm.
5-7
5 Using the Genetic Algorithm
As the plot shows, Rastrigin's function has many local minima—the “valleys” in the plot.
However, the function has just one global minimum, which occurs at the point [0 0] in
the x-y plane, as indicated by the vertical line in the plot, where the value of the function
is 0. At any local minimum other than [0 0], the value of Rastrigin's function is greater
than 0. The farther the local minimum is from the origin, the larger the value of the
function is at that point.
Rastrigin's function is often used to test the genetic algorithm, because its many local
minima make it difficult for standard, gradient-based methods to find the global
minimum.
The following contour plot of Rastrigin's function shows the alternating maxima and
minima.
0.8
0.6
−0.2
−0.4
Local minima
−0.6
−0.8
−1
−1 −0.5 0 0.5 1
Global minimum at [0 0]
5-8
Minimize Rastrigin's Function
Note Because the genetic algorithm uses random number generators, the algorithm
returns slightly different results each time you run it.
The Fitness function and Number of variables fields should appear as shown
in the following figure.
3 Click the Start button in the Run solver and view results pane, as shown in the
following figure.
While the algorithm is running, the Current iteration field displays the number of
the current generation. You can temporarily pause the algorithm by clicking the
Pause button. When you do so, the button name changes to Resume. To resume the
algorithm from the point at which you paused it, click Resume.
5-9
5 Using the Genetic Algorithm
When the algorithm is finished, the Run solver and view results pane appears as
shown in the following figure. Your numerical results might differ from those in the
figure, since ga is stochastic.
• The final value of the fitness function when the algorithm terminated:
Objective function value: 5.550533778020394E-4
Note that the value shown is very close to the actual minimum value of
Rastrigin's function, which is 0. “Setting the Initial Range” on page 5-75,
“Setting the Amount of Mutation” on page 5-90, and “Set Maximum Number of
Generations” on page 5-110 describe some ways to get a result that is closer to
the actual minimum.
• The reason the algorithm terminated.
Optimization terminated: maximum number of generations exceeded.
• The final point, which in this example is [-0.002 -0.001].
This returns
5-10
Minimize Rastrigin's Function
Optimization terminated:
average change in the fitness value less than options.FunctionTolerance.
x =
-1.0421 -1.0018
fval =
2.4385
exitflag =
1
Note Because the genetic algorithm uses random number generators, the algorithm
returns slightly different results each time you run it.
Displaying Plots
The Optimization app Plot functions pane enables you to display various plots that
provide information about the genetic algorithm while it is running. This information can
help you change options to improve the performance of the algorithm. For example, to
plot the best and mean values of the fitness function at each generation, select the box
next to Best fitness, as shown in the following figure.
5-11
5 Using the Genetic Algorithm
When you click Start, the Optimization app displays a plot of the best and mean values
of the fitness function at each generation.
When the algorithm stops, the plot appears as shown in the following figure.
5-12
Minimize Rastrigin's Function
The points at the bottom of the plot denote the best fitness values, while the points above
them denote the averages of the fitness values in each generation. The plot also displays
the best and mean values in the current generation numerically at the top.
To get a better picture of how much the best fitness values are decreasing, you can
change the scaling of the y-axis in the plot to logarithmic scaling. To do so,
1 Select Axes Properties from the Edit menu in the plot window to open the
Property Editor attached to your figure window as shown below.
5-13
5 Using the Genetic Algorithm
Typically, the best fitness value improves rapidly in the early generations, when the
individuals are farther from the optimum. The best fitness value improves more
slowly in later generations, whose populations are closer to the optimal point.
Note When you display more than one plot, you can open a larger version of a plot in a
separate window. Right-click (Ctrl-click for macOS) on a blank area in a plot while ga is
running, or after it has stopped, and choose the sole menu item.
“Plot Options” on page 10-30 describes the types of plots you can create.
5-14
Genetic Algorithm Terminology
Fitness Functions
The fitness function is the function you want to optimize. For standard optimization
algorithms, this is known as the objective function. The toolbox software tries to find the
minimum of the fitness function.
Write the fitness function as a file or anonymous function, and pass it as a function
handle input argument to the main genetic algorithm function.
Individuals
An individual is any point to which you can apply the fitness function. The value of the
fitness function for an individual is its score. For example, if the fitness function is
2 2 2
f ( x1, x2 , x3 ) = ( 2 x1 + 1 ) + ( 3 x2 + 4 ) + ( x3 - 2 ) ,
the vector (2, -3, 1), whose length is the number of variables in the problem, is an
individual. The score of the individual (2, –3, 1) is f(2, –3, 1) = 51.
5-15
5 Using the Genetic Algorithm
a 100-by-3 matrix. The same individual can appear more than once in the population. For
example, the individual (2, -3, 1) can appear in more than one row of the array.
At each iteration, the genetic algorithm performs a series of computations on the current
population to produce a new population. Each successive population is called a new
generation.
Diversity
Diversity refers to the average distance between individuals in a population. A
population has high diversity if the average distance is large; otherwise it has low
diversity. In the following figure, the population on the left has high diversity, while the
population on the right has low diversity.
Diversity is essential to the genetic algorithm because it enables the algorithm to search
a larger region of the space.
5-16
See Also
See Also
More About
• “What Is the Genetic Algorithm?” on page 5-2
• “How the Genetic Algorithm Works” on page 5-18
• “Nonlinear Constraint Solver Algorithms” on page 5-51
5-17
5 Using the Genetic Algorithm
a Scores each member of the current population by computing its fitness value.
These values are called the raw fitness scores.
b Scales the raw fitness scores to convert them into a more usable range of values.
These scaled values are called expectation values.
c Selects members, called parents, based on their expectation.
d Some of the individuals in the current population that have lower fitness are
chosen as elite. These elite individuals are passed to the next population.
e Produces children from the parents. Children are produced either by making
random changes to a single parent—mutation—or by combining the vector
entries of a pair of parents—crossover.
f Replaces the current population with the children to form the next generation.
3 The algorithm stops when one of the stopping criteria is met. See “Stopping
Conditions for the Algorithm” on page 5-22.
5-18
How the Genetic Algorithm Works
Initial Population
The algorithm begins by creating a random initial population, as shown in the following
figure.
1
Initial population
0.8
0.6
0.4
0.2
−0.2
−0.4
−0.6
−0.8
−1
−1 −0.5 0 0.5 1
In this example, the initial population contains 20 individuals. Note that all the
individuals in the initial population lie in the upper-right quadrant of the picture, that is,
their coordinates lie between 0 and 1. For this example, the Initial range in the
Population options is [0;1].
If you know approximately where the minimal point for a function lies, you should set
Initial range so that the point lies near the middle of that range. For example, if you
believe that the minimal point for Rastrigin's function is near the point [0 0], you could
set Initial range to be [-1;1]. However, as this example shows, the genetic algorithm
can find the minimum even with a less than optimal choice for Initial range.
5-19
5 Using the Genetic Algorithm
vectors—to their children. The algorithm usually selects individuals that have better
fitness values as parents. You can specify the function that the algorithm uses to select
the parents in the Selection function field in the Selection options.
The genetic algorithm creates three types of children for the next generation:
• Elite children are the individuals in the current generation with the best fitness
values. These individuals automatically survive to the next generation.
• Crossover children are created by combining the vectors of a pair of parents.
• Mutation children are created by introducing random changes, or mutations, to a
single parent.
“Mutation and Crossover” on page 5-26 explains how to specify the number of children
of each type that the algorithm generates and the functions it uses to perform crossover
and mutation.
The following sections explain how the algorithm creates crossover and mutation
children.
Crossover Children
The algorithm creates crossover children by combining pairs of parents in the current
population. At each coordinate of the child vector, the default crossover function
randomly selects an entry, or gene, at the same coordinate from one of the two parents
5-20
How the Genetic Algorithm Works
and assigns it to the child. For problems with linear constraints, the default crossover
function creates the child as a random weighted average of the parents.
Mutation Children
The algorithm creates mutation children by randomly changing the genes of individual
parents. By default, for unconstrained problems the algorithm adds a random vector
from a Gaussian distribution to the parent. For bounded or linearly constrained
problems, the child remains feasible.
The following figure shows the children of the initial population, that is, the population
at the second generation, and indicates whether they are elite, crossover, or mutation
children.
0.8
0.6
0.4
0.2
−0.2
−0.4
−0.6
Elite children
−0.8 Crossover children
Mutation children
−1
−1 −0.5 0 0.5 1 1.5 2
5-21
5 Using the Genetic Algorithm
Iteration 60 Iteration 80
1 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0 0
−0.2 −0.2
−0.4 −0.4
−0.6 −0.6
−0.8 −0.8
−1 −1
−1 −0.5 0 0.5 1 −1 −0.5 0 0.5 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0 0
−0.2 −0.2
−0.4 −0.4
−0.6 −0.6
−0.8 −0.8
−1 −1
−1 −0.5 0 0.5 1 −1 −0.5 0 0.5 1
As the number of generations increases, the individuals in the population get closer
together and approach the minimum point [0 0].
• Generations — The algorithm stops when the number of generations reaches the
value of Generations.
• Time limit — The algorithm stops after running for an amount of time in seconds
equal to Time limit.
• Fitness limit — The algorithm stops when the value of the fitness function for the
best point in the current population is less than or equal to Fitness limit.
• Stall generations — The algorithm stops when the average relative change in the
fitness function value over Stall generations is less than Function tolerance.
5-22
How the Genetic Algorithm Works
• Stall time limit — The algorithm stops if there is no improvement in the objective
function during an interval of time in seconds equal to Stall time limit.
• Stall test — The stall condition is either average change or geometric
weighted. For geometric weighted, the weighting function is 1/2n, where n is the
number of generations prior to the current. Both stall conditions apply to the relative
change in the fitness function over Stall generations.
• Function tolerance — The algorithm runs until the average relative change in the
fitness function value over Stall generations is less than Function tolerance.
• Constraint tolerance — The Constraint tolerance is not used as stopping
criterion. It is used to determine the feasibility with respect to nonlinear constraints.
Also, max(sqrt(eps),ConstraintTolerance) determines feasibility with respect
to linear constraints.
The algorithm stops as soon as any one of these conditions is met. You can specify the
values of these criteria in the Stopping criteria pane in the Optimization app. The
default values are shown in the pane.
5-23
5 Using the Genetic Algorithm
When you run the genetic algorithm, the Run solver and view results panel displays
the criterion that caused the algorithm to stop.
The options Stall time limit and Time limit prevent the algorithm from running too
long. If the algorithm stops due to one of these conditions, you might improve your
results by increasing the values of Stall time limit and Time limit.
5-24
How the Genetic Algorithm Works
Selection
The selection function chooses parents for the next generation based on their scaled
values from the fitness scaling function. The scaled fitness values are called the
expectation values. An individual can be selected more than once as a parent, in which
case it contributes its genes to more than one child. The default selection option,
Stochastic uniform, lays out a line in which each parent corresponds to a section of
the line of length proportional to its scaled value. The algorithm moves along the line in
steps of equal size. At each step, the algorithm allocates a parent from the section it
lands on.
• In the first step, the function selects parents deterministically according to the
integer part of the scaled value for each individual. For example, if an individual's
scaled value is 2.3, the function selects that individual twice as a parent.
• In the second step, the selection function selects additional parents using the
fractional parts of the scaled values, as in stochastic uniform selection. The function
lays out a line in sections, whose lengths are proportional to the fractional part of the
scaled value of the individuals, and moves along the line in equal steps to select the
parents.
Note that if the fractional parts of the scaled values all equal 0, as can occur using
Top scaling, the selection is entirely deterministic.
For details and more selection options, see “Selection Options” on page 10-39.
Reproduction Options
Reproduction options control how the genetic algorithm creates the next generation. The
options are
• Elite count — The number of individuals with the best fitness values in the current
generation that are guaranteed to survive to the next generation. These individuals
are called elite children. The default value of Elite count is 2.
When Elite count is at least 1, the best fitness value can only decrease from one
generation to the next. This is what you want to happen, since the genetic algorithm
minimizes the fitness function. Setting Elite count to a high value causes the fittest
individuals to dominate the population, which can make the search less effective.
5-25
5 Using the Genetic Algorithm
• Crossover fraction — The fraction of individuals in the next generation, other than
elite children, that are created by crossover. “Setting the Crossover Fraction” on page
5-92 describes how the value of Crossover fraction affects the performance of the
genetic algorithm.
Both processes are essential to the genetic algorithm. Crossover enables the algorithm to
extract the best genes from different individuals and recombine them into potentially
superior children. Mutation adds to the diversity of a population and thereby increases
the likelihood that the algorithm will generate individuals with better fitness values.
See “Creating the Next Generation” on page 5-19 for an example of how the genetic
algorithm applies mutation and crossover.
You can specify how many of each type of children the algorithm creates as follows:
For example, if the Population size is 20, the Elite count is 2, and the Crossover
fraction is 0.8, the numbers of each type of children in the next generation are as
follows:
5-26
See Also
See Also
More About
• “Genetic Algorithm Terminology” on page 5-15
• “Nonlinear Constraint Solver Algorithms” on page 5-51
5-27
5 Using the Genetic Algorithm
IntCon is a vector of positive integers that contains the x components that are integer-
valued. For example, if you want to restrict x(2) and x(10) to be integers, set IntCon
to [2,10].
Note Restrictions exist on the types of problems that ga can solve with integer variables.
In particular, ga does not accept any equality constraints when there are integer
variables. For details, see “Characteristics of the Integer ga Solver” on page 5-30.
Tip ga solves integer problems best when you provide lower and upper bounds for every x
component.
This example shows how to find the minimum of Rastrigin's function restricted so the
first component of x is an integer. The components of x are further restricted to be in the
region .
lb = [5*pi,-20*pi];
ub = [20*pi,-4*pi];
5-28
Mixed Integer Optimization
Optimization terminated: average change in the penalty fitness value less than options.
and constraint violation is less than options.ConstraintTolerance.
x =
16.0000 -12.9325
fval =
424.1355
exitflag =
5-29
5 Using the Genetic Algorithm
• No linear equality constraints. You must have Aeq = [] and beq = []. For a
possible workaround, see “No Equality Constraints” on page 5-31.
• No nonlinear equality constraints. Any nonlinear constraint function must return []
for the nonlinear equality constraint. For a possible workaround, see “Example:
Integer Programming with a Nonlinear Equality Constraint” on page 5-32.
5-30
Mixed Integer Optimization
The listed restrictions are mainly natural, not arbitrary. For example:
• There are no hybrid functions that support integer constraints. So ga does not use
hybrid functions when there are integer constraints.
• To obtain integer variables, ga uses special creation, crossover, and mutation
functions.
No Equality Constraints
You cannot use equality constraints and integer constraints in the same problem. You
can try to work around this restriction by including two inequality constraints for each
linear equality constraint. For example, to try to include the constraint
3x1 – 2x2 = 5,
3x1 – 2x2 ≤ 5
3x1 – 2x2 ≥ 5.
To write these constraints in the form A x ≤ b, multiply the second inequality by -1:
You can try to include the equality constraint using A = [3,-2;-3,2] and b = [5;-5].
Be aware that this procedure can fail; ga has difficulty with simultaneous integer and
equality constraints.
5-31
5 Using the Genetic Algorithm
This example attempts to locate the minimum of the Ackley function in five dimensions
with these constraints:
The Ackley function, described briefly in “Resuming ga From the Final Population” on
page 5-60, is difficult to minimize. Adding integer and equality constraints increases the
difficulty.
To include the nonlinear equality constraint, give a small tolerance tol that allows the
norm of x to be within tol of 4. Without a tolerance, the nonlinear equality constraint is
never satisfied, and the solver does not realize when it has a feasible solution.
norm(x) - 4 ≤ 0
-(norm(x) - 4) ≤ 0.
2 Allow a small tolerance in the inequalities:
norm(x) - 4 - tol ≤ 0
-(norm(x) - 4) - tol ≤ 0.
3 Write a nonlinear inequality constraint function that implements these inequalities:
function [c, ceq] = eqCon(x)
ceq = [];
rad = 4;
tol = 1e-3;
confcnval = norm(x) - rad;
c = [confcnval - tol;-confcnval - tol];
4 Set options:
5-32
Mixed Integer Optimization
opts = optimoptions('ga','MaxStallGenerations',50,'FunctionTolerance',1e-10,...
'MaxGenerations',300,'PlotFcn',@gaplotbestfun);
5 Set lower and upper bounds to help the solver:
nVar = 5;
lb = -5*ones(1,nVar);
ub = 5*ones(1,nVar);
6 Solve the problem:
rng(1,'twister') % for reproducibility
[x,fval,exitflag] = ga(@ackleyfcn,nVar,[],[],[],[], ...
lb,ub,@eqCon,[1 3 5],opts);
5-33
5 Using the Genetic Algorithm
x,fval,exitflag,norm(x)
x =
fval =
5.1119
exitflag =
5-34
Mixed Integer Optimization
ans =
4.0001
The odd x components are integers, as specified. The norm of x is 4, to within the
given relative tolerance of 1e-3.
8 Despite the positive exit flag, the solution is not the global optimum. Run the
problem again and examine the solution:
opts = optimoptions('ga',opts,'Display','off');
[x2,fval2,exitflag2] = ga(@ackleyfcn,nVar,[],[],[],[], ...
lb,ub,@eqCon,[1 3 5],opts);
5-35
5 Using the Genetic Algorithm
x2,fval2,exitflag2,norm(x2)
x2 =
fval2 =
4.2359
5-36
Mixed Integer Optimization
exitflag2 =
ans =
3.9980
The second run gives a better solution (lower fitness function value). Again, the odd
x components are integers, and the norm of x2 is 4, to within the given relative
tolerance of 1e-3.
Be aware that this procedure can fail; ga has difficulty with simultaneous integer and
equality constraints.
Integer ga Algorithm
Integer programming with ga involves several modifications of the basic algorithm (see
“How the Genetic Algorithm Works” on page 5-18). For integer programming:
5-37
5 Using the Genetic Algorithm
References
[1] Deb, Kalyanmoy. An efficient constraint handling method for genetic algorithms.
Computer Methods in Applied Mechanics and Engineering, 186(2–4), pp. 311–
338, 2000.
[2] Deep, Kusum, Krishna Pratap Singh, M.L. Kansal, and C. Mohan. A real coded
genetic algorithm for solving integer and mixed integer optimization problems.
Applied Mathematics and Computation, 212(2), pp. 505–518, 2009.
See Also
Related Examples
• “Solving a Mixed Integer Engineering Design Problem Using the Genetic
Algorithm” on page 5-39
5-38
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
The problem illustrated in this example involves the design of a stepped cantilever beam.
In particular, the beam must be able to carry a prescribed end load. We will solve a
problem to minimize the beam volume subject to various engineering design constraints.
In this example we will solve two bounded versions of the problem published in [1].
A stepped cantilever beam is supported at one end and a load is applied at the free end,
as shown in the figure below. The beam must be able to support the given load, , at a
fixed distance from the support. Designers of the beam can vary the width ( ) and
height ( ) of each section. We will assume that each section of the cantilever has the
same length, .
5-39
5 Using the Genetic Algorithm
The volume of the beam, , is the sum of the volume of the individual sections
Consider a single cantilever beam, with the centre of coordinates at the centre of its cross
section at the free end of the beam. The bending stress at a point in the beam is
given by the following equation
where is the bending moment at , is the distance from the end load and is the
area moment of inertia of the beam.
5-40
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
Now, in the stepped cantilever beam shown in the figure, the maximum moment of each
section of the beam is , where is the maximum distance from the end load, , for
each section of the beam. Therefore, the maximum stress for the -th section of the beam,
, is given by
where the maximum stress occurs at the edge of the beam, . The area moment of
inertia of the -th section of the beam is given by
The bending stress in each part of the cantilever should not exceed the maximum
allowable stress, . Consequently, we can finally state the five bending stress
constraints (one for each step of the cantilever)
The end deflection of the cantilever can be calculated using Castigliano's second theorem,
which states that
where is the deflection of the beam, is the energy stored in the beam due to the
applied force, .
5-41
5 Using the Genetic Algorithm
Given that for a cantilever beam, we can write the above equation as
where is the area moment of inertia of the -th part of the cantilever. Evaluating the
integral gives the following expression for .
Now, the end deflection of the cantilever, , should be less than the maximum allowable
deflection, , which gives us the following constraint.
For each step of the cantilever, the aspect ratio must not exceed a maximum allowable
aspect ratio, . That is,
for
We are now able to state the problem to find the optimal parameters for the stepped
cantilever beam given the stated constraints.
Let , , , , , , , , and
Minimize:
5-42
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
Subject to:
The first step of the beam can only be machined to the nearest centimetre. That is,
and must be integer. The remaining variables are continuous. The bounds on the
variables are given below:-
For the problem we will solve in this example, the end load that the beam must support
is .
5-43
5 Using the Genetic Algorithm
A note on the linear constraints: When linear constraints are specified to ga, you
normally specify them via the A, b, Aeq and beq inputs. In this case we have specified
them via the nonlinear constraint function. This is because later in this example, some of
the variables will become discrete. When there are discrete variables in the problem it is
far easier to specify linear constraints in the nonlinear constraint function. The
alternative is to modify the linear constraint matrices to work in the transformed
variable space, which is not trivial and maybe not possible. Also, in the mixed integer ga
solver, the linear constraints are not treated any differently to the nonlinear constraints
regardless of how they are specified.
Create vectors containing the lower bound (lb) and upper bound constraints (ub).
lb = [1 30 2.4 45 2.4 45 1 30 1 30];
ub = [5 65 3.1 60 3.1 60 5 65 5 65];
Note that there are a restricted set of ga options available when solving mixed integer
problems - see Global Optimization Toolbox User's Guide for more details.
5-44
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
We can now call ga to solve the problem. In the problem statement and are integer
variables. We specify this by passing the index vector [1 2] to ga after the nonlinear
constraint input and before the options input. We also seed and set the random number
generator here for reproducibility.
rng(0, 'twister');
[xbest, fbest, exitflag] = ga(@cantileverVolume, 10, [], [], [], [], ...
lb, ub, @cantileverConstraints, [1 2], opts);
5-45
5 Using the Genetic Algorithm
The solution returned from ga is displayed below. Note that the section nearest the
support is constrained to have a width ( ) and height ( ) which is an integer value and
this constraint has been honored by GA.
display(xbest);
5-46
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
xbest =
Columns 1 through 7
Columns 8 through 10
The engineers are now informed that the second and third steps of the cantilever can
only have widths and heights that are chosen from a standard set. In this section, we
show how to add this constraint to the optimization problem. Note that with the addition
of this constraint, this problem is identical to that solved in [1].
First, we state the extra constraints that will be added to the above optimization
• The width of the second and third steps of the beam must be chosen from the
following set:- [2.4, 2.6, 2.8, 3.1] cm
• The height of the second and third steps of the beam must be chosen from the
following set:- [45, 50, 55, 60] cm
So, first we transform the bounds on the discrete variables. Each set has 4 members and
we will map the discrete variables to an integer in the range [1, 4]. So, to map these
variables to be integer, we set the lower bound to 1 and the upper bound to 4 for each of
the variables.
5-47
5 Using the Genetic Algorithm
lb = [1 30 1 1 1 1 1 30 1 30];
ub = [5 65 4 4 4 4 5 65 5 65];
Transformed (integer) versions of , , and will now be passed to the fitness and
constraint functions when the ga solver is called. To evaluate these functions correctly,
, , and need to be transformed to a member of the given discrete set in these
functions. To see how this is done, examine the MATLAB files
cantileverVolumeWithDisc.m, cantileverConstraintsWithDisc.m and
cantileverMapVariables.m.
Now we can call ga to solve the problem with discrete variables. In this case are
integers. This means that we pass the index vector 1:6 to ga to define the integer
variables.
rng(0, 'twister');
[xbestDisc, fbestDisc, exitflagDisc] = ga(@cantileverVolumeWithDisc, ...
10, [], [], [], [], lb, ub, @cantileverConstraintsWithDisc, 1:6, opts);
5-48
Solving a Mixed Integer Engineering Design Problem Using the Genetic Algorithm
xbestDisc =
Columns 1 through 7
5-49
5 Using the Genetic Algorithm
Columns 8 through 10
As before, the solution returned from ga honors the constraint that and are
integers. We can also see that , are chosen from the set [2.4, 2.6, 2.8, 3.1] cm and ,
are chosen from the set [45, 50, 55, 60] cm.
Recall that we have added additional constraints on the variables x(3), x(4), x(5) and
x(6). As expected, when there are additional discrete constraints on these variables, the
optimal solution has a higher minimum volume. Note further that the solution reported
in [1] has a minimum volume of and that we find a solution which is
approximately the same as that reported in [1].
fprintf('\nCost function returned by ga = %g\n', fbestDisc);
Summary
This example illustrates how to use the genetic algorithm solver, ga, to solve a
constrained nonlinear optimization problem which has integer constraints. The example
also shows how to handle problems that have discrete variables in the problem
formulation.
References
[1] Survey of discrete variable optimization for structural design, P.B. Thanedar, G.N.
Vanderplaats, J. Struct. Eng., 121 (3), 301-306 (1995)
5-50
Nonlinear Constraint Solver Algorithms
such that
ci ( x) £ 0, i = 1 … m
ceqi ( x) = 0, i = m + 1 … mt
A◊ x £ b
Aeq ◊ x = beq
lb £ x £ ub,
where c(x) represents the nonlinear inequality constraints, ceq(x) represents the equality
constraints, m is the number of nonlinear inequality constraints, and mt is the total
number of nonlinear constraints.
5-51
5 Using the Genetic Algorithm
where
• The components λi of the vector λ are nonnegative and are known as Lagrange
multiplier estimates
• The elements si of the vector s are nonnegative shifts
• ρ is the positive penalty parameter.
The algorithm begins by using an initial value for the penalty parameter
(InitialPenalty).
Each subproblem solution represents one generation. The number of function evaluations
per generation is therefore much higher when using nonlinear constraints than
otherwise.
References
[1] Conn, A. R., N. I. M. Gould, and Ph. L. Toint. “A Globally Convergent Augmented
Lagrangian Algorithm for Optimization with General Constraints and Simple
Bounds,” SIAM Journal on Numerical Analysis, Volume 28, Number 2, pages
545–572, 1991.
[2] Conn, A. R., N. I. M. Gould, and Ph. L. Toint. “A Globally Convergent Augmented
Lagrangian Barrier Algorithm for Optimization with General Inequality
Constraints and Simple Bounds,” Mathematics of Computation, Volume 66,
Number 217, pages 261–288, 1997.
5-52
See Also
Penalty Algorithm
The penalty algorithm is similar to the “Integer ga Algorithm” on page 5-37. In its
evaluation of the fitness of an individual, ga computes a penalty value as follows:
References
[1] Deb, Kalyanmoy. An efficient constraint handling method for genetic algorithms.
Computer Methods in Applied Mechanics and Engineering, 186(2–4), pp. 311–
338, 2000.
See Also
More About
• “Genetic Algorithm Terminology” on page 5-15
• “How the Genetic Algorithm Works” on page 5-18
5-53
5 Using the Genetic Algorithm
5-54
Create Custom Plot Function
else
change = last_best - best; % Change in best score
last_best = best;
if change > 0 % Plot only when the fitness improves
plot(state.Generation,change,'xr');
end
end
Now, if you run the example described in “Minimize Rastrigin's Function” on page 5-7,
the tool displays plots similar to those shown in the following figure.
5-55
5 Using the Genetic Algorithm
The plot only shows changes that are greater than 0, which are improvements in best
fitness. The logarithmic scale enables you to see small changes in the best fitness
function that the upper plot does not reveal.
5-56
Create Custom Plot Function
The most important lines of the plot function are the following:
• persistent last_best
axx = gca;
axx.Yscale = 'log';
The field state.Score contains the scores of all individuals in the current
population. The variable best is the minimum score. For a complete description of
the fields of the structure state, see “Structure of the Plot Functions” on page 10-32.
• change = last_best - best
The variable change is the best score at the previous generation minus the best score
in the current generation.
• if change > 0
The code for gaplotchange contains many of the same elements as the code for
gaplotbestf, the function that creates the best fitness plot.
5-57
5 Using the Genetic Algorithm
See Also
Related Examples
• “Custom Output Function for Genetic Algorithm” on page 5-119
• “Plot Options” on page 10-30
5-58
Reproduce Results in Optimization App
Normally, you should leave Use random states from previous run unselected to get
the benefit of randomness in the genetic algorithm. Select the Use random states from
previous run check box if you want to analyze the results of that particular run or show
the exact results to others. After the algorithm has run, you can clear your results using
the Clear Status button in the Run solver settings.
Note If you select Include information needed to resume this run, then selecting
Use random states from previous run has no effect on the initial population created
when you import the problem and run the genetic algorithm on it. The latter option is
only intended to reproduce results from the beginning of a new run, not from a resumed
run.
5-59
5 Using the Genetic Algorithm
Resume ga
In this section...
“Resuming ga From the Final Population” on page 5-60
“Resuming ga From a Previous Run” on page 5-64
5-60
Resume ga
Suppose you want to experiment by running the genetic algorithm with other options
settings, and then later restart this run from its final population with its current options
settings. You can do this using the following steps:
1 Click Export to Workspace.
2 In the dialog box that appears,
5-61
5 Using the Genetic Algorithm
3 Click OK.
This exports the problem and options to a structure in the MATLAB workspace. You can
view the structure in the MATLAB Command Window by entering
ackley_uniform
ackley_uniform =
fitnessfcn: @ackleyfcn
nvars: 10
Aineq: []
bineq: []
Aeq: []
beq: []
lb: []
ub: []
nonlcon: []
intcon: []
rngstate: []
solver: 'ga'
options: [1x1 struct]
After running the genetic algorithm with different options settings or even a different
fitness function, you can restore the problem as follows:
1 Select Import Problem from the File menu. This opens the dialog box shown in the
following figure.
5-62
Resume ga
2 Select ackley_uniform.
3 Click Import.
This sets the Initial population and Initial scores fields in the Population panel to
the final population of the run before you exported the problem. All other options are
restored to their setting during that run. When you click Start, the genetic algorithm
resumes from the saved final population. The following figure shows the best fitness plots
from the original run and the restarted run.
5-63
5 Using the Genetic Algorithm
Note If, after running the genetic algorithm with the imported problem, you want to
restore the genetic algorithm's default behavior of generating a random initial
population, delete the population in the Initial population field.
The version of Ackley's function in the toolbox differs from the published version of
Ackley's function in Ackley [1]. The toolbox version has another exponential applied,
leading to flatter regions, so a more difficult optimization problem.
References
[1] Ackley, D. H. A connectionist machine for genetic hillclimbing. Kluwer Academic
Publishers, Boston, 1987.
The last output argument is the final population. To run ga using final_pop as the
initial population, enter
5-64
Resume ga
You can then use final_pop2, the final population from the second run, as the initial
population for a third run.
In Optimization app, you can choose to export a problem in a way that lets you resume
the run. Simply check the box Include information needed to resume this run when
exporting the problem.
This saves the final population, which becomes the initial population when imported.
If you want to run a problem that was saved with the final population, but would rather
not use the initial population, simply delete or otherwise change the initial population in
the Options > Population pane.
5-65
5 Using the Genetic Algorithm
• @fitnessfun — A function handle to the file that computes the fitness function.
“Compute Objective Functions” on page 2-2 explains how to write this file.
• nvars — The number of independent variables for the fitness function.
For a description of additional input and output arguments, see the reference page for
ga.
You can run the example described in “Minimize Rastrigin's Function” on page 5-7 from
the command line by entering
rng(1,'twister') % for reproducibility
[x,fval] = ga(@rastriginsfcn,2)
This returns
Optimization terminated:
average change in the fitness value less than options.FunctionTolerance.
x =
-1.0421 -1.0018
5-66
Options and Outputs
fval =
2.4385
[x,fval] = ga(@fitnessfun,nvars,[],[],[],[],[],[],[],options)
This syntax does not specify any linear equality, linear inequality, or nonlinear
constraints.
options = optimoptions(@ga);
This returns options with the default values for its fields. ga uses these default values
if you do not pass in options as an input argument.
ans =
To create options with a field value that is different from the default — for example to
set PopulationSize to 100 instead of its default value 50 — enter
options = optimoptions('ga','PopulationSize',100);
This creates options with all values set to their defaults except for PopulationSize,
which is set to 100.
5-67
5 Using the Genetic Algorithm
If you subsequently decide to change another field in options, such as setting PlotFcn
to @gaplotbestf, which plots the best fitness function value at each generation, call
optimoptions with the syntax
options = optimoptions(options,'PlotFcn',@plotbestf);
This preserves the current values of all fields of options except for PlotFcn, which is
changed to @plotbestf. Note that if you omit the input argument options,
optimoptions resets PopulationSize to its default value.
You can also set both PopulationSize and PlotFcn with the single command
options = optimoptions('ga','PopulationSize',100,'PlotFcn',@plotbestf);
Besides x and fval, this function returns the following additional output arguments:
See the ga reference page for more information about these arguments.
5-68
Use Exported Options and Problems
except that the exported 'Display' option defaults to 'off', and is 'final' in the
default at the command line.
If you export a problem, ga_problem, from the Optimization app, you can apply ga to it
using the syntax
[x,fval] = ga(ga_problem)
5-69
5 Using the Genetic Algorithm
Reproduce Results
Because the genetic algorithm is stochastic—that is, it makes random choices—you get
slightly different results each time you run the genetic algorithm. The algorithm uses the
default MATLAB pseudorandom number stream. For more information about random
number streams, see RandStream. Each time ga calls the stream, its state changes. So
that the next time ga calls the stream, it returns a different random number. This is why
the output of ga differs each time you run it.
If you need to reproduce your results exactly, you can call ga with an output argument
that contains the current state of the default stream, and then reset the state to this
value before running ga again. For example, to reproduce the output of ga applied to
Rastrigin's function, call ga with the syntax
x =
-1.0421 -1.0018
fval =
2.4385
exitflag =
1
The state of the stream is stored in output.rngstate. To reset the state, enter
stream = RandStream.getGlobalStream;
stream.State = output.rngstate.State;
If you now run ga a second time, you get the same results as before:
[x,fval,exitflag] = ga(@rastriginsfcn, 2)
Optimization terminated: average change in the fitness value less than options.Function
x =
-1.0421 -1.0018
5-70
Reproduce Results
fval =
2.4385
exitflag =
1
You can reproduce your run in the Optimization app by checking the box Use random
states from previous run in the Run solver and view results section.
Note If you do not need to reproduce your results, it is better not to set the state of the
stream, so that you get the benefit of the randomness in the genetic algorithm.
5-71
5 Using the Genetic Algorithm
You can plot the values of fval against the crossover fraction with the following
commands:
plot(0:.05:1, record);
xlabel('Crossover Fraction');
ylabel('fval')
5-72
Run ga from a File
The plot suggests that you get the best results by setting
options.CrossoverFraction to a value somewhere between 0.4 and 0.8.
You can get a smoother plot of fval as a function of the crossover fraction by running ga
20 times and averaging the values of fval for each crossover fraction. The following
figure shows the resulting plot.
5-73
5 Using the Genetic Algorithm
This plot also suggests the range of best choices for options.CrossoverFraction is
0.4 to 0.8.
5-74
Population Diversity
Population Diversity
In this section...
“Importance of Population Diversity” on page 5-75
“Setting the Initial Range” on page 5-75
“Custom Plot Function and Linear Constraints in ga” on page 5-79
“Setting the Population Size” on page 5-84
This section explains how to control diversity by setting the Initial range of the
population. “Setting the Amount of Mutation” on page 5-90 describes how the amount of
mutation affects diversity.
This section also explains how to set the population size on page 5-84.
Note The initial range restricts the range of the points in the initial population by
specifying the lower and upper bounds. Subsequent generations can contain points whose
entries do not lie in the initial range. Set upper and lower bounds for all generations in
the Bounds fields in the Constraints panel.
If you know approximately where the solution to a problem lies, specify the initial range
so that it contains your guess for the solution. However, the genetic algorithm can find
5-75
5 Using the Genetic Algorithm
the solution even if it does not lie in the initial range, if the population has enough
diversity.
The following example shows how the initial range affects the performance of the genetic
algorithm. The example uses Rastrigin's function, described in “Minimize Rastrigin's
Function” on page 5-7. The minimum value of the function is 0, which occurs at the
origin.
To run the example, open the ga solver in the Optimization app by entering
optimtool('ga') at the command line. Set the following:
Click Start in Run solver and view results. Although the results of genetic algorithm
computations are random, your results are similar to the following figure, with a best
fitness function value of approximately 2.
5-76
Population Diversity
The upper plot, which displays the best fitness at each generation, shows little progress
in lowering the fitness value. The lower plot shows the average distance between
individuals at each generation, which is a good measure of the diversity of a population.
For this setting of initial range, there is too little diversity for the algorithm to make
progress.
Next, try setting Initial range to [1;100] and running the algorithm. This time the
results are more variable. You might obtain a plot with a best fitness value of about 7, as
in the following plot. You might obtain different results.
5-77
5 Using the Genetic Algorithm
This time, the genetic algorithm makes progress, but because the average distance
between individuals is so large, the best individuals are far from the optimal solution.
Finally, set Initial range to [1;2] and run the genetic algorithm. Again, there is
variability in the result, but you might obtain a result similar to the following figure.
Run the optimization several times, and you eventually obtain a final point near [0;0],
with a fitness function value near 0.
5-78
Population Diversity
The diversity in this case is better suited to the problem, so ga usually returns a better
result than in the previous two cases.
5-79
5 Using the Genetic Algorithm
Fitness Function
The fitness function is lincontest6, included with your software. This is a quadratic
function of two variables:
switch flag
% Plot initialization
case 'init'
pop = state.Population(:,dimensionsToPlot);
plotHandle = plot(pop(:,1),pop(:,2),'*');
set(plotHandle,'Tag','gaplotshowpopulation2')
title('Population plot in two dimension','interp','none')
xlabelStr = sprintf('%s %s','Variable ', num2str(dimensionsToPlot(1)));
ylabelStr = sprintf('%s %s','Variable ', num2str(dimensionsToPlot(2)));
xlabel(xlabelStr,'interp','none');
ylabel(ylabelStr,'interp','none');
5-80
Population Diversity
hold on;
5-81
5 Using the Genetic Algorithm
set(plotHandle,'Xdata',pop(:,1),'Ydata',pop(:,2));
end
The custom plot function plots the lines representing the linear inequalities and bound
constraints, plots level curves of the fitness function, and plots the population as it
evolves. This plot function expects to have not only the usual inputs
(options,state,flag), but also a function handle to the fitness function,
@lincontest6 in this example. To generate level curves, the custom plot function needs
the fitness function.
Problem Constraints
options = optimoptions('ga','PlotFcns',...
{{@gaplotshowpopulation2,@lincontest6}});
The initial population, in the first plot, has many members on the linear constraint
boundaries. The population is reasonably well-dispersed.
rng default % for reproducibility
[x,fval] = ga(@lincontest6,2,A,b,[],[],lb,[],[],options);
Optimization terminated: average change in the fitness value less than options.Function
5-82
Population Diversity
5-83
5 Using the Genetic Algorithm
Note You should set Population size to be at least the value of Number of variables,
so that the individuals in each population span the space being searched.
5-84
Population Diversity
You can experiment with different settings for Population size that return good results
without taking a prohibitive amount of time to run.
5-85
5 Using the Genetic Algorithm
Fitness Scaling
In this section...
“Scaling the Fitness Scores” on page 5-86
“Comparing Rank and Top Scaling” on page 5-88
The range of the scaled values affects the performance of the genetic algorithm. If the
scaled values vary too widely, the individuals with the highest scaled values reproduce
too rapidly, taking over the population gene pool too quickly, and preventing the genetic
algorithm from searching other areas of the solution space. On the other hand, if the
scaled values vary only a little, all individuals have approximately the same chance of
reproduction and the search will progress very slowly.
The default fitness scaling option, Rank, scales the raw scores based on the rank of each
individual instead of its score. The rank of an individual is its position in the sorted
scores: the rank of the most fit individual is 1, the next most fit is 2, and so on. The rank
scaling function assigns scaled values so that
•
The scaled value of an individual with rank n is proportional to 1 / n .
• The sum of the scaled values over the entire population equals the number of parents
needed to create the next generation.
Rank fitness scaling removes the effect of the spread of the raw scores.
The following plot shows the raw scores of a typical population of 20 individuals, sorted
in increasing order.
5-86
Fitness Scaling
130
120
110
100
Score
90
80
70
60
50
0 5 10 15 20
Sorted individuals
The following plot shows the scaled values of the raw scores using rank scaling.
Scaled Values Using Rank Scaling
4.5
3.5
3
Scaled value
2.5
1.5
0.5
0 5 10 15 20
Sorted individuals
Because the algorithm minimizes the fitness function, lower raw scores have higher
scaled values. Also, because rank scaling assigns values that depend only on an
individual's rank, the scaled values shown would be the same for any population of size
20 and number of parents equal to 32.
5-87
5 Using the Genetic Algorithm
The following figure compares the scaled values of a population of size 20 with number of
parents equal to 32 using rank and top scaling.
Comparison of Rank and Top Scaling
8 Rank scaling
Top scaling
7
5
Scaled value
0 5 10 15 20
Sorted individuals
Because top scaling restricts parents to the fittest individuals, it creates less diverse
populations than rank scaling. The following plot compares the variances of distances
between individuals at each generation using rank and top scaling.
5-88
Fitness Scaling
0.6
0.5
Variance
0.4
0.3
0.2
0.1
0
0 10 20 30 40 50 60 70 80 90 100
Generation
5-89
5 Using the Genetic Algorithm
• Scale controls the standard deviation of the mutation at the first generation, which is
Scale multiplied by the range of the initial population, which you specify by the
Initial range option.
• Shrink controls the rate at which the average amount of mutation decreases. The
standard deviation decreases linearly so that its final value equals 1 – Shrink times
its initial value at the first generation. For example, if Shrink has the default value
of 1, then the amount of mutation decreases to 0 at the final step.
You can see the effect of mutation by selecting the plot options Distance and Range,
and then running the genetic algorithm on a problem such as the one described in
“Minimize Rastrigin's Function” on page 5-7. The following figure shows the plot after
setting the random number generator.
rng default % for reproducibility
options = optimoptions('ga','PlotFcn',{@gaplotdistance,@gaplotrange},...
'MaxStallGenerations',200); % to get a long run
[x,fval] = ga(@rastriginsfcn,2,[],[],[],[],[],[],[],options);
5-90
Vary Mutation and Crossover
The upper plot displays the average distance between points in each generation. As the
amount of mutation decreases, so does the average distance between individuals, which
is approximately 0 at the final generation. The lower plot displays a vertical line at each
generation, showing the range from the smallest to the largest fitness value, as well as
mean fitness value. As the amount of mutation decreases, so does the range. These plots
show that reducing the amount of mutation decreases the diversity of subsequent
generations.
For comparison, the following figure shows the plots for Distance and Range when you
set Shrink to 0.5.
options = optimoptions('ga',options,'MutationFcn',{@mutationgaussian,1,.5});
[x,fval] = ga(@rastriginsfcn,2,[],[],[],[],[],[],[],options);
5-91
5 Using the Genetic Algorithm
With Shrink set to 0.5, the average amount of mutation decreases by a factor of 1/2 by
the final generation. As a result, the average distance between individuals decreases less
than before.
5-92
Vary Mutation and Crossover
The example uses the fitness function whose value at a point is the sum of the absolute
values of the coordinates at the points. That is,
f ( x1, x2 ,..., xn ) = x1 + x2 + L + xn .
You can define this function as an anonymous function by setting Fitness function to
@(x) sum(abs(x))
Run the example with the default value of 0.8 for Crossover fraction, in the Options
> Reproduction pane. For reproducibility, switch to the command line and enter
rng(14,'twister')
Switch back to Optimization app, and click Run solver and view results > Start. This
returns the best fitness value of approximately 0.0799 and displays the following plots.
5-93
5 Using the Genetic Algorithm
To see how the genetic algorithm performs when there is no mutation, set Crossover
fraction to 1.0 and click Start. This returns the best fitness value of approximately .66
and displays the following plots.
5-94
Vary Mutation and Crossover
In this case, the algorithm selects genes from the individuals in the initial population
and recombines them. The algorithm cannot create any new genes because there is no
mutation. The algorithm generates the best individual that it can using these genes at
generation number 8, where the best fitness plot becomes level. After this, it creates new
copies of the best individual, which are then are selected for the next generation. By
generation number 17, all individuals in the population are the same, namely, the best
individual. When this occurs, the average distance between individuals is 0. Since the
algorithm cannot improve the best fitness value after generation 8, it stalls after 50 more
generations, because Stall generations is set to 50.
5-95
5 Using the Genetic Algorithm
To see how the genetic algorithm performs when there is no crossover, set Crossover
fraction to 0 and click Start. This returns the best fitness value of approximately 3 and
displays the following plots.
In this case, the random changes that the algorithm applies never improve the fitness
value of the best individual at the first generation. While it improves the individual
genes of other individuals, as you can see in the upper plot by the decrease in the mean
value of the fitness function, these improved genes are never combined with the genes of
the best individual because there is no crossover. As a result, the best fitness plot is level
and the algorithm stalls at generation number 50.
5-96
Vary Mutation and Crossover
at the MATLAB prompt. When the example is finished, the plots appear as in the
following figure.
5-97
5 Using the Genetic Algorithm
The lower plot shows the means and standard deviations of the best fitness values over
10 generations, for each of the values of the crossover fraction. The upper plot shows a
color-coded display of the best fitness values in each generation.
For this fitness function, setting Crossover fraction to 0.8 yields the best result.
However, for another fitness function, a different setting for Crossover fraction might
yield the best result.
5-98
Global vs. Local Minima Using ga
5-99
5 Using the Genetic Algorithm
The function has two local minima, one at x = 0, where the function value is –1, and the
other at x = 101, where the function value is –1 – 1/e. Since the latter value is smaller,
the global minimum occurs at x = 101.
5-100
Global vs. Local Minima Using ga
1 Copy and paste the following code into a new file in the MATLAB Editor.
function y = two_min(x)
if x <= 100
y = -exp(-(x/100).^2);
else
y = -exp(-1) + (x-100)*(x-102);
end
2 Save the file as two_min.m in a folder on the MATLAB path.
3 In the Optimization app,
The genetic algorithm returns a point very close to the local minimum at x = 0.
The following custom plot shows why the algorithm finds the local minimum rather than
the global minimum. The plot shows the range of individuals in each generation and the
population mean.
5-101
5 Using the Genetic Algorithm
5-102
Global vs. Local Minima Using ga
switch flag
case 'init'
set(gca,'xlim',[1,options.MaxGenerations+1]);
plotRange = errorbar(generation,Y,L,U);
set(plotRange,'Tag','gaplot1drange');
title('Range of Population, Mean','interp','none')
xlabel('Generation','interp','none')
case 'iter'
plotRange = findobj(get(gca,'Children'),'Tag','gaplot1drange');
newX = [get(plotRange,'Xdata') generation];
newY = [get(plotRange,'Ydata') Y];
newL = [get(plotRange,'Ldata') L];
newU = [get(plotRange,'Udata') U];
set(plotRange,'Xdata',newX,'Ydata',newY,'Ldata',newL,'Udata',newU);
end
Note that all individuals lie between –70 and 70. The population never explores points
near the global minimum at x = 101.
One way to make the genetic algorithm explore a wider range of points—that is, to
increase the diversity of the populations—is to increase the Initial range. The Initial
range does not have to include the point x = 101, but it must be large enough so that the
algorithm generates individuals near x = 101. Set Initial range to [-10;90] as shown
in the following figure.
5-103
5 Using the Genetic Algorithm
Then click Start. The genetic algorithm returns a point very close to 101.
This time, the custom plot shows a much wider range of individuals. There are
individuals near 101 from early on, and the population mean begins to converge to 101.
5-104
Global vs. Local Minima Using ga
5-105
5 Using the Genetic Algorithm
The example finds the minimum of Rosenbrock’s function, also known as De Jong’s
second function, which is defined by
2
(
f ( x1 , x2 ) = 100 x2 - x12 ) + (1 - x1 ) 2.
3000
2500
2000
1500
1000
500
0
3
2 2
1 1
0
0 −1
−1 −2
Minimum at (1,1)
5-106
Include a Hybrid Function
To see a worked example of a hybrid function, see “Hybrid Scheme in the Genetic
Algorithm”.
Then enter optimtool('ga') to open the Optimization app to the ga solver. Enter the
following settings:
Before adding a hybrid function, click Start to run the genetic algorithm by itself. The
genetic algorithm displays the following results in the Run solver and view results
pane:
The final point is somewhat close to the true minimum at (1, 1). You can improve this
result by setting Hybrid function to fminunc (in the Hybrid function options).
5-107
5 Using the Genetic Algorithm
fminunc uses the final point of the genetic algorithm as its initial point. It returns a
more accurate result, as shown in the Run solver and view results pane.
Specify nondefault options for the hybrid function by creating options at the command
line. Use optimset for fminsearch, or optimoptions for fmincon, patternsearch
or fminunc. For example:
hybridopts = optimoptions('fminunc','Display','iter','Algorithm','quasi-newton');
In the Optimization app enter the name of your options in the Options box under
Hybrid function:
5-108
Include a Hybrid Function
5-109
5 Using the Genetic Algorithm
Run the genetic algorithm for approximately 300 generations and click Stop. The
following figure shows the resulting best fitness plot after 300 generations.
5-110
Set Maximum Number of Generations
90
80
70
Fitness value
60
50
40
30
20
10
0
50 100 150 200 250 300
Generation
Note that the algorithm stalls at approximately generation number 170—that is, there is
no immediate improvement in the fitness function after generation 170. If you restore
Stall generations to its default value of 50, the algorithm could terminate at
approximately generation number 220. If the genetic algorithm stalls repeatedly with the
current setting for Generations, you can try increasing both the Generations and
Stall generations options to improve your results. However, changing other options
might be more effective.
5-111
5 Using the Genetic Algorithm
• Write the file that computes the function so that it accepts a matrix with arbitrarily
many rows, corresponding to the individuals in the population. For example, to
vectorize the function
f ( x1 , x2 ) = x12 - 2 x1 x2 + 6 x1 + x22 - 6 x2
The colon in the first entry of x indicates all the rows of x, so that x(:, 1) is a
vector. The .^ and .* operators perform elementwise operations on the vectors.
• At the command line, set the UseVectorized option to true using optimoptions.
• In the Optimization app, set User function evaluation > Evaluate fitness and
constraint functions to vectorized.
Note The fitness function, and any nonlinear constraint function, must accept an
arbitrary number of rows to use the Vectorize option. ga sometimes evaluates a single
row even during a vectorized calculation.
The following comparison, run at the command line, shows the improvement in speed
with vectorization.
options = optimoptions('ga','PopulationSize',2000);
tic;ga(@rastriginsfcn,20,[],[],[],[],[],[],[],options);toc
5-112
Vectorize the Fitness Function
options = optimoptions(options,'UseVectorized',true);
tic;
ga(@rastriginsfcn,20,[],[],[],[],[],[],[],options);
toc
Optimization terminated: maximum number of generations exceeded.
Elapsed time is 1.860655 seconds.
Vectorized Constraints
If there are nonlinear constraints, the objective function and the nonlinear constraints all
need to be vectorized in order for the algorithm to compute in a vectorized manner.
“Vectorize the Objective and Constraint Functions” on page 4-81 contains an example of
how to vectorize both for the solver patternsearch. The syntax is nearly identical for
ga. The only difference is that patternsearch can have its patterns appear as either
row or column vectors; the corresponding vectors for ga are the population vectors, which
are always rows.
5-113
5 Using the Genetic Algorithm
Begin by creating the fitness and constraint functions. First, create a file named
simple_fitness.m as follows:
function y = simple_fitness(x)
y = 100*(x(1)^2 - x(2))^2 + (1 - x(1))^2;
The genetic algorithm function, ga, assumes the fitness function will take one input x,
where x has as many elements as the number of variables in the problem. The fitness
function computes the value of the function and returns that scalar value in its one
return argument, y.
The ga function assumes the constraint function will take one input x, where x has as
many elements as the number of variables in the problem. The constraint function
computes the values of all the inequality and equality constraints and returns two
vectors, c and ceq, respectively.
To minimize the fitness function, you need to pass a function handle to the fitness
function as the first argument to the ga function, as well as specifying the number of
variables as the second argument. Lower and upper bounds are provided as LB and UB
5-114
Constrained Minimization Using ga
respectively. In addition, you also need to pass a function handle to the nonlinear
constraint function.
ObjectiveFunction = @simple_fitness;
nvars = 2; % Number of variables
LB = [0 0]; % Lower bound
UB = [1 13]; % Upper bound
ConstraintFunction = @simple_constraint;
rng(1,'twister') % for reproducibility
[x,fval] = ga(ObjectiveFunction,nvars,...
[],[],[],[],LB,UB,ConstraintFunction)
Optimization terminated: average change in the fitness value less than options.Function
and constraint violation is less than options.ConstraintTolerance.
x =
0.8123 12.3137
fval =
1.3581e+04
For problems without integer constraints, the genetic algorithm solver handles linear
constraints and bounds differently from nonlinear constraints. All the linear constraints
and bounds are satisfied throughout the optimization. However, ga may not satisfy all
the nonlinear constraints at every generation. If ga converges to a solution, the nonlinear
constraints will be satisfied at that solution.
If there are integer constraints, ga does not enforce the feasibility of linear constraints,
and instead adds any linear constraint violations to the penalty function. See “Integer ga
Algorithm” on page 5-37.
ga uses the mutation and crossover functions to produce new individuals at every
generation. ga satisfies linear and bound constraints by using mutation and crossover
functions that only generate feasible points. For example, in the previous call to ga, the
mutation function mutationguassian does not necessarily obey the bound constraints.
So when there are bound or linear constraints, the default ga mutation function is
mutationadaptfeasible. If you provide a custom mutation function, this custom
function must only generate points that are feasible with respect to the linear and bound
constraints. All the included crossover functions generate points that satisfy the linear
constraints and bounds except the crossoverheuristic function.
5-115
5 Using the Genetic Algorithm
To see the progress of the optimization, use the optimoptions function to create options
that select two plot functions. The first plot function is gaplotbestf, which plots the
best and mean score of the population at every generation. The second plot function is
gaplotmaxconstr, which plots the maximum constraint violation of nonlinear
constraints at every generation. You can also visualize the progress of the algorithm by
displaying information to the command window using the 'Display' option.
options = optimoptions('ga','PlotFcn',{@gaplotbestf,@gaplotmaxconstr},'Display','iter')
[x,fval] = ga(ObjectiveFunction,nvars,[],[],[],[],...
LB,UB,ConstraintFunction,options)
Best Max Stall
Generation Func-count f(x) Constraint Generations
1 2670 13603.6 0 0
2 5282 13578.2 5.718e-06 0
3 7994 14033.9 0 0
4 11794 13573.7 0.0009577 0
Optimization terminated: average change in the fitness value less than options.Function
and constraint violation is less than options.ConstraintTolerance.
x =
0.8122 12.3104
fval =
1.3574e+04
5-116
Constrained Minimization Using ga
You can provide a start point for the minimization to the ga function by specifying the
InitialPopulationMatrix option. The ga function will use the first individual from
InitialPopulationMatrix as a start point for a constrained minimization.
X0 = [0.5 0.5]; % Start point (row vector)
options = optimoptions('ga',options,'InitialPopulationMatrix',X0);
5-117
5 Using the Genetic Algorithm
x =
0.8122 12.3103
fval =
1.3573e+04
5-118
Custom Output Function for Genetic Algorithm
• Plot the range of the first two components of the population as a rectangle. The left
and lower sides of the rectangle are at the minima of x(1) and x(2) respectively, and
the right and upper sides are at the respective maxima.
• Halt the iterations when the best function value drops below 0.1 (the minimum value
of the objective function is 0).
• Record the entire population in a variable named gapopulationhistory in your
MATLAB® workspace every 10 generations.
• Modify the initial crossover fraction to the custom value 0.2, and then update it back
to the default 0.8 after 25 generations. The initial setting of 0.2 causes the first
several iterations to search primarily at random via mutation. The later setting of
0.8 causes the following iterations to search primarily via combinations of existing
population members.
Objective Function
The objective function is for four-dimensional x whose first two components are integer-
valued.
function f = gaintobj(x)
f = rastriginsfcn([x(1)-6 x(2)-13]);
f = f + rastriginsfcn([x(3)-3*pi x(4)-5*pi]);
Output Function
The custom output function sets up the plot during initialization, and maintains the plot
during iterations. The output function also pauses the iterations for 0.1s so you can see
the plot as it develops.
5-119
5 Using the Genetic Algorithm
h1 = figure;
ax = gca;
ax.XLim = [0 21];
ax.YLim = [0 21];
l1 = min(state.Population(:,1));
m1 = max(state.Population(:,1));
l2 = min(state.Population(:,2));
m2 = max(state.Population(:,2));
r = rectangle(ax,'Position',[l1 l2 m1-l1 m2-l2]);
history(:,:,1) = state.Population;
assignin('base','gapopulationhistory',history);
case 'iter'
% Update the history every 10 generations.
if rem(state.Generation,10) == 0
ss = size(history,3);
history(:,:,ss+1) = state.Population;
assignin('base','gapopulationhistory',history);
end
% Find the best objective function, and stop if it is low.
ibest = state.Best(end);
ibest = find(state.Score == ibest,1,'last');
bestx = state.Population(ibest,:);
bestf = gaintobj(bestx);
if bestf <= 0.1
state.StopFlag = 'y';
disp('Got below 0.1')
end
% Update the plot.
figure(h1)
l1 = min(state.Population(:,1));
m1 = max(state.Population(:,1));
l2 = min(state.Population(:,2));
m2 = max(state.Population(:,2));
r.Position = [l1 l2 m1-l1 m2-l2];
pause(0.1)
% Update the fraction of mutation and crossover after 25 generations.
if state.Generation == 25
options.CrossoverFraction = 0.8;
optchanged = true;
end
case 'done'
% Include the final population in the history.
ss = size(history,3);
history(:,:,ss+1) = state.Population;
5-120
Custom Output Function for Genetic Algorithm
assignin('base','gapopulationhistory',history);
end
Set options to call the custom output function, and to initially have little crossover.
options = optimoptions('ga','OutputFcn',@gaoutfun,'CrossoverFraction',0.2);
x =
fval =
0.0098
5-121
5 Using the Genetic Algorithm
40 4 7
5-122
See Also
There are seven records of the 40-by-4 population matrix (40 individuals, each a 4-
element row vector).
See Also
Related Examples
• “Create Custom Plot Function” on page 5-54
• “Output Function Options” on page 10-54
5-123
6
The inspiration for the algorithm is flocks of birds or insects swarming. Each particle is
attracted to some degree to the best location it has found so far, and also to the best
location any member of the swarm has found. After some steps, the population can
coalesce around one location, or can coalesce around a few locations, or can continue to
move.
See Also
Related Examples
• “Optimize Using Particle Swarm” on page 6-3
More About
• “Particle Swarm Optimization Algorithm” on page 6-11
6-2
Optimize Using Particle Swarm
The objective function in this example is De Jong’s fifth function, which is included with
Global Optimization Toolbox software.
dejong5fcn
Try to find the minimum of the function using the default particleswarm settings.
6-3
6 Particle Swarm Optimization
fun = @dejong5fcn;
nvars = 2;
rng default % For reproducibility
[x,fval,exitflag] = particleswarm(fun,nvars)
x =
-31.9521 -16.0176
fval =
5.9288
exitflag =
Is the solution x the global optimum? It is unclear at this point. Looking at the function
plot shows that the function has local minima for components in the range [-50,50]. So
restricting the range of the variables to [-50,50] helps the solver locate a global
minimum.
lb = [-50;-50];
ub = -lb;
[x,fval,exitflag] = particleswarm(fun,nvars,lb,ub)
x =
-16.0079 -31.9697
fval =
1.9920
6-4
Optimize Using Particle Swarm
exitflag =
This looks promising: the new solution has lower fval than the previous one. But is x
truly a global solution? Try minimizing again with more particles, to better search the
region.
options = optimoptions('particleswarm','SwarmSize',100);
[x,fval,exitflag] = particleswarm(fun,nvars,lb,ub,options)
x =
-31.9781 -31.9784
fval =
0.9980
exitflag =
This looks even more promising. But is this answer a global solution, and how accurate is
it? Rerun the solver with a hybrid function. particleswarm calls the hybrid function
after particleswarm finishes its iterations.
options.HybridFcn = @fmincon;
[x,fval,exitflag] = particleswarm(fun,nvars,lb,ub,options)
x =
-31.9783 -31.9784
6-5
6 Particle Swarm Optimization
fval =
0.9980
exitflag =
particleswarm found essentially the same solution as before. This gives you some
confidence that particleswarm reports a local minimum and that the final x is the
global solution.
See Also
More About
• “What Is Particle Swarm Optimization?” on page 6-2
• “Particle Swarm Optimization Algorithm” on page 6-11
6-6
Particle Swarm Output Function
An output function runs after each iteration of the solver. For syntax details, and for the
data available to an output function, see the particleswarm options reference pages.
This output function draws a plot with one line per dimension. Each line represents the
range of the particles in the swarm in that dimension. The plot is log-scaled to
accommodate wide ranges. If the swarm converges to a single point, then the range of
each dimension goes to zero. But if the swarm does not converge to a single point, then
the range stays away from zero in some dimensions.
Copy the following code into a file named pswplotranges.m on your MATLAB® path.
The code sets up nplot subplots, where nplot is the number of dimensions in the
problem.
6-7
6 Particle Swarm Optimization
tag = sprintf('psoplotrange_var_%g',i);
plotHandle = findobj(get(gca,'Children'),'Tag',tag); % Get the subplot
xdata = plotHandle.XData; % Get the X data from the plot
newX = [xdata optimValues.iteration]; % Add the new iteration
plotHandle.XData = newX; % Put the X data into the plot
ydata = plotHandle.YData; % Get the Y data from the plot
newY = [ydata irange]; % Add the new value
plotHandle.YData = newY; % Put the Y data into the plot
end
if toc(getappdata(gcf,'t0')) > 1/30 % If 1/30 s has passed
drawnow % Show the plot
setappdata(gcf,'t0',tic); % Reset the timer
end
case 'done'
% No cleanup necessary
end
Objective Function
function F = multirosenbrock(x)
% This function is a multidimensional generalization of Rosenbrock's
% function. It operates in a vectorized manner, assuming that x is a matrix
% whose rows are the individuals.
odds = 1:2:N-1;
evens = 2:2:N;
F = zeros(size(x));
F(:,odds) = 1-x(:,odds);
F(:,evens) = 10*(x(:,evens)-x(:,odds).^2);
F = sum(F.^2,2);
6-8
Particle Swarm Output Function
Set the multirosenbrock function as the objective function. User four variables. Set a
lower bound of -10 and an upper bound of 10 on each variable.
fun = @multirosenbrock;
nvar = 4; % A 4-D problem
lb = -10*ones(nvar,1); % Bounds to help the solver converge
ub = -lb;
Set the random number generator to get reproducible output. Then call the solver.
rng default % For reproducibility
[x,fval,eflag] = particleswarm(fun,nvar,lb,ub,options)
x =
fval =
3.4935e-04
eflag =
6-9
6 Particle Swarm Optimization
Results
The solver returned a point near the optimum [1,1,1,1]. But the span of the swarm
did not converge to zero.
See Also
More About
• “Output Function and Plot Function” on page 10-63
6-10
Particle Swarm Optimization Algorithm
Algorithm Outline
particleswarm is based on the algorithm described in Kennedy and Eberhart [1], using
modifications suggested in Mezura-Montes and Coello Coello [2] and in Pedersen [3].
The particle swarm algorithm begins by creating the initial particles, and assigning them
initial velocities.
It evaluates the objective function at each particle location, and determines the best
(lowest) function value and the best location.
It chooses new velocities, based on the current velocity, the particles’ individual best
locations, and the best locations of their neighbors.
It then iteratively updates the particle locations (the new location is the old one plus the
velocity, modified to keep particles within bounds), velocities, and neighbors.
Initialization
By default, particleswarm creates particles at random uniformly within bounds. If
there is an unbounded component, particleswarm creates particles with a random
uniform distribution from –1000 to 1000. If you have only one bound, particleswarm
shifts the creation to have the bound as an endpoint, and a creation interval 2000 wide.
Particle i has position x(i), which is a row vector with nvars elements. Control the
span of the initial swarm using the InitialSwarmSpan option.
6-11
6 Particle Swarm Optimization
particleswarm evaluates the objective function at all particles. It records the current
position p(i) of each particle i. In subsequent iterations, p(i) will be the location of the
best objective function that particle i has found. And b is the best over all particles: b =
min(fun(p(i))). d is the location such that b = fun(d).
Iteration Steps
The algorithm updates the swarm as follows. For particle i, which is at position x(i):
6-12
Particle Swarm Optimization Algorithm
• The difference between the current position and the best position in the current
neighborhood g-x
4 Update the position x = x + v.
5 Enforce the bounds. If any component of x is outside a bound, set it equal to that
bound.
6 Evaluate the objective function f = fun(x).
7 If f < fun(p), then set p = x. This step ensures p has the best position the
particle has seen.
8 If f < b, then set b = f and d = x. This step ensures b has the best objective
function in the swarm, and d has the best location.
9 If, in the previous step, the best function value was lowered, then set flag = true.
Otherwise, flag = false. The value of flag is used in the next step.
10 Update the neighborhood. If flag = true:
a Set c = max(0,c-1).
b Set N to minNeighborhoodSize.
c If c < 2, then set W = 2*W.
d If c > 5, then set W = W/2.
e Ensure that W is in the bounds of the InertiaRange option.
If flag = false:
a Set c = c+1.
b Set N = min(N + minNeighborhoodSize,SwarmSize).
Stopping Criteria
particleswarm iterates until it reaches a stopping criterion.
6-13
6 Particle Swarm Optimization
If particleswarm stops with exit flag 1, it optionally calls a hybrid function after it
exits.
References
[1] Kennedy, J., and R. Eberhart. "Particle Swarm Optimization." Proceedings of the
IEEE International Conference on Neural Networks. Perth, Australia, 1995, pp.
1942–1945.
6-14
7
7-2
Minimize Function with Many Local Minima
Description
This section presents an example that shows how to find a local minimum of a function
using simulated annealing.
De Jong's fifth function is a two-dimensional function with many (25) local minima:
dejong5fcn
7-3
7 Using Simulated Annealing
Many standard optimization algorithms get stuck in local minima. Because the
simulated annealing algorithm performs a wide random search, the chance of being
trapped in local minima is decreased.
Note Because simulated annealing uses random number generators, each time you run
this algorithm you can get different results. See “Reproduce Your Results” on page 7-12
for more information.
7-4
Minimize Function with Many Local Minima
This returns
Optimization terminated: change in best function value less than options.FunctionTolera
x =
-16.1292 -15.8214
fval =
6.9034
where
7-5
7 Using Simulated Annealing
Your results can differ from the pictured ones, because simulannealbnd uses a
random number stream.
7-6
Simulated Annealing Terminology
Objective Function
The objective function is the function you want to optimize. Global Optimization Toolbox
algorithms attempt to find the minimum of the objective function. Write the objective
function as a file or anonymous function, and pass it to the solver as a function handle.
For more information, see “Compute Objective Functions” on page 2-2 and “Create
Function Handle” (MATLAB).
Temperature
The temperature is a parameter in simulated annealing that affects two aspects of the
algorithm:
• The distance of a trial point from the current point (See “Outline of the Algorithm” on
page 7-9, Step 1 on page 7-9.)
• The probability of accepting a trial point with higher objective function value (See
“Outline of the Algorithm” on page 7-9, Step 2 on page 7-9.)
Temperature can be a vector with different values for each component of the current
point. Typically, the initial temperature is a scalar.
Temperature decreases gradually as the algorithm proceeds. You can specify the initial
temperature as a positive scalar or vector in the InitialTemperature option. You can
specify the temperature as a function of iteration number as a function handle in the
TemperatureFcn option. The temperature is a function of the “Annealing Parameter” on
page 7-8, which is a proxy for the iteration number. The slower the rate of temperature
decrease, the better the chances are of finding an optimal solution, but the longer the run
time. For a list of built-in temperature functions and the syntax of a custom temperature
function, see “Temperature Options” on page 10-69.
7-7
7 Using Simulated Annealing
Annealing Parameter
The annealing parameter is a proxy for the iteration number. The algorithm can raise
temperature by setting the annealing parameter to a lower value than the current
iteration. (See “Reannealing” on page 7-8.) You can specify the temperature schedule
as a function handle with the TemperatureFcn option.
Reannealing
Annealing is the technique of closely controlling the temperature when cooling a material
to ensure that it reaches an optimal state. Reannealing raises the temperature after the
algorithm accepts a certain number of new points, and starts the search again at the
higher temperature. Reannealing avoids the algorithm getting caught at local minima.
Specify the reannealing schedule with the ReannealInterval option.
7-8
How Simulated Annealing Works
After generating the trial point, the algorithm shifts it, if necessary, to stay within
bounds. The algorithm shifts each infeasible component of the trial point to a value
chosen uniformly at random between the violated bound and the (feasible) value at
the previous iteration.
2 The algorithm determines whether the new point is better or worse than the current
point. If the new point is better than the current point, it becomes the next point. If
the new point is worse than the current point, the algorithm can still make it the
next point. The algorithm accepts a worse point based on an acceptance function.
Choose the acceptance function with the AcceptanceFcn option. Choices:
7-9
7 Using Simulated Annealing
where
Δ = new objective – old objective.
T0 = initial temperature of component i
T = the current temperature.
Since both Δ and T are positive, the probability of acceptance is between 0 and
1/2. Smaller temperature leads to smaller acceptance probability. Also, larger Δ
leads to smaller acceptance probability.
• @myfun — Custom acceptance function, myfun. For custom acceptance function
syntax, see “Algorithm Settings” on page 10-70.
3 The algorithm systematically lowers the temperature, storing the best point found so
far. The TemperatureFcn option specifies the function the algorithm uses to update
the temperature. Let k denote the annealing parameter. (The annealing parameter
is the same as the iteration number until reannealing.) Options:
where
ki = annealing parameter for component i.
T0 = initial temperature of component i.
Ti = current temperature of component i.
si = gradient of objective in direction i times difference of bounds in direction i.
simulannealbnd safeguards the annealing parameter values against Inf and other
improper values.
7-10
How Simulated Annealing Works
5 The algorithm stops when the average change in the objective function is small
relative to FunctionTolerance, or when it reaches any other stopping criterion.
See “Stopping Conditions for the Algorithm” on page 7-11.
• FunctionTolerance — The algorithm runs until the average change in value of the
objective function in StallIterLim iterations is less than the value of
FunctionTolerance. The default value is 1e-6.
• MaxIterations — The algorithm stops when the number of iterations exceeds this
maximum number of iterations. You can specify the maximum number of iterations
as a positive integer or Inf. The default value is Inf.
• MaxFunctionEvaluations specifies the maximum number of evaluations of the
objective function. The algorithm stops if the number of function evaluations exceeds
the value of MaxFunctionEvaluations. The default value is
3000*numberofvariables.
• MaxTime specifies the maximum time in seconds the algorithm runs before stopping.
The default value is Inf.
• ObjectiveLimit — The algorithm stops when the best objective function value is
less than or equal to the value of ObjectiveLimit. The default value is -Inf.
Bibliography
[1] Ingber, L. Adaptive simulated annealing (ASA): Lessons learned. Invited paper to a
special issue of the Polish Journal Control and Cybernetics on “Simulated
Annealing Applied to Combinatorial Optimization.” 1995. Available from
http://www.ingber.com/asa96_lessons.ps.gz
7-11
7 Using Simulated Annealing
If you need to reproduce your results exactly, call simulannealbnd with the output
argument. The output structure contains the current random number generator state in
the output.rngstate field. Reset the state before running the function again.
x =
-16.1292 -15.8214
fval =
6.9034
If you now run simulannealbnd a second time, you get the same results.
You can reproduce your run in the Optimization app by checking the box Use random
states from previous run in the Run solver and view results section.
7-12
Reproduce Your Results
Note If you do not need to reproduce your results, it is better not to set the states of
RandStream, so that you get the benefit of the randomness in these algorithms.
7-13
7 Using Simulated Annealing
The above function is known as 'cam' as described in L.C.W. Dixon and G.P. Szego (eds.),
Towards Global Optimisation 2, North-Holland, Amsterdam, 1978.
We create a MATLAB file named simple_objective.m with the following code in it:
function y = simple_objective(x)
y = (4 - 2.1*x(1)^2 + x(1)^4/3)*x(1)^2 + x(1)*x(2) + ...
(-4 + 4*x(2)^2)*x(2)^2;
The Simulated Annealing solver assumes the objective function will take one input x
where x has as many elements as the number of variables in the problem. The objective
function computes the scalar value of the objective and returns it in its single return
argument y.
x =
-0.0896 0.7130
7-14
Minimization Using Simulated Annealing Algorithm
fval =
-1.0316
exitFlag =
output =
iterations: 2948
funccount: 2971
message: 'Optimization terminated: change in best function value less than opti
rngstate: [1x1 struct]
problemtype: 'unconstrained'
temperature: [2x1 double]
totaltime: 1.7748
The first two output arguments returned by SIMULANNEALBND are x, the best point
found, and fval, the function value at the best point. A third output argument,
exitFlag returns a flag corresponding to the reason SIMULANNEALBND stopped.
SIMULANNEALBND can also return a fourth argument, output, which contains
information about the performance of the solver.
SIMULANNEALBND can be used to solve problems with bound constraints. The lower
and upper bounds are passed to the solver as vectors. For each dimension i, the solver
ensures that lb(i) <= x(i) <= ub(i), where x is a point selected by the solver during
simulation. We impose the bounds on our problem by specifying a range -64 <= x(i) <= 64
for x(i).
lb = [-64 -64];
ub = [64 64];
Now, we can rerun the solver with lower and upper bounds as input arguments.
7-15
7 Using Simulated Annealing
[x,fval,exitFlag,output] = simulannealbnd(ObjectiveFunction,X0,lb,ub);
a, b, and c are parameters to the objective function that act as constants during the
optimization (they are not varied as part of the minimization). One can create a
MATLAB file called parameterized_objective.m containing the following code.
function y = parameterized_objective(x,a,b,c)
y = (a - b*x(1)^2 + x(1)^4/3)*x(1)^2 + x(1)*x(2) + ...
(-c + c*x(2)^2)*x(2)^2;
7-16
Minimization Using Simulated Annealing Algorithm
Again, we need to pass in a function handle to the objective function as well as a start
point as the second argument.
SIMULANNEALBND will call our objective function with just one argument x, but our
objective function has four arguments: x, a, b, c. We can use an anonymous function to
capture the values of the additional arguments, the constants a, b, and c. We create a
function handle 'ObjectiveFunction' to an anonymous function that takes one input x, but
calls 'parameterized_objective' with x, a, b and c. The variables a, b, and c have values
when the function handle 'ObjectiveFunction' is created, so these values are captured by
the anonymous function.
a = 4; b = 2.1; c = 4; % define constant values
ObjectiveFunction = @(x) parameterized_objective(x,a,b,c);
X0 = [0.5 0.5];
[x,fval] = simulannealbnd(ObjectiveFunction,X0)
x =
0.0898 -0.7127
fval =
-1.0316
7-17
8
Multiobjective Optimization
Multiobjective Theory
You might need to formulate problems with more than one objective, since a single
objective with several constraints may not adequately represent the problem being faced.
If so, there is a vector of objectives,
that must be traded off in some way. The relative importance of these objectives is not
generally known until the system's best capabilities are determined and tradeoffs
between the objectives fully understood. As the number of objectives increases, tradeoffs
are likely to become complex and less easily quantified. The designer must rely on his or
her intuition and ability to express preferences throughout the optimization cycle. Thus,
requirements for a multiobjective design strategy must enable a natural problem
formulation to be expressed, and be able to solve the problem and enter preferences into
a numerically tractable and realistic design problem.
Note that because F(x) is a vector, if any of the components of F(x) are competing, there is
no unique solution to this problem. Instead, the concept of noninferiority in Zadeh [4]
(also called Pareto optimality in Censor [1] and Da Cunha and Polak [2]) must be used to
characterize the objectives. A noninferior solution is one in which an improvement in one
objective requires a degradation of another. To define this concept more precisely,
consider a feasible region, Ω, in the parameter space. x is an element of the n-
{
W = x Œ Rn , }
8-2
What Is Multiobjective Optimization?
subject to
Gi ( x) = 0, i = 1,..., ke ,
Gi ( x) £ 0 , i = ke + 1,..., k,
l £ x £ u.
This allows definition of the corresponding feasible region for the objective function space
Λ:
{
L = y Œ R m : y = F( x), x Œ W . }
The performance vector F(x) maps parameter space into objective function space, as
represented in two dimensions in the figure “Figure 8-1, Mapping from Parameter Space
into Objective Function Space” on page 8-3.
Figure 8-1, Mapping from Parameter Space into Objective Function Space
8-3
8 Multiobjective Optimization
A and B are clearly noninferior solution points because an improvement in one objective,
F1, requires a degradation in the other objective, F2, i.e., F1B < F1A, F2B > F2A.
Since any point in Ω that is an inferior point represents a point in which improvement
can be attained in all the objectives, it is clear that such a point is of no value.
Multiobjective optimization is, therefore, concerned with the generation and selection of
noninferior solution points.
Noninferior solutions are also called Pareto optima. A general goal in multiobjective
optimization is constructing the Pareto optima. The algorithm used in gamultiobj is
described in Deb [3].
The fitness function for use in gamultiobj should return a vector of type double. The
population may be of type double, a bit string vector, or can be a custom-typed vector.
As in ga, if you use a custom population type, you must write your own creation,
mutation, and crossover functions that accept inputs of that population type, and specify
these functions in the following fields, respectively:
8-4
What Is Multiobjective Optimization?
You can set the initial population in a variety of ways. Suppose that you choose a
population of size m. (The default population size is 50 when the number of variables is
less than 6, and is 200 otherwise.) You can set the population:
8-5
8 Multiobjective Optimization
• gamultiobj uses only the 'penalty' algorithm for nonlinear constraints. See
“Nonlinear Constraint Solver Algorithms” on page 5-51.
• gamultiobj takes an option DistanceMeasureFcn, a function that assigns a
distance measure to each individual with respect to its neighbors.
• gamultiobj takes an option ParetoFraction, a number between 0 and 1 that
specifies the fraction of the population on the best Pareto frontier to be kept during
the optimization. If there is only one Pareto frontier, this option is ignored.
• gamultiobj uses only the Tournament selection function.
• gamultiobj uses elite individuals differently than ga. It sorts noninferior
individuals above inferior ones, so it uses elite individuals automatically.
• gamultiobj has only one hybrid function, fgoalattain.
• gamultiobj does not have a stall time limit.
• gamultiobj has different plot functions available.
• gamultiobj does not have a choice of scaling function.
8-6
Pareto Front for Two Objectives
Create this function file before proceeding, and store it as mymulti1.m on your MATLAB
path.
8-7
8 Multiobjective Optimization
8-8
Pareto Front for Two Objectives
3 Run the optimization by clicking Start under Run solver and view results.
8-9
8 Multiobjective Optimization
This plot shows the tradeoff between the two components of f. It is plotted in objective
function space; see the figure “Figure 8-2, Set of Noninferior Solutions” on page 8-4.
The results of the optimization appear in the following table containing both objective
function values and the value of the variables.
8-10
Pareto Front for Two Objectives
You can sort the table by clicking a heading. Click the heading again to sort it in the
reverse order. The following figures show the result of clicking the heading f1.
8-11
8 Multiobjective Optimization
Alternate Views
There are other ways of regarding the problem. The following figure contains a plot of the
level curves of the two objective functions, the Pareto frontier calculated by gamultiobj
(boxes), and the x-values of the true Pareto frontier (diamonds connected by a nearly-
straight line). The true Pareto frontier points are where the level curves of the objective
functions are parallel. They were calculated by finding where the gradients of the
objective functions are parallel. The figure is plotted in parameter space; see the figure
“Figure 8-1, Mapping from Parameter Space into Objective Function Space” on page 8-3.
8-12
Pareto Front for Two Objectives
gamultiobj found the ends of the line segment, meaning it found the full extent of the
Pareto frontier.
8-13
8 Multiobjective Optimization
Bibliography
[1] Censor, Y. “Pareto Optimality in Multiobjective Problems,” Appl. Math. Optimiz., Vol.
4, pp 41–59, 1977.
8-14
9
Parallel Processing
In this section...
“Parallel Processing Types in Global Optimization Toolbox” on page 9-2
“How Toolbox Functions Distribute Processes” on page 9-3
9-2
How Solvers Compute in Parallel
In addition, several solvers have hybrid functions that run after they finish. Some hybrid
functions can run in parallel. Also, most patternsearch search methods can run in
parallel. For more information, see “Parallel Search Functions or Hybrid Functions” on
page 9-15.
Solvers employ the Parallel Computing Toolbox parfor function to perform parallel
computations.
Note parfor does not work in parallel when called from within another parfor loop.
Suppose, for example, your objective function userfcn calls parfor, and you want to
call fmincon using MultiStart and parallel processing. Suppose also that the
9-3
9 Parallel Processing
conditions for parallel gradient evaluation of fmincon are satisfied, as given in “Parallel
Optimization Functionality” (Optimization Toolbox). The figure “When parfor Runs In
Parallel” on page 9-4 shows three cases:
1 The outermost loop is parallel MultiStart. Only that loop runs in parallel.
2 The outermost parfor loop is in fmincon. Only fmincon runs in parallel.
3 The outermost parfor loop is in userfcn. In this case, userfcn can use parfor in
parallel.
For patternsearch, if you select MADS as a poll or search method, parallel pattern
search does not have reproducible runs. If you select the genetic algorithm or Latin
hypercube as search methods, parallel pattern search does not have reproducible runs.
9-4
How Solvers Compute in Parallel
MultiStart is different. You can have reproducible runs from parallel MultiStart.
Runs are reproducible because MultiStart generates pseudorandom start points
locally, and then distributes the start points to parallel processors. Therefore, the
parallel processors do not use random numbers. For more details, see “Parallel
Processing and Random Number Streams” on page 3-77.
Limitations and Performance Considerations
For information on factors that affect the speed of parallel computations, and factors that
affect the results of parallel computations, see “Improving Performance with Parallel
Computing” (Optimization Toolbox). The same considerations apply to parallel
computing with Global Optimization Toolbox functions.
MultiStart
ms = MultiStart('UseParallel',true);
When these conditions hold, MultiStart distributes a problem and start points to
processes or processors one at a time. The algorithm halts when it reaches a stopping
condition or runs out of start points to distribute. If the MultiStart Display property
is 'iter', then MultiStart displays:
9-5
9 Parallel Processing
fmincon cannot estimate gradients in parallel when used with parallel MultiStart.
This lack of parallel gradient estimation is due to the limitation of parfor described in
“No Nested parfor Loops” on page 9-3.
fmincon can take longer to estimate gradients in parallel rather than in serial. In this
case, using MultiStart with parallel gradient estimation in fmincon amplifies the
slowdown. For example, suppose the ms MultiStart object has UseParallel set to
false. Suppose fmincon takes 1 s longer to solve problem with
problem.options.UseParallel set to true. Then run(ms,problem,200) takes 200
s longer than the same run with problem.options.UseParallel set to false
Note When executing serially, parfor loops run slower than for loops. Therefore, for
best performance, set your local solver UseParallel option to false when the
MultiStart UseParallel property is true.
Note Even when running in parallel, a solver occasionally calls the objective and
nonlinear constraint functions serially on the host machine. Therefore, ensure that your
functions have no assumptions about whether they are evaluated in serial and parallel.
GlobalSearch
GlobalSearch does not distribute a problem and start points to multiple processes or
processors. However, when GlobalSearch runs the fmincon local solver, fmincon can
estimate gradients by parallel finite differences. fmincon uses parallel computing when
you:
opts = optimoptions(@fmincon,'UseParallel',true,'Algorithm','sqp');
problem = createOptimProblem('fmincon','objective',@myobj,...
'x0',startpt,'options',opts);
9-6
How Solvers Compute in Parallel
For more details, see “Using Parallel Computing in Optimization Toolbox” (Optimization
Toolbox).
Pattern Search
• UseCompletePoll is true.
• UseVectorized is false (default).
• UseParallel is true.
When these conditions hold, the solver computes the objective function and constraint
values of the pattern search in parallel during a poll. Furthermore, patternsearch
overrides the setting of the Cache option, and uses the default 'off' setting.
Note Even when running in parallel, patternsearch occasionally calls the objective
and nonlinear constraint functions serially on the host machine. Therefore, ensure that
your functions have no assumptions about whether they are evaluated in serial or
parallel.
patternsearch can optionally call a search function at each iteration. The search is
parallel when you:
9-7
9 Parallel Processing
The limitations on patternsearch options, listed in “Pattern Search” on page 9-7, arise
partly from the limitations of parfor, and partly from the nature of parallel processing:
Genetic Algorithm
When these conditions hold, ga computes the objective function and nonlinear constraint
values of the individuals in a population in parallel.
Note Even when running in parallel, ga occasionally calls the fitness and nonlinear
constraint functions serially on the host machine. Therefore, ensure that your functions
have no assumptions about whether they are evaluated in serial or parallel.
9-8
How Solvers Compute in Parallel
The limitations on options, listed in “Genetic Algorithm” on page 9-8, arise partly from
limitations of parfor, and partly from the nature of parallel processing:
ga can have a hybrid function that runs after it finishes; see “Include a Hybrid Function”
on page 5-106. If you want the hybrid function to take advantage of parallel computation,
set its options separately so that UseParallel is true. If the hybrid function is
patternsearch, set UseCompletePoll to true so that patternsearch runs in
parallel.
If the hybrid function is fmincon, set the following options with optimoptions to have
parallel gradient estimation:
To find out how to write options for the hybrid function, see “Parallel Hybrid Functions”
on page 9-17.
Parallel computing with gamultiobj works almost the same as with ga. For detailed
information, see “Genetic Algorithm” on page 9-8.
The difference between parallel computing with gamultiobj and ga has to do with the
hybrid function. gamultiobj allows only one hybrid function, fgoalattain. This
function optionally runs after gamultiobj finishes its run. Each individual in the
calculated Pareto frontier, that is, the final population found by gamultiobj, becomes
the starting point for an optimization using fgoalattain. These optimizations run in
parallel. The number of processors performing these optimizations is the smaller of the
number of individuals and the size of your parpool.
9-9
9 Parallel Processing
Run gamultiobj with gaoptions, and fgoalattain runs in parallel. For more
information about setting the hybrid function, see “Hybrid Function Options” on page 10-
51.
When these conditions hold, particleswarm computes the objective function of the
particles in a population in parallel.
Note Even when running in parallel, particleswarm occasionally calls the objective
function serially on the host machine. Therefore, ensure that your objective function has
no assumptions about whether it is evaluated in serial or parallel.
The limitations on options, listed in “Particle Swarm” on page 9-10, arise partly from
limitations of parfor, and partly from the nature of parallel processing:
particleswarm can have a hybrid function that runs after it finishes; see “Include a
Hybrid Function” on page 5-106. If you want the hybrid function to take advantage of
9-10
How Solvers Compute in Parallel
parallel computation, set its options separately so that UseParallel is true. If the
hybrid function is patternsearch, set UseCompletePoll to true so that
patternsearch runs in parallel.
If the hybrid function is fmincon, set the GradObj option to 'off' or [] with
optimoptions to have parallel gradient estimation.
To find out how to write options for the hybrid function, see “Parallel Hybrid Functions”
on page 9-17.
Simulated Annealing
simulannealbnd does not run in parallel automatically. However, it can call hybrid
functions that take advantage of parallel computing. To find out how to write options for
the hybrid function, see “Parallel Hybrid Functions” on page 9-17.
9-11
9 Parallel Processing
Multicore Processors
If you have a multicore processor, you might see speedup using parallel processing. You
can establish a parallel pool of several workers with a Parallel Computing Toolbox
license. For a description of Parallel Computing Toolbox software, see “Getting Started
with Parallel Computing Toolbox” (Parallel Computing Toolbox).
Suppose you have a dual-core processor, and want to use parallel computing:
• Enter
parpool
at the command line. MATLAB starts a pool of workers using the multicore processor.
If you had previously set a nondefault cluster profile, you can enforce multicore (local)
computing:
parpool('local')
9-12
How to Use Parallel Processing
When you run an applicable solver with options, applicable solvers automatically use
parallel computing.
Processor Network
If you have multiple processors on a network, use Parallel Computing Toolbox functions
and MATLAB Distributed Computing Server™ software to establish parallel
computation. Here are the steps to take:
1 Make sure your system is configured properly for parallel computing. Check with
your systems administrator, or refer to the Parallel Computing Toolbox
documentation.
9-13
9 Parallel Processing
parpool(prof)
parpool('AttachedFiles',{'objfun.m','constrfun.m'});
pctRunOnAll('addpath network_file_path')
pctRunOnAll('which filename')
9-14
How to Use Parallel Processing
delete(gcp)
9-15
9 Parallel Processing
• UseCompleteSearch is true.
• The search method is not @searchneldermead or custom.
• If the search method is a patternsearch poll method or Latin hypercube search,
UseParallel is true. Set at the command line with optimoptions:
options = optimoptions('patternsearch','UseParallel',true,...
'UseCompleteSearch',true,'SearchFcn',@GPSPositiveBasis2N);
• If the search method is ga, the search method option has UseParallel set to true.
Set at the command line with optimoptions:
In the Optimization app, first create gaopt as above, and then use these settings:
9-16
How to Use Parallel Processing
For more information about search functions, see “Using a Search Method” on page 4-34.
ga, particleswarm, and simulannealbnd can have other solvers run after or
interspersed with their iterations. These other solvers are called hybrid functions. For
information on using a hybrid function with gamultiobj, see “Parallel Computing with
gamultiobj” on page 9-9. Both patternsearch and fmincon can be hybrid functions.
You can set options so that patternsearch runs in parallel, or fmincon estimates
gradients in parallel.
Set the options for the hybrid function as described in “Hybrid Function Options” on page
10-51 for ga, “Hybrid Function” on page 10-62 for particleswarm, or “Hybrid
Function Options” on page 10-72 for simulannealbnd. To summarize:
hybridopts = optimoptions('patternsearch','UseParallel',true,...
'UseCompletePoll',true);
2 Set the ga or simulannealbnd options to use patternsearch as a hybrid
function:
options = optimoptions('ga','UseParallel',true); % for ga
options = optimoptions('ga',options,...
'HybridFcn',{@patternsearch,hybridopts});
% or, for simulannealbnd:
options = optimoptions(@simulannealbnd,'HybridFcn',{@patternsearch,hybridopts});
For more information on parallel patternsearch, see “Pattern Search” on page 9-7.
• If your hybrid function is fmincon:
9-17
9 Parallel Processing
hybridopts = optimoptions(@fmincon,'UseParallel',true,...
'Algorithm','interior-point');
% You can use any Algorithm except trust-region-reflective
2 Set the ga or simulannealbnd options to use fmincon as a hybrid function:
options = optimoptions('ga','UseParallel',true);
options = optimoptions('ga',options,'HybridFcn',{@fmincon,hybridopts});
% or, for simulannealbnd:
options = optimoptions(@simulannealbnd,'HybridFcn',{@fmincon,hybridopts});
9-18
Minimizing an Expensive Optimization Problem Using Parallel Computing Toolbox™
For the purpose of this example, we solve a problem in four variables, where the objective
and constraint functions are made artificially expensive by pausing.
function f = expensive_objfun(x)
%EXPENSIVE_OBJFUN An expensive objective function used in optimparfor example.
9-19
9 Parallel Processing
We are interested in measuring the time taken by fmincon in serial so that we can
compare it to the parallel time.
startPoint = [-1 1 1 -1];
options = optimoptions('fmincon','Display','iter','Algorithm','interior-point');
startTime = tic;
xsol = fmincon(@expensive_objfun,startPoint,[],[],[],[],[],[],@expensive_confun,options
time_fmincon_sequential = toc(startTime);
fprintf('Serial FMINCON optimization takes %g seconds.\n',time_fmincon_sequential);
First-order Norm of
Iter F-count f(x) Feasibility optimality step
0 5 1.839397e+00 1.500e+00 3.211e+00
1 11 -9.760099e-01 3.708e+00 7.902e-01 2.362e+00
2 16 -1.480976e+00 0.000e+00 8.344e-01 1.069e+00
3 21 -2.601599e+00 0.000e+00 8.390e-01 1.218e+00
4 29 -2.823630e+00 0.000e+00 2.598e+00 1.118e+00
5 34 -3.905338e+00 0.000e+00 1.210e+00 7.302e-01
6 39 -6.212992e+00 3.934e-01 7.372e-01 2.405e+00
7 44 -5.948762e+00 0.000e+00 1.784e+00 1.905e+00
8 49 -6.940062e+00 1.233e-02 7.668e-01 7.553e-01
9 54 -6.973887e+00 0.000e+00 2.549e-01 3.920e-01
10 59 -7.142993e+00 0.000e+00 1.903e-01 4.735e-01
11 64 -7.155325e+00 0.000e+00 1.365e-01 2.626e-01
12 69 -7.179122e+00 0.000e+00 6.336e-02 9.115e-02
13 74 -7.180116e+00 0.000e+00 1.069e-03 4.670e-02
14 79 -7.180409e+00 0.000e+00 7.799e-04 2.815e-03
15 84 -7.180410e+00 0.000e+00 6.070e-06 3.122e-04
Since ga usually takes many more function evaluations than fmincon, we remove the
expensive constraint from this problem and perform unconstrained optimization instead.
9-20
Minimizing an Expensive Optimization Problem Using Parallel Computing Toolbox™
Pass empty matrices [] for constraints. In addition, we limit the maximum number of
generations to 15 for ga so that ga can terminate in a reasonable amount of time. We are
interested in measuring the time taken by ga so that we can compare it to the parallel ga
evaluation. Note that running ga requires Global Optimization Toolbox.
9-21
9 Parallel Processing
available and there is a parallel pool of workers. Similarly, ga, gamultiobj, and
patternsearch solvers in Global Optimization Toolbox evaluate functions in parallel.
To use the parfor feature, we use the parpool function to set up the parallel
environment. The computer on which this example is published has four cores, so
parpool starts four MATLAB® workers. If there is already a parallel pool when you run
this example, we use that pool; see the documentation for parpool for more information.
if max(size(gcp)) == 0 % parallel pool needed
parpool % create the parallel pool
end
Starting parallel pool (parpool) using the 'local' profile ... connected to 4 workers.
To minimize our expensive optimization problem using the parallel fmincon function,
we need to explicitly indicate that our objective and constraint functions can be evaluated
in parallel and that we want fmincon to use its parallel functionality wherever possible.
Currently, finite differencing can be done in parallel. We are interested in measuring the
time taken by fmincon so that we can compare it to the serial fmincon run.
options = optimoptions(options,'UseParallel',true);
startTime = tic;
xsol = fmincon(@expensive_objfun,startPoint,[],[],[],[],[],[],@expensive_confun,options
time_fmincon_parallel = toc(startTime);
fprintf('Parallel FMINCON optimization takes %g seconds.\n',time_fmincon_parallel);
First-order Norm of
Iter F-count f(x) Feasibility optimality step
0 5 1.839397e+00 1.500e+00 3.211e+00
1 11 -9.760099e-01 3.708e+00 7.902e-01 2.362e+00
2 16 -1.480976e+00 0.000e+00 8.344e-01 1.069e+00
3 21 -2.601599e+00 0.000e+00 8.390e-01 1.218e+00
4 29 -2.823630e+00 0.000e+00 2.598e+00 1.118e+00
5 34 -3.905338e+00 0.000e+00 1.210e+00 7.302e-01
6 39 -6.212992e+00 3.934e-01 7.372e-01 2.405e+00
7 44 -5.948762e+00 0.000e+00 1.784e+00 1.905e+00
8 49 -6.940062e+00 1.233e-02 7.668e-01 7.553e-01
9 54 -6.973887e+00 0.000e+00 2.549e-01 3.920e-01
10 59 -7.142993e+00 0.000e+00 1.903e-01 4.735e-01
11 64 -7.155325e+00 0.000e+00 1.365e-01 2.626e-01
12 69 -7.179122e+00 0.000e+00 6.336e-02 9.115e-02
13 74 -7.180116e+00 0.000e+00 1.069e-03 4.670e-02
14 79 -7.180409e+00 0.000e+00 7.799e-04 2.815e-03
9-22
Minimizing an Expensive Optimization Problem Using Parallel Computing Toolbox™
9-23
9 Parallel Processing
9-24
Minimizing an Expensive Optimization Problem Using Parallel Computing Toolbox™
Utilizing parallel function evaluation via parfor improved the efficiency of both
fmincon and ga. The improvement is typically better for expensive objective and
constraint functions.
9-25
10
Options Reference
In this section...
“How to Set Properties” on page 10-2
“Properties of Both Objects” on page 10-2
“GlobalSearch Properties” on page 10-6
“MultiStart Properties” on page 10-8
gs = GlobalSearch('Display','iter', ...
'StartPointsToRun','bounds-ineqs');
ms.Display = 'iter';
ms = MultiStart(ms,'Display','iter','MaxTime',100);
For more information on setting properties, see “Changing Global Options” on page 3-73.
The syntax for creating a new object from an existing object is:
10-2
GlobalSearch and MultiStart Properties (Options)
ms = MultiStart(gs);
or
gs = GlobalSearch(ms);
The new object contains the properties that apply of the old object. This section describes
those shared properties:
Display
• 'final' (default) — Summary results to command line after last solver run.
• 'off' — No output to command line.
• 'iter' — Summary results to command line after each local solver run.
FunctionTolerance
The FunctionTolerance property describes how close two objective function values
must be for solvers to consider them identical for creating the vector of local solutions.
Set FunctionTolerance to 0 to obtain the results of every local solver run. Set
FunctionTolerance to a larger value to have fewer results.
Solvers consider two solutions identical if they are within XTolerance distance of each
other and have objective function values within FunctionTolerance of each other. If
both conditions are not met, solvers report the solutions as distinct. The tolerances are
relative, not absolute. For details, see “When fmincon Runs” on page 3-52 for
GlobalSearch, and “Create GlobalOptimSolution Object” on page 3-56 for MultiStart.
MaxTime
The MaxTime property describes a tolerance on the number of seconds since the solver
began its run. Solvers halt when they see MaxTime seconds have passed since the
10-3
10 Options Reference
beginning of the run. Time means wall clock as opposed to processor cycles. The default
is Inf.
OutputFcn
The OutputFcn property directs the global solver to run one or more output functions
after each local solver run completes. The output functions also run when the global
solver starts and ends. Include a handle to an output function written in the appropriate
syntax, or include a cell array of such handles. The default is an empty entry ([]).
• stop is a Boolean. When true, the algorithm stops. When false, the algorithm
continues.
Note A local solver can have an output function. The global solver does not
necessarily stop when a local solver output function causes a local solver run to stop.
If you want the global solver to stop in this case, have the global solver output
function stop when optimValues.localsolution.exitflag=-1.
• optimValues is a structure, described in “optimValues Structure” on page 10-5.
• state is the current state of the global algorithm:
• 'init' — The global solver has not called the local solver. The fields in the
optimValues structure are empty, except for localrunindex, which is 0, and
funccount, which contains the number of objective and constraint function
evaluations.
• 'iter' — The global solver calls output functions after each local solver run.
• 'done' — The global solver finished calling local solvers. The fields in
optimValues generally have the same values as the ones from the final output
function call with state='iter'. However, the value of
optimValues.funccount for GlobalSearch can be larger than the value in the
last function call with 'iter', because the GlobalSearch algorithm might have
performed some function evaluations that were not part of a local solver. For more
information, see “GlobalSearch Algorithm” on page 3-50.
For an example using an output function, see “GlobalSearch Output Function” on page 3-
39.
10-4
GlobalSearch and MultiStart Properties (Options)
Note Output and plot functions do not run when MultiStart has the UseParallel
option set to true and there is an open parpool.
optimValues Structure
PlotFcn
The PlotFcn property directs the global solver to run one or more plot functions after
each local solver run completes. Include a handle to a plot function written in the
appropriate syntax, or include a cell array of such handles. The default is an empty entry
([]).
The syntax of a plot function is the same as that of an output function. For details, see
“OutputFcn” on page 10-4.
There are two predefined plot functions for the global solvers:
For an example using a plot function, see “MultiStart Plot Function” on page 3-44.
If you specify more than one plot function, all plots appear as subplots in the same
window. Right-click any subplot to obtain a larger version in a separate figure window.
Note Output and plot functions do not run when MultiStart has the UseParallel
option set to true and there is an open parpool.
10-5
10 Options Reference
StartPointsToRun
The StartPointsToRun property directs the solver to exclude certain start points from
being run:
XTolerance
The XTolerance property describes how close two points must be for solvers to consider
them identical for creating the vector of local solutions. Set XTolerance to 0 to obtain
the results of every local solver run. Set XTolerance to a larger value to have fewer
results. Solvers compute the distance between a pair of points with norm, the Euclidean
distance.
Solvers consider two solutions identical if they are within XTolerance distance of each
other and have objective function values within FunctionTolerance of each other. If
both conditions are not met, solvers report the solutions as distinct. The tolerances are
relative, not absolute. For details, see “When fmincon Runs” on page 3-52 for
GlobalSearch, and “Create GlobalOptimSolution Object” on page 3-56 for MultiStart.
GlobalSearch Properties
• “NumTrialPoints” on page 10-6
• “NumStageOnePoints” on page 10-7
• “MaxWaitCycle” on page 10-7
• “BasinRadiusFactor” on page 10-7
• “DistanceThresholdFactor” on page 10-7
• “PenaltyThresholdFactor” on page 10-7
NumTrialPoints
10-6
GlobalSearch and MultiStart Properties (Options)
Default: 1000
NumStageOnePoints
Number of start points in Stage 1. For details, see “Obtain Stage 1 Start Point, Run” on
page 3-51.
Default: 200
MaxWaitCycle
Default: 20
BasinRadiusFactor
A basin radius decreases after MaxWaitCycle consecutive start points are within the
basin. The basin radius decreases by a factor of 1–BasinRadiusFactor.
Default: 0.2
DistanceThresholdFactor
PenaltyThresholdFactor
Determines increase in penalty threshold. For details, see React to Large Counter Values
on page 3-54.
Default: 0.2
10-7
10 Options Reference
MultiStart Properties
UseParallel
The UseParallel property determines whether the solver distributes start points to
multiple processors:
For the solver to run in parallel you must set up a parallel environment with parpool.
For details, see “How to Use Parallel Processing” on page 9-12.
10-8
Pattern Search Options
• If you are using the Optimization app, you specify the options by selecting an option
from a drop-down list or by entering the value of the option in the text field.
• If you are calling patternsearch from the command line, you specify the options by
creating an options structure using the function optimoptions, as follows:
options = optimoptions('patternsearch','Param1',value1,'Param2',value2,...);
For example:
10-9
10 Options Reference
• Poll method refers to the label of the option in the Optimization app.
• PollMethod refers to the corresponding field of the options structure.
Plot Options
Plot options enable you to plot data from the pattern search while it is running. When
you select plot functions and run the pattern search, a plot window displays the plots on
separate axes. You can stop the algorithm at any time by clicking the Stop button on the
plot window.
You can select any of the following plots in the Plot functions pane.
• Best function value (@psplotbestf) plots the best objective function value.
• Function count (@psplotfuncount) plots the number of function evaluations.
• Mesh size (@psplotmeshsize) plots the mesh size.
• Best point (@psplotbestx) plots the current best point.
• Max constraint (@psplotmaxconstr) plots the maximum nonlinear constraint
violation.
• Custom enables you to use your own plot function. To specify the plot function using
the Optimization app,
“Structure of the Plot Functions” on page 10-11 describes the structure of a plot
function.
To display a plot when calling patternsearch from the command line, set the PlotFcn
option to be a function handle to the plot function. For example, to display the best
function value, set options as follows
10-10
Pattern Search Options
where @plotfun1, @plotfun2, and so on are function handles to the plot functions
(listed in parentheses in the preceding list).
If you specify more than one plot function, all plots appear as subplots in the same
window. Right-click any subplot to obtain a larger version in a separate figure window.
• x — Current point
• iteration — Iteration number
• fval — Objective function value
• meshsize — Current mesh size
• funccount — Number of function evaluations
• method — Method used in last iteration
• TolFun — Tolerance on function value in last iteration
• TolX — Tolerance on x value in last iteration
• nonlinineq — Nonlinear inequality constraints, displayed only when a nonlinear
constraint function is specified
• nonlineq — Nonlinear equality constraints, displayed only when a nonlinear
constraint function is specified
• flag — Current state in which the plot function is called. The possible values for
flag are
10-11
10 Options Reference
The output argument stop provides a way to stop the algorithm at the current iteration.
stop can have the following values:
Poll Options
Poll options control how the pattern search polls the mesh points at each iteration.
Poll method (PollMethod) specifies the pattern the algorithm uses to create the mesh.
There are two patterns for each of the classes of direct search algorithms: the generalized
pattern search (GPS) algorithm, the generating set search (GSS) algorithm, and the
mesh adaptive direct search (MADS) algorithm. These patterns are the Positive basis 2N
and the Positive basis N+1:
For example, if the optimization problem has three independent variables, the pattern
consists of the following six vectors.
10-12
Pattern Search Options
For example, if the objective function has three independent variables, the pattern
consists of the following four vectors.
Complete poll (UseCompletePoll) specifies whether all the points in the current
mesh must be polled at each iteration. Complete Poll can have the values On or Off.
• If you set Complete poll to On, the algorithm polls all the points in the mesh at each
iteration and chooses the point with the smallest objective function value as the
current point at the next iteration.
• If you set Complete poll to Off, the default value, the algorithm stops the poll as
soon as it finds a point whose objective function value is less than that of the current
point. The algorithm then sets that point as the current point at the next iteration.
10-13
10 Options Reference
• Consecutive — The algorithm polls the mesh points in consecutive order, that is, the
order of the pattern vectors as described in “Poll Method” on page 4-23.
Search Options
Search options specify an optional search that the algorithm can perform at each
iteration prior to the polling. If the search returns a point that improves the objective
function, the algorithm uses that point at the next iteration and omits the polling. Please
note, if you have selected the same Search method and Poll method, only the option
selected in the Poll method will be used, although both will be used when the options
selected are different.
Complete search (UseCompleteSearch) applies when you set Search method to GPS
Positive basis Np1, GPS Positive basis 2N, GSS Positive basis Np1, GSS
Positive basis 2N, MADS Positive basis Np1, MADS Positive basis 2N, or
Latin hypercube. Complete search can have the values on (true) or off (false).
For GPS Positive basis Np1, MADS Positive basis Np1, GPS Positive
basis 2N, or MADS Positive basis 2N, Complete search has the same meaning as
the poll option Complete poll.
Search method (SearchFcn) specifies the optional search step. The options are
10-14
Pattern Search Options
To change the default values of Iteration limit and Options at the command line,
use the syntax
options = optimoptions('patternsearch','SearchFcn',...
{@searchga,iterlim,optionsGA})
where iterlim is the value of Iteration limit and optionsGA is the genetic
algorithm options structure.
The way the search is performed depends on the setting for Complete search:
• If you set Complete search to On, the algorithm polls all the points that are
randomly generated at each iteration by the Latin hypercube search and chooses
the one with the smallest objective function value.
• If you set Complete search to Off (the default), the algorithm stops the poll as
soon as it finds one of the randomly generated points whose objective function
value is less than that of the current point, and chooses that point for the next
iteration.
To change the default values of Iteration limit and Design level at the command
line, use the syntax
10-15
10 Options Reference
where iterlim is the value of Iteration limit and level is the value of Design
level.
• Nelder-Mead (@searchneldermead) specifies a search using fminsearch, which
uses the Nelder-Mead algorithm. If you select Nelder-Mead, two other options
appear:
To change the default values of Iteration limit and Options at the command line,
use the syntax
options = optimoptions('patternsearch','SearchFcn',...
{@searchneldermead,iterlim,optionsNM})
where iterlim is the value of Iteration limit and optionsNM is the options
structure.
• Custom enables you to write your own search function. To specify the search function
using the Optimization app,
To see a template that you can use to write your own search function, enter
edit searchfcntemplate
10-16
Pattern Search Options
function [successSearch,xBest,fBest,funccount] =
searchfcntemplate(fun,x,A,b,Aeq,beq,lb,ub, ...
optimValues,options)
• x — Current point
• fval — Objective function value at x
• iteration — Current iteration number
• funccount — Counter for user function evaluation
• scale — Scale factor used to scale the design points
• problemtype — Flag passed to the search routines, indicating whether the
problem is 'unconstrained', 'boundconstraints', or
'linearconstraints'. This field is a subproblem type for nonlinear constrained
problems.
• meshsize — Current mesh size used in search step
• method — Method used in last iteration
• options — Pattern search options structure
10-17
10 Options Reference
Mesh Options
Mesh options control the mesh that the pattern search uses. The following options are
available.
Initial size (InitialMeshSize) specifies the size of the initial mesh, which is the
length of the shortest vector from the initial point to a mesh point. Initial size should be
a positive scalar. The default is 1.0.
Max size (MaxMeshSize) specifies a maximum size for the mesh. When the maximum
size is reached, the mesh size does not increase after a successful iteration. Max size
must be a positive scalar, and is only used when a GPS or GSS algorithm is selected as
the Poll or Search method. The default value is Inf. MADS uses a maximum size of 1.
Accelerator (AccelerateMesh) specifies whether, when the mesh size is small, the
Contraction factor is multiplied by 0.5 after each unsuccessful iteration. Accelerator
can have the values On or Off, the default. Accelerator applies to the GPS and GSS
algorithms.
Rotate (MeshRotate) is only applied when Poll method is set to GPS Positive
basis Np1 or GSS Positive basis Np1. It specifies whether the mesh vectors are
multiplied by –1 when the mesh size is less than 1/100 of the mesh tolerance (minimum
mesh size MeshTolerance) after an unsuccessful poll. In other words, after the first
unsuccessful poll with small mesh size, instead of polling in directions ei (unit positive
directions) and –Σei, the algorithm polls in directions –ei and Σei. Rotate can have the
values Off or On (the default). When the problem has equality constraints, Rotate is
disabled.
Note Changing the setting of Rotate has no effect on the poll when Poll method is set
to GPS Positive basis 2N, GSS Positive basis 2N, MADS Positive basis 2N,
or MADS Positive basis Np1.
Scale (ScaleMesh) specifies whether the algorithm scales the mesh points by carefully
multiplying the pattern vectors by constants proportional to the logarithms of the
absolute values of components of the current point (or, for unconstrained problems, of the
initial point). Scale can have the values Off or On (the default). When the problem has
equality constraints, Scale is disabled.
10-18
Pattern Search Options
See “Mesh Expansion and Contraction” on page 4-60 for more information.
Constraint Parameters
For information on the meaning of penalty parameters, see “Nonlinear Constraint Solver
Algorithm” on page 4-39.
Bind tolerance (TolBind) specifies the tolerance for the distance from the current
point to the boundary of the feasible region with respect to linear constraints. This
means Bind tolerance specifies when a linear constraint is active. Bind tolerance
is not a stopping criterion. Active linear constraints change the pattern of points
patternsearch uses for polling or searching. The default value of Bind tolerance is
1e-3.
Cache Options
The pattern search algorithm can keep a record of the points it has already polled, so
that it does not have to poll the same point more than once. If the objective function
requires a relatively long time to compute, the cache option can speed up the algorithm.
10-19
10 Options Reference
The memory allocated for recording the points is called the cache. This option should only
be used for deterministic objective functions, but not for stochastic ones.
Cache (Cache) specifies whether a cache is used. The options are On and Off, the
default. When you set Cache to On, the algorithm does not evaluate the objective
function at any mesh points that are within Tolerance of a point in the cache.
Tolerance (CacheTol) specifies how close a mesh point must be to a point in the cache
for the algorithm to omit polling it. Tolerance must be a positive scalar. The default
value is eps.
Size (CacheSize) specifies the size of the cache. Size must be a positive scalar. The
default value is 1e4.
Stopping Criteria
Stopping criteria determine what causes the pattern search algorithm to stop. Pattern
search uses the following criteria:
Mesh tolerance (MeshTolerance) specifies the minimum tolerance for mesh size. The
GPS and GSS algorithms stop if the mesh size becomes smaller than Mesh tolerance.
MADS 2N stops when the mesh size becomes smaller than MeshTolerance^2. MADS
Np1 stops when the mesh size becomes smaller than (MeshTolerance/nVar)^2, where
nVar is the number of elements of x0. The default value of MeshTolerance is 1e-6.
10-20
Pattern Search Options
Time limit (MaxTime) specifies the maximum time in seconds the pattern search
algorithm runs before stopping. This also includes any specified pause time for pattern
search algorithms.
10-21
10 Options Reference
To see a template that you can use to write your own output function, enter
edit psoutputfcntemplate
MATLAB passes the optimvalues, state, and flag data to your output function, and
the output function returns stop, options, and optchanged data.
• x — Current point
• iteration — Iteration number
• fval — Objective function value at x
• meshsize — Current mesh size
• funccount — Number of function evaluations
• method — Method used in last iteration, such as 'Update multipliers' or
'Increase penalty' for a nonlinearly constrained problem, or 'Successful
Poll', 'Refine Mesh', or 'Successful Search', possibly with a '\Rotate'
suffix, for a problem without nonlinear constraints
• TolFun — Absolute value of change in function value in last iteration
• TolX — Norm of change in x in last iteration
• nonlinineq — Nonlinear inequality constraint function values at x, displayed
only when a nonlinear constraint function is specified
• nonlineq — Nonlinear equality constraint function values at x, displayed only
when a nonlinear constraint function is specified
10-22
Pattern Search Options
• stop — Provides a way to stop the algorithm at the current iteration. stop can have
the following values.
10-23
10 Options Reference
Note You must set UseCompletePoll to true for patternsearch to use vectorized or
parallel polling. Similarly, set UseCompleteSearch to true for vectorized or parallel
searching.
10-24
Pattern Search Options
it loops through the mesh points. (At the command line, this assumes
'UseParallel' is at its default value of false.)
• When Evaluate objective and constraint functions ('UseVectorized') is
vectorized (true), patternsearch calls the objective function on all the points in
the mesh at once, i.e., in a single call to the objective function.
If there are nonlinear constraints, the objective function and the nonlinear
constraints all need to be vectorized in order for the algorithm to compute in a
vectorized manner.
For details and an example, see “Vectorize the Objective and Constraint Functions”
on page 4-81.
• When Evaluate objective and constraint functions (UseParallel) is in
parallel (true), patternsearch calls the objective function in parallel, using the
parallel environment you established (see “How to Use Parallel Processing” on page 9-
12). At the command line, set UseParallel to false to compute serially.
Note You cannot simultaneously use vectorized and parallel computations. If you set
'UseParallel' to true and 'UseVectorized' to true, patternsearch evaluates
your objective and constraint functions in a vectorized manner, not in parallel.
10-25
10 Options Reference
10-26
Pattern Search Options
10-27
10 Options Reference
10-28
Genetic Algorithm Options
• If you are using the Optimization app (optimtool), select an option from a drop-
down list or enter the value of the option in a text field.
• If you are calling ga or gamultiobj at the command line, create options using the
function optimoptions, as follows:
options = optimoptions('ga','Param1', value1, 'Param2', value2, ...);
% or
options = optimoptions('gamultiobj','Param1', value1, 'Param2', value2, ...);
10-29
10 Options Reference
See “Setting Options at the Command Line” on page 5-67 for examples.
For example:
Plot Options
Plot options let you plot data from the genetic algorithm while it is running. You can stop
the algorithm at any time by clicking the Stop button on the plot window.
You can select any of the following plot functions in the Plot functions pane for both ga
and gamultiobj:
10-30
Genetic Algorithm Options
• Best fitness (@gaplotbestf) plots the best score value and mean score versus
generation.
• Best individual (@gaplotbestindiv) plots the vector entries of the individual with
the best fitness function value in each generation.
• Expectation (@gaplotexpectation) plots the expected number of children versus
the raw scores at each generation.
• Range (@gaplotrange) plots the minimum, maximum, and mean score values in
each generation.
• Pareto front (@gaplotpareto) plots the Pareto front for the first two objective
functions.
• Average Pareto distance (@gaplotparetodistance) plots a bar chart of the
distance of each individual from its neighbors.
• Rank histogram (@gaplotrankhist) plots a histogram of the ranks of the
individuals. Individuals of rank 1 are on the Pareto frontier. Individuals of rank 2 are
lower than at least one rank 1 individual, but are not lower than any individuals from
other ranks, etc.
• Average Pareto spread (@gaplotspread) plots the average spread as a function of
iteration number.
To display a plot when calling ga or gamultiobj from the command line, set the
PlotFcn option to be a function handle to the plot function. For example, to display the
best fitness plot, set options as follows:
10-31
10 Options Reference
where @plotfun1, @plotfun2, and so on are function handles to the plot functions.
If you specify multiple plot functions, all plots appear as subplots in the same window.
Right-click any subplot to obtain a larger version in a separate figure window.
The output argument state is a state structure as well. Pass the input argument,
modified if you like; see “Changing the State Structure” on page 10-56. To stop the
iterations, set state.StopFlag to a nonempty character vector, such as 'y'.
The state structure for ga, which is an input argument to plot, mutation, and output
functions, contains the following fields:
10-32
Genetic Algorithm Options
gamultiobj
The state structure for gamultiobj, which is an input argument to plot, mutation, and
output functions, contains the following fields:
10-33
10 Options Reference
Population Options
Population options let you specify the parameters of the population that the genetic
algorithm uses.
Population type (PopulationType) specifies the type of input to the fitness function.
Types and their restrictions are:
Caution The individuals in a Bit string population are vectors of type double, not
strings or characters.
10-34
Genetic Algorithm Options
or Custom. You cannot use a Hybrid function, and ga ignores all constraints,
including bounds, linear constraints, and nonlinear constraints.
• Custom — For Crossover function and Mutation function, use Custom. For
Creation function, either use Custom, or provide an Initial population. You
cannot use a Hybrid function, and ga ignores all constraints, including bounds,
linear constraints, and nonlinear constraints.
Population size (PopulationSize) specifies how many individuals there are in each
generation. With a large population size, the genetic algorithm searches the solution
space more thoroughly, thereby reducing the chance that the algorithm returns a local
minimum that is not a global minimum. However, a large population size also causes the
algorithm to run more slowly.
If you set Population size to a vector, the genetic algorithm creates multiple
subpopulations, the number of which is the length of the vector. The size of each
subpopulation is the corresponding entry of the vector. See “Migration Options” on page
10-48.
Creation function (CreationFcn) specifies the function that creates the initial
population for ga. Do not specify a creation function with integer problems because ga
overrides any choice you make. Choose from:
10-35
10 Options Reference
For an example showing its behavior, see “Custom Plot Function and Linear
Constraints in ga” on page 5-79.
• Nonlinear Feasible population (@gacreationnonlinearfeasible) is the
default creation function for the 'penalty' nonlinear constraint algorithm. For
details, see “Constraint Parameters” on page 10-49.
• Custom lets you write your own creation function, which must generate data of the
type that you specify in Population type. To specify the creation function if you are
using the Optimization app,
The function returns Population, the initial population for the genetic algorithm.
Caution When you have bounds or linear constraints, ensure that your creation
function creates individuals that satisfy these constraints. Otherwise, your population
might not satisfy the constraints.
10-36
Genetic Algorithm Options
Initial scores (InitialScoreMatrix) specifies initial scores for the initial population.
The initial scores can also be partial. Do not specify initial scores with integer problems
because ga overrides any choice you make.
• Rank (@fitscalingrank) — The default fitness scaling function, Rank, scales the
raw scores based on the rank of each individual instead of its score. The rank of an
individual is its position in the sorted scores. An individual with rank r has scaled
score proportional to 1 / r . So the scaled score of the most fit individual is
proportional to 1, the scaled score of the next most fit is proportional to 1 / 2 , and so
on. Rank fitness scaling removes the effect of the spread of the raw scores. The square
10-37
10 Options Reference
root makes poorly ranked individuals more nearly equal in score, compared to rank
scoring. For more information, see “Fitness Scaling” on page 5-86.
• Proportional (@fitscalingprop) — Proportional scaling makes the scaled value
of an individual proportional to its raw fitness score.
• Top (@fitscalingtop) — Top scaling scales the top individuals equally. Selecting
Top displays an additional field, Quantity, which specifies the number of individuals
that are assigned positive scaled values. Quantity can be an integer from 1 through
the population size or a fraction from 0 through 1 specifying a fraction of the
population size. The default value is 0.4. Each of the individuals that produce
offspring is assigned an equal scaled value, while the rest are assigned the value 0.
The scaled values have the form [01/n 1/n 0 0 1/n 0 0 1/n ...].
To change the default value for Quantity at the command line, use the following
syntax:
options = optimoptions('ga','FitnessScalingFcn', {@fitscalingtop,quantity})
To change the default value of Max survival rate at the command line, use the
following syntax
options = optimoptions('ga','FitnessScalingFcn',
{@fitscalingshiftlinear, rate})
10-38
Genetic Algorithm Options
The function returns expectation, a column vector of scalars of the same length as
scores, giving the scaled values of each member of the population. The sum of the
entries of expectation must equal nParents.
Selection Options
Selection options specify how the genetic algorithm chooses parents for the next
generation. You can specify the function the algorithm uses in the Selection function
(SelectionFcn) field in the Selection options pane. Do not use with integer problems.
10-39
10 Options Reference
To change the default value of Tournament size at the command line, use the
syntax
options = optimoptions('ga','SelectionFcn',...
{@selectiontournament,size})
10-40
Genetic Algorithm Options
• expectation
• For ga, expectation is a column vector of the scaled fitness of each member
of the population. The scaling comes from the “Fitness Scaling Options” on page
10-37.
Tip You can ensure that you have a column vector by using expectation(:,
1). For example, edit selectionstochunif or any of the other built-in
selection functions.
• For gamultiobj, expectation is a matrix whose first column is the rank of
the individuals, and whose second column is the distance measure of the
individuals. See “Multiobjective Options” on page 10-50.
• nParents— Number of parents to select.
• options — Genetic algorithm options structure.
The function returns parents, a row vector of length nParents containing the
indices of the parents that you select.
Reproduction Options
Reproduction options specify how the genetic algorithm creates children for the next
generation.
Elite count (EliteCount) specifies the number of individuals that are guaranteed to
survive to the next generation. Set Elite count to be a positive integer less than or equal
to the population size. The default value is ceil(0.05*PopulationSize) for
continuous problems, and 0.05*(default PopulationSize) for mixed-integer
problems.
10-41
10 Options Reference
Mutation Options
Mutation options specify how the genetic algorithm makes small random changes in the
individuals in the population to create mutation children. Mutation provides genetic
diversity and enables the genetic algorithm to search a broader space. You can specify
the mutation function in the Mutation function (MutationFcn) field in the Mutation
options pane. Do not use with integer problems. You can choose from the following
functions:
• The Scale parameter determines the standard deviation at the first generation. If
you set Initial range to be a 2-by-1 vector v, the initial standard deviation is the
same at all coordinates of the parent vector, and is given by Scale*(v(2)-v(1)).
If you set Initial range to be a vector v with two rows and Number of variables
columns, the initial standard deviation at coordinate i of the parent vector is given
by Scale*(v(i,2) - v(i,1)).
• The Shrink parameter controls how the standard deviation shrinks as
generations go by. If you set Initial range to be a 2-by-1 vector, the standard
deviation at the kth generation, σk, is the same at all coordinates of the parent
vector, and is given by the recursive formula
Ê k ˆ
s k = s k-1 Á 1 - Shrink ˜.
Ë Generations ¯
If you set Initial range to be a vector with two rows and Number of variables
columns, the standard deviation at coordinate i of the parent vector at the kth
generation, σi,k, is given by the recursive formula
Ê k ˆ
s i,k = s i,k-1 Á1 - Shrink ˜.
Ë Generations ¯
10-42
Genetic Algorithm Options
If you set Shrink to 1, the algorithm shrinks the standard deviation in each
coordinate linearly until it reaches 0 at the last generation is reached. A negative
value of Shrink causes the standard deviation to grow.
The default value of both Scale and Shrink is 1. To change the default values at the
command line, use the syntax
options = optimoptions('ga','MutationFcn', ...
{@mutationgaussian, scale, shrink})
where scale and shrink are the values of Scale and Shrink, respectively.
Caution Do not use mutationgaussian when you have bounds or linear constraints.
Otherwise, your population will not necessarily satisfy the constraints.
• Uniform (mutationuniform) — Uniform mutation is a two-step process. First, the
algorithm selects a fraction of the vector entries of an individual for mutation, where
each entry has a probability Rate of being mutated. The default value of Rate is
0.01. In the second step, the algorithm replaces each selected entry by a random
number selected uniformly from the range for that entry.
To change the default value of Rate at the command line, use the syntax
options = optimoptions('ga','MutationFcn', {@mutationuniform, rate})
Caution Do not use mutationuniform when you have bounds or linear constraints.
Otherwise, your population will not necessarily satisfy the constraints.
• Adaptive Feasible (mutationadaptfeasible), the default mutation function
when there are constraints, randomly generates directions that are adaptive with
respect to the last successful or unsuccessful generation. The mutation chooses a
direction and step length that satisfies bounds and linear constraints.
• Custom enables you to write your own mutation function. To specify the mutation
function using the Optimization app,
10-43
10 Options Reference
Caution When you have bounds or linear constraints, ensure that your mutation
function creates individuals that satisfy these constraints. Otherwise, your population
will not necessarily satisfy the constraints.
Crossover Options
Crossover options specify how the genetic algorithm combines two individuals, or
parents, to form a crossover child for the next generation.
Crossover function (CrossoverFcn) specifies the function that performs the crossover.
Do not use with integer problems. You can choose from the following functions:
10-44
Genetic Algorithm Options
where the vector is a 1 from the first parent, and the genes where the vector is a 0
from the second parent, and combines the genes to form the child. For example, if p1
and p2 are the parents
p1 = [a b c d e f g h]
p2 = [1 2 3 4 5 6 7 8]
and the binary vector is [1 1 0 0 1 0 0 0], the function returns the following child:
child1 = [a b 3 4 e 6 7 8]
• Selects vector entries numbered less than or equal to n from the first parent.
• Selects vector entries numbered greater than n from the second parent.
• Concatenates these entries to form a child vector.
p1 = [a b c d e f g h]
p2 = [1 2 3 4 5 6 7 8]
and the crossover point is 3, the function returns the following child.
child = [a b c 4 5 6 7 8]
• Vector entries numbered less than or equal to m from the first parent
• Vector entries numbered from m+1 to n, inclusive, from the second parent
• Vector entries numbered greater than n from the first parent.
10-45
10 Options Reference
The algorithm then concatenates these genes to form a single gene. For example, if p1
and p2 are the parents
p1 = [a b c d e f g h]
p2 = [1 2 3 4 5 6 7 8]
and the crossover points are 3 and 6, the function returns the following child.
child = [a b c 4 5 6 g h]
If all the entries of Ratio lie in the range [0, 1], the children produced are within the
hypercube defined by placing the parents at opposite vertices. If Ratio is not in that
range, the children might lie outside the hypercube. If Ratio is a scalar, then all the
children lie on the line between the parents.
To change the default value of Ratio at the command line, use the syntax
options = optimoptions('ga','CrossoverFcn', ...
{@crossoverintermediate, ratio});
10-46
Genetic Algorithm Options
To change the default value of Ratio at the command line, use the syntax
options = optimoptions('ga','CrossoverFcn',...
{@crossoverheuristic,ratio});
options = optimoptions('ga','CrossoverFcn',@myfun);
10-47
10 Options Reference
Caution When you have bounds or linear constraints, ensure that your crossover
function creates individuals that satisfy these constraints. Otherwise, your population
will not necessarily satisfy the constraints.
Migration Options
Note Subpopulations refer to a form of parallel processing for the genetic algorithm. ga
currently does not support this form. In subpopulations, each worker hosts a number of
individuals. These individuals are a subpopulation. The worker evolves the
subpopulation independently of other workers, except when migration causes some
individuals to travel between workers.
Because ga does not currently support this form of parallel processing, there is no benefit
to setting PopulationSize to a vector, or to setting the MigrationDirection,
MigrationInterval, or MigrationFraction options.
You can control how migration occurs by the following three fields in the Migration
options pane:
• If you set Direction to Forward ('forward'), migration takes place toward the
last subpopulation. That is, the nth subpopulation migrates into the (n+1)th
subpopulation.
• If you set Direction to Both ('both'), the nth subpopulation migrates into both
the (n–1)th and the (n+1)th subpopulation.
10-48
Genetic Algorithm Options
Migration wraps at the ends of the subpopulations. That is, the last subpopulation
migrates into the first, and the first may migrate into the last.
• Interval (MigrationInterval) — Specifies how many generation pass between
migrations. For example, if you set Interval to 20, migration takes place every 20
generations.
• Fraction (MigrationFraction) — Specifies how many individuals move between
subpopulations. Fraction specifies the fraction of the smaller of the two
subpopulations that moves. For example, if individuals migrate from a subpopulation
of 50 individuals into a subpopulation of 100 individuals and you set Fraction to 0.1,
the number of individuals that migrate is 0.1*50=5.
Constraint Parameters
Constraint parameters refer to the nonlinear constraint solver. For details on the
algorithm, see “Nonlinear Constraint Solver Algorithms” on page 5-51.
Penalty Algorithm
10-49
10 Options Reference
Multiobjective Options
Multiobjective options define parameters characteristic of the multiobjective genetic
algorithm. You can specify the following parameters:
• ParetoFraction — Sets the fraction of individuals to keep on the first Pareto front
while the solver selects individuals from higher fronts. This option is a scalar between
0 and 1.
• DistanceMeasureFcn — Defines a handle to the function that computes distance
measure of individuals, computed in decision variable or design space (genotype) or in
function space (phenotype). For example, the default distance measure function is
@distancecrowding in function space, which is the same as
@distancecrowding,'phenotype'.
10-50
Genetic Algorithm Options
gamultiobj passes the population in pop, the computed scores for the population
in scores, and the options in options. Your distance function returns the
distance from each member of the population to a reference, such as the nearest
neighbor in some sense. For an example, edit the built-in file
distancecrowding.m.
ga Hybrid Function
A hybrid function is another minimization function that runs after the genetic algorithm
terminates. You can specify a hybrid function in Hybrid function (HybridFcn) options.
Do not use with integer problems. The choices are
• [] — No hybrid function.
• fminsearch (@fminsearch) — Uses the MATLAB function fminsearch to perform
unconstrained minimization.
• patternsearch (@patternsearch) — Uses a pattern search to perform constrained
or unconstrained minimization.
• fminunc (@fminunc) — Uses the Optimization Toolbox function fminunc to perform
unconstrained minimization.
• fmincon (@fmincon) — Uses the Optimization Toolbox function fmincon to perform
constrained minimization.
Note Ensure that your hybrid function accepts your problem constraints. Otherwise, ga
throws an error.
10-51
10 Options Reference
You can set separate options for the hybrid function. Use optimset for fminsearch, or
optimoptions for fmincon, patternsearch, or fminunc. For example:
hybridopts = optimoptions('fminunc','Display','iter','Algorithm','quasi-newton');
Include the hybrid options in the Genetic Algorithm options structure as follows:
options = optimoptions('ga',options,'HybridFcn',{@fminunc,hybridopts});
A hybrid function is another minimization function that runs after the multiobjective
genetic algorithm terminates. You can specify the hybrid function fgoalattain in
Hybrid function (HybridFcn) options.
1 Compute the maximum and minimum of each objective function at the solutions. For
objective j at solution k, let
Fmax ( j) = max Fk ( j )
k
Fmin ( j) = min Fk ( j).
k
4 For each solution k, perform the goal attainment problem with goal vector Fmin and
weight vector p(j,k).
10-52
Genetic Algorithm Options
For gamultiobj, if the weighted average relative change in the spread of the Pareto
solutions over Stall generations is less than Function tolerance, and the spread is
smaller than the average spread over the last Stall generations, then the algorithm
stops. The spread is a measure of the movement of the Pareto front.
• Stall time limit (MaxStallTime) — The algorithm stops if there is no improvement
in the best fitness value for an interval of time in seconds specified by Stall time
limit, as measured by tic and toc.
• Function tolerance (FunctionTolerance) — The algorithm stops if the average
relative change in the best fitness function value over Stall generations is less than
or equal to Function tolerance. (If the StallTest option is
'geometricWeighted', then the test is for a geometric weighted average relative
change.)
For gamultiobj, if the weighted average relative change in the spread of the Pareto
solutions over Stall generations is less than Function tolerance, and the spread is
smaller than the average spread over the last Stall generations, then the algorithm
stops. The spread is a measure of the movement of the Pareto front.
10-53
10 Options Reference
To see a template that you can use to write your own output functions, enter
edit gaoutputfcntemplate
For an example, see “Custom Output Function for Genetic Algorithm” on page 5-119.
10-54
Genetic Algorithm Options
[state,options,optchanged] = myfun(options,state,flag)
MATLAB passes the options, state, and flag data to your output function, and the
output function returns state, options, and optchanged data.
Note To stop the iterations, set state.StopFlag to a nonempty character vector, such
as 'y'.
• state — Structure containing information about the current generation. “The State
Structure” on page 10-32 describes the fields of state. To stop the iterations, set
state.StopFlag to a nonempty character vector, such as 'y'.
• options — Options structure modified by the output function. This argument is
optional.
• optchanged — Boolean flag indicating changes to options. To change options for
subsequent iterations, set optchanged to true.
10-55
10 Options Reference
Caution Changing the state structure carelessly can lead to inconsistent or erroneous
results. Usually, you can achieve the same or better state modifications by using
mutation or crossover functions, instead of changing the state structure in a plot function
or output function.
ga output functions can change the state structure (see “The State Structure” on page
10-32). Be careful when changing values in this structure, as you can pass inconsistent
data back to ga.
Tip If your output structure changes the Population field, then be sure to update the
Score field, and possibly the Best, NonlinIneq, or NonlinEq fields, so that they
contain consistent information.
To update the Score field after changing the Population field, first calculate the
fitness function values of the population, then calculate the fitness scaling for the
population. See “Fitness Scaling Options” on page 10-37.
10-56
Genetic Algorithm Options
When a nonlinear constraint function has been specified, Iterative and Diagnose do
not display the Mean f(x), but will additionally display:
If there are nonlinear constraints, the fitness function and the nonlinear constraints
all need to be vectorized in order for the algorithm to compute in a vectorized manner.
10-57
10 Options Reference
Note You cannot simultaneously use vectorized and parallel computations. If you set
'UseParallel' to true and 'UseVectorized' to true, ga evaluates your fitness and
constraint functions in a vectorized manner, not in parallel.
10-58
Particle Swarm Options
Each option in this section is listed by its field name in options. For example, Display
refers to the corresponding field of options.
Swarm Creation
By default, particleswarm calls the @pswcreationuniform swarm creation function.
This function works as follows.
10-59
10 Options Reference
After creation, particleswarm checks that all particles satisfy any bounds, and
truncates components if necessary. If the Display option is 'iter' and a particle
needed truncation, then particleswarm notifies you.
Set a custom creation function using optimoptions to set the CreationFcn option to
@customcreation, where customcreation is the name of your creation function file. A
custom creation function has this syntax.
swarm = customcreation(problem)
The creation function should return a matrix of size SwarmSize-by-nvars, where each
row represents the location of one particle. See problem for details of the problem
structure. In particular, you can obtain SwarmSize from
problem.options.SwarmSize, and nvars from problem.nvars.
edit pswcreationuniform
Display Settings
The Display option specifies how much information is displayed at the command line
while the algorithm is running.
iter displays:
10-60
Particle Swarm Options
The DisplayInterval option sets the number of iterations that are performed before
the iterative display updates. Give a positive integer.
Algorithm Settings
The details of the particleswarm algorithm appear in “Particle Swarm Optimization
Algorithm” on page 6-11. This section describes the tuning parameters.
The main step in the particle swarm algorithm is the generation of new velocities for the
swarm:
For u1 and u2 uniformly (0,1) distributed random vectors of length nvars, update the
velocity
• Larger absolute value of inertia W leads to the new velocity being more in the same
line as the old, and with a larger absolute magnitude. A large absolute value of W can
destabilize the swarm. The value of W stays within the range of the two-element
vector InertiaRange.
• Larger values of y1 = SelfAdjustmentWeight make the particle head more toward
the best place it has visited.
• Larger values of y2 = SocialAdjustmentWeight make the particle head more
toward the best place in the current neighborhood.
10-61
10 Options Reference
The MinNeighborsFraction option sets both the initial neighborhood size for each
particle, and the minimum neighborhood size; see “Particle Swarm Optimization
Algorithm” on page 6-11. Setting MinNeighborsFraction to 1 has all members of the
swarm use the global minimum point as their societal adjustment target.
See “Optimize Using Particle Swarm” on page 6-3 for an example that sets a few of these
tuning options.
Hybrid Function
A hybrid function is another minimization function that runs after the particle swarm
algorithm terminates. You can specify a hybrid function in the HybridFcn option. The
choices are
• [] — No hybrid function.
• fminsearch (@fminsearch) — Use the MATLAB function fminsearch to perform
unconstrained minimization.
• patternsearch (@patternsearch) — Use a pattern search to perform constrained
or unconstrained minimization.
• fminunc (@fminunc) — Use the Optimization Toolbox function fminunc to perform
unconstrained minimization.
• fmincon (@fmincon) — Use the Optimization Toolbox function fmincon to perform
constrained minimization.
Note Ensure that your hybrid function accepts your problem constraints. Otherwise,
particleswarm throws an error.
You can set separate options for the hybrid function. Use optimset for fminsearch, or
optimoptions for fmincon, patternsearch, or fminunc. For example:
hybridopts = optimoptions('fminunc','Display','iter','Algorithm','quasi-newton');
options = optimoptions(options,'HybridFcn',{@fminunc,hybridopts});
10-62
Particle Swarm Options
For an example that uses a hybrid function, see “Optimize Using Particle Swarm” on
page 6-3.
options = optimoptions(@particleswarm,'OutputFcn',@outfun)
where outfun is a function with syntax specified in “Structure of the Output Function or
Plot Function” on page 10-63. If you have several output functions, pass them in a cell
array:
options = optimoptions(@particleswarm,'OutputFcn',{@outfun1,@outfun2,@outfun3})
Similarly, plot functions are functions that particleswarm calls at each iteration. The
difference between an output function and a plot function is that a plot function has
built-in plotting enhancements, such as buttons that appear on the plot window to pause
or stop particleswarm. To specify a plot function,
options = optimoptions(@particleswarm,'PlotFcn',@plotfun)
where plotfun is a function with syntax specified in “Structure of the Output Function
or Plot Function” on page 10-63. If you have several plot functions, pass them in a cell
array:
options = optimoptions(@particleswarm,'PlotFcn',{@plotfun1,@plotfun2,@plotfun3})
The lone built-in plot function @pswplotbestf plots the best objective function value
against iterations.
For an example of a custom output function, see “Particle Swarm Output Function” on
page 6-7.
stop = myfun(optimValues,state)
10-63
10 Options Reference
If your function sets stop to true, iterations end. Set stop to false to have
particleswarm continue to calculate.
• 'init' — The solver has not begun to iterate. Your output function or plot
function can use this state to open files, or set up data structures or plots for
subsequent iterations.
• 'iter' — The solver is proceeding with its iterations. Typically, this is where
your output function or plot function performs its work.
• 'done' — The solver reached a stopping criterion. Your output function or plot
function can use this state to clean up, such as closing any files it opened.
optimValues Structure
10-64
Particle Swarm Options
Field Contents
swarmfvals Vector containing the objective function values of particles in
the swarm. For particle i, swarmfvals(i) =
fun(swarm(i,:)), where fun is the objective function.
Parallel particleswarm
If you have a Parallel Computing Toolbox license, you can distribute the evaluation of the
objective functions to the swarm among your processors or cores. Set the UseParallel
option to true.
Parallel computation is likely to be faster than serial when your objective function is
computationally expensive, or when you have many particles and processors. Otherwise,
communication overhead can cause parallel computation to be slower than serial
computation.
Vectorized particleswarm
If your objective function can evaluate all the particles at once, you can usually save time
by setting the UseVectorized option to true. Your objective function should accept an
M-by-N matrix, where each row represents one particle, and return an M-by-1 vector of
objective function values. This option works the same way as the patternsearch and
ga UseVectorized options. For patternsearch details, see “Vectorize the Objective
and Constraint Functions” on page 4-81.
10-65
10 Options Reference
Stopping Criteria
particleswarm stops iterating when any of the following occur.
Stopping Option Stopping Test Exit Flag
MaxStallIterations and Relative change in the best 1
FunctionTolerance objective function value g
over the last
MaxStallIterations
iterations is less than
FunctionTolerance.
MaxIterations Number of iterations 0
reaches MaxIterations.
OutputFcn or PlotFcn OutputFcn or PlotFcn can -1
halt the iterations.
ObjectiveLimit Best objective function -3
value g is less than or equal
to ObjectiveLimit.
MaxStallTime Best objective function -4
value g did not change in
the last MaxStallTime
seconds.
MaxTime Function run time exceeds -5
MaxTime seconds.
Also, if you set the FunValCheck option to 'on', and the swarm has particles with NaN,
Inf, or complex objective function values, particleswarm stops and issues an error.
10-66
Simulated Annealing Options
Each option in this section is listed by its field name in the options structure. For
example, InitialTemperature refers to the corresponding field of the options
structure.
Plot Options
Plot options enable you to plot data from the simulated annealing solver while it is
running.
PlotInterval specifies the number of iterations between consecutive calls to the plot
function.
To display a plot when calling simulannealbnd from the command line, set the
PlotFcn field of options to be a function handle to the plot function. You can specify
any of the following plots:
10-67
10 Options Reference
For example, to display the best objective plot, set options as follows
options = optimoptions(@simulannealbnd,'PlotFcn',@saplotbestf);
where @plotfun1, @plotfun2, and so on are function handles to the plot functions.
If you specify more than one plot function, all plots appear as subplots in the same
window. Right-click any subplot to obtain a larger version in a separate figure window.
• x — Current point
• fval — Objective function value at x
• bestx — Best point found so far
• bestfval — Objective function value at best point
• temperature — Current temperature
10-68
Simulated Annealing Options
The output argument stop provides a way to stop the algorithm at the current iteration.
stop can have the following values:
Temperature Options
Temperature options specify how the temperature will be lowered at each iteration over
the course of the algorithm.
10-69
10 Options Reference
temperature = options.InitialTemperature./optimValues.k;
Algorithm Settings
Algorithm settings define algorithmic specific parameters used in generating new points
at each iteration.
newx = myfun(optimValues,problem)
10-70
Simulated Annealing Options
For example, the current position is optimValues.x, and the current objective
function value is problem.objective(optimValues.x).
where Δ = new objective – old objective, and T is the current temperature. Since
both Δ and T are positive, the probability of acceptance is between 0 and 1/2.
Smaller temperature leads to smaller acceptance probability. Also, larger Δ leads
to smaller acceptance probability.
• @myfun — A custom acceptance function, myfun. The syntax is:
acceptpoint = myfun(optimValues,newx,newfval);
10-71
10 Options Reference
• [] — No hybrid function.
• @fminsearch — Uses the MATLAB function fminsearch to perform unconstrained
minimization.
• @patternsearch — Uses patternsearch to perform constrained or unconstrained
minimization.
• @fminunc — Uses the Optimization Toolbox function fminunc to perform
unconstrained minimization.
• @fmincon — Uses the Optimization Toolbox function fmincon to perform
constrained minimization.
Note Ensure that your hybrid function accepts your problem constraints. Otherwise,
simulannealbnd throws an error.
You can set separate options for the hybrid function. Use optimset for fminsearch, or
optimoptions for fmincon, patternsearch, or fminunc. For example:
hybridopts = optimoptions('fminunc','Display','iter','Algorithm','quasi-newton');
options = optimoptions(@simulannealbnd,options,'HybridFcn',{@fminunc,hybridopts});
10-72
Simulated Annealing Options
• FunctionTolerance — The algorithm runs until the average change in value of the
objective function in StallIterLim iterations is less than FunctionTolerance.
The default value is 1e-6.
• MaxIterations — The algorithm stops if the number of iterations exceeds this
maximum number of iterations. You can specify the maximum number of iterations
as a positive integer or Inf. Inf is the default.
• MaxFunctionEvaluations specifies the maximum number of evaluations of the
objective function. The algorithm stops if the number of function evaluations exceeds
the maximum number of function evaluations. The allowed maximum is
3000*numberofvariables.
• MaxTime specifies the maximum time in seconds the algorithm runs before stopping.
• ObjectiveLimit — The algorithm stops if the best objective function value is less
than or equal to the value of ObjectiveLimit.
• Specify Output function as @myfun, where myfun is the name of your function.
• To pass extra parameters in the output function, use “Anonymous Functions”
(Optimization Toolbox).
• For multiple output functions, enter a cell array of output function handles:
{@myfun1,@myfun2,...}.
• options = optimoptions(@simulannealbnd,'OutputFcn',@myfun);
• For multiple output functions, enter a cell array:
options = optimoptions(@simulannealbnd,'OutputFcn',{@myfun1,@myfun2,...});
To see a template that you can use to write your own output functions, enter
edit saoutputfcntemplate
10-73
10 Options Reference
• x — Current point
• fval — Objective function value at x
• bestx — Best point found so far
• bestfval — Objective function value at best point
• temperature — Current temperature, a vector the same length as x
• iteration — Current iteration
• funccount — Number of function evaluations
• t0 — Start time for algorithm
• k — Annealing parameter, a vector the same length as x
• flag — Current state in which the output function is called. The possible values for
flag are
• stop — Provides a way to stop the algorithm at the current iteration. stop can have
the following values:
10-74
Simulated Annealing Options
Display Options
Use the Display option to specify how much information is displayed at the command
line while the algorithm is running. The available options are
• off — No output is displayed. This is the default value for an options structure
created using optimoptions.
• iter — Information is displayed at each iteration.
• diagnose — Information is displayed at each iteration. In addition, the diagnostic
lists some problem information and the options that have been changed from the
defaults.
• final — The reason for stopping is displayed. This is the default.
10-75
10 Options Reference
Beginning in R2016a, the recommended way to set options is to use optimoptions. (You
already set particleswarm options using optimoptions.)
Note GlobalSearch and MultiStart use a different mechanism for setting properties.
See “GlobalSearch and MultiStart Properties (Options)” on page 10-2. Some of these
property names changed as solver option names changed.
Some option names changed in R2016a. See “Table of Option Names in Legacy Order” on
page 10-80.
optimoptions “hides” some options, meaning it does not display their values.
optimoptions displays only current names, not legacy names. For details, see “View
Options” (Optimization Toolbox).
10-76
Options Changes in R2016a
10-77
10 Options Reference
10-78
Options Changes in R2016a
10-79
10 Options Reference
10-80
Options Changes in R2016a
10-81
10 Options Reference
10-82
Options Changes in R2016a
10-83
10 Options Reference
10-84
11
createOptimProblem
Create optimization problem structure
Syntax
problem = createOptimProblem('solverName')
problem =
createOptimProblem('solverName','ParameterName',ParameterValue,...)
Description
problem = createOptimProblem('solverName') creates an empty optimization
problem structure for the solverName solver.
problem =
createOptimProblem('solverName','ParameterName',ParameterValue,...)
accepts one or more comma-separated parameter name/value pairs. Specify
ParameterName inside single quotes.
Input Arguments
solverName
Name of the solver. For a GlobalSearch problem, use 'fmincon'. For a MultiStart
problem, use 'fmincon', 'fminunc', 'lsqcurvefit' or 'lsqnonlin'.
Matrix for linear equality constraints. The constraints have the form:
Aeq x = beq
11-2
createOptimProblem
Aineq
Matrix for linear inequality constraints. The constraints have the form:
Aineq x ≤ bineq
beq
Vector for linear equality constraints. The constraints have the form:
Aeq x = beq
bineq
Vector for linear inequality constraints. The constraints have the form:
Aineq x ≤ bineq
lb
nonlcon
Function handle to the nonlinear constraint function. The constraint function must
accept a vector x and return two vectors: c, the nonlinear inequality constraints, and
ceq, the nonlinear equality constraints. If one of these constraint functions is empty,
nonlcon must return [] for that function.
If the GradConstr option is 'on', then in addition nonlcon must return two additional
outputs, gradc and gradceq. The gradc parameter is a matrix with one column for the
gradient of each constraint, as is gradceq.
objective
Function handle to the objective function. For all solvers except lsqnonlin and
lsqcurvefit, the objective function must accept a vector x and return a scalar. If the
GradObj option is 'on', then the objective function must return a second output, a
11-3
11 Functions — Alphabetical List
vector, representing the gradient of the objective. For lsqnonlin, the objective function
must accept a vector x and return a vector. lsqnonlin sums the squares of the objective
function values. For lsqcurvefit, the objective function must accept two inputs, x and
xdata, and return a vector.
options
ub
x0
A vector, a potential starting point for the optimization. Gives the dimensionality of the
problem.
xdata
ydata
Output Arguments
problem
11-4
createOptimProblem
Examples
Create a problem structure using Rosenbrock's function as objective (see “Include a
Hybrid Function” on page 5-106), the interior-point algorithm for fmincon, and
bounds with absolute value 2:
Alternatives
You can create a problem structure by exporting from the Optimization app
(optimtool), as described in “Exporting from the Optimization app” on page 3-8.
See Also
GlobalSearch | MultiStart | optimtool
Topics
“Create Problem Structure” on page 3-5
Introduced in R2010a
11-5
11 Functions — Alphabetical List
CustomStartPointSet
Custom start points
Description
A CustomStartPointSet is an object wrapper of a matrix whose rows represent start
points for MultiStart.
Creation
Syntax
tpoints = CustomStartPointSet(ptmatrix)
Description
tpoints = CustomStartPointSet(ptmatrix) generates a CustomStartPointSet
object from the ptmatrix matrix. Each row of ptmatrix represents one start point.
Input Arguments
ptmatrix — Start points
matrix
Start points, specified as a matrix. Each row of ptmatrix represents one start point.
Example: randn(40,3) creates 40 start points of 3 dimensions.
Data Types: double
11-6
CustomStartPointSet
Properties
NumStartPoints — Number of start points
positive integer
Object Functions
list List start points
Examples
Create CustomStartPointSet
11-7
11 Functions — Alphabetical List
[x,y,z] = meshgrid(1:4);
ptmatrix = [x(:),y(:),z(:)] + [10,20,30];
tpoints = CustomStartPointSet(ptmatrix);
Extract the original matrix from the tpoints object by using list.
tpts = list(tpoints);
isequal(ptmatrix,tpts)
ans =
logical
See Also
MultiStart | RandomStartPointSet | list
Topics
“CustomStartPointSet Object for Start Points” on page 3-18
“Workflow for GlobalSearch and MultiStart” on page 3-3
Introduced in R2010a
11-8
ga
ga
Find minimum of function using genetic algorithm
Syntax
x = ga(fitnessfcn,nvars)
x = ga(fitnessfcn,nvars,A,b)
x = ga(fitnessfcn,nvars,A,b,Aeq,beq)
x = ga(fitnessfcn,nvars,A,b,Aeq,beq,LB,UB)
x = ga(fitnessfcn,nvars,A,b,Aeq,beq,LB,UB,nonlcon)
x = ga(fitnessfcn,nvars,A,b,Aeq,beq,LB,UB,nonlcon,options)
x = ga(fitnessfcn,nvars,A,b,[],[],LB,UB,nonlcon,IntCon)
x = ga(fitnessfcn,nvars,A,b,[],[],LB,UB,nonlcon,IntCon,options)
x = ga(problem)
[x,fval] = ga(fitnessfcn,nvars,...)
[x,fval,exitflag] = ga(fitnessfcn,nvars,...)
[x,fval,exitflag,output] = ga(fitnessfcn,nvars,...)
[x,fval,exitflag,output,population] = ga(fitnessfcn,nvars,...)
[x,fval,exitflag,output,population,scores] = ga(fitnessfcn,
nvars,...)
Description
x = ga(fitnessfcn,nvars) finds a local unconstrained minimum, x, to the objective
function, fitnessfcn. nvars is the dimension (number of design variables) of
fitnessfcn.
Note “Passing Extra Parameters” (Optimization Toolbox) explains how to pass extra
parameters to the objective function and nonlinear constraint functions, if necessary.
11-9
11 Functions — Alphabetical List
x = ga(fitnessfcn,nvars,A,b,Aeq,beq,LB,UB,nonlcon,options) minimizes
with the default optimization parameters replaced by values in options. (Set
nonlcon=[] if no nonlinear constraints exist.) Create options using optimoptions
(recommended) or gaoptimset.
Note When there are integer constraints, ga does not accept linear or nonlinear equality
constraints, only inequality constraints.
x = ga(fitnessfcn,nvars,A,b,[],[],LB,UB,nonlcon,IntCon,options)
minimizes with integer constraints and with the default optimization parameters
replaced by values in the options structure.
11-10
ga
[x,fval,exitflag,output,population,scores] = ga(fitnessfcn,
nvars,...) returns scores the scores of the final population.
Input Arguments
fitnessfcn
Handle to the fitness function or name of the fitness function. The fitness function should
accept a row vector of length nvars and return a scalar value.
nvars
A*x ≤ b.
If the problem has m linear inequality constraints and nvars variables, then
11-11
11 Functions — Alphabetical List
Note ga does not enforce linear constraints to be satisfied when the PopulationType
option is 'bitString' or 'custom'.
A*x ≤ b.
If the problem has m linear inequality constraints and nvars variables, then
Aeq
Aeq*x = beq.
If the problem has m linear equality constraints and nvars variables, then
Note ga does not enforce linear constraints to be satisfied when the PopulationType
option is 'bitString' or 'custom'.
beq
Aeq*x = beq.
If the problem has m linear equality constraints and nvars variables, then
11-12
ga
LB
Vector of lower bounds. ga enforces that iterations stay above LB. Set LB(i) = –Inf if
x(i)is unbounded below.
Note ga does not enforce bounds to be satisfied when the PopulationType option is
'bitString' or 'custom'.
UB
Vector of upper bounds. ga enforces that iterations stay below UB. Set UB(i) = Inf if
x(i) is unbounded above.
nonlcon
ga attempts to achieve c ≤ 0 and ceq = 0. c and ceq are row vectors when there are
multiple constraints. Set unused outputs to [].
To learn how to use vectorized constraints, see “Vectorized Constraints” on page 2-8.
If IntCon is not empty, the second output of nonlcon (ceq) must be an empty entry
([]).
For information on how ga uses nonlcon, see “Nonlinear Constraint Solver Algorithms”
on page 5-51.
11-13
11 Functions — Alphabetical List
options
optimoptions hides the options listed in italics. See “Options that optimoptions Hides”
on page 10-76.
11-14
ga
11-15
11 Functions — Alphabetical List
11-16
ga
11-17
11 Functions — Alphabetical List
11-18
ga
11-19
11 Functions — Alphabetical List
11-20
ga
ga only: @gaplotbestf |
@gaplotbestindiv |
@gaplotexpectation |
@gaplotrange
gamultiobj only:
@gaplotpareto |
@gaplotparetodistance
| @gaplotrankhist |
@gaplotspread
PlotInterval Positive integer specifying the number of Positive integer | {1}
generations between consecutive calls to
the plot functions.
PopulationSize Size of the population. Positive integer | {50}
when numberOfVariables
<= 5, {200} otherwise |
{min(max(10*nvars,40),
100)} for mixed-integer
problems
11-21
11 Functions — Alphabetical List
11-22
ga
IntCon
Vector of positive integers taking values from 1 to nvars. Each value in IntCon
represents an x component that is integer-valued.
Note When IntCon is nonempty, Aeq and beq must be an empty entry ([]), and
nonlcon must return empty for ceq. For more information on integer programming, see
“Mixed Integer Optimization” on page 5-28.
problem
11-23
11 Functions — Alphabetical List
Output Arguments
x
fval
exitflag
11-24
ga
When there are integer constraints, ga uses the penalty fitness value instead of the
fitness value for stopping criteria.
output
Structure containing output from each generation and other information about algorithm
performance. The output structure contains the following fields:
• 'unconstrained'
• 'boundconstraints'
• 'linearconstraints'
• 'nonlinearconstr'
• 'integerconstraints'
• rngstate — State of the MATLAB random number generator, just before the
algorithm started. You can use the values in rngstate to reproduce the output of ga.
See “Reproduce Results” on page 5-70.
• generations — Number of generations computed.
• funccount — Number of evaluations of the fitness function.
• message — Reason the algorithm terminated.
• maxconstraint — Maximum constraint violation, if any.
population
scores
Column vector of the fitness values (scores for integerconstraints problems) of the
final population.
Examples
Given the following inequality constraints and lower bounds
11-25
11 Functions — Alphabetical List
È 1 1˘ È 2˘
Í- 1 2 ˙ È x1 ˘ £ Í 2 ˙ ,
Í ˙Íx ˙ Í ˙
ÍÎ 2 1 ˙˚ Î 2 ˚ ÍÎ 3 ˙˚
x1 ≥ 0, x2 ≥ 0,
use this code to find the minimum of the lincontest6 function, which is provided in
your software:
A = [1 1; -1 2; 2 1];
b = [2; 2; 3];
lb = zeros(2,1);
[x,fval,exitflag] = ga(@lincontest6,...
2,A,b,[],[],lb)
x =
0.6700 1.3310
fval =
-8.2218
exitflag =
1
x =
0.2000 2.0000 5.0000
11-26
ga
Definitions
Complementarity Measure
In the nonlinear constraint solver, the complementarity measure is the norm of the
vector whose elements are ciλi, where ci is the nonlinear inequality constraint violation,
and λi is the corresponding Lagrange multiplier.
Tips
• To write a function with additional parameters to the independent variables that can
be called by ga, see “Passing Extra Parameters” (Optimization Toolbox).
• For problems that use the population type Double Vector (the default), ga does not
accept functions whose inputs are of type complex. To solve problems involving
complex data, write your functions so that they accept real vectors, by separating the
real and imaginary parts.
Algorithms
For a description of the genetic algorithm, see “How the Genetic Algorithm Works” on
page 5-18.
For a description of the nonlinear constraint algorithm, see “Nonlinear Constraint Solver
Algorithms” on page 5-51.
Alternatives
For problems without integer constraints, consider using patternsearch instead of ga.
11-27
11 Functions — Alphabetical List
References
[1] Goldberg, David E., Genetic Algorithms in Search, Optimization & Machine Learning,
Addison-Wesley, 1989.
See Also
gamultiobj | gaoptimset | optimoptions | particleswarm | patternsearch
Topics
“Genetic Algorithm”
“Getting Started with Global Optimization Toolbox”
“Optimization Problem Setup”
11-28
gamultiobj
gamultiobj
Find Pareto front of multiple fitness functions using genetic algorithm
Syntax
x = gamultiobj(fitnessfcn,nvars)
x = gamultiobj(fitnessfcn,nvars,A,b)
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq)
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub)
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,nonlcon)
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,options)
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,nonlcon,options)
x = gamultiobj(problem)
[x,fval] = gamultiobj( ___ )
[x,fval,exitflag] = gamultiobj( ___ )
[x,fval,exitflag,output] = gamultiobj( ___ )
[x,fval,exitflag,output,population,scores] = gamultiobj( ___ )
Description
x = gamultiobj(fitnessfcn,nvars) finds x on the “Pareto Front” on page 11-57 of
the objective functions defined in fitnessfcn. nvars is the dimension of the
optimization problem (number of decision variables). The solution x is local, which means
it might not be on the global Pareto front.
Note “Passing Extra Parameters” (Optimization Toolbox) explains how to pass extra
parameters to the objective function and nonlinear constraint functions, if necessary.
11-29
11 Functions — Alphabetical List
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,nonlcon) finds a
Pareto set subject to the constraints defined in nonlcon. The function nonlcon accepts
x and returns vectors c and ceq, representing the nonlinear inequalities and equalities
respectively. gamultiobj minimizes fitnessfcn such that c(x) ≤ 0 and
ceq(x) = 0. (Set lb = [] and ub = [] if no bounds exist.) gamultiobj supports
nonlinear constraints only for the default PopulationType option ('doubleVector').
x = gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,options) or x =
gamultiobj(fitnessfcn,nvars,A,b,Aeq,beq,lb,ub,nonlcon,options) finds a
Pareto set x with the default optimization parameters replaced by values in options.
Create options using optimoptions (recommended) or gaoptimset.
[x,fval] = gamultiobj( ___ ), for any input variables, returns a matrix fval, the
value of all the fitness functions defined in fitnessfcn for all the solutions in x. fval
has nf columns, where nf is the number of objectives, and has the same number of rows
as x.
11-30
gamultiobj
Examples
Compute the Pareto front for a simple multiobjective problem. There are two objectives
and two decision variables x.
fitnessfcn = @(x)[norm(x)^2,0.5*norm(x(:)-[2;-1])^2+2];
Optimization terminated: average change in the spread of Pareto solutions less than opt
x =
-0.0072 0.0003
0.0947 -0.0811
1.0217 -0.6946
1.1254 -0.0857
-0.0072 0.0003
0.4489 -0.2101
1.8039 -0.8394
0.5115 -0.6314
1.5164 -0.7277
1.7082 -0.7006
11-31
11 Functions — Alphabetical List
To see the effect of a linear constraint on this problem, see “Multiobjective Problem with
Linear Constraint” on page 11-32.
This example shows how to find the Pareto front for a multiobjective problem in the
presence of a linear constraint.
fitnessfcn = @(x)[norm(x)^2,0.5*norm(x(:)-[2;-1])^2+2];
11-32
gamultiobj
Optimization terminated: average change in the spread of Pareto solutions less than opt
x =
-0.0006 0.0004
1.7518 -1.2508
1.3254 -0.8582
0.6863 -0.8279
-0.0006 0.0004
1.0625 -0.8347
0.5472 -0.6664
0.2908 -0.1038
1.5429 -1.0615
1.3770 -0.9921
11-33
11 Functions — Alphabetical List
To see the effect of removing the linear constraint from this problem, see “Simple
Multiobjective Problem” on page 11-31.
Find the Pareto front for the two fitness functions sin(x) and cos(x) on the interval
.
fitnessfcn = @(x)[sin(x),cos(x)];
nvars = 1;
lb = 0;
11-34
gamultiobj
ub = 2*pi;
rng default % for reproducibility
x = gamultiobj(fitnessfcn,nvars,[],[],[],[],lb,ub)
x =
3.1416
4.7124
3.6842
4.3171
3.2260
3.1416
3.8049
3.7220
4.5960
3.4626
Plot the solution. gamultiobj finds points along the entire Pareto front.
plot(sin(x),cos(x),'r*')
xlabel('sin(x)')
ylabel('cos(x)')
title('Pareto Front')
legend('Pareto front')
11-35
11 Functions — Alphabetical List
Compute and plot the Pareto front for the two-objective Schaffer's second function. This
function has a disconected Pareto front.
11-36
gamultiobj
y = zeros(length(x),2);
11-37
11 Functions — Alphabetical List
The two objective functions compete for x in the ranges [1,3] and [4,5]. But, the
Pareto-optimal front consists of only two disconnected regions, corresponding to the x in
the ranges [1,2] and [4,5]. There are disconnected regions because the region [2,3]
is inferior to [4,5]. In that range, objective 1 has the same values, but objective 2 is
smaller.
11-38
gamultiobj
Call gamultiobj.
Optimization terminated: average change in the spread of Pareto solutions less than opt
11-39
11 Functions — Alphabetical List
rng default
Set the fitness functions to kur_multiobjective, a function that has three control
variables and returns two fitness function values.
fitnessfcn = @kur_multiobjective;
nvars = 3;
function y = kur_multiobjective(x)
%KUR_MULTIOBJECTIVE Objective function for a multiobjective problem.
% The Pareto-optimal set for this two-objective problem is nonconvex as
% well as disconnected. The function KUR_MULTIOBJECTIVE computes two
% objectives and returns a vector y of size 2-by-1.
%
% Reference: Kalyanmoy Deb, "Multi-Objective Optimization using
% Evolutionary Algorithms", John Wiley & Sons ISBN 047187339
This function also appears in the example “Multiobjective Genetic Algorithm Options”.
Find the Pareto front and all other outputs for this problem.
11-40
gamultiobj
[x,fval,exitflag,output,population,scores] = gamultiobj(fitnessfcn,nvars,...
[],[],[],[],lb,ub);
Optimization terminated: average change in the spread of Pareto solutions less than opt
sizex =
18 3
sizepopulation =
50 3
sizescores =
50 2
The returned Pareto front contains 18 points. There are 50 members of the final
population. Each population row has three dimensions, coresponding to the three
decision variables. Each scores row has two dimensions, corresponding to the two
fitness functions.
Input Arguments
fitnessfcn — Fitness functions to optimize
function handle | function name
11-41
11 Functions — Alphabetical List
fitnessfcn is a function that accepts a real row vector of doubles x of length nvars and
returns a real vector F(x) of objective function values. For details on writing
fitnessfcn, see “Compute Objective Functions” on page 2-2.
If you set the UseVectorized option to true, then fitnessfcn accepts a matrix of size
n-by-nvars, where the matrix represents n individuals. fitnessfcn returns a matrix of
size n-by-m, where m is the number of objective functions. See “Vectorize the Fitness
Function” on page 5-112.
Example: @(x)[sin(x),cos(x)]
Data Types: char | function_handle | string
A*x <= b,
where x is the column vector of nvars variables x(:), and b is a column vector with M
elements.
x1 + 2x2 ≤ 10
3x1 + 4x2 ≤ 20
5x1 + 6x2 ≤ 30.
11-42
gamultiobj
Example: To set the sum of the x-components to 1 or less, take A = ones(1,N) and b =
1.
Data Types: double
A*x <= b,
where x is the column vector of nvars variables x(:), and A is a matrix of size M-by-
nvars.
x1 + 2x2 ≤ 10
3x1 + 4x2 ≤ 20
5x1 + 6x2 ≤ 30.
Example: To set the sum of the x-components to 1 or less, take A = ones(1,N) and b =
1.
Data Types: double
Aeq*x = beq,
where x is the column vector of nvars variables x(:), and beq is a column vector with
Me elements.
11-43
11 Functions — Alphabetical List
For example, give constraints Aeq = [1,2,3;2,4,1] and beq = [10;20] to specify
these sums:
x1 + 2x2 + 3x3 = 10
2x1 + 4x2 + x3 = 20.
Example: To set the sum of the x-components to 1, take Aeq = ones(1,N) and beq =
1.
Data Types: double
Aeq*x = beq,
where x is the column vector of nvars variables x(:), and Aeq is a matrix of size Meq-
by-N.
For example, give constraints Aeq = [1,2,3;2,4,1] and beq = [10;20] to specify
these sums:
x1 + 2x2 + 3x3 = 10
2x1 + 4x2 + x3 = 20.
Example: To set the sum of the x-components to 1, take Aeq = ones(1,N) and beq =
1.
Data Types: double
lb — Lower bounds
real vector | real array
Lower bounds, specified as a real vector or real array. If numel(lb) = nvars, then lb
specifies that x(i) >= lb(i) for all i.
11-44
gamultiobj
If numel(lb) < nvars, then lb specifies that x(i) >= lb(i) for
1 <= i <= numel(lb).
ub — Upper bounds
real vector | real array
Upper bounds, specified as a real vector or real array. If numel(ub) = nvars, then ub
specifies that x(i) <= ub(i) for all i.
If numel(ub) < nvars, then ub specifies that x(i) <= ub(i) for
1 <= i <= numel(ub).
If you set the UseVectorized option to true, then nonlcon accepts a matrix of size n-
by-nvars, where the matrix represents n individuals. nonlcon returns a matrix of size
n-by-mc in the first argument, where mc is the number of nonlinear inequality
constraints. nonlcon returns a matrix of size n-by-mceq in the second argument, where
mceq is the number of nonlinear equality constraints. See “Vectorize the Fitness
Function” on page 5-112.
11-45
11 Functions — Alphabetical List
Internally, gamultiobj sets the scores of infeasible population members to Inf. This
setting ensures that infeasible population members are ranked worst for reproduction.
See “Selection” on page 5-25, which describes how the scaled fitness value enters into
creating the next generation, and “Fitness Scaling Options” on page 10-37, which
describes the scaling of fitness values.
Data Types: char | function_handle | string
optimoptions hides the options listed in italics. See “Options that optimoptions Hides”
on page 10-76.
11-46
gamultiobj
11-47
11 Functions — Alphabetical List
11-48
gamultiobj
11-49
11 Functions — Alphabetical List
11-50
gamultiobj
11-51
11 Functions — Alphabetical List
11-52
gamultiobj
ga only: @gaplotbestf |
@gaplotbestindiv |
@gaplotexpectation |
@gaplotrange
gamultiobj only:
@gaplotpareto |
@gaplotparetodistance
| @gaplotrankhist |
@gaplotspread
PlotInterval Positive integer specifying the number of Positive integer | {1}
generations between consecutive calls to
the plot functions.
PopulationSize Size of the population. Positive integer | {50}
when numberOfVariables
<= 5, {200} otherwise |
{min(max(10*nvars,40),
100)} for mixed-integer
problems
11-53
11 Functions — Alphabetical List
Example: optimoptions('gamultiobj','PlotFcn',@gaplotpareto)
11-54
gamultiobj
Output Arguments
x — Pareto points
m-by-nvar array
Pareto points, returned as an m-by-nvar array, where m is the number of points on the
Pareto front. Each row of x represents one point on the Pareto front.
Function values on Pareto front, returned as an m-by-nf array. m is the number of points
on the Pareto front, and nf is the number of fitness functions. Each row of fval
represents the function values at one Pareto point in x.
11-55
11 Functions — Alphabetical List
Information about the optimization process, returned as a structure with these fields.
output Field Meaning
problemtype Type of problem:
• 'unconstrained' — No constraints
• 'boundconstraints' — Only bound constraints
• 'linearconstraints' — Linear constraints, with or
without bound constraints
• 'nonlinearconstr' — Nonlinear constraints, with or
without other types of constraints
rngstate State of the MATLAB random number generator, just before the
algorithm started. You can use the values in rngstate to
reproduce the output of gamultiobj. See “Reproduce Results”
on page 5-70.
generations Total number of generations, excluding HybridFcn iterations.
funccount Total number of function evaluations.
11-56
gamultiobj
Scores of the final population, returned as an n-by-nf array. n is the number of members
of the population, and nf is the number of fitness functions.
When there are nonlinear constraints, gamultiobj sets the scores of infeasible
population members to Inf.
Definitions
Pareto Front
A Pareto front is a set of points in parameter space (the space of decision variables) that
have noninferior fitness function values.
11-57
11 Functions — Alphabetical List
In other words, for each point on the Pareto front, you can improve one fitness function
only by degrading another. For details, see “What Is Multiobjective Optimization?” on
page 8-2
As in “Local vs. Global Optima” (Optimization Toolbox), it is possible for a Pareto front to
be local, but not global. “Local” means that the Pareto points can be noninferior
compared to nearby points, but points farther away in parameter space could have lower
function values in every component.
Algorithms
gamultiobj uses a controlled, elitist genetic algorithm (a variant of NSGA-II [1]). An
elitist GA always favors individuals with better fitness value (rank). A controlled elitist
GA also favors individuals that can help increase the diversity of the population even if
they have a lower fitness value. It is important to maintain the diversity of population for
convergence to an optimal Pareto front. Diversity is maintained by controlling the elite
members of the population as the algorithm progresses. Two options, ParetoFraction
and DistanceFcn, control the elitism. ParetoFraction limits the number of
individuals on the Pareto front (elite members). The distance function, selected by
DistanceFcn, helps to maintain diversity on a front by favoring individuals that are
relatively far away on the front. The algorithm stops if the spread, a measure of the
movement of the Pareto front, is small.
References
[1] Deb, Kalyanmoy. Multi-Objective Optimization Using Evolutionary Algorithms.
Chichester, England: John Wiley & Sons, 2001.
11-58
gamultiobj
See Also
ga | optimoptions
Topics
“Pareto Front for Two Objectives” on page 8-7
“Performing a Multiobjective Optimization Using the Genetic Algorithm”
“Multiobjective Genetic Algorithm Options”
“What Is Multiobjective Optimization?” on page 8-2
“Options and Syntax: Differences from ga” on page 8-6
Introduced in R2007b
11-59
11 Functions — Alphabetical List
gaoptimget
Obtain values of genetic algorithm options structure
Syntax
val = gaoptimget(options, 'name')
val = gaoptimget(options, 'name', default)
Description
val = gaoptimget(options, 'name') returns the value of the parameter name from
the genetic algorithm options structure options. gaoptimget(options, 'name')
returns an empty matrix [] if the value of name is not specified in options. It is only
necessary to type enough leading characters of name to uniquely identify it. gaoptimget
ignores case in parameter names.
See Also
ga | gamultiobj | gaoptimset
Topics
“Genetic Algorithm Options” on page 10-29
11-60
gaoptimset
gaoptimset
Create genetic algorithm options structure
Syntax
gaoptimset
options = gaoptimset
options = gaoptimset(@ga)
options = gaoptimset(@gamultiobj)
options = gaoptimset('param1',value1,'param2',value2,...)
options = gaoptimset(oldopts,'param1',value1,...)
options = gaoptimset(oldopts,newopts)
Description
gaoptimset with no input or output arguments displays a complete list of parameters
with their valid values.
11-61
11 Functions — Alphabetical List
Options
The following table lists the options you can set with gaoptimset. See “Genetic
Algorithm Options” on page 10-29 for a complete description of these options and their
values. Values in {} denote the default value. {}* means the default when there are
linear constraints, and for MutationFcn also when there are bounds. You can also view
the optimization parameters and defaults by typing gaoptimset at the command line. I*
indicates that ga ignores or overwrites the option for mixed integer optimization
problems.
optimoptions hides the options listed in italics, but gaoptimset does not. See
“Options that optimoptions Hides” on page 10-76.
11-62
gaoptimset
11-63
11 Functions — Alphabetical List
11-64
gaoptimset
11-65
11 Functions — Alphabetical List
11-66
gaoptimset
11-67
11 Functions — Alphabetical List
11-68
gaoptimset
ga only: @gaplotbestf |
@gaplotbestindiv |
@gaplotexpectation |
@gaplotrange
gamultiobj only:
@gaplotpareto |
@gaplotparetodistance
| @gaplotrankhist |
@gaplotspread
PlotInterval Positive integer specifying the number of Positive integer | {1}
generations between consecutive calls to
the plot functions.
PopulationSize Size of the population. Positive integer | {50}
when numberOfVariables
<= 5, {200} otherwise |
{min(max(10*nvars,40),
100)} for mixed-integer
problems
11-69
11 Functions — Alphabetical List
See Also
ga | gamultiobj | gaoptimget
11-70
gaoptimset
Topics
“Genetic Algorithm Options” on page 10-29
11-71
11 Functions — Alphabetical List
GlobalOptimSolution
Optimization solution
Description
A GlobalOptimSolution object contains information on a local minimum, including
location, objective function value, and start point or points that lead to the minimum.
Creation
When you execute run and request the “solutions” on page 11-0 output,
GlobalSearch and MultiStart create GlobalOptimSolution objects as output.
Properties
Exitflag — Exit condition of local solver
integer
Exit condition of the local solver, returned as an integer. Generally, a positive Exitflag
corresponds to a local optimum, and a zero or negative Exitflag corresponds to an
unsuccessful search for a local minimum.
For the exact meaning of each Exitflag, see the exitflag description in the
appropriate local solver function reference page:
• fmincon exitflag
11-72
GlobalOptimSolution
• fminunc exitflag
• lsqcurvefit exitflag
• lsqnonlin exitflag
Objective function value, returned as a real scalar. For the lsqnonlin and
lsqcurvefit solvers, Fval is the sum of squares of the residual.
Data Types: double
Output structure returned by the local solver. For details, see the output description in
the appropriate local solver function reference page:
• fmincon output
• fminunc output
• lsqcurvefit output
• lsqnonlin output
X — Local solution
array
Start points that lead to current solution, returned as a cell array. Control the distance
between points considered as distinct by setting the FunctionTolerance and
XTolerance properties of the global solver.
11-73
11 Functions — Alphabetical List
Examples
Obtain Multiple Local Solutions
All 30 local solver runs converged with a positive local solver exit flag.
disp(allmins)
X
Fval
Exitflag
Output
X0
11-74
GlobalOptimSolution
To examine the initial points that lead to the various solutions, see “Visualize the Basins
of Attraction” on page 3-36.
See Also
GlobalSearch | MultiStart | run
Topics
“Visualize the Basins of Attraction” on page 3-36
11-75
11 Functions — Alphabetical List
Introduced in R2010a
11-76
GlobalSearch
GlobalSearch
Find global minimum
Description
A GlobalSearch object contains properties (options) that affect how run repeatedly
runs a local solver to generate a GlobalOptimSolution object. When run, the solver
attempts to locate a solution that has the lowest objective function value.
Creation
Syntax
gs = GlobalSearch
gs = GlobalSearch(Name,Value)
gs = GlobalSearch(oldGS,Name,Value)
gs = GlobalSearch(ms)
Description
gs = GlobalSearch creates gs, a GlobalSearch solver with its properties set to the
defaults.
11-77
11 Functions — Alphabetical List
Properties
BasinRadiusFactor — Basin radius decrease factor
0.2 (default) | scalar from 0 through 1
Basin radius decrease factor, specified as a scalar from 0 through 1. A basin radius
decreases after MaxWaitCycle consecutive start points are within the basin. The basin
radius decreases by a factor of 1 – BasinRadiusFactor.
Level of display to the Command Window, specified as one of the following character
vectors or strings:
Example: 'iter'
Data Types: char | string
11-78
GlobalSearch
MaxTime does not interrupt local solvers during a run, so the total time can exceed
MaxTime.
Example: 180 stops the solver the first time a local solver call finishes after 180 seconds.
Data Types: double
Example: 40
Data Types: double
11-79
11 Functions — Alphabetical List
Number of Stage 1 points, specified as a positive integer. For details, see “Obtain Stage 1
Start Point, Run” on page 3-51.
Example: 1000
Data Types: double
Report on solver progress or halt solver, specified as a function handle or cell array of
function handles. Output functions run after each local solver call. They also run when
the global solver starts and ends. Write output functions using the syntax described in
“OutputFcn” on page 10-4. See “GlobalSearch Output Function” on page 3-39.
Data Types: cell | function_handle
Increase in the penalty threshold, specified as a positive scalar. For details, see React to
Large Counter Values on page 3-54.
Example: 0.4
Data Types: double
Plot solver progress, specified as a function handle or cell array of function handles. Plot
functions run after each local solver call. They also run when the global solver starts and
ends. Write plot functions using the syntax described in “OutputFcn” on page 10-4.
11-80
GlobalSearch
11-81
11 Functions — Alphabetical List
Object Functions
run Run multiple-start solver
Examples
Create an optimization problem that has several local minima, and try to find the global
minimum using GlobalSearch. The objective is the six-hump camel back problem (see
“Run the Solver” on page 3-21).
All 8 local solver runs converged with a positive local solver exit flag.
x =
-0.0898 0.7127
You can request the objective function value at x when you call run by using the
following syntax:
[x,fval] = run(gs,problem)
However, if you neglected to request fval, you can still compute the objective function
value at x.
fval = sixmin(x)
fval = -1.0316
11-82
GlobalSearch
To search for the global minimum, run GlobalSearch using the fmincon 'sqp'
algorithm.
rng default % For reproducibility
opts = optimoptions(@fmincon,'Algorithm','sqp');
11-83
11 Functions — Alphabetical List
problem = createOptimProblem('fmincon','objective',...
fun,'x0',3,'lb',-5,'ub',5,'options',opts);
gs = GlobalSearch;
[x,f] = run(gs,problem)
All 23 local solver runs converged with a positive local solver exit flag.
x = -0.3080
f = -3.9032
ms =
MultiStart with properties:
UseParallel: 1
Display: 'final'
FunctionTolerance: 2.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
Create a GlobalSearch object that uses the available properties from ms.
gs = GlobalSearch(ms)
gs =
GlobalSearch with properties:
NumTrialPoints: 1000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
11-84
GlobalSearch
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 2.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
gs has the same nondefault value of FunctionTolerance as ms. But gs does not use
the UseParallel property.
gs =
GlobalSearch with properties:
NumTrialPoints: 1000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 1.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
11-85
11 Functions — Alphabetical List
gs =
GlobalSearch with properties:
NumTrialPoints: 1000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 1.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'bounds'
XTolerance: 1.0000e-03
You can also update properties one at a time by using dot notation.
gs.MaxTime = 1800
gs =
GlobalSearch with properties:
NumTrialPoints: 1000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 1.0000e-04
MaxTime: 1800
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'bounds'
XTolerance: 1.0000e-03
11-86
GlobalSearch
Algorithms
For a detailed description of the algorithm, see “GlobalSearch Algorithm” on page 3-50.
Ugray et al. [1] describe both the algorithm and the scatter-search method of generating
trial points.
References
[1] Ugray, Zsolt, Leon Lasdon, John Plummer, Fred Glover, James Kelly, and Rafael
Martí. Scatter Search and Local NLP Solvers: A Multistart Framework for Global
Optimization. INFORMS Journal on Computing, Vol. 19, No. 3, 2007, pp. 328–
340.
See Also
GlobalOptimSolution | MultiStart | run
Topics
“Example of Run with GlobalSearch” on page 3-22
“Workflow for GlobalSearch and MultiStart” on page 3-3
Introduced in R2010a
11-87
11 Functions — Alphabetical List
list
List start points
Syntax
points = list(tpoints)
points = list(rs,problem)
Description
points = list(tpoints) returns the points inside the tpoints
CustomStartPointSet object.
Examples
Create CustomStartPointSet
Extract the original matrix from the tpoints object by using list.
tpts = list(tpoints);
11-88
list
ans =
logical
Create RandomStartPointSet
rs = RandomStartPointSet('NumStartPoints',40);
Create a problem with 3-D variables, lower bounds of 0, and upper bounds of
[10,20,30].
problem = createOptimProblem('fmincon','x0',rand(3,1),'lb',zeros(3,1),'ub',[10,20,30]);
largest = 29.8840
smallest = min(min(points))
smallest = 0.1390
Input Arguments
tpoints — Start points
CustomStartPointSet object
Start points, specified as a CustomStartPointSet object. list extracts the points into
a matrix where each row is one start point.
11-89
11 Functions — Alphabetical List
Output Arguments
points — Start points
k-by-n matrix
Start points, returned as a k-by-n matrix. Each row of the matrix represents one start
point.
11-90
list
See Also
CustomStartPointSet | MultiStart | RandomStartPointSet
Topics
“Workflow for GlobalSearch and MultiStart” on page 3-3
Introduced in R2010a
11-91
11 Functions — Alphabetical List
MultiStart
Find multiple local minima
Description
A MultiStart object contains properties (options) that affect how run repeatedly runs a
local solver to generate a GlobalOptimSolution object. When run, the solver attempts
to find multiple local solutions to a problem by starting from various points.
Creation
Syntax
ms = MultiStart
ms = MultiStart(Name,Value)
ms = MultiStart(oldMS,Name,Value)
ms = MultiStart(gs)
Description
ms = MultiStart creates ms, a MultiStart solver with its properties set to the
defaults.
11-92
MultiStart
Properties
Display — Level of display to Command Window
'final' (default) | 'iter' | 'off'
Level of display to the Command Window, specified as one of the following character
arrays or strings:
Example: 'iter'
Data Types: char | string
Maximum time in seconds that MultiStart runs, specified as a positive scalar. MultiStart
and its local solvers halt when MaxTime seconds have passed since the beginning of the
run, as measured by tic and toc.
MaxTime does not interrupt local solvers during a run, so the total time can exceed
MaxTime.
11-93
11 Functions — Alphabetical List
Example: 180 stops the solver the first time a local solver call finishes after 180 seconds.
Data Types: double
Report on solver progress or halt solver, specified as a function handle or cell array of
function handles. Output functions run after each local solver call. They also run when
the global solver starts and ends. Write output functions using the syntax described in
“OutputFcn” on page 10-4. See “GlobalSearch Output Function” on page 3-39.
Data Types: cell | function_handle
Plot solver progress, specified as a function handle or cell array of function handles. Plot
functions run after each local solver call. They also run when the global solver starts and
ends. Write plot functions using the syntax described in “OutputFcn” on page 10-4.
11-94
MultiStart
Example: true
Data Types: logical
Object Functions
run Run multiple-start solver
Examples
Run MultiStart
11-95
11 Functions — Alphabetical List
To search for the global minimum, run MultiStart on 20 instances of the problem using
the fmincon 'sqp' algorithm.
11-96
MultiStart
All 20 local solver runs converged with a positive local solver exit flag.
x = -0.3080
f = -3.9032
ms = MultiStart
ms =
MultiStart with properties:
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-06
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
Create a MultiStart object with looser tolerances than default, so the solver returns
fewer solutions that are close to each other. Also, have MultiStart run only initial
points that are feasible with respect to bounds and inequality constraints.
ms = MultiStart('FunctionTolerance',2e-4,'XTolerance',5e-3,...
'StartPointsToRun','bounds-ineqs')
ms =
MultiStart with properties:
11-97
11 Functions — Alphabetical List
UseParallel: 0
Display: 'final'
FunctionTolerance: 2.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'bounds-ineqs'
XTolerance: 0.0050
gs = GlobalSearch('FunctionTolerance',2e-4,'NumTrialPoints',2000)
gs =
GlobalSearch with properties:
NumTrialPoints: 2000
BasinRadiusFactor: 0.2000
DistanceThresholdFactor: 0.7500
MaxWaitCycle: 20
NumStageOnePoints: 200
PenaltyThresholdFactor: 0.2000
Display: 'final'
FunctionTolerance: 2.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
Create a MultiStart object that uses the available properties from gs.
ms = MultiStart(gs)
ms =
MultiStart with properties:
UseParallel: 0
11-98
MultiStart
Display: 'final'
FunctionTolerance: 2.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
ms has the same nondefault value of FunctionTolerance as gs. But ms does not use
the NumTrialPoints property.
ms =
MultiStart with properties:
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-04
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'all'
XTolerance: 1.0000e-06
ms =
MultiStart with properties:
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-04
11-99
11 Functions — Alphabetical List
MaxTime: Inf
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'bounds'
XTolerance: 1.0000e-03
You can also update properties one at a time by using dot notation.
ms.MaxTime = 1800
ms =
MultiStart with properties:
UseParallel: 0
Display: 'final'
FunctionTolerance: 1.0000e-04
MaxTime: 1800
OutputFcn: []
PlotFcn: []
StartPointsToRun: 'bounds'
XTolerance: 1.0000e-03
Algorithms
For a detailed description of the algorithm, see “MultiStart Algorithm” on page 3-54.
See Also
CustomStartPointSet | GlobalOptimSolution | GlobalSearch |
RandomStartPointSet | run
Topics
“Global or Multiple Starting Point Search”
“Parallel Computing”
11-100
MultiStart
Introduced in R2010a
11-101
11 Functions — Alphabetical List
particleswarm
Particle swarm optimization
Syntax
x = particleswarm(fun,nvars)
x = particleswarm(fun,nvars,lb,ub)
x = particleswarm(fun,nvars,lb,ub,options)
x = particleswarm(problem)
[x,fval,exitflag,output] = particleswarm( ___ )
Description
x = particleswarm(fun,nvars) attempts to find a vector x that achieves a local
minimum of fun. nvars is the dimension (number of design variables) of fun.
Note “Passing Extra Parameters” (Optimization Toolbox) explains how to pass extra
parameters to the objective function, if necessary.
11-102
particleswarm
Examples
x =
629.4474 311.4814
This solution is far from the true minimum, as you see in a function plot.
fsurf(@(x,y)x.*exp(-(x.^2+y.^2)))
11-103
11 Functions — Alphabetical List
Usually, it is best to set bounds. See “Minimize a Simple Function with Bounds” on page
11-104.
11-104
particleswarm
lb = [-10,-15];
ub = [15,20];
x =
-0.7071 -0.0000
Use a larger population and a hybrid function to try to get a better solution.
fun = @(x)x(1)*exp(-norm(x)^2);
lb = [-10,-15];
ub = [15,20];
options = optimoptions('particleswarm','SwarmSize',100,'HybridFcn',@fmincon);
x =
11-105
11 Functions — Alphabetical List
-0.7071 -0.0000
Return the optional output arguments to examine the solution process in more detail.
Call particleswarm with all outputs to minimize the function and get information
about the solution process.
rng default % For reproducibility
nvars = 2;
[x,fval,exitflag,output] = particleswarm(fun,nvars,lb,ub,options)
x =
-0.7071 -0.0000
fval =
-0.4289
exitflag =
output =
11-106
particleswarm
Input Arguments
fun — Objective function
function handle | function name
lb — Lower bounds
[] (default) | real vector or array
Lower bounds, specified as a vector or array of doubles. lb represents the lower bounds
elementwise in lb ≤ x ≤ ub.
11-107
11 Functions — Alphabetical List
ub — Upper bounds
[] (default) | real vector or array
Upper bounds, specified as a vector or array of doubles. ub represents the upper bounds
element-wise in lb ≤ x ≤ ub.
Some options are absent from the optimoptions display. These options are listed in
italics. For details, see “View Options” (Optimization Toolbox).
CreationFcn Handle to the function that creates the initial swarm. Default is
@pswcreationuniform. See “Swarm Creation” on page 10-59.
Display Level of display returned to the command line.
11-108
particleswarm
FunValCheck Check whether objective function and constraints values are valid.
'on' displays an error when the objective function or constraints
return a value that is complex, Inf, or NaN. The default, 'off',
displays no error.
HybridFcn Handle to a function that continues the optimization after
particleswarm terminates. Possible values:
• @fmincon
• @fminsearch
• @fminunc
• @patternsearch
Can also be a cell array specifying the hybrid function and its
options structure, such as {@fmincon,fminconopts}. Default is
[]. See “Hybrid Function” on page 10-62.
InertiaRange Two-element real vector with same sign values in increasing order.
Gives the lower and upper bound of the adaptive inertia. To obtain a
constant (nonadaptive) inertia, set both elements of InertiaRange
to the same value. Default is [0.1,1.1]. See “Particle Swarm
Optimization Algorithm” on page 6-11.
InitialSwarmMat Initial population or partial population of particles. M-by-nvars
rix matrix, where each row represents one particle. If M < SwarmSize,
then particleswarm creates more particles so that the total
number is SwarmSize. If M > SwarmSize, then particleswarm
uses the first SwarmSize rows.
InitialSwarmSpa Initial range of particle positions that @pswcreationuniform
n creates. Can be a positive scalar or a vector with nvars elements,
where nvars is the number of variables. The range for any particle
component is -InitialSwarmSpan/2,InitialSwarmSpan/2,
shifted and scaled if necessary to match any bounds. Default is
2000.
11-109
11 Functions — Alphabetical List
MaxStallIterati Positive integer with default 20. Iterations end when the relative
ons change in best objective function value over the last
MaxStallIterations iterations is less than
options.FunctionTolerance.
MaxStallTime Maximum number of seconds without an improvement in the best
known objective function value. Positive scalar with default Inf.
MaxTime Maximum time in seconds that particleswarm runs. Default is
Inf.
MinNeighborsFra Minimum adaptive neighborhood size, a scalar from 0 to 1. Default
ction is 0.25. See “Particle Swarm Optimization Algorithm” on page 6-11.
ObjectiveLimit Minimum objective value, a stopping criterion. Scalar, with default
-Inf.
OutputFcn Function handle or cell array of function handles. Output functions
can read iterative data, and stop the solver. Default is []. See
“Output Function and Plot Function” on page 10-63.
PlotFcn Function handle or cell array of function handles. Plot functions can
read iterative data, plot each iteration, and stop the solver. Default
is []. Available built-in plot function: @pswplotbestf. See “Output
Function and Plot Function” on page 10-63.
SelfAdjustmentW Weighting of each particle’s best position when adjusting velocity.
eight Finite scalar with default 1.49. See “Particle Swarm Optimization
Algorithm” on page 6-11.
SocialAdjustmen Weighting of the neighborhood’s best position when adjusting
tWeight velocity. Finite scalar with default 1.49. See “Particle Swarm
Optimization Algorithm” on page 6-11.
SwarmSize Number of particles in the swarm, an integer greater than 1.
Default is min(100,10*nvars), where nvars is the number of
variables.
UseParallel Compute objective function in parallel when true. Default is false.
See “Parallel or Vectorized Function Evaluation” on page 10-65.
UseVectorized Compute objective function in vectorized fashion when true.
Default is false. See “Parallel or Vectorized Function Evaluation”
on page 10-65.
11-110
particleswarm
Output Arguments
x — Solution
real vector
Solution, returned as a real vector that minimizes the objective function subject to any
bound constraints.
11-111
11 Functions — Alphabetical List
Algorithms
For a description of the particle swarm optimization algorithm, see “Particle Swarm
Optimization Algorithm” on page 6-11.
See Also
ga | patternsearch
Topics
“Optimize Using Particle Swarm” on page 6-3
11-112
particleswarm
Introduced in R2014b
11-113
11 Functions — Alphabetical List
patternsearch
Find minimum of function using pattern search
Syntax
x = patternsearch(fun,x0)
x = patternsearch(fun,x0,A,b)
x = patternsearch(fun,x0,A,b,Aeq,beq)
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub)
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub,nonlcon)
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub,nonlcon,options)
x = patternsearch(problem)
[x,fval] = patternsearch( ___ )
[x,fval,exitflag,output] = patternsearch( ___ )
Description
x = patternsearch(fun,x0) finds a local minimum, x, to the function handle fun
that computes the values of the objective function. x0 is a real vector specifying an initial
point for the pattern search algorithm.
Note “Passing Extra Parameters” (Optimization Toolbox) explains how to pass extra
parameters to the objective function and nonlinear constraint functions, if necessary.
11-114
patternsearch
lb ≤ x ≤ ub. If no linear equalities exist, set Aeq = [] and beq = []. If x(i) has no
lower bound, set lb(i) = -Inf. If x(i) has no upper bound, set ub(i) = Inf.
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub,nonlcon,options) minimizes
fun with the optimization options specified in options. Use optimoptions
(recommended) or psoptimset to set these options. If there are no nonlinear inequality
or equality constraints, set nonlcon = [].
[x,fval] = patternsearch( ___ ), for any syntax, returns the value of the objective
function fun at the solution x.
Examples
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
11-115
11 Functions — Alphabetical List
fun = @psobj;
x0 = [0,0];
x = patternsearch(fun,x0)
x =
-0.7037 -0.1860
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
x0 = [0.5,-0.5];
x = patternsearch(fun,x0,A,b)
11-116
patternsearch
x =
5.2824 -1.8758
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
lb = [0,-Inf];
ub = [Inf,-3];
A = [];
b = [];
Aeq = [];
beq = [];
x0 = [1,-5];
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub)
x =
11-117
11 Functions — Alphabetical List
0.1880 -3.0000
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
To do so, on your MATLAB path, save the following code to a file named
ellipsetilt.m.
x0 = [-2,-2];
A = [];
b = [];
Aeq = [];
beq = [];
lb = [];
ub = [];
11-118
patternsearch
nonlcon = @ellipsetilt;
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub,nonlcon)
x =
-1.5144 0.0874
Create the following two-variable objective function. On your MATLAB path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
Set options to give iterative display and to plot the objective function at each iteration.
options = optimoptions('patternsearch','Display','iter','PlotFcn',@psplotbestf);
Find the unconstrained minimum of the objective starting from the point [0,0].
x0 = [0,0];
A = [];
b = [];
Aeq = [];
beq = [];
lb = [];
ub = [];
nonlcon = [];
x = patternsearch(fun,x0,A,b,Aeq,beq,lb,ub,nonlcon,options)
11-119
11 Functions — Alphabetical List
(output trimmed)
11-120
patternsearch
x =
-0.7037 -0.1860
Find a minimum value of a function and report both the location and value of the
minimum.
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
Find the unconstrained minimum of the objective, starting from the point [0,0]. Return
both the location of the minimum, x, and the value of fun(x).
x0 = [0,0];
[x,fval] = patternsearch(fun,x0)
x =
-0.7037 -0.1860
fval =
-7.0254
11-121
11 Functions — Alphabetical List
Create the following two-variable objective function. On your MATLAB® path, save the
following code to a file named psobj.m.
function y = psobj(x)
fun = @psobj;
Find the unconstrained minimum of the objective, starting from the point [0,0]. Return
the solution, x, the objective function value at the solution, fun(x), the exit flag, and the
output structure.
x0 = [0,0];
[x,fval,exitflag,output] = patternsearch(fun,x0)
x =
-0.7037 -0.1860
fval =
-7.0254
exitflag =
output =
11-122
patternsearch
function: @psobj
problemtype: 'unconstrained'
pollmethod: 'gpspositivebasis2n'
maxconstraint: []
searchmethod: []
iterations: 66
funccount: 229
meshsize: 9.5367e-07
rngstate: [1x1 struct]
message: 'Optimization terminated: mesh size less than options.MeshTolerance.
Input Arguments
fun — Function to be minimized
function handle | function name
11-123
11 Functions — Alphabetical List
function f = myfun(x)
f = ... % Compute function value at x
x = patternsearch(@(x)norm(x)^2,x0,A,b);
x0 — Initial point
real vector
Initial point, specified as a real vector. patternsearch uses the number of elements in
x0 to determine the number of variables that fun accepts.
Example: x0 = [1,2,3,4]
Data Types: double
A*x <= b,
where x is the column vector of N variables x(:), and b is a column vector with M
elements.
x1 + 2x2 ≤ 10
3x1 + 4x2 ≤ 20
5x1 + 6x2 ≤ 30,
11-124
patternsearch
A = [1,2;3,4;5,6];
b = [10;20;30];
Example: To specify that the control variables sum to 1 or less, give the constraints A =
ones(1,N) and b = 1
Data Types: double
A*x <= b,
where x is the column vector of N variables x(:), and A is a matrix of size M-by-N.
x1 + 2x2 ≤ 10
3x1 + 4x2 ≤ 20
5x1 + 6x2 ≤ 30,
Example: To specify that the control variables sum to 1 or less, give the constraints A =
ones(1,N) and b = 1
Data Types: double
Linear equality constraints, specified as a real matrix. Aeq is an Me-by-N matrix, where
Me is the number of equalities, and N is the number of variables (number of elements in
x0).
11-125
11 Functions — Alphabetical List
Aeq*x = beq,
where x is the column vector of N variables x(:), and beq is a column vector with Me
elements.
x1 + 2x2 + 3x3 ≤ 10
2x1 + 4x2 + x3 ≤ 20,
Example: To specify that the control variables sum to 1, give the constraints Aeq =
ones(1,N) and beq = 1
Data Types: double
Aeq*x = beq,
where x is the column vector of N variables x(:), and Aeq is a matrix of size Meq-by-N.
x1 + 2x2 + 3x3 ≤ 10
2x1 + 4x2 + x3 ≤ 20,
11-126
patternsearch
Example: To specify that the control variables sum to 1, give the constraints Aeq =
ones(1,N) and beq = 1
Data Types: double
lb — Lower bounds
real vector | real array
Lower bounds, specified as a real vector or real array. If the number of elements in x0 is
equal to that of lb, then lb specifies that
for all i.
for
ub — Upper bounds
real vector | real array
Upper bounds, specified as a real vector or real array. If the number of elements in x0 is
equal to that of ub, then ub specifies that
for all i.
for
11-127
11 Functions — Alphabetical List
c(x) <= 0
ceq(x) = 0
For example,
x = patternsearch(@myfun,x0,A,b,Aeq,beq,lb,ub,@mycon)
11-128
patternsearch
optimoptions hides the options listed in italics; see “Options that optimoptions Hides”
on page 10-76.
Option Description Values
AccelerateMesh Accelerate convergence near a minimum. true | {false}
patternsearch contracts by an extra factor of
1/2.
11-129
11 Functions — Alphabetical List
11-130
patternsearch
11-131
11 Functions — Alphabetical List
11-132
patternsearch
11-133
11 Functions — Alphabetical List
Create the structure problem by exporting a problem from the Optimization app, as
described in “Importing and Exporting Your Work” (Optimization Toolbox).
Output Arguments
x — Solution
real vector
Solution, returned as a real vector. The size of x is the same as the size of x0. When
exitflag is positive, x is typically a local solution to the problem.
11-134
patternsearch
In the nonlinear constraint solver, the complementarity measure is the norm of the
vector whose elements are ciλi, where ci is the nonlinear inequality constraint violation,
and λi is the corresponding Lagrange multiplier.
Information about the optimization process, returned as a structure with these fields:
• 'unconstrained'
• 'boundconstraints'
• 'linearconstraints'
• 'nonlinearconstr'
• pollmethod — Polling technique.
• searchmethod — Search technique used, if any.
• iterations — Total number of iterations.
11-135
11 Functions — Alphabetical List
Algorithms
By default, patternsearch looks for a minimum based on an adaptive mesh that, in the
absence of linear constraints, is aligned with the coordinate directions. See “What Is
Direct Search?” on page 4-2 and “How Pattern Search Polling Works” on page 4-16.
References
[1] Audet, Charles, and J. E. Dennis Jr. “Analysis of Generalized Pattern Searches.”
SIAM Journal on Optimization. Volume 13, Number 3, 2003, pp. 889–903.
[2] Conn, A. R., N. I. M. Gould, and Ph. L. Toint. “A Globally Convergent Augmented
Lagrangian Barrier Algorithm for Optimization with General Inequality
Constraints and Simple Bounds.” Mathematics of Computation. Volume 66,
Number 217, 1997, pp. 261–288.
[3] Abramson, Mark A. Pattern Search Filter Algorithms for Mixed Variable General
Constrained Optimization Problems. Ph.D. Thesis, Department of Computational
and Applied Mathematics, Rice University, August 2002.
[4] Abramson, Mark A., Charles Audet, J. E. Dennis, Jr., and Sebastien Le Digabel.
“ORTHOMADS: A deterministic MADS instance with orthogonal directions.”
SIAM Journal on Optimization. Volume 20, Number 2, 2009, pp. 948–966.
[5] Kolda, Tamara G., Robert Michael Lewis, and Virginia Torczon. “Optimization by
direct search: new perspectives on some classical and modern methods.” SIAM
Review. Volume 45, Issue 3, 2003, pp. 385–482.
11-136
patternsearch
[6] Kolda, Tamara G., Robert Michael Lewis, and Virginia Torczon. “A generating set
direct search augmented Lagrangian algorithm for optimization with a
combination of general and linear constraints.” Technical Report
SAND2006-5315, Sandia National Laboratories, August 2006.
[7] Lewis, Robert Michael, Anne Shepherd, and Virginia Torczon. “Implementing
generating set search methods for linearly constrained minimization.” SIAM
Journal on Scientific Computing. Volume 29, Issue 6, 2007, pp. 2507–2530.
See Also
ga | optimoptions | psoptimset
Topics
“Optimize Using the GPS Algorithm” on page 4-8
“Coding and Minimizing an Objective Function Using Pattern Search”
“Constrained Minimization Using Pattern Search”
“Pattern Search Options”
“Optimize an ODE in Parallel” on page 4-86
“Pattern Search Climbs Mount Washington”
“Optimization Workflow” on page 1-17
“What Is Direct Search?” on page 4-2
“Pattern Search Terminology” on page 4-13
“How Pattern Search Polling Works” on page 4-16
“Polling Types” on page 4-49
“Search and Poll” on page 4-34
11-137
11 Functions — Alphabetical List
psoptimget
Obtain values of pattern search options structure
Syntax
val = psoptimget(options,'name')
val = psoptimget(options,'name',default)
Description
val = psoptimget(options,'name') returns the value of the parameter name from
the pattern search options structure options. psoptimget(options,'name') returns
an empty matrix [] if the value of name is not specified in options. It is only necessary
to type enough leading characters of name to uniquely identify it. psoptimget ignores
case in parameter names.
Examples
opts = psoptimset('TolX',1e-4);
val = psoptimget(opts,'TolX')
See Also
patternsearch | psoptimset
Topics
“Pattern Search Options” on page 10-9
11-138
psoptimget
11-139
11 Functions — Alphabetical List
psoptimset
Create pattern search options structure
Syntax
psoptimset
options = psoptimset
options = psoptimset(@patternsearch)
options = psoptimset('param1',value1,'param2',value2,...)
options = psoptimset(oldopts,'param1',value1,...)
options = psoptimset(oldopts,newopts)
Description
psoptimset with no input or output arguments displays a complete list of parameters
with their valid values.
11-140
psoptimset
Options
The following table lists the options you can set with psoptimset. See “Pattern Search
Options” on page 10-9 for a complete description of the options and their values. Values
in {} denote the default value. You can also view the optimization parameters and
defaults by typing psoptimset at the command line.
optimoptions hides the options listed in italics, but psoptimset does not. See
“Options that optimoptions Hides” on page 10-76.
Option Description Values
AccelerateMesh Accelerate convergence near a minimum. true | {false}
patternsearch contracts by an extra factor of
1/2.
11-141
11 Functions — Alphabetical List
11-142
psoptimset
11-143
11 Functions — Alphabetical List
11-144
psoptimset
See Also
optimoptions | patternsearch | psoptimget
11-145
11 Functions — Alphabetical List
RandomStartPointSet
Random start points
Description
A RandomStartPointSet object describes how to generate a set of pseudorandom points
for use with MultiStart. A RandomStartPointSet object does not contain points. It
contains parameters for generating the points when MultiStart runs or when you use
the list function.
Creation
Syntax
rs = RandomStartPointSet
rs = RandomStartPointSet(Name,Value)
rs = RandomStartPointSet(oldrs,Name,Value)
Description
rs = RandomStartPointSet creates a default RandomStartPointSet object.
Properties
ArtificialBound — Absolute value of default bounds for unbounded components
1000 (default) | positive scalar
11-146
RandomStartPointSet
Absolute value of the default bounds for unbounded components, specified as a positive
scalar.
Example: 1e2
Data Types: double
Object Functions
list List start points
Examples
rs = RandomStartPointSet
rs =
RandomStartPointSet with properties:
NumStartPoints: 10
ArtificialBound: 1000
Create RandomStartPointSet
11-147
11 Functions — Alphabetical List
rs = RandomStartPointSet('NumStartPoints',40);
Create a problem with 3-D variables, lower bounds of 0, and upper bounds of
[10,20,30].
problem = createOptimProblem('fmincon','x0',rand(3,1),'lb',zeros(3,1),'ub',[10,20,30]);
points = list(rs,problem);
largest = max(max(points))
largest = 29.8840
smallest = min(min(points))
smallest = 0.1390
Update RandomStartPointSet
rs = RandomStartPointSet('NumStartPoints',50)
rs =
RandomStartPointSet with properties:
NumStartPoints: 50
ArtificialBound: 1000
rs = RandomStartPointSet(rs,'NumStartPoints',100,'ArtificialBound',1e4)
rs =
RandomStartPointSet with properties:
NumStartPoints: 100
11-148
RandomStartPointSet
ArtificialBound: 10000
rs =
RandomStartPointSet with properties:
NumStartPoints: 100
ArtificialBound: 500
See Also
CustomStartPointSet | MultiStart | list
Topics
“Workflow for GlobalSearch and MultiStart” on page 3-3
Introduced in R2010a
11-149
11 Functions — Alphabetical List
run
Run multiple-start solver
Syntax
x = run(gs,problem)
x = run(ms,problem,k)
x = run(ms,problem,startpts)
[x,fval] = run( ___ )
[x,fval,exitflag,output] = run( ___ )
[x,fval,exitflag,output,solutions] = run( ___ )
Description
x = run(gs,problem) runs GlobalSearch to find a solution or multiple local
solutions to problem.
[x,fval] = run( ___ ) returns the objective function value at x, the best point found,
using any of the arguments in the previous syntaxes. For the lsqcurvefit and
lsqnonlin local solvers, fval contains the squared norm of the residual.
11-150
run
Examples
Create an optimization problem that has several local minima, and try to find the global
minimum using GlobalSearch. The objective is the six-hump camel back problem (see
“Run the Solver” on page 3-21).
rng default % For reproducibility
gs = GlobalSearch;
sixmin = @(x)(4*x(1)^2 - 2.1*x(1)^4 + x(1)^6/3 ...
+ x(1)*x(2) - 4*x(2)^2 + 4*x(2)^4);
problem = createOptimProblem('fmincon','x0',[-1,2],...
'objective',sixmin,'lb',[-3,-3],'ub',[3,3]);
x = run(gs,problem)
All 8 local solver runs converged with a positive local solver exit flag.
x =
-0.0898 0.7127
You can request the objective function value at x when you call run by using the
following syntax:
[x,fval] = run(gs,problem)
However, if you neglected to request fval, you can still compute the objective function
value at x.
fval = sixmin(x)
fval = -1.0316
11-151
11 Functions — Alphabetical List
Use a default MultiStart object to solve the six-hump camel back problem (see “Run
the Solver” on page 3-21).
rng default % For reproducibility
ms = MultiStart;
sixmin = @(x)(4*x(1)^2 - 2.1*x(1)^4 + x(1)^6/3 ...
+ x(1)*x(2) - 4*x(2)^2 + 4*x(2)^4);
problem = createOptimProblem('fmincon','x0',[-1,2],...
'objective',sixmin,'lb',[-3,-3],'ub',[3,3]);
[x,fval,exitflag,outpt,solutions] = run(ms,problem,30);
All 30 local solver runs converged with a positive local solver exit flag.
Examine the best function value and the location where the best function value is
attained.
fprintf('The best function value is %f.\n',fval)
Create a set of initial 2-D points for MultiStart in the range [-3,3] for each
component.
v = -3:0.5:3;
[X,Y] = meshgrid(v);
ptmatrix = [X(:),Y(:)];
tpoints = CustomStartPointSet(ptmatrix);
Find the point that minimizes the six-hump camel back problem (see “Run the Solver” on
page 3-21) by starting MultiStart at the points in tpoints.
rng default % For reproducibility
ms = MultiStart;
11-152
run
All 169 local solver runs converged with a positive local solver exit flag.
x =
-0.0898 0.7127
Create an optimization problem that has several local minima, and try to find the global
minimum using GlobalSearch. The objective is the six-hump camel back problem (see
“Run the Solver” on page 3-21).
rng default % For reproducibility
gs = GlobalSearch;
sixmin = @(x)(4*x(1)^2 - 2.1*x(1)^4 + x(1)^6/3 ...
+ x(1)*x(2) - 4*x(2)^2 + 4*x(2)^4);
problem = createOptimProblem('fmincon','x0',[-1,2],...
'objective',sixmin,'lb',[-3,-3],'ub',[3,3]);
[x,fval,exitflag,output,solutions] = run(gs,problem);
All 8 local solver runs converged with a positive local solver exit flag.
To understand what GlobalSearch did to solve this problem, examine the output
structure and solutions object.
disp(output)
funcCount: 2261
localSolverTotal: 8
localSolverSuccess: 8
localSolverIncomplete: 0
11-153
11 Functions — Alphabetical List
localSolverNoSolution: 0
message: 'GlobalSearch stopped because it analyzed all the trial poin
disp(solutions)
X
Fval
Exitflag
Output
X0
arrayfun(@(x)x.Output.funcCount,solutions)
ans =
31 34 40 3
The eight local solver runs found four solutions. The funcCount output shows that
fmincon took no more than 40 function evaluations to reach each of the four solutions.
The output does not show how many function evaluations four of the fmincon runs took.
Most of the 2261 function evaluations seem to be for GlobalSearch to evaluate trial
points, not for fmincon to run starting from those points.
Input Arguments
gs — GlobalSearch solver
GlobalSearch object
11-154
run
ms — MultiStart solver
MultiStart object
Output Arguments
x — Best point found
real array
11-155
11 Functions — Alphabetical List
Best point found, returned as a real array. The best point is the one with lowest objective
function value.
11-156
run
Field Meaning
localSolverNoSolutio Number of local solver runs with negative exit flag.
n
localSolverSuccess Number of local solver runs with positive exit flag.
localSolverTotal Total number of local solver runs.
message Exit message.
See Also
GlobalOptimSolution | GlobalSearch | MultiStart
Topics
“Run the Solver” on page 3-21
“Workflow for GlobalSearch and MultiStart” on page 3-3
Introduced in R2010a
11-157
11 Functions — Alphabetical List
saoptimget
Values of simulated annealing options structure
Syntax
val = saoptimget(options, 'name')
val = saoptimget(options, 'name', default)
Description
val = saoptimget(options, 'name') returns the value of the parameter name from
the simulated annealing options structure options. saoptimget(options, 'name')
returns an empty matrix [] if the value of name is not specified in options. It is only
necessary to type enough leading characters of name to uniquely identify the parameter.
saoptimget ignores case in parameter names.
Examples
opts = saoptimset('TolFun',1e-4);
val = saoptimget(opts,'TolFun');
See Also
saoptimset | simulannealbnd
Topics
“Simulated Annealing Options” on page 10-67
11-158
saoptimget
Introduced in R2007a
11-159
11 Functions — Alphabetical List
saoptimset
Create simulated annealing options structure
Syntax
saoptimset
options = saoptimset
options = saoptimset('param1',value1,'param2',value2,...)
options = saoptimset(oldopts,'param1',value1,...)
options = saoptimset(oldopts,newopts)
options = saoptimset('simulannealbnd')
Description
saoptimset with no input or output arguments displays a complete list of parameters
with their valid values.
11-160
saoptimset
saoptimset('simulannealbnd')
ans =
AnnealingFcn: @annealingfast
TemperatureFcn: @temperatureexp
AcceptanceFcn: @acceptancesa
TolFun: 1.0000e-006
StallIterLimit: '500*numberofvariables'
MaxFunctionEvaluations: '3000*numberofvariables'
TimeLimit: Inf
MaxIterations: Inf
ObjectiveLimit: -Inf
Display: 'final'
DisplayInterval: 10
HybridFcn: []
HybridInterval: 'end'
PlotFcns: []
PlotInterval: 1
OutputFcns: []
InitialTemperature: 100
ReannealInterval: 100
DataType: 'double'
Options
The following table lists the options you can set with saoptimset. See “Simulated
Annealing Options” on page 10-67 for a complete description of these options and their
values. Values in {} denote the default value. You can also view the options parameters
by typing saoptimset at the command line.
optimoptions hides the options listed in italics, but saoptimset does not. See
“Options that optimoptions Hides” on page 10-76.
Option Description Values
AcceptanceFcn Handle to the function the Function handle |{@acceptancesa}
algorithm uses to determine if a
new point is accepted
AnnealingFcn Handle to the function the Function handle | @annealingboltz
algorithm uses to generate new | {@annealingfast}
points
11-161
11 Functions — Alphabetical List
11-162
saoptimset
11-163
11 Functions — Alphabetical List
See Also
optimoptions | saoptimget | simulannealbnd
Topics
“Simulated Annealing Options” on page 10-67
Introduced in R2007a
11-164
simulannealbnd
simulannealbnd
Find minimum of function using simulated annealing algorithm
Syntax
x = simulannealbnd(fun,x0)
x = simulannealbnd(fun,x0,lb,ub)
x = simulannealbnd(fun,x0,lb,ub,options)
x = simulannealbnd(problem)
[x,fval] = simulannealbnd( ___ )
[x,fval,exitflag,output] = simulannealbnd( ___ )
Description
x = simulannealbnd(fun,x0) finds a local minimum, x, to the function handle fun
that computes the values of the objective function. x0 is an initial point for the simulated
annealing algorithm, a real vector.
Note “Passing Extra Parameters” (Optimization Toolbox) explains how to pass extra
parameters to the objective function, if necessary.
11-165
11 Functions — Alphabetical List
[x,fval] = simulannealbnd( ___ ), for any syntax, returns the value of the objective
function fun at the solution x.
Examples
Minimize De Jong's fifth function, a two-dimensional function with many local minima.
11-166
simulannealbnd
Minimize De Jong's fifth function using simulannealbnd starting from the point [0,0].
fun = @dejong5fcn;
x0 = [0 0];
x = simulannealbnd(fun,x0)
x =
-32.0285 -0.1280
11-167
11 Functions — Alphabetical List
The simulannealbnd algorithm uses the MATLAB® random number stream, so you
might obtain a different result.
Start simulannealbnd starting at the point [0,0], and set lower bounds of -64 and
upper bounds of 64 on each component.
11-168
simulannealbnd
fun = @dejong5fcn;
x0 = [0 0];
lb = [-64 -64];
ub = [64 64];
x = simulannealbnd(fun,x0,lb,ub)
x =
-15.9790 -31.9593
The simulannealbnd algorithm uses the MATLAB® random number stream, so you
might obtain a different result.
Observe the progress of simulannealbnd by setting options to use some plot functions.
options = saoptimset('PlotFcns',{@saplotbestx,...
@saplotbestf,@saplotx,@saplotf});
Start simulannealbnd starting at the point [0,0], and set lower bounds of -64 and
upper bounds of 64 on each component.
x =
-15.9790 -31.9593
11-169
11 Functions — Alphabetical List
11-170
simulannealbnd
Start simulannealbnd starting at the point [0,0], and set lower bounds of -64 and
upper bounds of 64 on each component.
fun = @dejong5fcn;
x0 = [0,0];
lb = [-64,-64];
ub = [64,64];
[x,fval,exitflag,output] = simulannealbnd(fun,x0,lb,ub)
x =
-15.9790 -31.9593
11-171
11 Functions — Alphabetical List
fval =
1.9920
exitflag =
output =
iterations: 1762
funccount: 1779
message: 'Optimization terminated: change in best function value less than opti
rngstate: [1x1 struct]
problemtype: 'boundconstraints'
temperature: [2x1 double]
totaltime: 1.1936
The simulannealbnd algorithm uses the MATLAB® random number stream, so you
might obtain a different result.
Input Arguments
fun — Function to be minimized
function handle | function name
11-172
simulannealbnd
x = simulannealbnd(@myfun,x0)
x0 — Initial point
real vector
Initial point, specified as a real vector. simulannealbnd uses the number of elements in
x0 to determine the number of variables that fun accepts.
Example: x0 = [1,2,3,4]
Data Types: double
lb — Lower bounds
real vector | real array
Lower bounds, specified as a real vector or real array. If the number of elements in x0 is
equal to that of lb, then lb specifies that
ub — Upper bounds
real vector | real array
11-173
11 Functions — Alphabetical List
Upper bounds, specified as a real vector or real array. If the number of elements in x0 is
equal to that of ub, then ub specifies that
optimoptions hides the options listed in italics; see “Options that optimoptions Hides”
on page 10-76.
Option Description Values
AcceptanceFcn Handle to the function the Function handle |{@acceptancesa}
algorithm uses to determine if a
new point is accepted
AnnealingFcn Handle to the function the Function handle | @annealingboltz
algorithm uses to generate new | {@annealingfast}
points
DataType Type of decision variable 'custom' | {'double'}
Display Level of display 'off' | 'iter' | 'diagnose' |
{'final'}
DisplayInterval Interval for iterative display Positive integer | {10}
11-174
simulannealbnd
11-175
11 Functions — Alphabetical List
11-176
simulannealbnd
• x0 — Starting point
• lb — Lower bound for x
• ub — Upper bound for x
• solver — 'simulannealbnd'
• options — Options structure created with optimoptions or saoptimset
• rngstate — Optional field to reset the state of the random number generator
Create the structure problem by exporting a problem from the Optimization app, as
described in “Importing and Exporting Your Work” (Optimization Toolbox).
Output Arguments
x — Solution
real vector
Solution, returned as a real vector. The size of x is the same as the size of x0. Typically, x
is a local solution to the problem when exitflag is positive.
11-177
11 Functions — Alphabetical List
See Also
ga | optimoptions | patternsearch | saoptimset
Topics
“Minimization Using Simulated Annealing Algorithm”
“Simulated Annealing Options”
“Multiprocessor Scheduling using Simulated Annealing with a Custom Data Type”
“Optimization Workflow” on page 1-17
11-178
simulannealbnd
Introduced in R2007a
11-179