1 s2.0 S0022247X03005353 Main
1 s2.0 S0022247X03005353 Main
1 s2.0 S0022247X03005353 Main
Abstract
Dynamic programming identifies the value function of continuous time optimal control with a
solution to the Hamilton–Jacobi equation, appropriately defined. This relationship in turn leads to
sufficient conditions of global optimality, which have been widely used to confirm the optimality
of putative minimisers. In continuous time optimal control, the dynamic programming methodology
has been used for problems with state space a vector space. However there are many problems of
interest in which it is necessary to regard the state space as a manifold. This paper extends dynamic
programming to cover problems in which the state space is a general finite-dimensional C ∞ mani-
fold. It shows that, also in a manifold setting, we can characterise the value function of a free time
optimal control problem as a unique lower semicontinuous, lower bounded, generalised solution of
the Hamilton–Jacobi equation. The application of these results is illustrated by the investigation of
minimum time controllers for a rigid pendulum.
2003 Elsevier Inc. All rights reserved.
1. Introduction
In optimal control theory, dynamic programming links the function describing how the
minimum cost is affected by changes in initial data (“the value function”) and solutions
to the Hamilton–Jacobi equation. In contrast to techniques for solving optimal control
* Corresponding author.
E-mail address: r.vinter@ic.ac.uk (R.B. Vinter).
0022-247X/$ – see front matter 2003 Elsevier Inc. All rights reserved.
doi:10.1016/S0022-247X(03)00535-3
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 119
problems based on the maximum principle and related necessary conditions, dynamic pro-
gramming provides sufficient conditions of optimality. A frequent application of dynamic
programming is to confirm optimality of a putative minimiser. Here, necessary conditions
of optimality provide guesses regarding the nature of minimising controls for arbitrary ini-
tial data; this information is used to construct a suitable solution to the Hamilton–Jacobi
equation, to validate the “minimiser” under consideration, a procedure referred to as “con-
structing a field of extremals.” The technique has been used, for example, to confirm
optimality of extremals (i.e., arcs satisfying Euler–Lagrange type conditions) for many
classical variational problems, including the brachystochrone problem [12], for dynamic
optimisation problems arising in resource economics [4] and for variational problems in
nonlinear elasticity [5].
Research on the dynamic programming approach in optimal control has been very ex-
tensive. Much of this centres on the task of providing appropriate solution concepts, to
ensure that the value function can be characterised as the unique solution to the Hamilton–
Jacobi equation. This is far from straightforward—to take account of numerous optimal
control problems of interest where the value functions lack the requisit smoothness to be
regarded as classical solutions to the Hamilton–Jacobi equation (we examine one such
example in Section 8), we must allow “solutions” which are nondifferentiable or even dis-
continuous. The use of elementary notions of solutions to the Hamilton–Jacobi equation,
based on satisfaction of the equation at points of differentiability, fails to guarantee unique-
ness of solutions. Sophisticated analytical machinery is now available however, dealing
with these difficulties (viscosity solutions, generalised proximal normal solutions, etc). We
refer to [2,11] for background discussion.
However there remain important classes of optimal control problems not directly cov-
ered by current theory. A simple example (one which we shall examine in detail) is
minimum time control of a rigid pendulum, driven by a tangential force (the “control”),
subject to saturation, and confined to the vertical plane. The target set is the configuration
in which the pendulum is stationary and makes a zero angle with the downward vertical.
For this problem, the state space M is a two-dimensional differentiable manifold, the
tangent bundle of S 1 . It is not difficult to come up with plausible candidates for mini-
mum controls for this problem. However, standard techniques cannot be used directly to
verify optimality, because the state space is a finite-dimensional C ∞ manifold, not home-
omorphic to an open subset of a finite-dimensional space, and “continuous time” dynamic
programming has hitherto been developed only in a vector space setting.
In the free time problem described above, one of a number of “fixes” can be used to
provide a test of optimality based on dynamic programming for optimal control problems
on vector spaces. We can narrow the problem by introducing state constraints which con-
fine motion to one local coordinate chart. Alternatively we can regard M as the image of
all of R2 , under some C ∞ mapping which is onto, but not one-to-one; in this way, we
can reformulate the original problem as a vector space problem with target set comprising
multiple copies of the original target point. However this is a very simple problem and we
would expect a great deal of ingenuity would be required in more complicated situations,
to provide an alternative formulation to which vector space dynamic programming can be
applied.
120 I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140
When specialised to a vector space setting (M = Rn ), (3) provides the information that
the value function V satisfies the minimum time problem Hamilton–Jacobi equation and
accompanying boundary conditions on the target set C;
This reproduces the ‘state of the art’ characterisation of value functions for free time
problems given, for example, in [13] with one modification: we have an extra hypothesis,
not present in [13] excluding state trajectories with finite escape times. The target set is
an arbitrary closed subset of the state space and no controllability-type hypotheses are
imposed, whose purpose is to ensure continuity of the value function.
For a fixed time optimal control problem with state space a vector space, Frankowska
and Plaskacz [8] provide a Hamilton–Jacobi type characterisation of the value function, in
situations where the state is constrained to lie in a closed subset D (with, possibly, non-
empty interior). In the case when the manifold M can be embedded in R N , for some N ,
in such a manner that F has a locally Lipschitz extension to all of R N and a certain
‘outward-pointing’ condition is satisfied, we expect that use of the methods of [8] (based
on interpreting D as M) provide an alternative approach to dynamic programming on man-
ifolds, though one that does not emphasise the coordinate free character of the problem.
The idea of posing optimal control problems on manifolds and of developing intrin-
sic formulations of certain optimality conditions such as the maximum principle are by
no means new [7,10]. Properties of solutions of partial differential equations on mani-
folds, too, have been the subject of extensive investigations. The novel feature of this paper
is that it extends to a manifold setting optimality conditions of dynamic programming
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 121
type, and provides the appropriate machinery for interpreting ‘nonsmooth’ solutions of the
Hamilton–Jacobi equation.
The main results of this paper, and also a more detailed description of the solution to
rigid pendulum problem of the final section, were reported in [3].
Throughout the paper, B denotes the open unit ball in Euclidean space.
If we are to develop dynamic programming techniques for free time optimal control
problems on manifolds of any generality, we must allow for possibly nondifferentiable
value functions. It is necessary therefore to extend to a manifold setting vector space ma-
chinery for analysing the local properties of nonsmooth functions, in terms of gradient-like
constructs. Many of the analytic tools from nonsmooth analysis (including proximal subd-
ifferentials, strict subdifferentials, limiting subdifferentials, lower Dini directional deriva-
tives and viscosity-type subdifferentials) have analogues in a manifold setting. Of primary
interest here will be a generalisation of the proximal subdifferential since this alone is
required in our approach to dynamic programming on manifolds. Recall that M is an n-
dimensional C ∞ manifold.
The following proposition links the above definition of proximal subdifferential to the
familiar proximal subdifferential defined on finite-dimensional vector spaces.
Proof. (i) ⇒ (ii) Suppose σ ∈ ∂ P f (p). Then (2) is satisfied for some neighbourhood W
of p and some C 2 function m : W → R. Write x̄ = φ(p). Take any local coordinate chart
(φ, U ) and > 0 such that φ −1 (φ(p) + B) ⊂ U ∩ W and any C 2 function m such that
σ = dm (p). By (2) we have, for all x ∈ x̄ + B,
f ◦ φ −1 (x ) − f ◦ φ −1 (x̄) m ◦ φ −1 (x ) − m ◦ φ −1 (x̄).
Since σ = dm = dm and m ◦ φ −1 and m ◦ φ −1 are both C 2 functions, we have
∇(m ◦ φ −1 )(x̄) = ∇(m ◦ φ −1 )(x̄)
and, for some M > 0 sufficiently large,
m ◦ φ −1 (x ) − m ◦ φ −1 (x̄) ∇(m ◦ φ −1 )(x̄) · (x − x̄) − M|x − x̄|2
for all x ∈ x̄ + B.
We have confirmed (ii).
(ii) ⇒ (i) Assume that (ii) is true. Then f (p) < +∞ and σ = dm (p). (2) is satisfied
with W = φ −1 (φ(p) + B) and
2
m(p ) = f (p) + ηT φ(p ) − φ(p) − M φ(p ) − φ(p) .
We have confirmed (i). ✷
We state the main results of the paper. These relate to the free time optimal control
problem P (p) of Section 2. It is asserted that the value function
V (p) := inf P (p), p ∈ M,
is the unique solution (appropriately defined) to the Hamilton–Jacobi equation
1 − H (p, −dV (p)) = 0, p ∈ M \ C,
(4)
V (p) = 0, p ∈ C,
124 I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140
We are ready to state our characterisation of the value function for the free time optimal
control problem on a manifold, in terms of generalised solutions of the Hamilton–Jacobi
equation.
Theorem 4.1. Assume that the data for problem P (p) satisfy hypotheses (H1)–(H3). Then
the value function V is the unique lower semicontinuous function which is a proximal
solution to the Hamilton–Jacobi equation (4).
5. Existence of minimisers
Lemma 5.1. Fix a compact set G ⊂ M and T > 0. Assume (H1)–(H3). Then there exists
a compact set D such that for all T ∈ [0, T ] and F -trajectories q : [0, T ] → M such that
q(0) ∈ G, we have
q(t) ∈ D for all t ∈ [0, T ].
Proof. Take any arc q : [0, T ] → M as described in the lemma. We deduce from (H3) that
d
s q(t) K 1 + s q(t) a.e. t ∈ [0, T ]. (5)
dt
Now t → s(q(t)) and t → |s(q(t))| are locally Lipschitz continuous. So there exists a
subset Q ⊂ [0, T ], of full measure, on which both functions are differentiable. We claim
d
s q(t ) K 1 + s q(t ) (6)
dt
for all t ∈ Q. Indeed, (6) follows from (5) if s(q(t)) = 0, since then
d d
s q(t ) = ± s q(t ) .
dt dt
On the other hand, if s(q(t )) = 0, t → |s(q(t))| is minimised at t = t so
d
s q(t ) = 0 K 1 + s q(t ) .
dt
We see that (6) is satisfied in this case also.
By Gronwall’s lemma,
s q(t) eKt s q(0) + 1 L for all t ∈ [0, T ],
for some constant L (independent of q). We have used the fact that the continuous function
p → s(p) is bounded on the compact set G. The assertions of the lemma now follow from
the assumed properties of s(·). ✷
Lemma 5.2. Assume (H1)–(H3). Fix a compact set G ⊂ M and T̄ > 0. Then there exists
a finite collection of local coordinate charts {(φj , Uj )} and constants k̄ > 0, c̄ > 0 and
¯ > 0 with the following properties. Given any T ∈ [0, T̄ ] and F -trajectory q : [0, T ] → M
satisfying q(0) ∈ G, there exists a local coordinate chart (φ, U ) ∈ {(φj , Uj )} such that
φ −1 φ q(T ) + B ¯ ⊂U
and
F̂ (x ) ⊂ F̂ (x) + k̄|x − x|B and F̂ (x) ⊂ c̄B for all x ∈ φ(U ),
126 I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140
Proof. We choose an atlas {(φp , Up ): p ∈ M}, such that, for each p ∈ M, we have p ∈ Up ,
constants kp > 0 and cp > 0 governing the Lipschitz and boundedness of F̂ on Up (see
(H2) and (H3)) and a constant ¯ > 0 such that
φp−1 φp (p) + 2p B ⊂ Up
and
F̂p (x) ⊂ cp B for all x ∈ φp (p) + 2p B,
where
d
F̂p (x) = v: vi ∈ (φp )∗ F φp−1 (x) .
dxi
i
Proof. We can assume that p ∈ / C since, if p ∈ C, P (p) has the trivial solution (q ≡ p,
T ∗ = 0). We can also assume that inf P (p) < +∞ for, otherwise, by convention, the min-
imum cost is +∞, which is achieved by any arc (q, T ) with left endpoint p.
Let T ∗ = inf P (p) (< +∞). Since p ∈ / C, and in view of (H2), T ∗ > 0. By definition of
the infimum cost, there exists a sequence of F -trajectories qi : [0, Ti ] → M with qi (0) = p,
qi (Ti ) ∈ C, such that Ti ↓ T ∗ . Extend (without relabelling) each qi to [0, +∞), by constant
extrapolation to the right.
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 127
Let F = {(φi , Ui )} be the finite family of local coordinate charts of Lemma 5.2 corre-
sponding to the initial set G = {p} and T̄ , some upper bound on the Ti ’s. Let k̄ > 0, c̄ > 0
and ¯ > 0 be the constants of Lemma 5.2.
Choose δ > 0 and a positive integer N 2 such that
δ < ¯ /3c̄, δ T ∗, (N − 1)δ < T ∗ < Nδ.
By eliminating initial terms in the sequence {(qi , Ti )}, we can arrange that
Ti ∈ (N − 1)δ, Nδ for all i.
We now construct a sequence of arcs
q[j ] : (j − 1)δ, j δ → M, j = 1, . . . , N − 1,
and
q[N] : (N − 1)δ, (Nδ) ∧ T ∗ → M.
According to the properties of the family of local coordinate charts F , we can choose a
local coordinate chart (φ[1] , U[1] ) ∈ F such that
−1
φ[1] φ[1] (p) + ¯ B ⊂ U[1] . (7)
Write
d −1
F̂[1] (x) = v ∈ Rn : vj ∈ φ[1]∗ F φ[1] (x) .
dxj
j
for all i. Since T i ↓ T ∗ , since xi[2] ’s converge uniformly to x[2] , xi[2] ’s are uniformly Lip-
schitz continuous and the set on the right side of (8) is closed, we conclude that
x[2] ∈ φ[2] (C ∩ U[2] ).
It follows that the arc (q[2] , T ∗ ) achieves the minimum cost. The proposition is proved in
the case N = 2.
If, on the other hand, N > 2, we construct also arcs q[j ] : [(j − 1)δ, j δ] → M, and local
coordinate charts (φ[j ] , Uj ) ∈ F , for j = 3, 4, . . . , N , recursively, in a manner which pat-
terns the construction of q[2] from q[1] , etc., extracting suitable subsequences at each step.
A repeat of the preceding analysis establishes that the arc (q, T ∗ ), obtained by concatenat-
ing q[1] , . . . , q[N] , is a minimiser for P (p). ✷
each i, and Ti → T ∗ for some T ∗ . (9) will follow if we can construct an F -trajectory
q : [0, T ∗ ] → M such that q(0) = p and q(T ∗ ) ∈ C. The construction of q is along almost
identical lines to that of the minimising F -trajectories in the proof of Theorem 5.3, and
is not repeated here. (The only difference, which does not affect the analysis, is that the
condition “qi (0) = p” is replaced by “qi (0) → p.”) ✷
6. Monotonicity properties
Proof. We can assume that V (p) < +∞ since, otherwise, the assertions of the proposition
are trivial. We can also assume that U is the open set associated with some local coordinate
chart (φ, U ). Write x0 = φ(p) and choose > 0 such that x0 + B ⊂ φ(U ). Write
V̂ (x) = V φ −1 (x) ,
d
F̂ (x) = v: vi ∈ φ∗ F φ −1 (x)
dxi
i
Ṽ (x) := V̂ (x) if x ∈ x0 + B,
+∞ if x ∈ / x0 + B.
Well-known arguments (see, e.g., [6,11]) permit us to deduce from (12) that
min η · v + η1 · σ + η2 · α 0 for all (η, η1 , η2 ) ∈ ND
P
(x, τ, z) (13)
(v,σ,α)∈F̃(x,τ,z)
for all (x, τ, z) ∈ Rn × R × R. Here, ND P denotes the (vector space) proximal normal cone
to the set D (at the relevant basepoint). (Notice that, for |x − x0 | , F̃ (x, τ, z) contains
the point (0, 0, 0), so (13) is automatically satisfied.)
We now apply the proximal normal version of the weak invariance theorem (see,
e.g., [11]) in the standard vector space setting, noting that the relevant hypotheses, namely
F̃ is an upper semicontinuous, bounded, convex, nonempty multifunction and satisfies
the “inward pointing” condition (13) (see also [6]). We conclude that there exists an F -
trajectory (x(t), τ (t), z(t)), 0 t < ∞, with initial value (x0 , 0, 0), and taking values in D.
Choose < /c. Then |x(t) − x0 | < for 0 t . It follows that Ṽ (x(t)) = V̂ (x(t))
and τ̇ (t) = 1 for 0 t . For (x(t), τ (t), z(t)) ∈ D, z(0) = V (x0 ) we deduce
V̂ (x0 ) V̂ x(t) + t, 0 t .
Proposition 6.2 (Strong monotonicity). Suppose that, for some lower semicontinuous func-
tion W : M → R ∪ {+∞},
1 − H (p, −η) 0 for all η ∈ ∂ P W (p), p ∈ M.
Then, for any F -trajectory q : [0, T ] → M and t ∈ (0, T ],
W q(0) t + W q(t) . (14)
Proof. Take any t ∈ (0, T ]. We aim to show (14). It may be assumed that W (q(t)) < +∞
since, otherwise, the inequality is automatically satisfied. There exists a chart (φ, U ) and
> 0 such that A := φ −1 [φ(q(t)) + B] ⊂ U and
φ −1 φ q(s) + (/2)B ⊂ A for all s ∈ [t − , t].
Write x(s) = φ(q(s)) for s ∈ [t − , t]. Let F̂ be the local representation of F with respect
to (φ, U ) and define Ŵ : (−∞, +∞) × A → R ∪ {+∞} to be
Ŵ (s, x) := s + W φ −1 (x) .
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 131
(∂ P Ŵ here has its customary, vector space, meaning.) Bearing in mind that F̂ is Lipschitz
continuous, bounded and has values closed, convex sets, and recalling that Ŵ (t, x(t)) =
t + W (q(t)) < +∞, we can show, using a local analysis along the lines of the proof of [11,
Theorem 12.2.4], that
Ŵ s, x(s) Ŵ t, x(t) for all s ∈ [t − , t],
i.e.,
W x(s) + s W t, x(t) + t for all s ∈ [t − , t].
Define
smin = min σ ∈ [0, t]: W x(s) + s W t, x(t) + t for all s ∈ (σ, t] .
We must have smin = 0, for, otherwise, the preceding analysis tells us that the interval
(σ, t] could be extended to the left, while preserving the inequality, in contradiction of the
definition of smin . We have shown that W (x(s)) + s W (t, x(t)) + t for all s ∈ (0, T ]. It
is true also for s = 0, by lower semicontinuity. ✷
Take any η ∈ ∂ P V (p). Write ζ for the vector satisfying i ξi dxi = φ ∗−1 η. Then
2
V̂ (x) − V̂ x(0) ζ · x − x(0) − M x − x(0) for all x ∈ x(0) + B,
for some M > 0 and > 0. We have
1 + η, v = 1 + ζ · e = 1 − lim t −1 ζ · x(−t) − x(0)
t ↓0
−1
1 + lim inf t V q(−t) − V q(0) 0.
t ↓0
(22) is confirmed. Since (18) and (22) imply (15) and (16), we have confirmed that V
satisfies the stated conditions.
It remains to show that V is the unique such function.
Let W be any lower semicontinuous function, bounded from below, vanishing on C,
which satisfies the “equivalent” conditions
1 − H (p, −η) 0 for all η ∈ ∂ P W (p), p ∈ M \ C,
1 − H (p, −η) 0 for all η ∈ ∂ P W (p), p ∈ M.
Take any p ∈ M. We show
(i) For any F -trajectory q : [0, T ] → M such that q (0) = p and q (T ) ∈ C we have
W (p) T ;
(ii) There exists an F -trajectory q : [0, T ] → M such that q(0) = p, q(T ) ∈ C and
W (p) T .
In other words, (q̄, T ∗ ) is an F -trajectory. In view of the other properties listed above,
(q̄, T ∗ ) is a maximal element for the chain C.
Invoking Zorn’s lemma, we deduce existence of a maximal element (q, T ) in V. By
definition of elements in V, (q, T ) is an F -trajectory such that
8. An example
We study an example, one of the simplest where the state space is not homeomorphic
to an open subset of Rn , to illustrate the role of Theorem 4.1, regarding the confirmation
of minimisers in a manifold setting.
The system is the rigid pendulum in the vertical plane, to which an amplitude limited
torque (the control) is applied. The state space is a two-dimensional manifold, namely
the tangent bundle of S 1 . Fix a number α ∈ (0, π/2). The configuration of the system is
described by local coordinates, (x1 , x2 ), in each of two local coordinate charts.
The Upper Chart. In this chart, x2 is the angular velocity and x1 describes the angle
from the downward vertical, in the range +π/2 − α x1 +3π/2 + α.
The Lower Chart. In this chart, x2 is the angular velocity and x1 describes the angle
from the downward vertical, in the range −π/2 − α x1 +π/2 + α.
The charts are illustrated in Figs. 1 and 2. Assume zero gravity. Assume also that the
pendulum has unit moment of inertia and the controlling torque u is required to satisfy the
constraints −1 u +1. Then, with respect to local coordinates in either chart domain,
the dynamic constants are
d
(x1 , x2 ) ∈ (x2 , u): u ∈ [−1, +1] .
dt
A local analysis of extremals would indicate that the value function (in either local coordi-
nate chart) is described by the function V : D → R, where
D = (x1 , x2 ) ∈ R 2 : −π/2 − α x1 3π/2 + α
136 I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140
and
V (x1 , x2 ) := min Vk (x1 , x2 ). (29)
k∈Z
Proposition 8.1. The function V defined above is continuous. Furthermore, V is the value
function, expressed in local coordinates, w.r.t. either chart. (Notice that V unambiguously
defines a function on the manifold, since V is periodic, with period 2π .)
Proof. We shall apply Theorem 4.1. This is justified, since the system dynamics satisfy
the compactness, convexity, and local Lipschitzity and boundedness hypotheses (H1) and
(H2), and also the linear growth hypothesis (H3), with functional s : M → R chosen to be
s(x1 , x2 ) = |x2 |2 , w.r.t. local coordinates in either chart.
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 137
V (0, 0) = 0 since V0 (0, 0) = 0 and the Vk ’s are nonnegative valued. V0 is clearly con-
tinuous. Notice next that, for any bounded set E ⊂ R2 ,
lim inf V0 (x1 + 2πk, x2 ) = +∞. (30)
k→∞ (x1 ,x2 )∈E
It follows that for each (x1 , x2 ) ∈ E, we can limit attention to a (fixed) finite collection of
functions {Vk }k∈S , when evaluating V (x1 , x2 ) (see (29)). We deduce that V is continuous
since, locally, it is a pointwise infimum over a finite number of continuous functions.
Notice that V0 (0, 0) = 0 and V (2πk, 0) > 0 for all k. It follows that
V (0, 0) = V0 (0, 0).
Arguing as in step (B) above, we deduce from the second claimed assertion that
1 + η1 · x2 + min η2 · u 0,
|u|1
as required.
Reviewing properties (A)–(C), we see that they amount to the assertions of Proposi-
tion 8.1. It remains then to validate the claim. We attend, finally, to this task.
Validation of the Claim. Take any (x1 , x2 ) ∈ R2 and (η1 , η2 ) ∈ ∂ P V0 (x1 , x2 ). Then there
exists M > 0 and > 0 such that
2
V0 (y1 , y2 ) − V0 (x1 , x2 ) (η1 , η2 ) · (y1 − x1 , y2 − x2 ) − M (y1 − x1 , y2 − x2 )
(33)
for all (y1 , y2 ) ∈ (x1 , x2 ) + B.
(These relationships provide parametric descriptions of two curves, the union of whose
graphs comprise nonzero points in the complement of int A1 ∪ · · · ∪ int A4 in R2 .) We
consider only case (a). (Case (b) is treated analogously.) For all > 0 sufficiently small,
1
− (t ± ) , t ± ∈ A2 .
2
2
Inserting
1
(y1 , y2 ) = − (t ± )2 , t ±
2
into (33), dividing across by and passing to the limit as ↓ 0 gives
−1 = +η1 t − η2 . (35)
For > 0, (x1 , x2 ) + (0, −1) ∈ A3 . Inserting
(y1 , y2 ) = (x1 , x2 ) − (0, −1)
I. Chryssochoos, R.B. Vinter / J. Math. Anal. Appl. 287 (2003) 118–140 139
For > 0, (− 2 /2, +) ∈ A4 and (+ 2 /2, −) ∈ A4 . Inserting
1 2
(y1 , y2 ) = ∓ , ±
2
into (33), dividing across by and passing to the limit as ↓ 0, we deduce that η2 = 0.
Since x2 = 0, (36) follows. We have validated the claim. Proof of the proposition is com-
plete. ✷
Figure 3 shows contours of the value function for the rigid pendulum minimum time
problem, in local coordinates for the lower chart. For a fixed initial condition, the optimal
control is bang bang. Figure 4 shows switching surfaces in local coordinates for the lower
chart, separating the regions where the optimal control takes value +1 from the regions
where the optimal control takes value −1.
References