Math 25b - Multivariable Taylor Series: N R R 1 N
Math 25b - Multivariable Taylor Series: N R R 1 N
Math 25b - Multivariable Taylor Series: N R R 1 N
Which agrees with our multivariable Taylor formula with k = 1 in which case one of the
js is 1 and the others are 0 so the linear terms match.
Now for degree 2, we need the second derivatives at zero. The term in the Taylor series
in t at t = 1 is:
n
!0
1 1 1 X
f (ϕ(t))00 = (D1 f (ϕ(t))ϕ01 (t) + · · · + Dn f (ϕ(t)ϕ0n (t))0t=0 = Di f (ϕ(t))hi
2 2 2 i=1
t=0
This also matches our second-order term in our multivariable Taylor series ansatz. By
induction, we get all the higher terms in the same way as ϕ(n) (t) = 0 for n ≥ 2 and only
the first term in the chain rule is nonzero. Therefore if g(t) = f (ϕ(t)) we can look at the
remainder term Rk (t) = g(t) − g(0) − g 0 (0)t − 21 g 00 (0)t2 − · · · − k!
1 (k)
g (0)tk . By Taylor’s
theorem the remainder term is of the form
g (k+1) (ζ)tk+1
Rk (t) = for some ζ ∈ (0, t)
(k + 1)!
1
the neighborhood of ~a. Because d(~a + t~h, ~a) = ||t~h|| = |t|||~h|| ≤ ||~h|| so the polynomial is
bounded by ||~h||k+1 so:
|Rk (~h)|| ≤ M ||h||k+1
as each hi is bounded by |hi | ≤ ||~h|| and M ≥ 0 is a bound given by the sup of the
k + 1-order derivatives in the studied neighborhood.
This is known as the Hessian matrix. If f ∈ C 2 then the second order derivatives are
continuous and Di Dj f (~a) = Dj Di f (~a) and the matrix is symmetric. Then consider a
product of the form hT Hh where h is a column vector. Then this product is exactly
proportional to the second-order term in the Taylor series, which is:
h1 n n
1 T 1 . 1 X 1 X
h Hh = h1 . . . hn H .. = Hij hi hj = Di Dj f (~a)hi hj
2 2 2 i,j=1 2 i,j=1
hn
The sign of this quadratic form gives us the nature of the critical point. Remember that
we can diagonalize any symmetric matrix, i.e. find an orthonormal basis of eigenvectors.
Then the matrix of H will be diagonal eigenvalues λi as the elements, and so the above
becomes:
n
1 T 1X
h Hh = λi h2i
2 2 i=1
We see that if all λi > 0 then the point is a local minimum, and if all λi < 0 it’s a
maximum. If all λi = 0 then we need to look at higher order terms; otherwise, the point
is neither a maximum or a minimum.