Calculos Concentrate - Gordon PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 182

Calculus Concentrate

Russell A. Gordon
Whitman College


c 2006
Preface

There are many calculus books currently available on the market. Most of these are thorough, large, and
expensive and, more or less, have the same table of contents. These books are useful to have around
as references for those few people who actually use calculus on a regular basis. However, they are not
necessarily beneficial to a student who is going through calculus for the first or second time. Students are
easily overwhelmed by the amount of information, they suffer backaches from hauling around a heavy book,
and sticker shock is inevitable at the beginning of the term. Calculus books have become so large because
they are chosen by committee and mathematicians rarely agree on the proper way to teach calculus. There
is no “right” way to teach calculus, but each person who teaches calculus has strong opinions concerning
how it should be taught. In order for a calculus book to be marketable, it has to appeal to a wide variety
of people by including their favorite topics and styles of presentation. Areas of disagreement include topics
to cover, depth of coverage of each topic, variety of applications to other disciplines, amount of rigor, use of
technology, and the number and type of examples and exercises. To satisfy many different people, the books
of necessity become very large. The electronic text that follows is much shorter because it represents one
person’s attempt to put the essential ideas of calculus into a short and concise format. It may not appeal to
a wide range of mathematicians, but it should provide most students with a good foundation in calculus.
A quick perusal of the text will reveal some of its key features. Almost every section is two pages long
and the pages are arranged so that (when printed) an entire section appears when the book is held open. To
keep the sections short, the number of examples and amount of discussion is kept to a minimum. The book
is not intended to be a set of lecture notes, but rather a framework upon which a lecture can be constructed.
Since the sections are short, the number of exercises is also limited. There are enough exercises to give most
students good experience with the concepts in each section. Further exercises can be found at the end of
each chapter. If more exercises are needed, there are many sources of problems available in other textbooks

iii
iv Preface

and at various sites on the Internet. The order and selection of topics is a bit different from standard books.
Rather than justify each of these differences, I will just say that I have thought carefully about each topic and
each sentence that appears in the book. I hope that a teacher who uses the book will be able to appreciate
some of the guiding principles that have shaped it.

A Note to the Student

The title of this book, Calculus Concentrate, carries a double meaning depending on the definition of the
word “concentrate”. One meaning is similar to its use on frozen juice containers. Instead of adding water
and stirring, you will need to add thought and contemplation. As you read the short sections, and it is
highly recommended that you do so, think carefully about each sentence and each mathematical equation.
Stop if necessary and do some calculations or spend some time absorbing the ideas. If you are still confused,
get some assistance. Reading the book is where the more common meaning of concentrate comes into play;
you will need to think hard while working through the book.
Many of the problems in elementary mathematics books can be solved by imitating the examples in
the text rather than understanding the concepts. While learning skills in this way is useful, an important
aspect of problem solving is knowing which skills to use and combining several skills in a multi-step problem.
Learning to solve non-routine problems, those without an example to imitate and requiring more than one
step, is one of the goals of this text. These sorts of problems require more time and effort and can be
frustrating, but the satisfaction of solving such a problem is much greater than simply imitating the solution
to a similar problem. Keep this in mind as you work the problems in this text. If you are not sure how
to start a problem, review the ideas in the section and think about what you do know about the concepts
mentioned in the problem.

Acknowledgments

Although the organization and wording of this text are my creations, there is very little original material
to be found here. I have drawn on my 30 years experience teaching calculus and sources too numerous to
mention. Conversations with colleagues have also impacted certain aspects of this work. David Guichard
provided technical support for the preparation of the manuscript. A grant from the Stanley Rall summer
research scholarship fund allowed Pat Cade to spend a summer reading the text and working through the
exercises. However, it is inevitable that there will be some difficulties and errors in the text. Most of these
will probably occur in the solutions section as sometimes the exercises were modified more quickly than the
solutions. Feel free to contact me at the mathematics department of Whitman College, Walla Walla, WA
99362 or at gordon@whitman.edu with questions and comments.
Contents

1
The Derivative 1
1.1 Lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Numbers, Sets, and Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 The Limit Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.4 Continuous Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.5 Evaluating Limits Algebraically . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.6 Slope of a Curve . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.7 The Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.8 Differentiation of Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.9 Power Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.10 Chain Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.11 Product and Quotient Rules . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.12 Maximum and Minimum Outputs . . . . . . . . . . . . . . . . . . . . . . . . 26
1.13 Increasing and Decreasing Functions . . . . . . . . . . . . . . . . . . . . . . 28
1.14 The First Derivative Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.15 Optimization Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
1.16 Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
1.17 Derivatives of Trigonometric Functions . . . . . . . . . . . . . . . . . . . . . 40
1.18 Derivative Applications of Trigonometric Functions . . . . . . . . . . . . . . . 42
1.19 Properties of Exponents and Logarithms . . . . . . . . . . . . . . . . . . . . 44
1.20 Derivatives of Exponential and Logarithmic Functions . . . . . . . . . . . . . 46
1.21 Differential Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
1.22 Applications of Differential Equations . . . . . . . . . . . . . . . . . . . . . . 50
1.23 Definition of Limit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
1.24 Limits Involving ∞ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
1.25 L’Hôpital’s Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
1.26 The Mean Value Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

v
vi Contents

1.27 Applications of the Mean Value Theorem . . . . . . . . . . . . . . . . . . . . 60


1.28 Concavity and Inflection Points . . . . . . . . . . . . . . . . . . . . . . . . . 62
1.29 Velocity and Acceleration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
1.30 Polynomial Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
1.31 Newton’s Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
1.32 Implicit Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
1.33 Related Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
1.34 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
1.35 Supplementary Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

2
The Integral 79
2.1 Summation notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
2.2 Area under a curve . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
2.3 The integral of a continuous function . . . . . . . . . . . . . . . . . . . . . . 86
2.4 Evaluating the integral of a polynomial . . . . . . . . . . . . . . . . . . . . . 88
2.5 Further properties of the integral . . . . . . . . . . . . . . . . . . . . . . . . 90
2.6 The Fundamental Theorem of Calculus . . . . . . . . . . . . . . . . . . . . . 92
2.7 The Fundamental Theorem of Calculus, continued . . . . . . . . . . . . . . . 94
2.8 Finding antiderivatives: guess and check . . . . . . . . . . . . . . . . . . . . 96
2.9 Finding antiderivatives: integration by substitution . . . . . . . . . . . . . . . 98
2.10 Finding antiderivatives: integration by parts . . . . . . . . . . . . . . . . . . 100
2.11 Improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
2.12 Area between curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
2.13 Volume: method of cross-sections . . . . . . . . . . . . . . . . . . . . . . . . 106
2.14 Volume of solids of revolution: method of cylindrical shells . . . . . . . . . . . 108
2.15 Force exerted by a liquid . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
2.16 Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
2.17 Center of mass . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
2.18 Arc length . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
2.19 Integrals involving quadratic polynomials . . . . . . . . . . . . . . . . . . . . 118
2.20 Using a table of integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
2.21 Trigonometric substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
2.22 Integrating rational functions . . . . . . . . . . . . . . . . . . . . . . . . . . 124
2.23 Numerical integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
2.24 Supplementary Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Contents vii

3
Sequences and Series 131
3.1 Mathematical Induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
3.2 Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
3.3 Properties of Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
3.4 The Completeness Axiom . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
3.5 Infinite Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
3.6 The Integral Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
3.7 Comparison Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
3.8 Absolute Convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
3.9 Root and Ratio Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150
3.10 Power Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
3.11 Properties of Power Series . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
3.12 Taylor Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
3.13 Taylor’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
3.14 Supplementary Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160

A
Algebra/Geometry Review 163

B
Table of Integrals 165

C
Answers to Exercises 169
C.1 Chapter 1 Answers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
C.2 Chapter 2 Answers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
C.3 Chapter 3 Answers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
viii Contents

A Very Brief History of Calculus

Calculus has its roots in two geometric problems: determining the areas of regions with curved boundaries
and finding tangents to curves other than circles. Using geometric techniques, the ancient Greeks were able
to solve many such problems for the conic sections and some other easily defined curves. An increased
interest in these problems occurred during the Renaissance and it became evident that these problems had
applications to physical problems. In the middle of the seventeenth century, René Descartes (1596–1650)
and Pierre de Fermat (1601–1665) independently developed the x-y coordinate system. Although we take
this idea for granted now, it was actually quite a leap at the time. The Cartesian coordinate system (named
in honor of Descartes in spite of the fact that Fermat’s approach was closer to the one we use today) provides
a link between geometry and algebra. Geometric curves have algebraic equations and algebraic equations
generate geometric curves. This connection between geometry and algebra makes it possible to use algebraic
techniques to solve geometric problems; this area of mathematics is known as analytic geometry. Algebra
provided new proofs of well-known geometry facts and it also led to the discovery of new results in geometry.
More importantly, algebraic equations yielded a whole host of new curves to study. Previously, curves were
defined by geometry or by the trajectory of a moving point, but now almost every algebraic equation provided
another curve to study. The analysis of these curves using analytic geometry paved the way for the discovery
of calculus.
During the time period 1640–1670, a number of mathematicians came tantalizingly close to discovering
a connection between the area problem and the tangent problem. While working out various problems in
physics, primarily the theory of gravitation, Isaac Newton (1642–1727) studied infinite series and through
them saw how the area and tangent problems were connected. The date of this discovery is generally taken
to be 1665–1666 when Newton was home from college due to an outbreak of the plague. Gottfried Leibniz
(1646–1716) independently discovered this connection in the 1670’s and formulated much of the notation
for calculus that is in use today. He wrote the first published paper on calculus and it appeared in 1684.
(Unfortunately, Newton’s lack of formal publishing of his results led to a bitter dispute over priority of credit
for discovering calculus.) This recognition of the connection between areas and tangents in the last part of
the seventeenth century is taken to be the origin of calculus. The first calculus text, written by Guillaume
de L’Hôpital (1661–1704) appeared in 1696. During these years and into the next century, there was an
explosion of mathematical ideas and calculus led to the solutions of many different physical and geometric
problems. The fact that many problems in the physical sciences can be reduced to finding areas or tangents
has made calculus the cornerstone of the scientific revolution.
1
The Derivative

Consider the linear function y = 7x + 5. The graph of this function is a line with slope 7 and y-intercept
5. If the variable x increases by 1, then the variable y increases by 7. This statement is valid whether x
increases from 3 to 4 or from 150 to 151. It is easy to see why this is true; if x increases from a to a + 1,
then the change in y is

( ) ( ) ( ) ( )
value of y when x is a + 1 − value of y when x is a = 7(a + 1) + 5 − 7a + 5 = 7.

Thus the slope of the line gives a measure of the rate of change of y with respect to x. For the function
y = x2 , if x increases from 3 to 4, then y increases by 42 − 32 = 7, and if x increases from 150 to 151, then
y increases by 1512 − 1502 = 301. In this case, the rate of change of y depends on the original value of x; if
x increases from a to a + 1, then the change in y is

(a + 1)2 − a2 = 2a + 1.

In general, for functions that are not linear, the rate of change of y with respect to x depends on the original
value of x, that is, the rate of change of y with respect to x is variable. Differential calculus provides a
way to precisely measure the rate of change of y with respect to x for an arbitrary function. Since many
physical quantities are rates of change (velocity is the rate of change of distance with respect to time,
acceleration is the rate of change of velocity with respect to time, etc.), differential calculus has a variety of
useful applications. In this lengthy chapter, we will define the derivative, develop formulas for computing
derivatives, and explore some of the applications of the derivative.

1
2 Chapter 1 The Derivative

1.1 Lines

Two distinct points in the plane determine a line. Consider the line through the points P and Q with
coordinates (x1 , y1 ) and (x2 , y2 ), respectively. Assume that x1 ̸= x2 . Since the right triangles drawn on the
middle graph in the figure below are similar, the ratios h1 /b1 and h2 /b2 are equal.
y .
y .
y .
...... ...... ......
.. .. .. .. .. ..
Q.... Q.... Q....
. . .•.....
y2
.....•.
.. ......•.
...
y2
. .. . .
... .........
........ . . ...... ......... h2 .. . . .
.
.....
.. .
..... ...
. . y ...•.... ........
....... ......................b.....2..................
. . .. . ....... ......... ..
...
. .. . . .... ...... ...
.... .... ... .
.
.. . . ...

....... ............................ h1 .. . ...


. .. . .
. ...
P......... P......... b1 . .
P.............................................................................................................
.
.
. .

.....• y1 ..........
...... x
.....• ......
.......... x
.....• y1
..
..........
.... x
x1 x2 x1 x x2

This shows that the change in y (often denoted by ∆y) over the change in x (∆x) is the same for any two
pairs of points chosen on the line. The constant value, usually denoted by m, is referred to as the slope of
the line and can be determined by the formula

∆y y2 − y1
m= = .
∆x x2 − x1

For a point (x, y) (other than (x1 , y1 )) to be on this line (see the graph on the right), we must have

y − y1 y2 − y1
= or y − y1 = m(x − x1 ).
x − x1 x2 − x1

This form for the equation of a line is called the point-slope form of the line. The equation of a line is
often written in the slope-intercept form y = mx + b, where m is the slope and b is the y-intercept of the
line. Another useful formula for a line is Ax + By + C = 0, where A, B, and C are constants. This form
allows for the possibility of vertical lines; a vertical line occurs when B = 0 and A ̸= 0.
The slope of a line represents the rate of change of y with respect to x. If x changes by an amount ∆x
and the slope of the line is m, then y changes by an amount ∆y = m∆x:

( ) ( ) ( ) ( )
∆y = value of y at x + ∆x − value of y at x = m(x + ∆x) + b − mx + b = m∆x.

This is true no matter what the original value of x is; the change in y is always a constant multiple of the
change in x. For general curves, the change in y with respect to x varies from point to point and we will
later determine how to find this variable rate of change. For now, it is important to understand that a given
line has the same slope at each point.
Two distinct lines are said to be parallel if they have no point in common. It is easy to verify that two
distinct lines are parallel if and only if they have the same slope. Two distinct lines that are not parallel
have exactly one point of intersection. Two intersecting lines form a unique acute or right angle. If the
angle formed by the two intersecting lines is a right angle, the lines are said to be perpendicular. Two
non-vertical lines are perpendicular if and only if the product of their slopes is −1. You should be able to
prove this fact using the Pythagorean Theorem.
1.1 Lines 3

Problem: Find an equation for the line that goes through (3, 7) and is parallel to the line 2x + y − 5 = 0.

Solution: We first put the given line in slope-intercept form by solving the equation 2x + y − 5 = 0 for y
to obtain y = −2x + 5. The slope of the given line is thus −2. Since parallel lines have the same slopes, the
requested line has a slope of −2 also. Using the point-slope form for the equation of a line, the equation of
the line through (3, 7) and parallel to 2x + y − 5 = 0 is

y − 7 = −2(x − 3) or y = −2x + 13.

Problem: Find the points on the circle x2 + y 2 = 1 where the tangent line to the circle goes through (0, 3).

Solution: Let (a, b) be a point on the circle with the desired property. The solution to this problem hinges
on the following fact concerning tangent lines to circles: the tangent line is perpendicular to the radial line.
In the figure below, the line through (a, b) and the origin is perpendicular to the line through (a, b) and
(0, 3). This means that the product of the slopes of these two lines is −1. The resulting equation, along with
the equation a2 + b2 = 1, makes it possible to find a and b. The computations are shown by the figure.
y
.......
... ..
b b−3
· = −1 ⇒
a a−0
•...... (0, 3)
...
...
b − 3b = −a2 ⇒
2
...
3b = a2 + b2 = 1 ⇒
...
...
...
...
...
1
..................... (a, b)
...
...
b= ⇒
....
...
•..
... 3
..........
.... ..
. .
. . ... √
... . .
.....
.......... x 8
...... ..... a=± .
.......... 3

( √8 ) ( √ )
The two points on the circle for which the tangent line goes through (0, 3) are thus 3 , 13 and − 38 , 13 .

Exercises
1. Find an equation for the line that goes through the points (−1, 2) and (3, 8).
2. Let ℓ be the line 2x + 3y = 6 and let P be the point (3, −2). Find an equation for the line through P that is
parallel to ℓ and an equation for the line through P that is perpendicular to ℓ.
3. Find an equation for the perpendicular bisector of the segment joining the points (−1, 2) and (3, 8).
4. Find the slope, x-intercept, and y-intercept of the line 4x + 3y = 24, then find its distance from the origin.
5. Find the point of intersection of the lines with equations 2x + y − 5 = 0 and 3x − 4y − 2 = 0.
6. Let m be a negative number and let ℓ be the line through (2, 3) with slope m. The line ℓ cuts off a triangle in
the first quadrant. Find the area of this triangle as a function of m.
7. Consider the trapezoid with vertices at (0, 0), (3, 2), (6, 2), and (6, 0). Find a line that goes through the origin
and divides the trapezoid into two parts of equal area.
8. There are two points on the circle x2 + y 2 = 1 for which the tangent line passes through the point (7, 1). Find
the (x, y) coordinates of both these points.
4 Chapter 1 The Derivative

1.2 Numbers, Sets, and Functions

It is not possible to do mathematics without some reference to numbers. The natural numbers are the
numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, . . . , which are used for counting objects. The integers are the numbers
. . . , −5, −4, −3, −2, −1, 0, 1, 2, 3, 4, 5, . . . , which extend indefinitely in either direction. In this context, the
natural numbers are referred to as the positive integers. We will use the symbol Z to denote the set of
integers and the symbol Z+ to denote the set of positive integers. When numbers are used for measurement, it
is sometimes necessary to consider parts of a whole. This need for parts or fractions of an integer leads to the
concept of a rational number. A rational number is a number of the form p/q, where p and q are integers
with q ̸= 0. In other words, a rational number is a ratio of two integers. The standard symbol for the set of
rational numbers is Q, where the use of the letter Q follows from the fact that rational numbers are quotients.
The number 3/5 can be interpreted as follows: cut a string into 5 equal pieces and take 3 of the pieces. The
rational numbers include the integers since an integer n can also be represented as n/1. Since the rational
numbers are the familiar numbers that arise in everyday life, a great deal of elementary school mathematics
is devoted to a study of these numbers. However, not every number can be expressed as the ratio of two
integers; the square root of 2 is one example. Such numbers are referred to as irrational numbers. One
difference between the rational numbers and irrational numbers can be noted in their decimal expansions;
the decimal expansion of a rational number has a repeating pattern whereas the decimal expansion of an
irrational number does not. The rational and irrational numbers together form the set of real numbers,
which is denoted by the symbol R.
The reader should have some previous experience with sets. A set is loosely defined to be a collection
of objects. The objects in a set usually have some features in common, such as the set of integers or the set
of real numbers. The objects in a set are called elements or points in the set. If x belongs to a set S, then
we write x ∈ S and say “x is an element of S”, “x belongs to the set S”, or “x is a point in S”. A set A is a
subset of a set B, denoted A ⊆ B, if each element of A belongs to B. Two sets A and B are equal if and
only if A ⊆ B and B ⊆ A, that is, the sets A and B have the same elements. A set that has no elements
is called the empty set, denoted by the symbol ∅. There are various ways to represent sets of numbers.
For example, {1, 2, 3, . . . , 98, 99} and {n ∈ Z+ : n < 100} both represent the set of all positive integers less
than 100. The first representation is merely a listing of the elements of the set. The second representation
is read as “the set of all positive integers n such that n < 100”. In calculus, the most common type of set is
an interval. An interval (of real numbers) has one of the following nine forms:

(−∞, a) = {x ∈ R : x < a}
(a, b) = {x ∈ R : a < x < b};
(−∞, a] = {x ∈ R : x ≤ a};
[a, b) = {x ∈ R : a ≤ x < b};
(b, ∞) = {x ∈ R : x > b};
(a, b] = {x ∈ R : a < x ≤ b};
[b, ∞) = {x ∈ R : x ≥ b};
[a, b] = {x ∈ R : a ≤ x ≤ b};
(−∞, ∞) = R .

The intervals in the first column are bounded intervals and those in the second are unbounded intervals; an
unbounded interval contains numbers that are arbitrarily large. Note how the symbols ( and [ are used to
1.2 Numbers, Sets, and Functions 5

denote whether or not the endpoint is included in the interval; [a, b] is a closed interval since it contains both
its endpoints and (a, b) is an open interval since it contains neither of its endpoints.
The objects of study in calculus are functions. A function provides a relationship between two or more
quantities. The relationship can be exhibited with a graph, a table of values, a verbal description, or a
formula. For the most part, calculus deals with the situation in which the function is given by a formula.
Once again, it is assumed that the reader has some experience with functions. Let A and B be two nonempty
sets. A function f : A → B is a rule of correspondence that assigns to each element of A exactly one element
of B. If f assigns the element b of B to the element a of A, then we write f (a) = b. We say that b is the value
of f at a or that b is the output that corresponds to the input a. In this class, we will be concerned with
functions of the form f : I → R, where I is an interval of real numbers and R is the set of all real numbers.
The rule of correspondence will almost always be given by a formula. For example, we can define a function
f : R → R by f (x) = x2 . This function assigns the real number x2 to the real number x. It is important to
understand the subtle distinction between f and f (x); x represents the input, f is the function, and f (x) is
the output. It may help to think of f as a machine that accepts inputs x and turns them into outputs f (x).
However, the machine f is not the same as the output f (x).
The domain of a function f is the set {x ∈ R : f (x) is defined}, that is, the set of inputs for which
f (x) makes sense. The range of a function f is the set {y ∈ R : y = f (x) for some x}, that is, the set of all
possible outputs of the function. Finally, the graph of f is the set {(x, y) : y = f (x)}, the collection of all
ordered pairs that satisfy the equation. A graph of the equation y = f (x) provides an amazing link between
algebra and geometry; an algebraic equation takes on a geometric shape. You should be able to sketch the
graphs of simple functions such as y = x2 , y = 1/x, and y = sin x quickly and without a calculator as well
as modify them slightly by translation and scaling.
Let n be a positive integer. A polynomial P of degree n is a function of the form

P (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 ,

where a0 , a1 , . . . , an are fixed real numbers and an ̸= 0. The numbers ai are known as the coefficients of P .
A nonzero constant function, a function that has the same value at every point, is referred to as a polynomial
of degree 0. For the record, a root of a polynomial is a real number r such that P (r) = 0. A rational
function is a ratio of two polynomials. The collection of all rational functions includes all polynomials
since the constant function 1 is a polynomial. The determination of the value of a rational function at a
particular real number involves only the operations of addition, subtraction, multiplication, and division. An
algebraic function is a function that involves these four operations along with the process of finding roots.

For example, the function g defined by g(x) = 3 4x2 + 5x − 1 is an algebraic function. The five operations
used to define algebraic functions are all familiar to students who have studied high school algebra, and, if
necessary, simple examples of these operations can be performed by students without a calculator.
In addition to the algebraic functions, there are many other functions considered in algebra and calculus.
The most familiar ones are trigonometric functions, inverse trigonometric functions, exponential
6 Chapter 1 The Derivative

functions, and logarithmic functions. These functions are known as transcendental functions because
the evaluation of these functions “transcends” the algebraic operations. There will be brief reviews of these
functions later in the text, but it is assumed that the reader has worked with them in the past.
We will also make use of two less familiar functions; the absolute value function and the greatest integer
function. Given a real number x, the absolute value of x, denoted by |x|, is defined by
{
x, if x ≥ 0;
|x| =
−x, if x < 0.

Note that |x| ≥ 0 for all x and that |x| = 0 if and only if x = 0. The number |x| can be interpreted as the
distance on the number line from the point x to the point 0. It follows that |a − b| represents the distance
between the numbers a and b. The greatest integer function, denoted by ⌊x⌋, represents the greatest
integer that is less than or equal to x. For example, ⌊8.62⌋ = 8, ⌊π⌋ = 3, and ⌊−4.1⌋ = −5. The graphs of
these two functions on the interval [−3, 3] are given below.
y y
...... ......
... .. ... ..
..... .. •
...
3 3
..... .. y = ⌊x⌋
..... 2 . •...........◦
..... .... 2
.....1 ...... y = |x| 1 •...........◦
.........
. ......
.......... x •...........◦ ......
.......... x
−3 −2 −1 1 2 3 4 −3 −2 −1 1 2 3 4
−1 .•..........◦
•...........◦
•...........◦ −3

Given two functions f and g, it is possible to combine them in several ways to obtain new functions:

addition : f + g is defined by (f + g)(x) = f (x) + g(x)


subtraction : f − g is defined by (f − g)(x) = f (x) − g(x)
multiplication : f g is defined by (f g)(x) = f (x)g(x)
division : f /g is defined by (f /g)(x) = f (x)/g(x), assuming that g(x) ̸= 0
composition : f ◦ g is defined by (f ◦ g)(x) = f (g(x))

Pay particular attention to the notation for these combinations of functions.

Problem: Find the domain of each of the functions f , g, and h, where

√ x √
9 − x2 , g(x) = √ 9 − x2 .
3
f (x) = , and h(x) =
9 − x2

Solution: Since the square root of a negative number is not a real number, the domain of the function f
consists of those values of x for which 9 − x2 ≥ 0. Thus, the domain of f is the closed interval [−3, 3]. The
domain of g is almost the same as the domain of f , but in this case we must also avoid division by 0. It
follows that the domain of g is the open interval (−3, 3). Finally, every real number has a cube root, so the
domain of h is R, the set of all real numbers.

Problem: Find the set of real numbers that satisfy the inequality 4 − |x − 1| ≥ 1.
1.2 Numbers, Sets, and Functions 7

Solution: The inequality can be written as |x − 1| ≤ 3. This inequality means that the distance from x to 1
is at most 3, or equivalently, −3 ≤ x − 1 ≤ 3. The solution set of the inequality is thus {x ∈ R : −2 ≤ x ≤ 4}.

Problem: For f (x) = x3 − 3x + 1, find f (x − 1) and f (x2 ).

Solution: To find the value of a composite function, simply replace each occurrence of x with the given
expression. Doing so yields

f (x − 1) = (x − 1)3 − 3(x − 1) + 1 = x3 − 3x2 + 3x − 1 − 3x + 3 + 1 = x3 − 3x2 + 3;

f (x2 ) = (x2 )3 − 3x2 + 1 = x6 − 3x2 + 1.

Exercises
1. Express the rational number 4/7 as a repeating decimal.
2. Find the rational number represented by the repeating decimal 0.19191919 . . ..
3. Let A = {0, 1}. For each set B, determine whether A ⊆ B, B ⊆ A, or A = B.

a) B = {x : x3 = x} b) B = {x : x = x} c) B = {x : x5 = −x}
4. Find the domain of each of the following functions.
2x √ x
a) f (x) = b) g(x) = 3 + 2x − x2 c) h(x) = √
x−5 4 − x2
5. Sketch each of the following graphs. You should be able to sketch these graphs without using a calculator.

a) y = x2 + 4 b) y = (x + 4)2 c) y = x − 2
1
d) y = |x + 3| e) y = |2x − 5| f) y =
x−2
6. Sketch the graph on the given interval.
a) y = ⌊x/2⌋ + 3, [0, 6] b) y = x − ⌊x⌋, [−3, 3] c) y = ⌊x2 − 1⌋, [0, 2]
7. Let f (x) = x2 − 3x, g(x) = 2x + 1, and h(x) = ⌊x⌋. Evaluate the following.
a) (f − g)(−1) b) (f g)(2) c) (f /g)(1)
d) (f ◦ g)(2) e) (g ◦ h)(π) f ) (h ◦ f )(1/2)
8. Given f (x) = x + 4x and g(x) = 2x − 3, find (f ◦ g)(x), (g ◦ f )(x), and (g ◦ g)(x).
2

9. Find functions
√ f and g so that h = f ◦ g.
a) h(x) = x4 + 4x2 + 1 b) h(x) = sin 4x c) h(x) = cos2 x
10. Find all of the roots of the polynomial x3 + 3x2 − 2x − 2. (Find one root by inspection, then use long division.)
11. Find the points of intersection of the graphs y = x2 + 4x and y = x + 10.
f (x + h) − f (x − 2h)
12. For the function f (x) = x2 , find and simplify the expression .
3h
13. Express the area of a regular hexagon as a function of its perimeter.
14. Write each of the sets as an interval.
a) {x ∈ R : |x − 6| ≤ 10} b) {x ∈ R : x2 + 4x − 5 < 0} c) {x ∈ R : 21 − |2x − 5| > 3}
15. Sketch a graph of the functions f and g defined by (one graph for each function)
{ 2 {
x + 1, if x ≤ 2; 1/x, if x < 0;
f (x) = and g(x) =
3 − 2x, if x > 2; 3x + 1, if x ≥ 0.
8 Chapter 1 The Derivative

1.3 The Limit Concept

The limit concept is the main idea which separates calculus from algebra. Limits are not easy—time and
effort are required to understand them fully—but this concept lies behind the major ideas in calculus. In
this section, we will work toward an intuitive understanding of limits.
The symbols lim f (x) = L are read as “the limit of f (x) as x approaches c is L”. Intuitively, this means
x→c
that the output values f (x) are close to L when the input values x are close to c but not equal to c. As
a simple example, the statement lim x2 = 9 means that if you take numbers near 3, such as 2.99 or 3.004,
x→3
and square them, you get numbers close to 9. This statement says much more than 32 = 9; it asserts that
numbers near 3 have squares near 9 and that the closer x is to 3, the closer x2 is to 9. This example may
make the limit concept seem trivial, but consider the following limits:

x−2 ln x 10x − 1
lim , lim , lim , and lim cos(π/x).
x→2 x2 − 4 x→1 x − 1 x→0 x x→0

In each case, the function is undefined at the point in question. However, all that matters is what happens
to the outputs when the inputs are close to c, not equal to c. In some cases, algebra can be used to find the
exact value of a limit:
x−2 x−2 1 1
lim = lim = lim = .
x→2 x2 − 4 x→2 (x − 2)(x + 2) x→2 x + 2 4
We will consider this method in more detail in Section 1.5. For other functions, a calculator can be used to
provide an estimate of a limit (assuming that the limit exists). If f (x) = (10x − 1)/x, then

f (0.01) = 2.3293, f (0.001) = 2.3052, f (0.0001) = 2.3029, f (0.00001) = 2.3026,

f (−0.01) = 2.2763, f (−0.001) = 2.2999, f (−0.0001) = 2.3023, f (−0.00001) = 2.3026,

so it appears that lim (10x − 1)/x is approximately 2.3026. It is also possible to determine limits from a
x→0
graph of a function; an example will be left for the exercises.
A function may not have a limit at a point. In this case, we say that lim f (x) does not exist. For
x→c
example, lim cos(π/x) does not exist. To see this, let f (x) = cos(π/x) and note that f (1/n) = (−1)n for
x→0
each positive integer n. As n increases, the numbers 1/n get closer to 0, but the output values are not
getting close to a specific real number. Hence, the limit does not exist. (This fact is also apparent from a
graph of the function.)
There are other variations on the limit concept. For example, the symbols lim+ f (x) = L are read as
x→c
“the limit of f (x) as x approaches c from the positive side is L”, and they mean that the output values f (x)
are close to L when the input values x are close to c but larger than c. This limit is called the right-hand limit
of f at c since the input values of f are to the right of c on the number line. The symbols lim− f (x) = L,
x→c
which denote the left-hand limit of f at c, have an analogous interpretation. The left-hand and right-hand
limits of f at c are known as one-sided limits since the values of x approach c from only one side. As simple
examples, lim ⌊x⌋ = 5 and lim ⌊x⌋ = 4. It should be clear that lim f (x) = L if and only if lim f (x) = L
x→5+ x→5− x→c x→c−
and lim+ f (x) = L. In particular, if the two one-sided limits are different, then the two-sided limit does not
x→c
exist.
1.3 The Limit Concept 9

Some simple and useful properties of limits are listed below. It is possible to prove these rigorously (see
Section 1.23 for the definition of a limit), but for now it is sufficient that the results should make intuitive
sense. Suppose that f and g have limits at c and that k is a constant. Then

1. lim kf (x) = k lim f (x); 2. lim (f (x) + g(x)) = lim f (x) + lim g(x);
x→c x→c x→c x→c x→c

3. lim (f (x) − g(x)) = lim f (x) − lim g(x); 4. lim (f (x)g(x)) = lim f (x) · lim g(x);
x→c x→c x→c x→c x→c x→c

f (x) lim f (x)


x→c
5. lim = , assuming that lim g(x) ̸= 0.
x→c g(x) lim g(x) x→c
x→c
Analogous properties hold for one-sided limits. Note the use of these properties as you read the next few
sections.

Exercises
1. Use algebra to evaluate each of the limits.
1 − x2 x2 − 4 x2 − 3x
a) lim b) lim c) lim
x→1 x−1 x→−2 x2 +x−2 x→3 2x2− 3x − 9
2. Use a calculator to estimate the limit to the nearest thousandth. Your solution should include a table of values;
be sure to check numbers on either side of c. Radian measure is assumed whenever trigonometric functions
appear.
1 − cos x sin x − tan x ( )1/x
a) lim b) lim c) lim 1 + x
x→0 x2 x→0 x3 x→0

ln x log10 x x
d) lim e) lim f ) lim
x→1 x−1 x→1 x−1 x→0 5x − 3x
ax − 1
3. Estimate lim to the nearest thousandth for a = 2, 3, 4, 5. Your solution should include a table of values.
x→0 x
4. Evaluate each of the following limits by thinking about the behavior of the functions and/or sketching a graph.
x ( ) ( )
a) lim b) lim x − ⌊x⌋ c) lim x − ⌊x⌋
x→0− |x| x→2− x→4+
( ) ( ) √
d) lim ⌊x⌋ + ⌊−x⌋ e) lim ⌊x/3⌋ − ⌊x⌋ f ) lim x sin(π/x)
x→5 x→−1− x→0+

5. Explain why each of the following limits does not exist.


x−2
a) lim ⌊x⌋ b) lim c) lim sin(2/x)
x→−3 x→2 |x − 2| x→0

6. Use the graph of the function f to find the quantities listed to the right of the graph.
y.
......
... ..
a) lim f (x)
x→−1− e) lim f (x)
x→−4+
.• • ..• .
....... 2
y = f (x) b) lim f (x)
..
... .
...
. x→−1+ f ) lim f (x)
.
... .
.
.... ◦.. 1 ..◦. ◦.................................
x→3

..
.........
. ...
. ..
.
c) lim f (x) g) f (1)
. ....... ..... x→1
....... .... ..........
..... x d) lim f (x) h) f (2.1)
−4 −3 −2 −1 1 2 3 4 x→2−

7. Suppose that lim f (x) = 5 and lim g(x) = 12. Find each of the following limits.
x→2 x→2
4f (x) + 1
a) lim (2f (x) − g(x) − 3x) b) lim 3f (x)g(x) c) lim
x→2 x→2 x→2 g(x)
10 Chapter 1 The Derivative

1.4 Continuous Functions

When asked for a definition of a continuous function, most students say that it is a function whose graph
has no breaks in it. Although it has intuitive merit, it should be clear that this visual representation of
continuity is not suitable as a mathematical definition. To drive home this point, remember that a graph,
even a computer-generated graph, is drawn by plotting some points and sketching a curve through those
points. A sketch created in this fashion assumes in advance that the function is continuous. To use the
visual idea to obtain a mathematical definition, note that a break in a graph occurs at some point on the
graph. At such a point, something goes wrong with the limit of the function.

DEFINITION 1.1 A function f is continuous at a number c if lim f (x) = f (c).


x→c

Continuous functions are predictable in the sense that inputs near c generate outputs near f (c). The
definition can be broken into three parts: (i) the function is defined at c, (ii) the function has a limit at c,
and (iii) the value of the limit and the value of the function are the same. It follows that a function can fail
to be continuous in one of three ways. These are illustrated by the following functions:
{ x
2x − 1 , if x ̸= 0;
f (x) = , g(x) = |x| and h(x) = ⌊x⌋ + ⌊−x⌋.
x 0, if x = 0;

Each of these functions is not continuous at 0; f is not defined at 0 (but it does have a limit at 0), g does
not have a limit at 0, and the limit of h at 0 (which is −1) does not equal h(0).
The next two theorems can be proved using the definition of continuity and the properties of limits. The
details will be omitted, but several of the exercises provide an indication of the nature of the proofs.

THEOREM 1.2 Let f and g be continuous functions and let k be a constant. Then the functions kf ,
f + g, f − g, f g, f /g (assuming g ̸= 0), and f ◦ g are continuous functions.

THEOREM 1.3 Polynomials, rational functions, algebraic functions, trigonometric functions, inverse
trigonometric functions, exponential functions, logarithmic functions and the absolute value function are
continuous at all points for which they are defined. The greatest integer function has a discontinuity at each
integer.

These two theorems indicate that most of the familiar functions are continuous at every point for which
they are defined. We conclude this section with a very useful property of continuous functions.

THEOREM 1.4 Intermediate Value Theorem If f : [a, b] → R is continuous on [a, b] and v is any
number between f (a) and f (b), then there is a number c in (a, b) such that f (c) = v.

The Intermediate Value Theorem has a simple geometric interpretation. The graph of y = v is a
horizontal line that has a y-intercept that lies between f (a) and f (b). As indicated in the figure, the graph
y = f (x) of a continuous function that starts at the point (a, f (a)) and ends at the point (b, f (b)) must cross
this horizontal line at some point c between a and b. Thinking of a continuous function as one whose graph
1.4 Continuous Functions 11

can be sketched without raising the pencil makes this result plausible, but a proof using the definition of
continuity involves ideas we have not discussed and will be omitted.
y
..
... .....
.
.......
... ..

... .....y = f (x)


f (b)
...
.
v ...................................................................................
..... .
f (a) .. .... ...
... ..
......
......
.......... x
a c b

As a simple illustration of this theorem, we will prove that the equation x3 = 4x + 5 has a solution. Let
f be defined by f (x) = x3 − 4x − 5. Since it is a polynomial, the function f is continuous on the interval
[2, 3]. Note that f (2) = −5 and f (3) = 10. By the Intermediate Value Theorem, there exists a number c in
(2, 3) such that f (c) = 0. Since 0 = f (c) = c3 − 4c − 5, the number c is a solution to the equation x3 = 4x + 5.

Exercises
1. List the discontinuities of the given function.
2x + 3 2 + sin x x2 + 4
a) f (x) = b) g(x) = c) h(x) =
x2 − 4x − 5 x3 − 6x 2x − 1
2. Sketch the graph of a single function f with all the following properties: (i) f is defined on [0, 5], (ii) f is
continuous except at 1 and 4, (iii) f has different one-sided limits at 1, and (iv) f has a limit at 4.
3. Find a value for the constant a for which the given function is continuous for all real numbers. Explain your
reasoning.
{ { 2 { 2
4 − x2
, if x ̸= 2; ax , if x ≤ 3; 2x , if x ≤ a;
a) f (x) = x−2 b) g(x) = c) h(x) =
a, if x = 2. 4 − ax, if x > 3. x + 3, if x > a.

4. Prove that the function f (x) = ⌊x⌋ + ⌊−x⌋ is not continuous at any integer.
{
4 − x2 , if x ≤ 2;
5. Consider the function f defined by f (x) = Show that f satisfies the conclusion of the
x + 2, if x > 2.
Intermediate Value Theorem on the interval [0, 5], but not on the interval [1, 5]. (Remember to check every value
of v between f (a) and f (b).)
6. Prove that the polynomial x3 − 3x2 − 100 has a root. Find an interval of length one in which the root lies.
7. Prove that there is a number that is exactly 4 less than its cube and another number that is exactly 5 more
than its cube.
8. Prove that the equation x4 = 3x has two positive solutions.
9. Suppose that f : [0, 2] → R is continuous on [0, 2] and that f (0) = f (2). Prove there is a point c ∈ [0, 1] such
that f (c + 1) = f (c). Hint: Consider the function g defined by g(x) = f (x + 1) − f (x) on the interval [0, 1].
10. Suppose that f and g are continuous functions. Use the definition of continuity and the properties of limits
listed at the end of Section 1.3 to prove that the functions f + g and f g are continuous.
11. Let f be defined by f (x) = C, where C is a constant, and let g be defined by g(x) = x. Explain (using the
definition) why f and g are continuous functions. Then use these functions and Theorem 1.2 to show that the
polynomial P defined by P (x) = 2x2 − 3x + 4 is continuous. The general result for polynomials is proved in a
similar way.
12. Explain how the continuity of polynomials shows that rational functions are continuous at every point for which
they are defined.
12 Chapter 1 The Derivative

1.5 Evaluating Limits Algebraically

The limit of a continuous function is easy to evaluate. If f is continuous at c, then lim f (x) = f (c); the
x→c
value of the limit is the value of the function. If the function is not continuous, then more work is involved in
finding the limit. In such cases, the limit may or may not exist. If the limit does exist, it cannot be found by
plugging a number into a function. Many of the limits that appear in calculus are of this latter type. Often
direct substitution leads to 0/0, a meaningless ratio. However, it is sometimes possible to use algebra to
simplify the function so that direct substitution becomes possible. The most common algebraic techniques
are factoring, expanding, finding a common denominator, and multiplying by the conjugate. Each of these
is illustrated in the examples below.
The first example illustrates how factoring can simplify a limit (this technique has already been used in
Section 1.3):
x2 − 4 (x − 2)(x + 2) x+2 4
lim = lim = lim = .
x→2 2x2 + 3x − 14 x→2 (x − 2)(2x + 7) x→2 2x + 7 11
Direct substitution of x = 2 in the original limit results in 0/0. After factoring, the function that re-
mains is continuous and the limit can be found by substitution. The second example shows how expanding
(multiplying out) a quantity can help find a limit.

(2 + x)3 − 8 12x + 6x2 + x3 12 + 6x + x2


lim = lim = lim = 2.
x→0 6x x→0 6x x→0 6

A complex fraction is one in which there are fractions inside of fractions. When this occurs, it is usually a
good idea to find a common denominator and simplify so that only one fraction appears. Remember that
dividing by a is the same as multiplying by 1/a.

1
−1 ( 1 −2x + 2 ) ( 1 −2(x − 1) ) −2
lim 2x − 1 = lim · = lim · = lim = −2.
x→1 x−1 x→1 x − 1 2x − 1 x→1 x−1 2x − 1 x→1 2x − 1

Finally, there are situations in which multiplying by a conjugate is useful. This is most often helpful when
a square root is involved. Since the product (a + b)(a − b) = a2 − b2 has no middle terms, multiplying a
√ √ ( √ )( √ )
quantity such as c − d by c + d eliminates the square root: c − d c + d = c2 − d. This technique
is illustrated in the next example.
( x−2 √ (√ )
x−2 6 − x + x) (x − 2) 6 − x + x
lim √ = lim √ · √ = lim
x→2 6 − x − x x→2 6−x−x 6−x+x x→2 6 − x − x2
(√ ) (√ )
(x − 2) 6 − x + x − 6−x+x 4
= lim = lim =− .
x→2 (2 − x)(3 + x) x→2 3+x 5

Another result that is sometimes useful for evaluating limits is the squeeze theorem. As with the other
properties of limits, we will accept it on the basis of strong intuitive reasoning.

THEOREM 1.5 Squeeze Theorem for Functions Suppose that g(x) ≤ f (x) ≤ h(x) for all x that
are near c but not equal to c. If lim g(x) = L = lim h(x), then lim f (x) = L.
x→c x→c x→c
1.5 Evaluating Limits Algebraically 13

To illustrate this theorem, consider lim x4 sin(π/x). Since the sine function has values between −1 and
x→0
1, it is easy to see that −x4 ≤ x4 sin(π/x) ≤ x4 for all nonzero values of x. Since both lim −x4 = 0 and
x→0
lim x4 = 0, it follows from the squeeze theorem that lim x4 sin(π/x) = 0.
x→0 x→0
As you might suspect, algebra alone is not sufficient to evaluate every limit. As an example, consider
sin x
lim . Substituting 0 for x gives 0/0, which is not helpful, and there is no apparent algebra which will
x→0 x
simplify the fraction. Using a calculator to make a table of values of the function sin x/x for x near 0, we
find that a reasonable conjecture is that the limit is 1. (It is important to realize that radian measure is
assumed here.) We will prove this fact in Section 1.17. Once this limit is known, other related limits can be
found using some algebra. First, note that all of the following limits are 1:

sin 2x 5t sin θ2
lim , lim , and lim .
x→0 2x t→0 sin 5t θ→0 θ 2

In each case, the argument of the sine function is the same as the other term in the fraction and this term
is going to 0. Using this idea and the algebraic properties of limits, we find that
( )
tan 4x sin 4x sin 4x 1
lim = lim = lim 4 · · =4·1·1=4
x→0 x x→0 x cos 4x x→0 4x cos 4x
sin2 x 1 − cos2 x
and lim = lim = lim (1 + cos x) = 2.
x→0 1 − cos x x→0 1 − cos x x→0

Exercises
1. Use algebra to evaluate each of the following limits.
x2 + 2x − 3 x3 − 4x x2 − 1
a) lim 2 b) lim 4 c) lim
x→−3 x − x − 12 x→2 x − 16 x→1 x3 − 1

x−4 x+4−2 (h + 4)2 − 16
d) lim √ e) lim f ) lim
x→4 x−2 x→0 x h→0 h
1 1

x2 − 4 (2h + 5)2 − 25 (3 + h)2 9
g) lim h) lim i) lim
x→2 1 1 h→0 3h h→0 h

x 2
x2 + ax − 7
2. Find a value of a so that lim exists and find the value of the limit.
x→2 x2 − 2x
3. Use the squeeze theorem to evaluate each of the following limits.
( )x
a) lim x2 cos(1/x) b) lim x arctan(4/x) c) lim 10 + 21/x
x→0 x→0 x→0+

4. Evaluate each of the following trigonometric limits.


tan x sin 7x sin 2x
a) lim b) lim c) lim
x→0 x x→0 x x→0 sin 3x
tan 3x 1 − cos x x2
d) lim e) lim f ) lim
x→0 sin 5x x→0 x x→0 1 − cos x
14 Chapter 1 The Derivative

1.6 Slope of a Curve

Consider the graph of a “smooth” continuous function (the graph has no sharp corners) and pick a point on
the graph. If you look at a small portion of the graph that contains the point, the graph resembles a straight
line. As the viewing area shrinks (and the graph is magnified), the effect becomes more dramatic. This sort
of behavior is easy to see on a computer graph that has a zoom feature. The line resembling the graph is
called the tangent line to the curve at the given point and its slope can be interpreted as the slope of the
curve at the point. To compute the slope of the curve y = f (x) at the point (c, f (c)), take any other point
(v, f (v)) on the graph and determine the slope of the secant line—the line connecting the two points on
the graph. The limit of the slopes of the secant lines as v → c is the slope of the tangent line at c (see the
figure).
y .
..... .
.......
.. y = f (x)
.. . f (v) − f (c)
. .....
............................ secant line
.
msec =
v−c
... ...
.... ....
f (v) ....
...
....
f (v) − f (c)
.. ...
... .............
...... tangent line
mtan = lim
.... ......................
..
. ... ........
v−c
f (c) ......................................................................
..
. ..
v→c
.. ....
.....
......
......
.....
............ x
c v

The slope of the secant line is given by the difference quotient (f (v) − f (c))/(v − c); it is the quotient of two
differences. Note that v = c gives the meaningless ratio 0/0. The slope of a curve is essentially the slope of
the line joining two adjacent points on the curve, but since there are no adjacent points it is necessary to
use a limit process as one point on the graph slides toward a fixed point on the graph. Thus, the geometric
problem of determining the slope of the tangent line reduces to the algebraic problem of evaluating the limit
f (v) − f (c)
lim . This discussion is summarized in the following definition.
v→c v−c

DEFINITION 1.6 The slope of the curve y = f (x) when x = c is given by

f (v) − f (c) f (c + h) − f (c)


lim or lim ,
v→c v−c h→0 h

provided that the limit exists. (The two limits give the same value.) This number, which is sometimes
dy
denoted by , is the slope of the line tangent to the curve at the point (c, f (c)). It also represents the
dx x=c
rate of change of y = f (x) with respect to x at the point c.

The notation dy/dx for the slope of a curve is due to Leibniz and arises in a natural way from the equation
dy ∆y
= lim , where ∆x and ∆y represent the change in x and change in y, respectively. Although the
dx ∆x→0 ∆x
symbols dy and dx do not really have any meaning—it is only their ratio that does—it is sometimes quite
useful to think of dy as a little bit of y and dx as a little bit of x. In fact, the “d” in the symbol dy stands
for difference; dy is the difference between two extremely close y values.
1.6 Slope of a Curve 15

As an example, let f be the function defined by f (x) = x2 . The point (1, 1) is on the graph of this
function. The tangent line and the computations required to find its slope are shown below.
y
..
.....
.....
...
... . .. y = x2
... . f (v) − f (1)
............
.. mtan = lim
... . v→1 v−1
... .... tangent line
... ......... y = 2x − 1
. . = lim
v2 − 1
..
.
... ... v→1 v − 1
... 1 .. ......
.
.... .. ..

....... .....................
.
= lim (v + 1) = 2.
v→1
..... .. ..
.....
............ x
...
..
1
..

As a second example, let g be the function defined by g(x) = 10/(x2 + 6). The slope of the graph y = g(x)
when x = 2 is
10
g(v) − g(2) 2+6
−1 4 − v2 2+v 2
lim = lim v = lim = lim − 2 =− .
v→2 v−2 v→2 v−2 v→2 (v − 2)(v + 6)
2 v→2 v +6 5

2
An equation for the tangent line to the graph at the point (2, 1) is thus y − 1 = − (x − 2). The number
5
−2/5 represents the rate of change of y with respect to x for this function when x = 2. In particular, one
would expect the y-values to be decreasing when the x-values are increasing near the point 2.
The fact that the slope of a graph can be interpreted as a rate of change opens the door to a number of
physical applications. For instance, if x represents time and y represents distance, then the rate of change of
y (distance) with respect to x (time) gives velocity. In other words, the slope of a distance graph is velocity.
Similarly, the slope of a velocity graph is acceleration.

Exercises
1. Find the slope of the curve y = f (x) at the given point.
1
a) f (x) = x2 , x = 3 b) f (x) = x3 , x = 1 c) f (x) = , x=5
x
x √
d) f (x) = x2 − x3 , x = 1 e) f (x) = , x=2 f ) f (x) = 4 x, x = 4
x+1
2. Find the x-intercept of the line tangent to y = x2 when x = c, where c is any positive number.
3. Find equations for the tangent and normal lines to the curve y = x4 when x = 1. (A normal line is perpendicular
to the tangent line.)
4. Use both lim and lim forms to find the slope of the graph y = x3 at a generic point c.
v→c h→0
5. Find the slope of the curve y = f (x) at a generic point c in its domain.
10
a) f (x) = x3 + 2x2 b) f (x) = 4/x c) f (x) = √
x
6. Find the values of x for which the tangent line to the curve y = x3 + 2x2 is horizontal.
7. There are two points on the curve y = 4/x for which the tangent line goes through (3, 1). Find these two points.

8. The velocity v in ft/sec of a particle at time t seconds is given by v(t) = 10/ t. What is the velocity of the
particle when its acceleration is −5 ft/sec ? What does the negative sign for acceleration indicate?
2
16 Chapter 1 The Derivative

1.7 The Derivative

Let f be a continuous function and consider the graph of y = f (x). Unless f is a linear function, it should
be clear that the slope of the graph varies from point to point on the graph. For any given value of x, the
graph has a certain slope. With a little thought, one can see that this defines a function; given an input
x, an output—namely, the slope of the curve at the point (x, f (x))—is generated. This function is usually
denoted by f ′ and called the derivative of f since it is derived from the function f . The derivative concept
is given a formal definition below. As the reader should be certain to understand, the two forms of the limit
given in the definition are equivalent.

DEFINITION 1.7 The derivative of a function f is another function f ′ defined by

f (v) − f (x) f (x + h) − f (x)


f ′ (x) = lim or f ′ (x) = lim
v→x v−x h→0 h

for each value of x in the domain of f for which the limit exists.

The number f ′ (c) represents the slope of the graph y = f (x) when x = c; it thus gives the rate of change
of y with respect to x at the point c. If f has a derivative at the point c, then f is said to be differentiable
at c. If f has a derivative at each point of an interval I, then f is said to be differentiable on I. As we will
see, most of the familiar functions are differentiable at each point in their domains.
To illustrate the definition, the derivative of the function f (x) = x3 + 2x2 is
( ) ( )
′ (x + h)3 + 2(x + h)2 − x3 + 2x2
f (x) = lim
h→0 h
( 3 ) ( )
x + 3x2 h + 3xh2 + h3 + 2x2 + 4xh + 2h2 − x3 + 2x2
= lim
h→0 h
( )
= lim 3x2 + 3xh + h2 + 4x + 2h = 3x2 + 4x.
h→0

As is evident in this example, the computations involved in finding a derivative using the definition are not
all that difficult but they can become a bit tedious.
It should be clear that a differentiable function is continuous; there can be no slope at a point on a graph
where there is a break in the graph.

THEOREM 1.8 If f is differentiable at c, then f is continuous at c.

Proof. Suppose that f is differentiable at c. Performing some algebra and using the limit definition of the
derivative, we find that
( f (x) − f (c) )
lim f (x) = lim (x − c) + f (c) = f ′ (c) · 0 + f (c) = f (c).
x→c x→c x−c

This shows that f is continuous at c.

The converse of this theorem is not true; there are continuous functions that are not differentiable. Any
function whose graph is connected but has a sharp corner is an example of this type of function; the absolute
1.7 The Derivative 17

value function at the origin is a simple example. The reason that corners create a problem lies in the fact
that limits are two-sided. Consequently, in order for a derivative to exist, the slope of the curve must be the
same on each side of the point. At a corner, the slopes are different on the left and the right. Such points
are easy to identify on a graph:
y
......
......
..
.....
.. .. The function f is not differentiable at the
..........
. .
.... y = f (x) points a, b, and c, but it is differentiable
.
.. .
... .... ......... ..... at every other point.
...
. ... . .... ..
.. ... ... ... ....
... ..
....... ..........
...... x
a b c

For reasons which should now be clear, the graph of a differentiable function is said to be a smooth graph.

Exercises
1. Use the definition of the derivative to find f ′ (x).
a) f (x) = x2 b) f (x) = x3 c) f (x) = x4
√ 5
d) f (x) = 3x2 − 4x + 5 e) f (x) = x f ) f (x) = 2x +
x
2. Find equations for the tangent and normal lines to the curve y = x2 when x = 3.
3. Find the x-intercept of the line tangent to the graph of y = x3 when x = 2.
4. Let f (x) = x3 − 4x2 + x. Find the values of x for which the slope of the graph of f is 4.
5. The graph of a function y = f (x) is given below. Sketch the graph of the function f ′ .
y.
..
...
......

.
.... .
.
... .... ..
... ..... ... y = f (x)
.. ... .
... ... ...
... .
.
... ... ... ................ x
... .
.. ... ...
.... ..
.....
6. Sketch the graph of a differentiable function f with the following properties: f ′ (−2) = 0, f (1) = 3, f ′ (1) = −5,
f (2) = 1, and f ′ (2) = 0.
7. Sketch the graph of a continuous function that is not differentiable at the x values of 1, 3, and 6 but is
differentiable at every other point.
8. Give a formula for a function that is continuous but not differentiable at x = 3/2.
9. Use the definition of the derivative to show algebraically that the given function is not differentiable at 0, that
is, show that lim f (h)/h does not exist. Describe what is happening geometrically. Even though the derivative
h→0
does not exist, can you find a tangent line for the graph at the origin?
a) f (x) = x1/3 b) f (x) = |x| c) f (x) = x2/3

10. For each real number x, let D(x) be the distance from x to the integer that is closest to x. Where does the
function D fail to have a derivative? What is the derivative of D when it exists? Begin by sketching a graph of
the function.
18 Chapter 1 The Derivative

1.8 Differentiation of Polynomials

The definition of the derivative can be used to find the derivative of most functions, but as the formula for
the function becomes more complicated so does the algebra required to simplify the quotient in the definition
of the derivative. Once the derivative of a function has been found, there is no reason to use the definition
again for that function; the result can be recorded for later use. In this way, a list of formulas for finding
derivatives has been compiled. Each formula is discovered using the definition of the derivative, but then
the formula rather than the definition is used to find the derivative. Formulas for derivatives come in two
different types:
1) formulas for the derivatives of specific functions;
2) formulas for the derivatives of combinations of functions.
In this section, we will consider a few simple formulas for computing derivatives.
One of the simplest types of functions is the power function, f (x) = xr , when r is a nonzero integer.
The exercises in the previous section generate the following conjecture:
If r is a positive integer and f (x) = xr , then f ′ (x) = rxr−1 .
This formula is referred to as the power rule and is an example of a derivative formula for a specific function.
A proof of this conjecture uses the definition of the derivative and a factoring formula:

v r − xr ( )
f ′ (x) = lim = lim v r−1 + v r−2 x + v r−3 x2 + · · · + vxr−2 + xr−1 = rxr−1 .
v→x v − x v→x

For example, if f (x) = x2006 , then f ′ (x) = 2006x2005 .


In Section 1.2, several ways of combining functions were discussed. It should be clear that the derivative
of an algebraic combination of two functions f and g should somehow be related to the derivatives of f and
g. The following theorem gives the relationship for some of these combinations.

THEOREM 1.9 Let F and G be functions and let k be a constant. If F and G are differentiable on an
interval I, then the functions kF , F + G, and F − G are differentiable on I and

a) constant multiple rule (kF )′ = kF ′ ,

b) sum rule (F + G)′ = F ′ + G′ ,

c) difference rule (F − G)′ = F ′ − G′ .

Proof. We will give a proof of the result for sums. Using the definition of the derivative, we obtain

(F + G)(v) − (F + G)(x)
(F + G)′ (x) = lim
v→x v−x
( ) ( )
F (v) + G(v) − F (x) + G(x)
= lim
v→x v−x
( F (v) − F (x) G(v) − G(x) )
= lim + = F ′ (x) + G′ (x).
v→x v−x v−x

The proofs of the other parts are similar.


1.8 Differentiation of Polynomials 19

In words, the sum rule says that the derivative of a sum is the sum of the derivatives. Combining these
simple rules with the power rule (and the obvious fact that the derivative of a constant function is zero), it
is possible to find the derivative of any polynomial without using the definition. For example,

( )′ ( )′ ( )′ ( )′ ( )′
2x4 − 3x3 + 6x − 7 = 2 x4 − 3 x3 + 6 x − 7 = 2 · 4x3 − 3 · 3x2 + 6 · 1 − 0 = 8x3 − 9x2 + 6.

In other words, if f (x) = 2x4 − 3x3 + 6x − 7, then f ′ (x) = 8x3 − 9x2 + 6; all you need to do is find the
derivative of each term and combine the results in the “obvious” way. It is immediately apparent that this
is much easier than using the definition of the derivative.
If y = f (x), then some common notations for the derivative are

dy d df
f ′ (x), , f (x), , Dx f (x).
dx dx dx

The first three notations are the only ones that will be used in this text. (Please note that, for the purposes
of this course, y ′ is not an acceptable notation for the derivative; see the last exercise in this section.) For
example, the power rule and sum rule can be written as

d r d ( ) d d
x = rxr−1 and F (x) + G(x) = F (x) + G(x),
dx dx dx dx
dy
respectively. The derivative of y = f (x) evaluated at the point c is denoted by f ′ (c) or . The latter
dx x=c
notation is a bit awkward but some way of distinguishing between the derivative and its values is needed.

Exercises
1. Use formulas in this section to find the derivative of the given function. Pay particular attention to your notation.
a) f (x) = 4x − 7 b) g(x) = 3x2 + 13x − 8 c) h(x) = −2x3 + 3x2 − 5x + 6
2u6 + 3u4 − u2
d) F (t) = 3t5 − 2t4 + t − 2 e) G(z) = 12 z 4 + 13 z 3 − z 2 f ) H(u) =
12
g) y = x100 + 100x h) s = 0.01t5 + 0.2t3 − 4t i) z = 2w30 − 1
33
w11

2. Find equations for the tangent and normal lines to the curve y = x6 − 4x5 + 2x4 + x + 3 when x = 1.
3. Find the x-intercept of the line tangent to the graph of y = x4 − 3x3 + 5x when x = 2.
4. Let f (x) = 3x5 − 10x3 − 45x + 7. Find the values of x for which the tangent line is horizontal.
√ of a particle at time t seconds is given by s = 0.02t − 0.4t + 60t. What is the velocity
5 3
5. The position s in meters
of the particle after 10 seconds?
t5 + 2t4 + 3t3
6. Suppose that the height h in inches of a beanstalk after t hours is h = . What is the rate of growth
360
of the beanstalk at the end of two days? Give your answer in feet per minute.
7. Suppose that f and g are differentiable functions and that f (2) = 9, f ′ (2) = 5, g(2) = 3, and g ′ (2) = −2. Find
h(2) and h′ (2) for the given function h.
a) h = 4f b) h = f + g c) h = f − g
d) h = 2f − 3g e) h = (f + 2g)/5 f ) h = f (2)g
8. Prove the constant multiple rule and the difference rule.
dy dy
9. For y = 4x2 t3 , find and . Note that y ′ would be ambiguous in this situation.
dx dt
20 Chapter 1 The Derivative

1.9 Power Rule

In the last section, we proved that the derivative of xr is rxr−1 for the case in which r is a positive integer.
It turns out that this derivative formula is valid for any nonzero rational number r, but a little more effort is
involved to prove this fact. We begin by proving the reciprocal rule: if f is a differentiable function, then
( 1 )′ f′ d 1 f ′ (x)
= − 2 or =− ,
f f dx f (x) (f (x))2
for all values of x for which f (x) ̸= 0. To prove this rule, we use the definition of the derivative, a little
algebra, and the fact that f is a continuous function.
1 1

d 1 f (v) f (x)
= lim
dx f (x) v→x v−x
( 1 f (x) − f (v) )
= lim ·
v→x v − x f (x)f (v)
( 1 f (v) − f (x) )
= lim − ·
v→x f (x)f (v) v−x
1 ′ f ′ (x)
=− · f (x) = −
(f (x))2 (f (x))2
Notice how algebra was used to manipulate the original fraction into one that contained the definition of the
derivative of f . Where is the continuity of f used in this proof?
d r
If r is a positive integer, then dx x = rxr−1 . Properties of exponents and the reciprocal rule then yield
d −r d 1 rxr−1 rxr−1
x = r
= − r 2 = − 2r = −rx−r−1 ,
dx dx x (x ) x
which shows that the power rule is valid for negative integers. The power rule for integers can then be used
to prove the result for arbitrary rational numbers.

d r
THEOREM 1.10 Power Rule If r is a nonzero rational number, then x = rxr−1 for all values
dx
of x for which both sides of the equation are defined.

Proof. Let r be a nonzero rational number. Then r = p/q, where p is a nonzero integer and q is a positive
integer. Define a function f by f (x) = xr = xp/q . We will use the definition of the derivative to find f ′ . Let
w = v 1/q and z = x1/q . Since the function x1/q is continuous, it follows that w → z as v → x. Hence,
v p/q − xp/q wp − z p ( wp − z p w − z )
f ′ (x) = lim = lim q = lim ·
v→x v−x w→z w − z q w→z w − z wq − z q
1 p p
= pz p−1 · q−1 = z p−q = x(p/q)−1 = rxr−1 .
qz q q
(The values of the limits follow from the fact that they have already been established for integers.) We have
thus shown that the derivative of xr is rxr−1 . This completes the proof.

When using the power rule, it is important to be careful with exponents:


d ( √ 6 ) d ( 1/2 ) 2 3 2x + 3
4 x− √ = 4x − 6x−1/2 = 2x−1/2 + 3x−3/2 = 1/2 + 3/2 = 3/2 ;
dx x dx x x x
d ( 6 5 ) d ( 5 ) 15 12 15 3(8x − 5)
4− 2 + 3 = 4 − 6x−2 + x−3 = 12x−3 − x−4 = 3 − 4 = .
dx x 2x dx 2 2 x 2x 2x4
1.9 Power Rule 21

Let’s look at a different type of example. Let a and b represent nonzero real numbers and consider the
b
function f defined by f (x) = ax + . The numbers a and b are referred to as parameters; for each pair of
x
numbers a and b, we obtain a different function f . The reader should sketch several graphs y = f (x) for
different choices of a and b. (A good place to start are the (a, b) pairs (1, 2), (−1, 2), (1, −2), and (−1, −2).)
We will prove that none of the tangent lines to any of the curves represented in this way goes through the
origin. Let w be any nonzero number (so w is in the domain of f ) and consider the tangent line to the curve
at (w, f (w)). The slope of the tangent line at this point is f ′ (w), and the slope of the line through this point
and the origin is f (w)/w. In order for the tangent line to go through the origin, these two slopes must be
b
equal. This leads to the equation wf ′ (w) = f (w). Since f ′ (x) = a − 2 , the number w must satisfy the
x
equation
( b ) b 2b
w a − 2 = aw + ⇒ = 0.
w w w
Since no value of w can satisfy this last equation, there is no point on the graph for which the tangent line
passes through the origin. By using the parameters a and b, we have shown in a single step that an infinite
collection of functions has the same property. This is one of the advantages of using parameters.

Exercises
1. Find and simplify the derivative of the given function.
√ √ 7 5 12
a) f (x) = 2 4 x − 6 3 x b) g(x) = c) h(x) = − 2
3x4 x x
2t − 5 z 3 − 4z + 6 √
d) F (t) = √ e) G(z) = f ) H(s) = s(3s − 2 s)
4 t z2
4 7 2
g) y = 3x2 − h) z = 2w4 − 3w3 + 6w2 − 5w i) s = − √
x t 5 t
4
2. Find an equation for the line tangent to the graph of f (x) = 3x + when x = 2.
x
3. Let c and k be positive numbers. The tangent line to the curve xy = k when x = c cuts off a triangle in the first
quadrant. Find the area of this triangle and note that its value is independent of c.

4. Find both points on the curve y = x for which the tangent line goes through the point (5, 3).

9
x−1
5. Evaluate lim by interpreting the limit as a derivative.
x→1 x − 1
6. Given that f is a differentiable function, use the definition of the derivative to obtain the formula for the
derivative of the function f 2 . (The computations are similar to those in the proof of the reciprocal rule.)
22 Chapter 1 The Derivative

1.10 Chain Rule

One of the most important formulas for differentiation is the formula for the derivative of a composite
function. This formula is known as the chain rule. It indicates how the derivative of the composite function
F ◦ G is related to the derivatives of the functions F and G. The formula itself is quite simple, but some
practice is required before its use becomes second nature. As a start, note that the symbols used in the
definition of the derivative simply represent quantities with certain properties. For a function F , both of the
following equations are equivalent variations on the definition of the derivative:
( )
′ F (θ) − F (z) )

( F (w) − F G(x)
F (z) = lim and F G(x) = lim .
θ→z θ−z w→G(x) w − G(x)
It is the last equation that is relevant for a proof of the chain rule. Suppose that G is a continuous function.
Then G(v) → G(x) as v → x. Using G(v) for w, we find that
( ) ( ) ( ) ( )

( ) F G(v) − F G(x) F G(v) − F G(x)
F G(x) = lim = lim .
G(v)→G(x) G(v) − G(x) v→x G(v) − G(x)
This fact will be used in the proof of the chain rule.

THEOREM 1.11 Chain Rule Let F and G be functions for which F ◦ G is defined. If F and G are
differentiable, then F ◦ G is differentiable and (F ◦ G)′ = (F ′ ◦ G) G′ . That is ,
d ( ) ( )
F G(x) = F ′ G(x) G′ (x)
dx
for each value of x in the domain of G.

Proof. Using the definition of the derivative, we obtain


( ) ( )
d ( ) F G(v) − F G(x)
F G(x) = lim
dx v→x v−x
( ( ) ( ) )
F G(v) − F G(x) G(v) − G(x)
= lim ·
v→x G(v) − G(x) v−x

( ) ′
= F G(x) G (x).
The last step uses the observation made before the theorem.

There is a potential problem with this proof; if G(v) = G(x) infinitely often as v → x, then there is an
unavoidable division by zero in the middle step of the proof. This can only happen at a point for which
G′ (x) = 0 and the graph of G wiggles wildly. As such functions are extremely unusual and rarely occur in
applications, we will not consider a general proof.
The chain rule states that the derivative of a composite function is the derivative of the “outside”
function, with the “inside” function left alone, multiplied by the derivative of the “inside” function. The
following examples illustrate the chain rule.
d ( 4 )5 ( )4 ( )
x − x + 1 = 5 x4 − x + 1 4x3 − 1 ;
dx
d √ 2 1( 2 ( ) 2(x + 1)
2x + 4x + 11 = 2x + 4x + 11)−1/2 4x + 4 = √ ;
dx 2 2x2 + 4x + 11
d 3 d ( 2 )−1 ( )−2 ( ) −3(2x + 5)
2
= 3 x + 5x = −3 x2 + 5x 2x + 5 = .
dx x + 5x dx (x2 + 5x)2
1.10 Chain Rule 23

The only specific derivative formula obtained thus far is for the function xr . For composite functions of this
type, the chain rule can be expressed as

d ( )r ( )r−1 ′
G(x) = r G(x) G (x).
dx

This result is sometimes referred to as the extended power rule. However, the chain rule is quite general
and will appear again when we find the derivatives of trigonometric and exponential functions.
Suppose that the variable y depends on a variable u and that the variable u depends on the variable x.
Then y varies with x and has the form of a composite function; given x, first find u, then use it to find y.
To find the rate of change of y with respect to x, we note that the equation

∆y ∆y ∆u dy dy du
= · becomes = · .
∆x ∆u ∆x dx du dx

The latter equation provides a nice symbolic representation for the chain rule. It has the further appeal in
that it looks like the du terms cancel and this makes the formula easy to remember. One of the drawbacks
of this representation is that the points at which the derivatives should be evaluated are not indicated.

Exercises
1. Find and simplify the derivative of the given function.

a) f (x) = (2x2 − 3x + 1)5 b) g(x) = (x3 + 6x)4 c) h(x) = x4 + 3x2 + 15

d) F (t) = (t3 − 3t2 + t)3 e) G(z) = 3 z 3 + 6z f ) H(s) = (3s2 + 15s − 8)4/3
4 6 5
g) y = h) z = √ i) s =
(3x2 − 4x + 6)3 w2 +6 t3 + 3t2
√ √
2. Find the derivative of the function s defined by s(x) = x2 + x2 + 2x + 3.
9
3. Find an equation for the line tangent to the graph of f (x) = √ when x = 2.
4x + 1

4. Find a point on the curve y = x2 + 4x + 8 for which the tangent line goes through the origin.
5. Suppose that f and g are differentiable functions with f (1) = 1, f ′ (1) = 3, g(1) = −2, and g ′ (1) = −1.
Determine h′ (1) for each function h.
a) h(x) = (g(x))3 b) h(x) = (f (x))3 − 3g(x2 ) c) h(x) = g(f (7 − 6x))

6. Let f (x) = 2x2 − 3x + 1 and let g = f ◦ f ◦ f , that is, g(x) = f (f (f (x))). Find g ′ (2). (Do not find g(x)!)
7. Explain why the reciprocal rule is a special case of the chain rule.
24 Chapter 1 The Derivative

1.11 Product and Quotient Rules

Although the derivative of a sum is the sum of the derivatives, it is NOT true that the derivative of a product
is the product of the derivatives. As an example, note that

d ( 3 4) d ( 7) d ( 3) d ( 4)
x ·x = x = 7x6 and x · x = 3x2 · 4x3 = 12x5
dx dx dx dx

give different values. To motivate the correct formula, consider a rectangle with length ℓ and width w.
Suppose that the length increases by a small amount dℓ and that the width increases by a small amount dw.
Subtracting the original area from the new area yields

..................................................................................................................
. .. . ..
dw .........................ℓ....dw
....................................dℓ dw .... dA = (ℓ + dℓ) (w + dw) − ℓw
...................................................... .. ... ..................................
...
.. ... .. ... ...
..
w .. .. w .. ... w dℓ .......... = ℓ dw + w dℓ + dℓ dw,
.............................................. .......................................................................................
ℓ ℓ dℓ
where dA is the increase in the area. If dℓ and dw are quite small, say 10−8 , their product will be very
small indeed and it follows that dA is essentially ℓ dw + w dℓ. The change in the product is the first times
the change of the second plus the second times the change in the first. The product rule results if the word
‘change’ is replaced by the word ‘derivative’.

THEOREM 1.12 Product and Quotient Rules Let F and G be functions. If F and G are
differentiable on an interval I, then F G and F/G are differentiable on I and
( F )′ GF ′ − F G′
(F G)′ = F G′ + GF ′ and = .
G G2

Of course, the quotient rule is only valid for those values of x for which G(x) ̸= 0.

Proof. Using the definition of the derivative and the motivational idea discussed above, we obtain

F (v)G(v) − F (x)G(x)
(F G)′ (x) = lim
v→x v−x
( ( ))( ( ))
F (x) + F (v) − F (x) G(x) + G(v) − G(x) − F (x)G(x)
= lim
v→x v−x
( )
G(v) − G(x) F (v) − F (x) ( ) G(v) − G(x)
= lim F (x) · + G(x) · + F (v) − F (x) ·
v→x v−x v−x v−x
= F (x)G′ (x) + G(x)F ′ (x) + 0 · G′ (x)

= F (x)G′ (x) + G(x)F ′ (x).

Note the use of the continuity of F in the evaluation of the third limit. The quotient rule follows from
F 1
the product rule and the reciprocal rule by writing the quotient as F · . The details will be left as an
G G
exercise.

In words, the product rule states that the derivative of a product is the first times the derivative of the
second plus the second times the derivative of the first. The quotient rule states that the derivative of a
1.11 Product and Quotient Rules 25

quotient is the denominator times the derivative of the numerator minus the numerator times the derivative
of the denominator all over the denominator squared.
The following examples illustrate the product and quotient rules. As indicated, it is good practice to
get into the habit of simplifying derivatives as much as possible.

If f (x) = (4x + 3)3 (6 − x)4 , then f ′ (x) = (4x + 3)3 · 4(6 − x)3 (−1) + (6 − x)4 · 3(4x + 3)2 4
( )
= 4(4x + 3)2 (6 − x)3 −(4x + 3) + 3(6 − x)

= 4(15 − 7x)(4x + 3)2 (6 − x)3 .

√ 1
If g(x) = x3 6x + 1, then g ′ (x) = x3 · (6x + 1)−1/2 · 6 + (6x + 1)1/2 · 3x2
2
( )
= 3x2 (6x + 1)−1/2 x + (6x + 1)
3x2 (7x + 1)
= √ .
6x + 1
2x + 3 dy (x2 + 4) · 2 − (2x + 3) · 2x 8 − 6x − 2x2 −2(x − 1)(x + 4)
If y = 2
, then = 2 2
= 2 2
= .
x +4 dx (x + 4) (x + 4) (x2 + 4)2
Note that simplification means to combine terms and/or factor. You do not want to expand things that are
already factored unless it is then possible to combine them with other terms. For instance, the denominator
in the derivative of a quotient should NOT be multiplied out.

Exercises
1. Find and simplify the derivative of the given function. Be sure to factor as much as possible.
√ x+3 x
a) f (x) = x x + 2 b) g(x) = c) h(x) = √
2x − 1 2x + 5
√ 6x + 1
d) u(x) = 4x 10 − x2 e) v(x) = (x + 1)3 (2x − 1)4 f ) w(x) =
x2 + 2x + 4
( )4
x2 t−1 √
g) y = √ h) s = i) z = 3
4 + x2 (2 − x)5
4 + x2 t+1

2. Determine all the values of x for which the tangent line to the graph of y = (x − 2)3 (3x + 1)4 is horizontal.
3. Find an equation for the line tangent to the graph y = x2 /(3x + 4) when x = 4.
4. The position s in meters of a particle at time t seconds is given by s = 10t/(2t + 1). When is the velocity of the
particle 0.1 meters per second?
5. Consider the function f defined by f (x) = x/(x + 1).
a) Find equations for all of the tangent lines of f that are parallel to the line x − 4y = 3.
b) Find all of the points on the graph of f for which the tangent line goes through the point (0, 4).
6. Suppose that f and g are differentiable functions with f (2) = 1, f ′ (2) = −3, g(2) = 4, and g ′ (2) = 5. Determine
h′ (2) for each function h.
a) h = f g b) h = f /g c) h = f 2 g 3

7. Finish the proof of the quotient rule. Use the suggestion mentioned at the end of the proof of Theorem 1.12.
8. Suppose that F , G, and H are differentiable functions on an interval I. Find a derivative formula for (F GH)′ .
Can you extend this to the product of four or more functions?
26 Chapter 1 The Derivative

1.12 Maximum and Minimum Outputs

Let f be a continuous function defined on a closed and bounded interval [a, b]. After sketching many possible
graphs for y = f (x), including some wiggly and jerky options, the following two observations seem to be
true.

i. The graph always has a high point and a low point.

ii. The high point and low point occur either at the endpoints of the interval, at points for which
f ′ = 0, or at points for which f ′ does not exist.

Both of these observations are indeed true. We begin with a mathematical definition of the maximum and
minimum outputs of a function.

DEFINITION 1.13 Let f be a function defined on an interval I. The function f has a maximum
output at d ∈ I if f (x) ≤ f (d) for all x ∈ I, and f has a minimum output at c ∈ I if f (x) ≥ f (c) for all
x ∈ I.

For the most part, this is a simple concept. Consider the function f defined by f (x) = 1 − x2 on the
interval [−1, 2]. A quick sketch of the graph (see below) reveals that the maximum output of f is 1 and
occurs when x = 0, while the minimum output of f is −3 and occurs when x = 2.
y .
y.
...... ......
.. .. .. ..
.

2. . 4. . ..
...
....1............... y = 1 − x ..
2
3 y = x2
.. ... ...
... ...
•. .....
........... x ..
1.... 2
2
−2 −1
... .. .
−1 ... 1 ◦..
...
−2
... .....
x
... 1 2
.............

...
−3 •.

However, there are some subtleties going on here as well. For instance, consider the function g defined by
g(x) = x2 on the interval (1, 2). This function has neither a maximum nor a minimum output on the given
interval since the endpoints of the interval are not included in the domain. It is true that 4 is larger than
all of the outputs of g, but 4 is not an output of g for x in the interval (1, 2).
The maximum and minimum outputs of a function are known as the extreme outputs (or extreme
values) of a function. As the second example illustrates, a function may not have extreme outputs on a given
interval. The following theorem gives conditions that guarantee that extreme outputs exist.

THEOREM 1.14 Extreme Value Theorem If f is continuous on a closed interval [a, b], then f has
both a maximum output and a minimum output on [a, b]. In other words, there exist numbers c, d ∈ [a, b]
such that f (c) ≤ f (x) ≤ f (d) for all x ∈ [a, b].
1.12 Maximum and Minimum Outputs 27

The proof of this theorem will be omitted as it involves some properties of real numbers that we have
not discussed. Our goal is to find the extreme outputs of a function once we know that they exist. This is
where the second observation made earlier comes into play.

THEOREM 1.15 Let f be a function defined on an open interval I. If f has an extreme output at a
point c ∈ I, then either f is not differentiable at c or f ′ (c) = 0.

Proof. Suppose that f has a minimum output when the input is c. This means that f (v) − f (c) ≥ 0 for
all v ∈ I. It follows that
{
f (v) − f (c) is less than or equal to 0 if v ∈ I and v < c;
v−c is greater than or equal to 0 if v ∈ I and v > c.

f (v) − f (c)
If f happens to be differentiable at c, then f ′ (c) = lim = 0.
v→c v−c
As a consequence of this theorem, it is a reasonably easy task to find the maximum and minimum
outputs of a continuous function f defined on a closed interval [a, b]. The first step is to find the critical
inputs of f ; these are the inputs x for which either f ′ (x) = 0 or f ′ (x) does not exist. The second step is to
evaluate f at the critical inputs that belong to [a, b] and at the endpoints of the interval. The largest of the
outputs is the maximum output and the smallest is the minimum output.

Problem: Find the maximum and minimum outputs of f (x) = 6x − x3 on the interval [1, 3].

Solution: Since f is a polynomial, it is continuous and differentiable for all real numbers. To find the

critical inputs of f , we first find its derivative: f ′ (x) = 6 − 3x2 . The function f ′ is zero when x = ± 2.

These are the critical inputs of f , but only 2 is in the given interval. Evaluating f at the endpoints and
(√ ) √
the appropriate critical input yields f (1) = 5, f 2 = 4 2 ≈ 5.657, f (3) = −9. The maximum output
√ √
of f on [1, 3] is thus 4 2, which occurs when x = 2, and the minimum output of f on [1, 3] is −9, which
occurs when x = 3.

Exercises
1. Find the maximum and minimum outputs of the function on the given interval.
a) f (x) = 16 + 4x − x2 , [−1, 4] b) f (x) = 2x3 − 3x2 − 12x + 40, [−2, 3]
3 x 1
c) f (x) = x − 2x + 8, [0, 4]
4 2
d) f (x) = 4x + , [0.4, 2] e) f (x) = + , [1, 4]
3x − 1 2 x
2. Sketch the graph of a function with the given properties.
a) A function f that is continuous on (2, 5) but has no maximum output.
b) A function g that is not continuous on [−1, 3] but has both a maximum and a minimum output.
c) A function h that is continuous on (1, ∞) and has both a maximum and a minimum output.
3. The sum of two nonnegative numbers is 10.
a) Find such numbers so that the product of one with the cube of the other is as large as possible.
b) Find such numbers so that the sum of one and the cube of the other is as small as possible.
4. Find the minimum distance from a point on the parabola y = x2 to the point (0, 2).
28 Chapter 1 The Derivative

1.13 Increasing and Decreasing Functions

The slope of a line gives the rate of change of y with respect to x. If the line has positive slope, then y
increases as x increases; the graph goes up as you move to the right. Similarly, if the line has negative slope,
then y decreases as x increases; the graph goes down as you move to the right.
y .
y.
..... .....
.. .. .. .. ..

...
...... .......
... .......
...
..... This line has
.......
....... This line has
...
... positive slope. ....... negative slope.
...
... .......
....... .....
... ....... ........... x
....
......
.......... x
...
. .....
....
.....
Putting the “going up” and “moving right” ideas into algebra yields the following definition.

DEFINITION 1.16 Let f be a function defined on an interval I.

a) The function f is increasing on I if u < v implies f (u) ≤ f (v) for all points u and v in I.

b) The function f is decreasing on I if u < v implies f (u) ≥ f (v) for all points u and v in I.

The function f , whose graph is sketched below, is increasing on the interval (−∞, a], decreasing on the
interval [a, b], and increasing on the interval [b, ∞).
y
......
... .. .
........... ..
.. .. ........ ...
.. .... .. y = f (x)
....
... ..... ....
.. ..........
... ..........
..... x
. a b

Determining where a function is increasing or decreasing from a formula is more difficult than reading its
graph. Of course, it is possible to first sketch the graph, then try to read off the values where the curve
changes direction. This method works well in some cases, but it does have some disadvantages. First of
all, you need to be certain you can see all of the interesting features of the graph on the same screen. If
not, you will miss some important information. Secondly, there may be situations in which exact values for
the turning points are needed, not just numerical approximations. Finally, if the formula for the function
involves parameters rather than numbers (x3 + ax rather than x3 + 4x), then the graphing method breaks
down. Calculus provides a way to avoid all of these potential difficulties.
Recall that the derivative of a function f gives the slope of the graph y = f (x). If f ′ is positive, then
the tangent lines of f have positive slope. Since lines with positive slopes are increasing, it seems reasonable
to expect that a function with a positive derivative is increasing. Similarly, we expect a function with a
negative derivative to be decreasing. This result is included in the following theorem.

THEOREM 1.17 Suppose that f is differentiable on an interval I.

a) The function f ′ is nonnegative (≥ 0) on I if and only if f is increasing on I.


1.13 Increasing and Decreasing Functions 29

b) The function f ′ is nonpositive (≤ 0) on I if and only if f is decreasing on I.

Although this theorem should have a strong intuitive appeal, it is not a trivial matter to prove it; the
details will be given in Section 1.26. The advantage of this property of derivatives is that it reduces the
problem of determining where a differentiable function f is increasing or decreasing to the problem of finding
the intervals on which its derivative f ′ is positive or negative. The function f ′ is negative when its graph is
below the x-axis and positive when its graph is above the x-axis. There are only two ways for the graph of a
function f ′ to switch sides of the x-axis at a point c: either the graph of f ′ crosses the x-axis (f ′ (c) = 0) or
the graph of f ′ jumps across the x-axis (f ′ has a discontinuity at c). In between such points, the function
f ′ must have the same sign. These ideas will be used in the following example.

Problem: Determine the intervals on which the function f defined by f (x) = x2 /(x + 2) is increasing and
those on which it is decreasing.
x(x + 4)
Solution: As the reader should verify, f ′ (x) = . The graph of f ′ can only cross the x-axis at the
(x + 2)2
points −4, −2, and 0. In between these points, the sign of f ′ stays the same. For example, between −2 and
0, the term x is negative, the term x + 4 is positive, and the term (x + 2)2 is positive. It follows that f ′ is
negative on the interval (−2, 0). Using similar reasoning, we obtain the following information:
interval (−∞, −4) (−4, −2) (−2, 0) (0, ∞)
sign of f ′ positive negative negative positive
property of f increasing decreasing decreasing increasing
Hence, the function f is increasing on the intervals (−∞, −4] and [0, ∞), and decreasing on the intervals
[−4, −2) and (−2, 0]. Pay careful attention to whether or not the endpoints are included.

Exercises
1. Determine the intervals on which f is increasing and those on which it is decreasing given f ′ (x) = x(x−1)2 (x−3)3 .
2. Determine the intervals on which f is increasing and those on which it is decreasing. Treat a as a positive
constant.

a) f (x) = x3 − 6x2 b) f (x) = 3x4 + 8x3 + 24x2 c) f (x) = x − 4 x
d) f (x) = 6x5 − 110x3 + 300x e) f (x) = 2x + 1/x f ) f (x) = x − 3x2/3

g) f (x) = 12x − 2x2 h) f (x) = (x2 − x + 4)/(x − 1) i) f (x) = 4x3 + 21x2 − 24x − 3
j) f (x) = x3 − a2 x k) f (x) = x + a2 /x l) f (x) = x2 + a3 /x

3. Suppose that f is increasing on I. Prove that −f is decreasing on I.


4. Suppose that f and g are increasing on I. Prove that f + g is increasing on I.
5. Give an example to show that the product of two increasing functions on R may not be an increasing function.
30 Chapter 1 The Derivative

1.14 The First Derivative Test

Consider the graph of the differentiable function f shown below. It is clear that f is increasing on the
intervals (−∞, a] and [b, c] and decreasing on the intervals [a, b] and [c, ∞). The maximum output of the
function occurs when x = a. y..
.......
....... . ..
.. .. .........
.. .... y = f (x)
....
... ..... ..
...............
.. f (b) .................... ...
... ...
.. ...
.....
.. .......... x

.. a b c
.
Although the graph changes direction when x = b and x = c, the function f does not have a maximum or
minimum output at these points. However, there is some sort of extreme behavior at the points b and c. For
instance, on the interval [0, c] the minimum output of the function f is f (b). Extreme outputs such as these
are known as relative extreme outputs. The function f has a relative minimum output at b and a relative
maximum output at c. These terms are defined as follows.

DEFINITION 1.18 Let f be defined on an interval I and let c be a point in I. Then

a) f has a relative maximum output at c if f (x) ≤ f (c) for all x in some open interval containing c.

b) f has a relative minimum output at c if f (x) ≥ f (c) for all x in some open interval containing c.

A brief study of Theorem 1.15 shows that if a continuous function f has a relative extreme output at a
point c, then c is a critical input of f . That is, either f ′ (c) does not exist or f ′ (c) = 0. However, a function
may not have a relative extreme output at a critical input. For example, the function f (x) = x3 has a
critical input when x = 0, but the function does not have a relative extreme output at 0. The following
theorem provides a way to test critical inputs to determine whether or not they correspond to relative
extreme outputs.

THEOREM 1.19 First Derivative Test Suppose that f is continuous on an open interval containing
the point c and that c is a critical input for f .

a) If f ′ is positive on an interval (a, c) for some a < c and negative on an interval (c, b) for some b > c,
then f has a relative maximum output at c.

b) If f ′ is negative on an interval (a, c) for some a < c and positive on an interval (c, b) for some b > c,
then f has a relative minimum output at c.

Proof. The proof of this theorem is quite easy. Given the conditions listed in part (a), the function f is
increasing on (a, c) and decreasing on (c, b). It follows that f (c) ≥ f (x) for all x ∈ (a, b). Hence, the function
f has a relative maximum output at c. The proof of part (b) is similar.

Problem: Determine the nature of all the critical inputs for f (x) = 3x4 − 20x3 − 18x2 + 180x + 245.
1.14 The First Derivative Test 31

Solution: As usual, the first step is to solve the equation f ′ (x) = 0. (Since f is a polynomial, the derivative
will exist at every point.) Since

f ′ (x) = 12x3 − 60x2 − 36x + 180 = 12(x3 − 5x2 − 3x + 15)


( )
= 12 x2 (x − 5) − 3(x − 5) = 12(x2 − 3)(x − 5),
√ √
the critical inputs of f are − 3, 3, and 5. Proceeding as in the last section, we look at the intervals on
which f ′ has constant sign.
√ √ √ √
interval (−∞, − 3) (− 3, 3) ( 3, 5) (5, ∞)
sign of f ′ negative positive negative positive
property of f decreasing increasing decreasing
increasing

By the first derivative test, the function f has a relative maximum output when x = 3 and relative

minimum outputs when x = − 3 and x = 5. Since f (x) becomes arbitrarily large as x gets large in either
the negative or positive direction, it is clear that f has no maximum output on its domain (−∞, ∞). Noting
√ √
that f (− 3) = 218 − 120 3 and f (5) = 70, with a little more thought, we find that the minimum output

of f on (−∞, ∞) is approximately 10.154 and occurs when x = − 3.

Problem: Find the minimum distance from a point on the curve y = 2/x2 to the origin.

Solution: For each x > 0 (by symmetry, it is sufficient to consider this case only), the square S(x) of
4
the distance from the point (x, 2/x2 ) to the origin is given by S(x) = x2 + 4 . As the reader may verify,
√ x
S ′ (x) = 0 when x = 2 and the first derivative test indicates that S has a minimum at this value. Since
√ √
S( 2) = 3, the minimum distance from a point on the curve y = 2/x2 to the origin is 3. For the record,

this minimum distance occurs at the points (± 2, 1). (Note that minimizing the square of the distance is
equivalent to minimizing the distance.)

Exercises
1. Determine the nature of all the critical inputs of the given function. Also check to see if the function has a
maximum or minimum output on its domain. Treat a as a positive constant.

a) f (x) = x3 − 6x2 b) f (x) = 3x4 + 8x3 + 24x2 c) f (x) = x − 4 x
d) f (x) = 6x5 − 110x3 + 300x e) f (x) = 2x + 6/x f ) f (x) = 3x2/3 − x
g) f (x) = x + a2 /x h) f (x) = x2 + a4 /x2 i) f (x) = x/(x2 + a2 )

2. The product of two positive numbers is 10. Find the minimum value for their sum.
3. Suppose that x and y are two positive numbers for which xy 2 = 10. Find the minimum value of x + y.

4. Find the minimum distance from a point on the curve y = 4/ x to the origin.
5. Find a cubic polynomial that has a relative minimum output when x = −2 and a relative maximum output
when x = 5.
6. Prove part (b) of Theorem 1.19.
32 Chapter 1 The Derivative

1.15 Optimization Problems

Many of the applications that involve mathematics begin with a verbal description. After developing an
understanding of the problem, the next step is to translate the problem into the language of mathematics. It
is then possible to use the tools of mathematics to solve the problem. Finally, the solution must be translated
back into the original language of the problem.
One category of problems of this type are optimization problems; the goal is to maximize or minimize
some quantity under certain restrictions. Since the derivative can be used to identify inputs that generate
possible extreme outputs, calculus can be used to solve a number of these problems. The following problem
and its solution illustrate the main ideas behind solving these types of problems.

Problem: Suppose that it is necessary to construct a rectangular box with a square base and a volume of
twenty cubic feet. The material for the sides costs four dollars per square foot, the material for the base
costs eight dollars per square foot, and the material for the top costs two dollars per square foot. Find the
minimum cost of constructing the box.

Solution: The finished product will look something like this:

.................................................................................
.. ... .. ..
.. ..
.. .... .. .
.. ..
.. ... ..
..
...
.. ... .
.. ...................................................................................
..
..

h ... .. ..
..
...
.. .. ..
.. ....
.. .. ..
.. ..
.. .
.. ..
.. ...
.. .. ..
.. ...
.. .. ..
.
...... ... ... ... ... ..... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ........ ....
..
..

... .. .
.. ...
... .. ..
.. ...
... .. ..
.. ..
x .. . .. .
..
... .. .. . .
.. .
..... .. .
.. . .
.........................................................................
x

Since the base of the box is expensive, we do not want the base to be too large. However, a small base forces
the sides to be rather large in order to maintain a volume of twenty cubic feet and the cost of construction
increases again. There must be some dimensions in between these extremes that will minimize the cost. In
general, a rectangular box has three dimensions, but in this case there are only two variables since the base
is a square. Let
x be the length and width of the box, in feet;

h be the height of the box, in feet;

V be the volume of the box, in cubic feet;

C be the cost of construction for the box, in dollars.


The area of the base of the box is x2 , the area of the top of the box is also x2 , and each of the four sides of
the box has an area of xh. It follows that V = x2 h and C = 2x2 + 8x2 + 4(4xh) = 10x2 + 16xh. We want
1.15 Optimization Problems 33

to minimize the cost C, but it depends on both x and h. However, given a value for x, it is necessary to
choose h so that V = 20. Hence, the height h is a function of the length x. It follows that C only depends
on a choice of x. Given a value for x, we find that

20 20 320
h= and C = 10x2 + 16x · = 10x2 + .
x2 x2 x

Note that x may assume any positive value, that inputs of x near 0 generate large outputs for C, and that
large inputs of x also generate large outputs for C. We want to find the value of x that will minimize C.
Hence, we must solve the following mathematical problem:

320
Find the minimum output of the function C(x) = 10x2 + on the interval (0, ∞).
x

To solve this problem, first find the values of x that satisfy the equation C ′ (x) = 0:

320 √ √
C ′ (x) = 20x − C ′ (x) = 0
3 3
and implies x= 16 = 2 2.
x2

The analysis of the problem assures us that this value of x will generate the minimum cost, but the first
derivative test can also be used. Since
√ √
C ′ (x) < 0 for 0 < x < 2 2 and C ′ (x) > 0 for 2 2 < x < ∞,
3 3


the function C has a minimum value when x = 2 3 2. To find the minimum cost of the box and the height
√ √
of the box, substitute this value of x into the equations for C and h to obtain C = 120 3 4 and h = 2.5 3 2.
(Some effort is required to find these exact values.) Thus, the minimum cost of the box is about $190.49
when the dimensions of the box are roughly 2.52 feet by 2.52 feet by 3.15 feet.

It should be pointed out that “real world” applications like this are always somewhat contrived. The
construction of the box will require nails or screws (which cost a little), the sides must overlap some so
that they can be fastened together, the box may need to fit in a certain corner of the basement, etc. In
addition, plywood and other building materials are typically sold in fixed sizes, and small variations in the
dimensions of the box will have little effect on the cost of the box. It is important to keep ideas such as
these in mind and not turn off your common sense. However, the problems in this section are to be solved
in the ideal mathematical world; a more careful analysis of the problem, which would be more complicated,
will be deferred for now.
There are no definitive methods for solving optimization problems; practice and experience are the best
teachers. The following list offers some suggestions that may be helpful.
1) Read the problem several times and make sure you understand what it says. (In other words, don’t just
glance at the problem and decide it is too hard or too confusing.) Draw a picture of the situation if that
is appropriate; actually drawing several pictures representing different possibilities is a good idea. Be
certain you understand which quantities vary and how they impact the problem.
2) Assign variables to unknown quantities and write down equations that link these variables. Finding
mathematical connections between the variables may be the most difficult part of the problem as these
34 Chapter 1 The Derivative

may not be immediately evident from the wording of the problem. Some common equations arise from
known geometry formulas for area and volume, similar triangles, and the Pythagorean Theorem.
3) Express the “optimizing function” in terms of one variable and find the appropriate domain for this
variable. The domain typically follows from the physical nature of the problem.
4) Solve the problem using calculus or perhaps some other mathematical technique such as completing the
square. Be certain to verify that you obtain a maximum or a minimum as desired; you can use the closed
interval method if appropriate or the first derivative test.
5) Be sure to answer the question as stated in the problem. This may involve translating your mathematics
back into the language of the problem.
It is important to not give up on a problem too soon. Have confidence that you can find a solution and
keep thinking about the problem until some useful insight occurs. Observing someone else solve one of these
problems may make them look easy; once you see the key idea, the problem almost solves itself. The most
crucial aspect of these problems is learning to find that key idea on your own. We conclude this section with
one more example.

Problem: The legs of a right triangle have lengths a and b. Find the area of the largest rectangle that can
be inscribed in this triangle assuming that one side of the rectangle is parallel to one of the legs.

Solution: Let x and y be the width and height, respectively, of a rectangle that satisfies the conditions of
the problem (see the figure).

.......... .......... ..........


... ........ ... ........ ... ........
.. ....... .. ..... .. .....
a ... .......
....... a ............................................................... a ...............................................................
... ...... ... x y ............... ... .......
y ....... ..........
... ... ... ...
......................................................... ............................................................. .........................................................
..

b b x b−x

We want to maximize the area A = xy of the rectangle. Using similar triangles, the relationship between x
and y becomes apparent (see the far right triangle in the figure):

y b−x a
= or y = (b − x).
a b b

The problem can now be stated as follows:

a
Find the maximum value of A(x) = (bx − x2 ) on the interval [0, b].
b

It is easy to verify that the maximum value of A is A(b/2) = ab/4. Hence, the maximum area of the inscribed
rectangle is ab/4. Note that the area of this largest rectangle is exactly half the area of the original triangle.
The interested reader can show that the same result occurs if one side of the rectangle is parallel to the
hypotenuse.

Exercises
1. A rectangular garden with area 400 m2 is to be surrounded on three sides by a wall costing $60/m and on the
fourth side by a fence costing $40/m. Find the dimensions for the garden that will minimize the cost.
1.15 Optimization Problems 35

2. Suppose you want to use 1000 feet of fencing to enclose a rectangular area that is further subdivided into three
..................................
rectangles by two fences running parallel to one of the sides of the bordering rectangle (like this ....... ...... ...... ...... ). Find
..............................
the largest total area that you can enclose.
3. A rectangular field with an area of 10,000 square feet is to be fenced off. In addition to going all around the
rectangular field, the fence also cuts off an isosceles triangle inside one corner of the field. This part of the fence
....................................
starts at the midpoint of one of the shorter sides of the rectangle (like this ....... ......
.. ). Find the dimensions of the
..............................
field that minimize the amount of fencing required.
4. The area of the print on a book page is 45 square inches. The margins are one inch on the sides and bottom
and one-half inch at the top. Find the dimensions of a page of this book if the only object is to use the minimal
amount of paper. Also, find the minimum area of a page.
5. An open top rectangular box with a square base is to be constructed having a volume of 864 cm3 . Find the
minimum surface area for the box.
6. You must construct a rectangular box with a square base. The material for the sides costs four dollars per
square foot, the material for the base costs nine dollars per square foot, and the material for the top costs five
dollars per square foot.
a) Find the volume of the largest box you can build for three hundred dollars.
b) Find the minimum cost of constructing a box with a volume of twenty-four cubic feet.
7. A can (with a top and a bottom) in the shape of a right circular cylinder is to be constructed with a volume of
300π cubic centimeters. Find the dimensions of the can that will minimize the surface area of the can. Also,
determine the ratio of height to radius for this optimal can.
8. Find the maximum possible area of a rectangle with base that lies on the x-axis and with two upper vertices on
the graph of the equation y = 4 − x2 .
9. Find the area of the largest rectangle that can be inscribed in an equilateral triangle of side length s. Assume
that one side of the rectangle is parallel to a side of the triangle.
10. Find the area of the largest isosceles triangle that can be inscribed in a circle of radius r. Find the lengths of
all three sides of this optimal triangle. What fraction of the circle is occupied by the optimal triangle?
11. Find the volume of the largest right circular cylinder that can be inscribed in a sphere of radius r. Find the ratio
of height to radius for this optimal cylinder. What fraction of the sphere is occupied by the optimal cylinder?
12. Find the volume of the largest right circular cone that can be inscribed in a sphere of radius r. Find the ratio
of height to radius for this optimal cone. What fraction of the sphere is occupied by the optimal cone?
13. Two vertical poles stand twenty feet apart. One is ten feet tall and the other is eight feet tall. Find the length
of the shortest wire that can reach from the top of one pole to a point on the ground between the poles and
then to the top of the other pole.
14. An island in a lake is located 600 yards opposite one end of a portion of the straight shoreline that is 3 miles
long. Suppose you can swim at a rate of 3.2 miles per hour and can run at a rate of 9.5 miles per hour. Find the
least amount of time required to get to the other end of the shoreline from the island. (Ignore transition time.)
15. Let a and b be positive constants. Find an equation for the line that passes through the point (a, b) and cuts
off the least area in the first quadrant. (The line, which must have negative slope, becomes the hypotenuse of a
right triangle with legs on the coordinate axes.) Find the area of this smallest triangle as well as the intercepts
of the line.
16. Let a and d be positive numbers. Suppose that two light sources are separated by a distance d and that one
source is a times as bright as the other. Find the point on the straight line between the light sources at which
there is the least amount of light. Use the assumption that the intensity of the light at a point is proportional
to the reciprocal of the square of the distance from the light source.
36 Chapter 1 The Derivative

1.16 Trigonometric Functions

We assume that the reader is familiar with angles and angle measurement, both in degrees and radians. As
a quick reminder, if x is a number between 0 and 2π, then the angle x radians is the angle cut off in a circle
of radius r by an arc of length xr (see the figure).
.................................
.. ...... . .... xr 360◦ = 2π radians
....
...
... ...
.
..
..
... π
.. 1◦ =
.

...
..
.
.. x ...
............................................... 180
radians
... r .. ◦
. 180
... ... = 1 radian
..... .. π
......... .........
............
If x > 2π, then the angle is determined by “taking laps” in a counterclockwise direction. If x < 0, then the
angle is determined by going in a clockwise direction.
The word “trigonometry” refers to the measurement of triangles. For acute angles, the trigonometric
functions can be defined using the sides of a right triangle as in the figure below.
.......... opposite hypotenuse
... ....... sin θ = csc θ =
... ........ hypotenuse opposite
... ....... hypotenuse adjacent hypotenuse
... ....... cos θ = sec θ =
opposite
.. ....... hypotenuse adjacent
... .......
... ...... opposite adjacent
........................................................................θ........................ tan θ =
adjacent
cot θ =
opposite
adjacent

However, in calculus, the trigonometric functions need to be defined for all real numbers. Given a real
number θ, interpret θ as the radian measure of an angle with vertex at the origin and initial side the positive
x-axis. The terminal side of this angle intersects the unit circle in a unique point. The x-coordinate of this
point is defined to be cos θ and the y-coordinate is defined to be sin θ. The other trigonometric functions
are then defined in terms of sin θ and cos θ.

y
.......
.......

1.2
......................•...............
(cos θ, sin θ)
...
.... .
...
. ...
... tan θ =
sin θ
sec θ =
1
... ...
..
... θ ... .............. cos θ cos θ
..........................................
.
... .... x
... 1 ..1.2 . cos θ 1
.... ... cot θ = csc θ =
...... sin θ
.......................... x2 + y2 = 1
sin θ

A number of relationships are clear from the definitions of the trigonometric functions. These include the
fact that the trigonometric functions repeat every 2π units as well as the following identities:
sin2 θ + cos2 θ = 1 sin(−θ) = − sin θ sin(π − θ) = sin θ sin(π + θ) = − sin θ
2
tan θ + 1 = sec θ 2
cos(−θ) = cos θ cos(π − θ) = − cos θ cos(π + θ) = − cos θ
1 + cot2 θ = csc2 θ tan(−θ) = − tan θ tan(π − θ) = − tan θ tan(π + θ) = tan θ

Another set of useful identities that follow from the symmetry of the circle are
( ) ( ) ( )
π π π
cos θ = sin −θ ; cot θ = tan −θ ; csc θ = sec −θ .
2 2 2
1.16 Trigonometric Functions 37

The prefix “co” in front of three of the trigonometric functions refers to the complement of an angle; for
instance, the cosine of x is the sine of the complement of x.
In calculus, the angle will most often be denoted by x, where it is assumed that x is in radians. The
graphs of the functions sin x, cos x, tan x, and cot x are given below; graphs for sec x and csc x will be left for
the reader. Since the graphs are periodic (that is, they repeat every 2π units) only a portion of each graph
is given.
y y
..... .....
....... .......

y = sin x y = cos x
1
...
............................. ....1..................
..... .
...........
.......
... ..... ..... .
..
...... .....
....
. ...........
.....
.
.... .....

...... π ........
x x
. π....... . ..... ...........
.. . ....... ..... 2π 2π
−1
..................... −1
........ ........
.........

y . .
y
..
......
... ..
..
.....
.. .. ... .. .
..
..
... .. .... .. .... ... .. .
.. .
. ..
..
... ...... ... ...... .. .
..
.
..3
...
3 ... ... .
. ... y = cot x ......
..
... .. .
..
.. .. .... y = tan x ... ...... .. .
.. .
... ...... ... ..
2 2
.. .
..
.. ......
..
..
.. ... ... ..

1 .
..
.... ...
.. .. 1 ...
..
.. ... ..
..

.. ......
.. .
... ...... ....
..
. ..... ..
..
.. ..
.. ..... ...
..... ..

.... ..
.. . .

....
.. ... ... ............. ...
.. .. x ..... π... ..... 2π...... ...............x
−π/2 .....
.. ..

..−1
..
..π/2
..
..
..
..
.. 3π/2 ..
.. .... .... .... ....
.. ..
..
..
..
.. ..
.. −1 ... .... ... ....
.. ..
..
..
.. .
.. . ..
.. ... .... ... ....
.. −2
..
.. .. .
.. .
..
..
−2 ... .... ... ....
..
..
.. .. .
.. .
..
.. ... .... ... ....
.. −3
.. .. .
.. .
..
.. ... .... ... ....
..
.. .. .
.. −3 ... .... .. ....
. ..
. .
.. .
..
.. . .

The exact values of the trigonometric functions can be determined easily for some angles. These values are
recorded in the following table and should be used when they appear in problems.
θ 0 π/6 π/4 π/3 π/2
√ √
sin θ 0 1/2 2/2 3/2 1
√ √
cos θ 1 3/2 2/2 1/2 0
√ √
tan θ 0 3/3 1 3 ∗

There are many other identities satisfied by the trigonometric functions. Some of these are listed below.
There is no real need to memorize all of these formulas, but it is important to know that such formulas exist
and to be able to use them when necessary.

1 − cos 2x
sin(x + y) = sin x cos y + sin y cos x sin 2x = 2 sin x cos x sin2 x =
2
cos(x + y) = cos x cos y − sin x sin y cos 2x = cos2 x − sin2 x 2
cos x =
1 + cos 2x
tan x + tan y 2 tan x 2
tan(x + y) =
1 − tan x tan y
tan 2x = 1 − cos 2x
1 − tan2 x 2
tan x =
1 + cos 2x
38 Chapter 1 The Derivative

For triangles that do not have a right angle, the following relationships between the sides and angles of
a triangle are sometimes useful.
.....
...α.....
. .
.. .....
. law of cosines law of sines
b .. ... c
... ... c2 = a2 + b2 − 2ab cos γ
sin α
=
sin β
=
sin γ
... a b c
.. ...
.. γ .
................................β.........
a
The proofs of these two properties of triangles are not difficult and will be requested in the exercises.
Finally, we give definitions for the inverse trigonometric functions. The number arcsin(1/2) rep-
resents the angle or arc (in radians) for which the value of the sine function is 1/2. Since there are many
angles for which this is true, we need to limit the range of potential answers in order to define a function.
One way to proceed is the following.

1. For each real number x ∈ [−1, 1], arcsin x is the unique real number taken from the interval
[−π/2, π/2] that satisfies sin(arcsin x) = x.
π
2. For each real number x ∈ [−1, 1], arccos x = − arcsin x.
( 2x )
3. For each real number x, arctan x = arcsin √ .
x2 + 1
π
4. For each real number x, arccot x = − arctan x.
2
5. For each real number x that satisfies |x| ≥ 1, arccsc x = arcsin(1/x).
π
6. For each real number x that satisfies |x| ≥ 1, arcsec x = − arccsc x.
2

Problem: Find the exact value of all the trigonometric functions when the angle is 2π/3 radians.

Solution: The symmetry of the unit circle makes it possible to reduce problems such as this to a problem
for which the angle lies in the first quadrant. Using identities and values listed in this section, we find that
( 2π ) ( 2π ) ( π ) √3 ( 2π ) ( 2π ) (π) 1
sin = sin π − = sin = and cos = − cos π − = − cos =− .
3 3 3 2 3 3 3 2
Once the sine and cosine are known, the other trigonometric functions are easily found:
( 2π ) √ ( 2π ) 1 ( 2π ) ( 2π ) 2
tan = − 3, cot = − √ , sec = −2, and csc =√ .
3 3 3 3 3 3
Problem: Find all of the values of x in the interval [−π, 2π] that satisfy 1 + 5 cos(3x) = 0.

Solution: If we let θ = 3x, then the given problem is equivalent to finding all of the solutions to cos θ = −0.2
that are in the interval [−3π, 6π]. (Be certain that you understand this first step.) There are several ways to
proceed; here is one possibility. Let z = arccos 0.2 and note that z ∈ (0, π/2). The desired values of θ that lie
in the interval [0, 2π] are then π − z and π + z. Since the trigonometric functions repeat every 2π units, we
can just add or subtract multiples of 2π to each of these values to obtain further solutions to cos θ = −0.2.
We limit ourselves to values of θ that belong to the interval [−3π, 6π]:

3x = θ = −π + z, π − z, π + z, 3π − z, 3π + z, 5π − z, 5π + z.
1.16 Trigonometric Functions 39

Since z ≈ 1.3694, the solutions to 1 + 5 cos(3x) = 0 that lie in the interval [−π, 2π] are

x ≈ −0.591, 0.591, 1.504, 2.685, 3.598, 4.780, 5.692.

Problem: Simplify the expression sec(arctan 2x).

Solution: Let θ = arctan(2x). Then tan θ = 2x. This statement can be represented with a right triangle.

......... 2x
... ........ √ tan θ = 2x =
1
.. ....... 1 + 4x2 √
2x ... .......
....... sec θ = 1 + 4x2
... ... ( )
..........
...................................θ.................. θ ∈ (−π/2, π/2) ⇒ sec θ > 0
1

Hence, the expression sec(arctan 2x) simplifies to 1 + 4x2 .

Exercises
1. Find the exact value of all the trigonometric functions when the angle is 7π/6 radians.
2. Find the exact value of all the trigonometric functions given that sin x = 2/3 and 0 < x < π/2.
3. Find the exact value of all the trigonometric functions given that tan x = −4 and π/2 < x < π.
4. The hypotenuse of a right triangle is 12 and one of its angles is 38◦ . Find the lengths of the other two sides.
5. Find all of the values of x in the interval [−2π, 4π] that satisfy cos x = 1/2.
6. Find all of the values of x in the interval [0, 4π] that satisfy sin x = 0.7.
7. Find all of the values of x in the interval [0, 2π] that satisfy 5 − 7 sin(2x) = 0.
8. Find all of the values of x in the interval [0, 3π] that satisfy 2 + cos(2x) = 5 sin x.
9. Find three solutions to the equation 1 + tan x = 0.
10. Prove the law of sines. Hint: Compute the area of the triangle three different ways, using each side once as the
base.
11. Prove the law of cosines. Hint: Referring to the figure in the text, drop a perpendicular from the top vertex to
the side a, then use the Pythagorean Theorem on the right triangle having c as its hypotenuse.
12. Suppose a chord in a circle of radius 8 has length 12. Find the length of the arc cut off by the chord.
13. The sides of a triangle are 6, 15, and 16. Find the angles (to the nearest tenth of a degree) of the triangle.
14. Without a calculator, sketch a graph of each function.
a) f (x) = 1 + 2 sin x b) g(x) = 5 cos(2x) c) h(x) = −2 tan(x/4)
15. Sketch the graphs of sec x and csc x.
16. List the ranges of the six inverse trigonometric functions.
17. Without a calculator,
√ find the exact value of each of the following.
a) arcsin(1/ √2 ) b) arcsin(−1/2)
√ c) arccos(−1/2)

d) arctan(1/ 3) e) arcsec(− 2) f ) arccsc(−2/ 3)
18. Simplify each of the following expressions. Indicate the values of x for which each is defined.
a) tan(arcsin x) b) sin(arctan x) c) cos(2 arcsin x)
40 Chapter 1 The Derivative

1.17 Derivatives of Trigonometric Functions

To find the derivative of the sine function, it is necessary to return to the definition of the derivative and
determine some way of computing the limit of the difference quotient. In this case, some properties of the
trigonometric functions and a few trigonometric identities provide the relevant information. The definition
of the derivative yields

d sin(x + θ) − sin x sin x cos θ + sin θ cos x − sin x


sin x = lim = lim
dx θ→0 θ θ→0 θ
( sin θ 1 − cos θ )
= lim cos x − sin x .
θ→0 θ θ

To determine the limits of the quotients sin θ/θ and (1 − cos θ)/θ as θ → 0, assume that θ is given in radians
and consider the portion of the unit circle that lies in the first quadrant:

Q
...
A (1, 0)
.. .......
B .................... P..... ..
1
B (0, 1) area of triangle OCP is sin θ cos θ
... ......... .. 2
... ....
... 1 ... . .... ....... ...
1
C (cos θ, 0) .. area of sector OAP is 2
θ
P (cos θ, sin θ) ... ... ... ..... . area of triangle OAQ is 1
tan θ
.. ... .. .... 2
... .... . .
Q (1, tan θ)
...........θ.........................................
O C A

From the figure, it is clear that the area of triangle OCP is less than the area of sector OAP which in turn
is less than the area of triangle OAQ. Determining these areas in terms of θ and rearranging gives

θ 1
sin θ cos θ < θ < tan θ ⇒ cos θ < < .
sin θ cos θ

Although the figure indicates that θ is positive, this equation is valid for any small nonzero value of θ because
cos(−θ) = cos θ and sin(−θ)/(−θ) = sin θ/θ. Since lim cos θ = 1, the squeeze law gives
θ→0

θ sin θ
lim = 1 or equivalently lim = 1.
θ→0 sin θ θ→0 θ

Using this limit and some algebra yields


( )
1 − cos θ 1 − cos2 θ sin θ sin θ
lim = lim = lim · = 0 · 1 = 0.
θ→0 θ θ→0 θ(1 + cos θ) θ→0 1 + cos θ θ

Given the values of these limits, the derivative of the function sin x is seen to be cos x. Graphing y = sin x,
then using the graph to sketch its derivative makes this result seem very plausible.
The derivatives of the other five trigonometric functions can then be found using trigonometric identities
and previous derivative formulas. For example, the chain rule and the quotient rule give

d d (π ) (π )
cos x = sin − x = cos − x (−1) = − sin x;
dx dx 2 2
d d sin x cos x · cos x − sin x · (− sin x) 1
tan x = = 2
= = sec2 x.
dx dx cos x cos x cos2 x
1.17 Derivatives of Trigonometric Functions 41

The derivatives of the six trigonometric functions are recorded in the following table.
d d d
sin x = cos x tan x = sec2 x sec x = sec x tan x
dx dx dx
d d d
cos x = − sin x cot x = − csc2 x csc x = − csc x cot x
dx dx dx
It is important to remember that all of the other derivative formulas still apply when differentiating functions
that involve trigonometric functions. For example,

d
tan 5x = sec2 5x · 5 = 5 sec2 5x;
dx
d 2
x sec x = x2 sec x tan x + 2x sec x = x sec x(x tan x + 2);
dx
d ( )
cos4 (x2 ) = 4 cos3 (x2 ) · − sin(x2 ) · (2x) = −8x sin(x2 ) cos3 (x2 );
dx
d cos x (4 + sin x)(− sin x) − cos x(cos x) −4 sin x − 1
= 2
= .
dx 4 + sin x (4 + sin x) (4 + sin x)2

For problems such as the third example, some students find it easier to rewrite trigonometric functions
without the shorthand notation for powers:

d d ( )4 ( )3 ( )
cos4 (x2 ) = cos(x2 ) = 4 cos(x2 ) · − sin(x2 ) · (2x) .
dx dx

If you have trouble with this type of derivative, you might find this way of writing the powers helpful. Also, as
in the fourth example, notice that trigonometric identities are sometimes used when simplifying derivatives
of trigonometric functions. In this case, the basic identity sin2 x + cos2 x = 1 was used.

Exercises
1. Find and simplify the derivative of the given function.
a) f (x) = sin x − 2 cos 3x b) g(x) = x2 sin 2x c) h(x) = cos3 2x
cos x
d) y = e) s = sin2 t cos t f ) w = z − 2 sec 4z
3 + sin x
cos x
g) u(x) = sin x − 1
sin3 x h) v(θ) = sec2 5θ i) w(x) =
3
2 + cot2 x
j) F (t) = t2 sin(1/t) k) G(t) = 5 tan 4t l) H(x) = 3 sin4 (2x2 )

2. Use the definition of the derivative to derive the derivative formula for cos x.
3. Use the quotient rule or the chain rule to derive the derivative formula for the given function.
a) cot x b) sec x c) csc x
4. Evaluate each of the following limits, where r is a nonzero real number.
sin 2θ tan rθ tan rθ
a) lim b) lim c) lim
θ→0 sin rθ θ→0 θ θ→0 sin 7θ
5. Find an equation for the line tangent to the curve y = 2 sin x − 3 cos 2x when x = π/6.
6. Find all values of x in [−π, 4π] for which the function f (x) = x − 2 sin x has a horizontal tangent.
7. Show graphically that there is a point on the graph of y = sec x for which the tangent line goes through the
origin. If the x-coordinate of such a point is a, what equation must a satisfy?

8. Find and simplify the derivative of f (x) = 1 + x2 + x4 cos2 5x.
42 Chapter 1 The Derivative

1.18 Derivative Applications of Trigonometric Functions

The derivatives of the inverse trigonometric functions will be useful in the chapter on integration. These
formulas follow fairly easily from the derivatives of the trigonometric functions and some trigonometric
identities. Assuming that arcsin x is differentiable (this fact does require proof, but we will not concern
ourselves with it), its derivative can be found using an identity and the chain rule:

d d
sin(arcsin x) = x ⇒ sin(arcsin x) = 1 ⇒ cos(arcsin x) · arcsin x = 1.
dx dx

It follows that
d 1 1
arcsin x = =√ .
dx cos(arcsin x) 1 − x2
The derivatives of the other inverse trigonometric functions can then be found using their definitions (see
Section 1.16) and, if necessary, the chain rule. For instance,

d d (π ) 1
arccos x = − arcsin x = − √ .
dx dx 2 1 − x2
Proceeding in this way, we obtain the following formulas.

d 1 d 1 d 1
arcsin x = √ arctan x = arcsec x = √
dx 1 − x2 dx 1 + x2 dx |x| x2 − 1
d 1 d 1 d 1
arccos x = − √ arccot x = − arccsc x = − √
dx 1 − x2 dx 1 + x2 dx |x| x2 − 1
As one example, which incorporates the new formulas and the chain rule,

d 1 x 2x
arcsin(x2 /2) = √ ( )2 · x = √(4 − x4 )/4 = √4 − x4 .
dx
1 − x2 /2

The following problems illustrate these new derivative formulas (6 trigonometric functions and 6 inverse
trigonometric functions). They also provide a review of the various applications of the derivative.

Problem: Find the maximum and minimum outputs of f (x) = x − sin 2x on the interval [0, π].

Solution: We first find the critical inputs of f that lie in the interval [0, π] by solving f ′ (x) = 0:

π 5π
f ′ (x) = 1 − 2 cos 2x; f ′ (x) = 0 ⇒ x = , .
6 6

Evaluating f at the critical inputs and the endpoints yields


√ √
π 3 5π 3
f (0) = 0, f (π/6) = − , f (5π/6) = + , f (π) = π.
6 2 6 2

Using a calculator to approximate these values, the maximum output of f is f (5π/6) ≈ 3.4840 and the
minimum output of f is f (π/6) ≈ −0.3424. .......
... .....
. ..
Problem: Find the value of x that will maximize θ. ... .......
3 .. .....
... .....
.. .....
....... .
1 ............
. . ... ... .... ................θ........
...........................x..................................
1.18 Derivative Applications of Trigonometric Functions 43

Solution: Using right triangles, we can express θ as a function of x:

θ(x) = arccot(x/4) − arccot x, 0 < x < ∞.

To find the value of x that maximizes θ, first find and simplify θ′ (x). The result is

4 1 12 − 3x2
θ′ (x) = − 2
+ 2
= .
16 + x 1+x (1 + x2 )(16 + x2 )

The only relevant critical input is 2. Since θ′ is positive on the interval (0, 2) and negative on the interval
(2, ∞), the function θ has a maximum output when x = 2. The corresponding maximum value for θ is
approximately 36.87◦ . (Do you see the advantage of using arccot x rather than arctan x in this problem?)

Exercises
1. Find and simplify the derivative of the given function.
a) f (x) = arcsin(x/2) b) g(t) = arctan(t/4) c) h(x) = arccos(3/x)
( )2 √
d) y = arcsin x e) s = (1 + t2 ) arctan t f ) r = θ arccos θ − 1 − θ2

2. Use the chain rule to verify the derivative formulas for arctan x and arccsc x.

3. Find the maximum and minimum outputs of the function f (x) = sin x + 3 cos x on the interval [0, π].
4. Find the maximum and minimum outputs of the function g(x) = x − 2 cos x on the interval [−π, π].
5. Find the maximum and minimum outputs of the function h(x) = sin2 x + cos x on the interval [0, π].
6. Determine the intervals in [0, 2π] on which the function f (x) = sin x + sin2 x is increasing and those on which it
is decreasing.
7. A line of length 60 is split into equal thirds. The right and left thirds are then each bent upward through the
same angle θ to form a (topless) trapezoid. Find the value of θ that will maximize the area of the trapezoid.
8. Consider each of the following figures with quantities as indicated.
........ .... .......
........... ...... ... .....
... ... ..... ... ... ... .....
... .. ..... ... .. ... .....
... ... θ ..... ... ... . .. .....
... ... .....
.....
... ... ... ....... b ...
...
.....
.....
12 ... ... 3 ... ... .. .
... ... .....
..... ... .... .. .... ... ...
..
.....
.....
... ... ... .. .
... ... .....
..... ... ... ....... .. 2
..
...
...................... .....
.....
............. ... . .............. ....θ.... . . . .. ... ..... .
................................................................................ .......................................................................... a ............
..........................................................................θ....................
x 2x x 3−x x
a) For the left figure, find the value of x ∈ (0, ∞) that will maximize θ.
b) For the middle figure, find the value of x ∈ [0, 3] that will maximize θ.
c) For the right figure, find the value of x ∈ (0, ∞) that will maximize θ. Treat a and b as constants.
44 Chapter 1 The Derivative

1.19 Properties of Exponents and Logarithms

Let a be a real number and let n be a positive integer. The symbol an represents the product of a with itself
n times; it is simply a shorthand notation. From this definition, it follows easily that

am ( )n
am an = am+n , = am−n , and am = amn
an

for positive integers m and n. Consistency with these rules yields the properties (give these some thought):

1 √ √
a1 = a, a0 = 1, a−n = , a1/n = n
a, and am/n = n
am .
an

In this way, the expression ax can be defined for any rational number x. However, in order to apply calculus
to an exponential function, the domain must be all real numbers. In other words, it is necessary to assign
a meaning to ax even when x is irrational. Since an irrational number can be approximated to any degree
of accuracy by a rational number, the expression ax can be approximated to any degree of accuracy for an

2
irrational exponent x. For example, a is the limit of the sequence a1.4 , a1.41 , a1.414 , . . .. It can be shown
that the function f defined by f (x) = ax , where a is any positive number, is continuous for all real numbers;
the general shape of its graph is given below.

y .
y.
...... ...... ...
......
. ... ......
ar as = ar+s a0 = 1
... ...
...
y = ax , a>1 .. ... y = ax , 0 < a < 1 ar
= ar−s a1 = a
... . .... as
.•. (1, a) ....
......
. .....
.....
( r )s
(0, 1)..•.... a = ars (ab)r = ar br
....... (0, 1) •............. (1, a)
... ......
.
...
. ... ..
x
•..........
..........
.... x
.......... .

The properties of exponents for arbitrary real numbers are listed to the right of the graphs.
When working with exponential functions, it is sometimes necessary to solve equations such as 2x = 21
or 10x = 280. This leads to the concept of logarithm. The symbol loga b represents the exponent to which
a must be raised in order to obtain the number b. In other words, the equation x = loga b is equivalent to
ax = b. The graph of y = loga x, where a > 1, is sketched below. Note that the domains of these functions
are (0, ∞).
y
..
......
.. ..
y = loga x, a > 1
........ loga 1 = 0
..
...
..............
. loga (rs) = loga r + loga s
......
.
...

........ loga a = 1
. loga (r/s) = loga r − loga s
.... (a, 1)
..•... .....
.......... x
loga ar = r, r ∈ R
.. (1, 0) loga rs = s loga r
.. aloga r = r, r > 0
...

Since a logarithm is an exponent, logarithms have properties that are related to the properties of exponents.
These are listed to the right of the graph.
1.19 Properties of Exponents and Logarithms 45

Although logarithms can be defined using any positive number a as the base, the most common choices
for bases are 10 and e. The choice of 10 should come as no surprise. The number e, which is a very
( )1/h
important mathematical constant, can be defined by e = lim 1 + h . To five decimal places, e ≈ 2.71828.
h→0
Logarithms to the base e are usually written as ln x rather than loge x, that is, ln x = loge x.

Problem: Given that loga 2 = r and loga 3 = s, express loga 72 in terms of r and s.

Solution: Using the properties of logarithms, we find that

loga 72 = loga (23 · 32 ) = loga 23 + loga 32 = 3 loga 2 + 2 loga 3 = 3r + 2s.

Problem: Find the exact value of x that satisfies the equation 5 + 2e4x = 89.

Solution: To solve this problem, first isolate the exponential term, then rewrite the equation in logarithmic
form:
1
5 + 2e4x = 89 ⇒ e4x = 42 ⇒ 4x = ln 42 ⇒ x = ln 42.
4
This represents the exact value of x that satisfies the equation.

Exercises

1. Without a calculator, find log10 0.01, log3 (1/81), log2 32, log4 8, ln e2 , ln(1/e), and log10 10.
2. Suppose that loga 2 = r, loga 3 = s, and loga 10 = t. Express loga 6, loga 15, loga (1/9), loga 4000, loga 24,
loga 30, loga (9/20), and loga 0.0003 in terms of r, s, and t.
3. Find the exact value of x that satisfies ln(x − 2) − ln 5 = 1.
4. Find the exact value of x that satisfies the equation, then use a calculator to approximate x to the nearest
thousandth.
a) 4x = 25 b) x = log6 1000 c) 150e0.2x = 1000
10e−0.1x
d) 100 − e0.04x = 20 e) =4 f ) ex + e−x = 6
1 + 0.3e−0.1x
5. The temperature of an object in degrees Celsius at time t minutes is given by T (t) = 82 − 40e−0.3t . What is the
initial temperature? What is the temperature when t = 3? When will the temperature be 80◦ Celsius?
6. Suppose that a function f is defined by f (x) = Cekx , where C and k are constants. Given that f (2) = 100 and
f (5) = 800, find C and k.
7. Jack and Jill plant beans. Jack’s beanstalk is t2 inches tall after t days while Jill’s beanstalk is 2t inches tall
after t days. How tall is each beanstalk after 3 weeks? Give your answers in suitable units.

8. Evaluate the expressions 10 x and log10 x for x = 100 and x = 101000 . Which of the two expressions is greater
for large values of x? Find an exact value of x for which the two expressions are equal.
9. Let a > 0. Prove that ax = ex ln a for all real numbers x.
ln x
10. Let a > 0. Prove that loga x = for all positive real numbers x.
ln a
( )1/h
11. Use the fact that e = lim 1 + h to evaluate each of the following limits. Treat a as a nonzero constant.
( )a/h
h→0
( )1/h ( )n
1
a) lim 1 + h b) lim 1 + ah c) lim 1 +
h→0 h→0 n→∞ n
46 Chapter 1 The Derivative

1.20 Derivatives of Exponential and Logarithmic Functions

The derivative formula for an exponential function is quite simple. To derive this formula, recall that the
number e is defined by e = lim (1 + h)1/h . This means that (1 + h)1/h ≈ e or (eh − 1)/h ≈ 1 when h is near
h→0
0. Since the approximations improve as h → 0, it follows that lim (eh − 1)/h = 1. This fact, along with the
h→0
definition of the derivative, can then be used to find the derivative of the exponential function ex :

d x ex+h − ex ex eh − ex eh − 1
e = lim = lim = ex lim = ex ,
dx h→0 h h→0 h h→0 h

a very simple derivative formula. By a property of exponents and the chain rule,

d x d x ln a
a = e = ex ln a ln a = (ln a) ax ,
dx dx

for any positive number a. Hence, the derivative of any exponential function is a constant multiple of
the original function. In other words, the derivative of ax is proportional to ax and ln a is the constant of
proportionality. The reason e is a special number is because the constant of proportionality for the derivative
of ex is 1.
The following examples illustrate this new derivative formula.

d
(3x2 + 5ex ) = 6x + 5ex ;
dx
d 4x
(e − 3e−2x ) = e4x · 4 − 3e−2x · (−2) = 4e4x + 6e−2x ;
dx
d 2 2 2 1 2
xex /4 = xex /4 · (x/2) + ex /4 = (x2 + 2)ex /4 ;
dx 2
d x √ √ 1
(2 + 3 x ) = (ln 2) 2x + (ln 3) 3 x · √ .
dx 2 x

d f (x)
To make it explicit, the chain rule yields e = f ′ (x)ef (x) .
dx
To find the derivative of the function ln x, we begin with the identity eln x = x, which is valid for all
x > 0. Assuming that ln x is differentiable, differentiating both sides of this identity yields

d d 1 1
eln x · ln x = 1 ⇒ ln x = ln x = ,
dx dx e x

another simple derivative formula. For example,

d 1
(x3 ln x) = x3 · + 3x2 ln x = x2 (1 + 3 ln x);
dx x
d 1 2x + 6
2
ln(x + 6x + 10) = 2 · (2x + 6) = 2 .
dx x + 6x + 10 x + 6x + 10
d −1 1
Note that ln(−x) = = for all x < 0. This fact, along with the chain rule, yields the formulas
dx −x x
d 1 d f ′ (x)
ln |x| = and ln f (x) = .
dx x dx f (x)

Problem: Determine the intervals on which the function f defined by f (x) = x2 e−x/4 is increasing and
those on which it is decreasing.
1.20 Derivatives of Exponential and Logarithmic Functions 47

Solution: The function f is differentiable for all real numbers, so the only critical inputs will be those for
which f ′ (x) = 0. Since
( 1) x
f ′ (x) = x2 · e−x/4 − + 2x · e−x/4 = (8 − x) e−x/4 ,
4 4

we see that f ′ (x) = 0 when x is 0 or 8. Checking the sign of f ′ on the intervals (−∞, 0), (0, 8), and (8, ∞),
it follows that f is decreasing on the intervals (−∞, 0] and [8, ∞), and increasing on the interval [0, 8].

Problem: Find the maximum output of the function g defined by g(x) = x−1/4 ln x.

Solution: The domain of the function g is (0, ∞). Note that g(x) < 0 for 0 < x < 1 and g(x) > 0 for x > 1.
Using the product rule to find g ′ , we obtain

1 1 −5/4 ( 1 ) 4 − ln x
g ′ (x) = x−1/4 · − x ln x = x−5/4 1 − ln x =
x 4 4 4x5/4

and conclude that g ′ is positive on (0, e4 ) and negative on (e4 , ∞). It follows that g has a maximum value
when x = e4 . Substituting this value into the formula for g reveals that the maximum output of g is 4/e.

Exercises
1. Find and simplify the derivative of the given function.
2
a) f (x) = ex + 2e−2x b) g(x) = 5x − 4e−x c) h(x) = (x + 1)e−x
d) F (x) = ln |2x + 1| e) G(x) = esin 2x f ) H(x) = 6 tan(ex/2 )
g) y = cos2 (e−x ) h) s = e−2t sin 3t i) w = (ln z)4
ln x
j) u(x) = ln |x2 + 4x + 2| k) v(x) = x ln x l) w(x) = 3
x
et √
m) y = x3 + 3x n) s = o) w = z 2 + (ln z)2
1 + e2t
2. The velocity v in meters per second of a particle at time t seconds is given by v = (t2 + 100t)e−t . What is the
acceleration of the particle after ten seconds?
3. Find the exact minimum output of the function f (x) = ex + e−2x .
4. Find the maximum output of the function g(x) = a2 xe−ax , where a is a positive constant.

5. Find the maximum and minimum outputs of (ln x)/ x on the interval [1, 25].
6. Find the intervals on which the function is increasing and those on which it is decreasing and determine the
nature of the critical inputs.
2 ln x (ln x)2
a) f (x) = x2 e−x b) g(x) = 2 c) h(x) =
x x
7. Find a point on the graph of y = 2ex/3 at which the tangent line passes through the origin.
d r
8. Prove that x = rxr−1 for any nonzero real number r. Hint: Note that xr = er ln x .
dx
9. Consider the rather unusual function f (x) = xx , defined for x > 0. Find the derivative of this function, then
find its minimum output. Hint: Write xx = ex ln x .
10. Use a property of logarithms to find a derivative formula for loga x, where a > 0.
48 Chapter 1 The Derivative

1.21 Differential Equations

In some cases, it is necessary to find a function given information about its derivative, usually in the form of
an equation. An equation that involves the derivative of a function is known as a differential equation. A
solution to a differential equation is a function that satisfies the equation. As an extremely simple example,
suppose that we need to find a function f with the property that f ′ (x) = 2x. It is clear that the function
f (x) = x2 satisfies this differential equation. Note that f (x) = x2 + 1 and f (x) = x2 − 8 also satisfy the
differential equation. In fact, any function of the form f (x) = x2 + C, where C is a constant, satisfies the
equation. This follows from the fact that the derivative of a constant is zero. In general, there are an infinite
number of solutions to a given differential equation. However, if further information about the function is
known, a unique solution can be determined. Suppose that f ′ (x) = 2x and f (1) = 4. The only function f
that satisfies both of these conditions is f (x) = x2 + 3.
One way to find a function given its derivative is to just think about differentiation in reverse. For
instance, for power functions differentiation reduces the exponent by one; the reverse process will add one
to the exponent. If f ′ (x) = x5 , then f (x) = 1
6 x6 + C, where C is any constant. Two further examples of
simple differential equations follow.
If f ′ (x) = 6x2 − 4x + 1 and f (1) = 0, then f (x) = 2x3 − 2x2 + x − 1.
If g ′ (x) = 4 cos x and g(π) = 2, then g(x) = 4 sin x + 2.
Although there are many techniques for solving differential equations of this type (some will be considered
in the next chapter), for now just think about differentiation in reverse. One of the advantages of differential
equations is the fact that you can always check your answer by taking the derivative of your proposed solution
and seeing if it satisfies the given differential equation.
Now consider the differential equation f ′ (x) = f (x). This differential equation is of a different type
than the ones just considered; the unknown function appears on both sides of the equation. A solution to
this equation is a function whose derivative is the same as the original function. Thinking over the list of
derivative formulas indicates that a solution is f (x) = ex . Where should the generic constant C go? A quick
check shows that f (x) = ex + C does not satisfy the equation f ′ (x) = f (x), but f (x) = Cex does. In general,
if both C and k are constants, the chain rule gives

d ( )
Cekx = Cekx · k = k Cekx .
dx

This shows that the derivative of Cekx is k times itself. This discussion essentially proves the following
theorem.

THEOREM 1.20 Let G be a function and let k be a constant. If G′ (x) = kG(x) for all x, then
G(x) = Cekx , where C is a constant that can be determined from further information. In fact, it is easy to
see that C = G(0).

Here are two examples that illustrate the type of differential equation considered in the theorem.
If f ′ (x) = 4f (x) and f (0) = 20, then f (x) = 20e4x .
1.21 Differential Equations 49

If s′ (t) = −s(t) and s(0) = 5, then s(t) = 5e−t .


It is important to remember that this type of differential equation is different than the earlier type. In the
first case, the derivative of the function is explicitly given, but in these last examples, it is given that the
derivative of the function is a multiple of the function.
Using a little bit of algebra, we can solve slightly more complicated differential equations. The following
two examples illustrate the main ideas.

Problem: Find a function f such that f ′ (x) = f (x) + 3 and f (0) = 10.

Solution: Since the functions f (x) and f (x) + 3 have the same derivative, we find that

d ( )
f ′ (x) = f (x) + 3 ⇔ f (x) + 3 = f (x) + 3
dx

The second equation is of the form given by the theorem with G(x) = f (x) + 3 and k = 1. It follows that
the function f satisfies f (x) + 3 = Cex . Using the fact that f (0) = 10 gives C = 13. Thus the solution to
the differential equation is f (x) = 13ex − 3.

Problem: Find a function A such that A′ (t) = 10 − 2A(t) and A(0) = 40.

Solution: The first step is to factor out the coefficient of A(t), then proceed as in the previous example.

( ) d( ) ( )
A′ (t) = −2 A(t) − 5 ⇒ A(t) − 5 = −2 A(t) − 5 ⇒ A(t) − 5 = Ce−2t .
dt

(In this case, G is the function A(t) − 5 and k = −2.) Using the fact that A(0) = 40 gives C = 35. Thus the
solution to the differential equation is A(t) = 35e−2t + 5.

Exercises
1. Solve each of the following simple differential equations.
2 √
a) f ′ (x) = 3x2 , f (1) = 2 b) g ′ (x) = , g(1) = 5 c) h′ (x) = x, h(4) = 6
x
d) F ′ (x) = 3x2 − 8x, F (0) = 1 e) G′ (x) = x4 + 2x3 − x + 1, G(1) = 3
t
f ) u′ (t) = 4e2t , u(0) = 12 g) v ′ (t) = 8 sin(2t), v(π) = 5 h ) w′ (t) = √ , w(0) = 2
t2 + 1
2. Solve each of the following differential equations.
a) f ′ (t) = −2f (t), f (0) = 8 b) g ′ (t) = 4g(t), g(0) = 5 c) h′ (t) = 5h(t), h(1) = 2
( )
d) F ′ (x) = 2 + F (x), F (0) = 10 e) G′ (x) = 4 − G(x), G(0) = 30 f ) H ′ (x) = 3 1 + H(x) , H(0) = 46

3. Solve each of the following differential equations.


1
a) A′ (t) = 8 + 2A(t), A(0) = 20 b) B ′ (t) = 15 − 5B(t), B(0) = 2 c) S ′ (t) = 80 − S(t), S(0) = 40
3
50 Chapter 1 The Derivative

1.22 Applications of Differential Equations

Since a derivative can be interpreted as a rate of change, it can be used in many quantitative situations
that involve change. A common situation that occurs in applications is one in which a quantity changes
at a rate proportional to its size. Radioactive decay, population growth, chemical reactions, and debts or
annuities involving interest can, under certain conditions, experience changes of this type. If A(t) represents
a function of time t, then the statement “the rate of change of A is proportional to A” can be translated
mathematically as A′ (t) = kA(t), where k is a constant. The solution to this differential equation (see the
last section) is A(t) = Cekt , where C = A(0) is a constant. Two applications of this idea are given below.

Problem: Suppose a certain radioactive sample decays from 800 grams to 500 grams in 2 years. When will
only 100 grams of the sample remain?

Solution: Let A(t) be the number of grams in the sample after t years. It is given that A(0) = 800 and
A(2) = 500. Since radioactive samples decay at a rate proportional to the amount present, there exists a
constant k such that A′ (t) = kA(t). (Note that k will be a negative number.) It follows that A(t) = 800ekt .
To find k, we use the fact that A(2) = 500:

1 ( )
500 = 800ek·2 ⇒ k = ln 5/8 .
2

Rather than obtain a decimal approximation for k (which can lead to roundoff errors), it is best to leave k
in exact form. Thus A(t) = 800ekt , where k has the value found above. To answer the question, solve the
equation A(t) = 100 for t. This yields
( ) ( )
ln 1/8 2 ln 1/8
100 = 800e kt
⇒ t= = ( ) ≈ 8.85.
k ln 5/8

Hence, there will be 100 grams of the sample about 8.85 years after the initial amount of 800 grams.

Problem: A 300 gallon tank contains 200 gallons of brine (salt dissolved in water) with a concentration of
1/4 pound of salt per gallon of water. A brine containing 1 pound of salt per gallon of water runs into the
tank at the rate of 4 gallons per minute, and the well-stirred mixture runs out of the tank at the same rate.
When will there be 100 pounds of salt in the tank?

Solution: Let A(t) be the number of pounds of salt in the tank after t minutes. Since the initial concentration
is 1/4 lb/gal, it is easy to see that A(0) = 200 · (1/4) = 50. However, more effort is required to find a
differential equation for A(t). The units of A′ (t) are pounds per minute, so we need to determine the number
of pounds of salt per minute entering the tank and subtract the number of pounds of salt per minute leaving
the tank. The inflow concentration is 1 pound of salt per gallon and the outflow concentration is A(t)/200
pounds of salt per gallon. (This is where the well-stirred assumption is used; it is assumed that the salt is
spread evenly throughout the tank.) Thus the rates at which salt enters and leaves the tank are

( ) ( ) ( ) ( A(t) ) A(t)
4 gal/min · 1 lb/gal = 4 lb/min and 4 gal/min · lb/gal = lb/min
200 50
1.22 Applications of Differential Equations 51

respectively. It follows that A′ (t) = 4 − A(t)/50. To find A(t), we need to solve the differential equation

1
A′ (t) = 4 − A(t), A(0) = 50.
50

The techniques in the previous section yield A(t) = 200−150e−t/50 . To answer the question, solve A(t) = 100
for t to obtain t = 50 ln 1.5. Therefore, there are 100 pounds of salt in the tank about 20.27 minutes after
the process is started.

Exercises
1. The radioactive isotope thorium-234 has a half-life (the time required for half of a sample to decay) of about
590 hours. If 50 mg are present initially, after how many hours will there be only 40 mg left?
2. A population of bacteria increases at a rate proportional to the current population. Initially there were 100
bacteria and one hour later there are 130 bacteria. How many bacteria will there be after 24 hours?
3. The rate at which sugar dissolves in water kept at a constant temperature is proportional to the amount that
remains to be dissolved. Suppose that 10 pounds of sugar is added to a vat of water and that 5 pounds dissolves
in the first five minutes. How long does it take for the next 4 pounds to dissolve?
4. A 300 gallon tank contains 200 gallons of brine with a concentration of 0.4 pounds of salt per gallon of water.
A brine containing 1.2 pounds of salt per gallon of water runs into the tank at the rate of 5 gallons per minute,
and the well-stirred mixture runs out of the tank at the same rate. When will there be 180 pounds of salt in
the tank?
5. A tank contains 300 gallons of fresh water. A brine containing 0.5 pounds of salt per gallon of water runs into
the tank at the rate of two gallons per minute, and the well-stirred mixture runs out at the same rate. What is
the concentration of salt in the tank at the end of ten minutes?
6. A 300 gallon tank contains 100 gallons of brine with a concentration of one pound of salt per gallon of water. A
brine containing 0.5 pounds of salt per gallon of water runs into the tank at the rate of four gallons per minute,
and the well-stirred mixture runs out of the tank at the same rate. When is the concentration of salt in the tank
0.7 pounds per gallon?
7. Five hundred gallons of pesticide is accidentally spilled into a lake with a volume of 8 ×107 gallons and uniformly
mixes with the lake water. A river flows into the lake bringing 10,000 gallons of fresh water per minute and
water leaves the lake at the same rate. Estimate how long will it take to reduce the pesticide in the lake to one
part per billion.
8. A cup of tea is made by pouring boiling water (212◦ F) into a cup. Three minutes later, the tea is brewed but
at 180◦ F is too hot to drink. Assuming that the air is a constant 68◦ F, how long will it take for the tea to
become a drinkable temperature of 160◦ F? (Use Newton’s Law of Cooling, which states that the rate of change
of temperature of an object is proportional to the difference in temperature of the object and its surroundings.)
9. Suppose that you have acquired a college debt of $12,000. The annual interest rate is 6% and you pay $200 each
month. How long does it take you to pay off your loan and how much do you pay altogether? Answer these
same questions assuming you only pay $100 each month. Hint: In order to obtain a differential equation, we
need to assume that both interest and payments are made continuously. Let A(t) be the amount you owe after
t years. Since you are paying at the rate of $2400 per year, the function A satisfies the differential equation
A′ (t) = 0.06A(t) − 2400.
10. Repeat Exercise 9 assuming that the interest rate is 9%.
11. Repeat Exercise 9 assuming that the original debt is $20,000.
12. Suppose that you have a credit card debt of $1100. The annual interest rate is 12% and you pay $15 each month.
How long does it take you to pay off your loan and how much do you pay altogether? See the hint for Exercise
9.
52 Chapter 1 The Derivative

1.23 Definition of Limit

An intuitive understanding of the symbols lim f (x) = L is sufficient for many situations, but there are
x→c
applications (both applied and theoretical) in which it is necessary to be more specific than the phrase “f (x)
is close to L when x is close to c”. During the eighteenth and early nineteenth centuries, mathematicians
struggled to come up with a suitable definition for the limit concept. The definition obtained at the end of
this period is still in use today. This abstract but important concept will be the topic for this section.
A function f has a limit of L at a point c if the output values of f are near L when the input values are
near c. In order to turn this sentence into a mathematical definition, it is necessary to explicitly quantify
the phrases “near L” and “near c”. The absolute value function can be used to determine when one number
is near another since |a − b| represents the distance between the points a and b. If |x − c| and |f (x) − L| are
small, then x is near c and f (x) is near L. In order for the limit of f at c to be L, the functional values f (x)
must be within a prescribed (small and arbitrary) distance of L for all input values of x close enough to c,
but not including c. The value of the function at c does not come into play when finding the limit at c; the
function may not even be defined at c. A function f has a limit L at c if given any positive number ϵ, it is
possible to find another positive number δ so that |f (x) − L| < ϵ for all x that satisfy 0 < |x − c| < δ. That
is, if you want the functional values of f to be within ϵ of L, you must choose the input values of f to be
within δ of c (see the figure).
y
..
.....
......
... y = f (x)
...
...
... To guarantee that |f (x) − L| < ϵ
...
. for all x that satisfy 0 < |x − c| < δ,
L + ϵ ... ... ... ... ... ... ... ... ... ... ... ............
.. ... choose δ = min{v − c, c − u}.
L ......
...
...............
..... ... ... ... ..
.
L−ϵ ... ... ... ... ... ... ... ... .
. . .
.. ... . ..
.. ............
..
..
..
..
.. ..........
.............
.
..... x
u c v

DEFINITION 1.21 Let f be defined on some open interval containing the point c, except possibly at
c. Then lim f (x) = L if for each ϵ > 0 there exists δ > 0 such that |f (x) − L| < ϵ for all x that satisfy
x→c
0 < |x − c| < δ.

This is not an easy definition and it takes time and effort to understand it well. For the time being,
it is sufficient to understand the quantities that appear in the definition and how they correspond to the
intuitive notion of a limit. For starters, the first problem below illustrates the relationship between ϵ and δ
using actual numbers. The second problem provides a brief glance into the nature of limit proofs.
1 1
Problem: Consider lim √ = . Given ϵ = 0.01, find (to 4 decimal places) a suitable value for δ.
x→4 x 2
Solution: According to the definition of limit, we must find δ > 0 so that
1 1

0 < |x − 4| < δ ⇒ √ − < 0.01.
x 2
1.23 Definition of Limit 53

We begin with the second inequality and perform algebra until the quantity x − 4 appears:
1 1
1 1
√ − < 0.01 ⇔ −0.01 < √ − 0.5 < 0.01 ⇔ 0.49 < √ < 0.51 ⇔
x 2 x x
1 √ 1 1 1 1 1
< x< ⇔ <x< ⇔ −4<x−4< − 4.
0.51 0.49 0.512 0.492 0.512 0.492
Using a calculator, we find that −0.1553248 < x − 4 < 0.1649312. We can thus choose δ to be any positive
number that is smaller than 0.1553248. Hence, a suitable choice for δ is 0.1553.

Problem: Prove that lim (4x − 5) = 3.


x→2

Solution: Given ϵ > 0, we must choose δ > 0 so that


0 < |x − 2| < δ ⇒ (4x − 5) − 3 < ϵ.


Noting that (4x − 5) − 3 = 4x − 8 = 4 x − 2 , it is clear that δ = ϵ/4 has the desired property. A formal
proof reads as follows. Let ϵ > 0 and choose δ = ϵ/4. For all x that satisfy 0 < |x − 2| < δ, we find that

|(4x − 5) − 3| = |4x − 8| = 4|x − 2| < 4δ = ϵ.

This shows that lim (4x − 5) = 3.


x→2

Exercises
1. For the given limit and value of ϵ, find (to 4 decimal places) a suitable choice for δ.

a) lim x2 = 4, ϵ = 0.05 b) lim x = 1, ϵ = 0.01 c) lim x3 = 64, ϵ = 0.2
x→2 x→1 x→4
1 1 1 1
d) lim = , ϵ = e) lim sin x = , ϵ = 0.1 f ) lim ex = e2 , ϵ = 0.04
x→3 x 3 25 x→π/6 2 x→2

2
2. For the limit lim x = 4, find (to 6 decimal places) a suitable δ for the given value of ϵ.
x→2
a) ϵ = 0.01 b) ϵ = 0.004 c) ϵ = 0.0003
3. For each limit, find (to 4 decimal places) a suitable δ for ϵ = 1.
a) lim x2 = 4 b) lim x2 = 400 c) lim x2 = 10, 000
x→2 x→20 x→100
2 2
4. A square must have an area of 400 cm give or take 5 cm .
a) What range of values for the side of the square will meet these specifications?
b) Formulate this problem in the form lim f (x) = L and identify f , c, L, ϵ, and δ.
x→c

5. A spherical ball bearing must have a volume of 40 cm3 give or take 1 cm3 .
a) What range of radius values will meet these specifications?
b) Formulate this problem in the form lim f (x) = L and identify f , c, L, ϵ, and δ.
x→c
6. Use the definition of limit to prove each of the following.
a) lim (10x + 3) = 13 b) lim (3 − 4x) = 7 c) lim x2 = 9
x→1 x→−1 x→3

7. Suppose that lim f (x) = L and that k is a nonzero constant. Prove that lim kf (x) = kL.
x→c x→c
54 Chapter 1 The Derivative

1.24 Limits Involving ∞

The symbol ∞ represents a quantity that is larger than every real number. It is worth emphasizing that
∞ is a concept, not a number. To say that x goes to ∞ or to write x → ∞ means that the variable x is
eventually larger than any real number, that is, the values of x are growing without bound. The expression
lim f (x) = L means that the output values f (x) are near L when the input values x are extremely large or,
x→∞
to say it another way, f (x) gets closer and closer to L as x gets bigger and bigger. This statement can be made
mathematically precise, but this intuitive version will be suitable for us. The expression lim f (x) = L is
x→−∞
defined similarly. The simplest example of this concept is lim 1/x = 0.
x→∞
To compute limits of the form lim f (x) when f is a fraction made up of algebraic functions, a good
x→∞
procedure is the following: find the highest power of x that appears in the denominator and divide both
numerator and denominator by that power of x. This is illustrated in the following examples.

3 1
2x3 + 3x − 1 2x3 + 3x − 1 1/x3 2+ 2 − 3 2
lim = lim · = lim x x = ;
x→∞ 3x3 − 4x2 + 6x x→∞ 3x3 − 4x2 + 6x 1/x3 x→∞ 4 6 3
3− + 2
x x
8
(√ ) 7x + 8 7+
x 7
lim x2 + 7x + 8 − x = lim √ = lim √ = .
x→∞ x→∞ 2
x + 7x + 8 + x x→∞ 7 8 2
1+ + 2 +1
x x

Note the use of the conjugate in the second example.


The expression lim f (x) = ∞ means that the output values f (x) become arbitrarily large when the
x→c
input values x are near c, that is, f (x) gets big as x approaches c. A limit of this type does not exist since
there is no number which the output values get close to. However, the symbols give some indication as to
why the limit does not exist and are a useful mathematical shorthand. Some examples include (one-sided
limits of this type are defined in the expected way)

1 3 1 2x − 1
lim = ∞, lim = ∞, lim− = −∞, lim+ 2 = ∞, and lim+ e1/x = ∞.
x→0 x2 x→2 |x − 2| x→0 x x→3 x − 9 x→0

As indicated in the first four of these examples, if the numerator approaches a nonzero number while the
denominator approaches 0, then an infinite limit occurs.
Limits involving ∞ can be used to define the concept of an asymptote for the graph of a function.

DEFINITION 1.22 Let f be a function.

a) The line x = c is a vertical asymptote for the graph y = f (x) if either lim− |f (x)| = ∞ or
x→c
lim |f (x)| = ∞. (The graph goes off the scale vertically at c.)
x→c+

b) The line y = d is a horizontal asymptote for the graph y = f (x) if either lim f (x) = d or
x→−∞
lim f (x) = d. (The graph resembles the horizontal line y = d when x is large.)
x→∞

Horizontal asymptotes are fairly easy to spot; just check the limits lim f (x) and lim f (x). Vertical
x→−∞ x→∞
asymptotes require a little more effort. A rule of thumb is that vertical asymptotes occur when a division by
1.24 Limits Involving ∞ 55

zero appears in the formula for the function. However, it is still necessary to verify that the function has an
infinite limit at such points. Furthermore, a function can have an infinite limit without a division by zero;
the function ln |x| has a vertical asymptote at 0. It is thus necessary to think in terms of the definition and
not only in terms of division by zero. For example, the function f defined by

x2 − 3x − 10
f (x) =
x2 − 4

has a vertical asymptote at x = 2, but there is no vertical asymptote at x = −2. As the reader should verify,
the function f has a limit as x → −2.

Exercises
1. Evaluate each of the following limits.
2x + 5 x − 2x3 x2
a) lim b) lim c) lim
x→∞ 6x − 1 + 6x − 4
x→∞ 5x3 x→∞ (4 − x)(3x + 11)
(√ ) ( √ ) 2
d) lim x2 − 3x + 6 − x e) lim 2x − 4x2 + 9 f ) lim e−x /3
x→∞ x→∞ x→∞
( ) √
1 − 2x x2 + 8
g) lim 2+e x
h) lim √ i) lim
x→−∞ x→−∞ x2 + 4x + 5 x→∞ 3x + 2

2. Evaluate each of the following limits.


1 2x + 3 x2
a) lim b) lim c) lim
x→2 (x − 2)4 x→1 |x2 − x| x→2− x2 −4
2x + 1 2x2 + 3x − 5 x+1
d) lim √ e) lim f ) lim
x→3+ x−3 x→−1+ x2 + 3x + 2 x→2+ x2 + x − 6
2
( )
g) lim cot x h) lim e2/x i) lim 5 − ln |x − 2|
x→π − x→0 x→2

3. Write a precise mathematical definition (see the previous section) for lim f (x) = ∞ and lim f (x) = L.
x→c x→∞
4. Sketch the graph of a function with the given properties.
a) lim f (x) = ∞, lim f (x) = 1, and lim f (x) = 1
x→2 x→∞ x→−∞

b) lim g(x) = ∞, lim g(x) = −∞, and lim g(x) = 0


x→3− x→3+ x→∞

c) lim h(x) = −∞, lim h(x) = 4, and lim h(x) = 1


x→−1 x→∞ x→−∞

5. Find all of the asymptotes, both vertical and horizontal, for the given function.
4x − 1 2x2 + 3 ex
a) f (x) = b) g(x) = c) h(x) =
2x + 3 x2 − 5x + 6 ex −2
6. Find all of the vertical asymptotes for the given function. Be careful.
x2 − 4 sin x
a) f (x) = b) g(x) = c) h(x) = ln |2x − 1|
x3 − x2 − 2x x2 − 2x
7. Explain in words what the expression lim f (x) = ∞ means. Then evaluate the limits.
x→∞
( ) ( √ ) x3
a) lim x − 40x
3 2
b) lim 10 x − x c) lim 2
x→∞ x→∞ x→∞ x + 6x + 10
56 Chapter 1 The Derivative

1.25 ^ pital's Rule


L'Ho

The derivative concept is defined in terms of limits; all derivative formulas have their roots in properties of
limits. However, once derivative formulas have been established, it is possible to use derivatives to evaluate
limits. For example,

x1000 − 1 ( x1000 − 1 1 ) d 1000 1 1000


lim = lim · = x · = = 500.
x→1 x − 1
2 x→1 x−1 x+1 dx x=1 2 2

The use of derivatives to evaluate limits yields the following result known as L’Hôpital’s Rule. The “suitable
conditions” mentioned in the hypotheses involve continuity and differentiability properties that will always
be satisfied by the functions we consider.

THEOREM 1.23 L’Hôpital’s Rule Under suitable conditions on the functions f and g, if either the
f (x) f ′ (x)
limits lim f (x) and lim g(x) are both 0 or both infinite, then lim = lim ′ , assuming that the latter
g(x) g (x)
limit exists. (The limits can be of any type: x → a, x → a , x → −∞, etc.)
+

Proof. Because it covers so many situations, this is not an easy result to prove. We offer here a proof
that is valid under the assumption that f ′ and g ′ are continuous functions with lim f (x) = 0 = lim g(x)
x→a x→a
and g ′ (a) ̸= 0.

f (x) f (x) − f (a)


lim = lim f, g are continuous at a
x→a g(x) x→a g(x) − g(a)
( f (x) − f (a) x−a )
= lim · multiply by 1
x→a x−a g(x) − g(a)

f (a)
= ′ definition of derivative
g (a)
f ′ (x)
= lim ′ f ′ , g ′ are continuous
x→a g (x)

Although not a general proof, this proof does give a good idea why L’Hôpital’s Rule works.

We illustrate this rule with three examples. It is important to check that the limit is of the form 0/0 or
∞/∞ before proceeding. If after one step, the limit is still of the form 0/0 or ∞/∞, then the rule is just
applied again.
ex − 1 ex 1
lim = lim = ;
x→0 sin 2x x→0 2 cos 2x 2
tan x − x sec2 x − 1 1 ( tan x )2 1
lim 3
= lim 2
= lim = ;
x→0 x x→0 3x x→0 3 x 3
x2 4x 8
lim = lim x/2 = lim x/2 = 0.
x→∞ ex/2 x→∞ e x→∞ e

For the middle limit, we used the fact that lim sin x/x = 1; see Section 1.17.
x→0
Limits that result in the forms 0/0 or ∞/∞ are known as indeterminate forms. This term is used
because the value of the limit is not determined by the form; the form gives no indication whatsoever as to
the value of the limit. L’Hôpital’s Rule can be used to determine the value of a limit of an indeterminate
form of the type 0/0 or ∞/∞. Other indeterminate forms include ∞ − ∞, 0 · ∞, 1∞ , ∞0 , and 00 . L’Hôpital’s
1.25 L’Hôpital’s Rule 57

Rule cannot be used on these forms unless some algebra is first performed to convert them to a 0/0 or ∞/∞
form. For example,
1 1
(
1 1 ) ln x − x + 1 −1 − 2 1
lim − = lim = lim x = lim x =− .
x→1 x − 1 ln x x→1 (x − 1) ln x x→1 1 x→1 1 1 2
1 − + ln x +
x x2 x
(The first limit has the form ∞ − ∞, while the second and third limits have the form 0/0.) For indeterminate
forms that involve exponents, it is first necessary to find the limit of the natural logarithm of the function.
( )1/x2
To illustrate, suppose we are asked to find lim cos x . This limit is of the form 1∞ . We first use the
x→0
properties of logarithms to evaluate
(( ( )
)1/x2 ) ln cos x − tan x − sec2 x 1
lim ln cos x = lim = lim = lim =− .
x→0 x→0 x2 x→0 2x x→0 2 2
( )1/x2
The solution to the original limit is thus lim cos x = e−1/2 . (We were asked to find a limit L and, after
x→0
some effort, we found that ln L = −1/2. It follows that L = e−1/2 .)
The symbols 0/0, ∞ − ∞, etc. are meaningless since they represent indeterminate forms. Consequently,
they should not be used in equations. Do not write equations such as

f (x) 0 ( )
lim = , lim f (x) − g(x) = ∞ − ∞, lim f (x)g(x) = 1∞ .
x→c g(x) 0 x→c x→c

You should simply make a note of the type of indeterminate form that appears, then continue with the
problem.

Exercises
1. Evaluate each of the following limits.
x2 + 4x − 5 x5 − 32 x47 + 1
a) lim b) lim c) lim
x→1 x3 + x2 + x − 3 x→2 x3 + 2x2 − 6x − 4 x→−1 x5 + x4 − 2x − 2
tan x e −x−1
x
cos x − cos(3x)
d) lim e) lim f ) lim
x→0 ex − 1 x→0 x2 x→0 x2
x + arctan(5x) ln x x3
g) lim h) lim √ i) lim
x→0 2x + arctan x x→∞ 4 x x→∞ ex

2. Evaluate each of the following limits.


√ ( )
a) lim (x2 + 3x) csc x b) lim x ln x c) lim x − xe−2/x
x→0 x→0+ x→∞
( )1/x ( )1/x ( )x
x
d) lim 1 + sin 2x e) lim e2x + x2 f ) lim
x→0 x→∞ x→∞ x−2

3. Let r be a positive constant. Evaluate each of the following limits.


tan(r2 x) ( )1/x
a) lim b) lim xr ln x c) lim 1 + rx
x→0 sin(rx) x→0+ x→0

r
( )x
ln x x r
d) lim e) lim f ) lim 1 +
x→∞ xr x→∞ ex x→∞ x
√ √
2a3 x − x4 − a a2 x
3

4. Let a be a positive constant. Evaluate lim √ , which is one of L’Hôpital’s original examples.
a − ax3
x→a 4
58 Chapter 1 The Derivative

1.26 The Mean Value Theorem

The Mean Value Theorem is an extremely important theorem in differential calculus. It is a simple result
with far reaching consequences. In fact, it is used to prove most of the useful properties of the derivative.

THEOREM 1.24 Mean Value Theorem If f is continuous on [a, b] and differentiable on (a, b), then
f (b) − f (a) ′
there exists a point c ∈ (a, b) such that f (c) = .
b−a
The graph below provides a geometric interpretation for the Mean Value Theorem. The difference
quotient (f (b) − f (a))/(b − a) represents the slope of the secant line connecting the points (a, f (a)) and
(b, f (b)) on the graph of f , and the number f ′ (c) represents the slope of the tangent line to the graph of f at
(c, f (c)). The conclusion of the Mean Value Theorem states that there exists a point in the interval where
the tangent line is parallel to the secant line.
y
........
... ..
y = f (x)
..
...
f (b) − f (a)
...
.. secant line with slope
f (b) ...
. . b−a
..... .
...
......
..... .
.......
......
. . ....
......
.....
......
..
. . . .....
......
tangent line with slope f ′ (c)
.
......
.
....

...
................

....
..... ......
.....
.. ...
...
......
. . . ...
..
...........................................................
.
..
...... ....
f (a) . .
.....
......
......
.....
........... x
a c b
The slope of the secant line can be interpreted as the average rate of change of the function f on the interval
[a, b]. It is from this perspective that the adjective “mean” appears in the Mean Value Theorem; there exists
a point c in the interval at which the instantaneous rate of change of f at c equals the average rate of change
of f on the interval [a, b].
The Mean Value Theorem is a consequence of the following special case of the theorem in which the
secant line is assumed to be horizontal. The details of the proof are requested in Exercise 10.

THEOREM 1.25 Rolle’s Theorem If f is continuous on [a, b], differentiable on (a, b), and f (a) =
f (b), then there exists a point c ∈ (a, b) such that f ′ (c) = 0.

Proof. By the Extreme Value Theorem, the function f has a maximum output and a minimum output
on the interval [a, b]. Unless f (x) = f (a) for all x ∈ [a, b] (in which case the conclusion is obvious), at least
one of these extreme outputs occurs at a point c ∈ (a, b). Since f is differentiable on (a, b), it follows from
Theorem 1.15 that f ′ (c) = 0.

To illustrate the conclusion of the Mean Value Theorem, consider the function f (x) = x3 − 2x2 on the
interval [1, 4]. Since this function is a polynomial, it is continuous and differentiable on [1, 4]. (Always make
certain your function satisfies the continuity and differentiability conditions on the given interval.) Therefore,
the Mean Value Theorem guarantees a point c ∈ (1, 4) for which
f (4) − f (1) 32 − (−1)
f ′ (c) = ⇔ 3c2 − 4c = = 11.
4−1 4−1
1.26 The Mean Value Theorem 59

( √ ) ( √ )
The quadratic formula yields c = 2 ± 37 /3. Since c ∈ (1, 4), we find that c = 2 + 37 /3.
As a second example, consider the function g(x) = 1/x2 on the generic interval [a, b] with a > 0. The
point c guaranteed by the Mean Value Theorem satisfies

1 1 √
2 − 2 2 a+b 2 2
b 2 a 3 2a b
− 3 = ⇒ 3 = 2 2 ⇒ c= .
c b−a c a b a+b

The formula for c represents a different way to obtain a mean value for two positive numbers a and b. (Two

familiar ways are the arithmetic mean (a + b)/2 and the geometric mean ab.)

Exercises
1. For the given function and interval, find the point c guaranteed by the Mean Value Theorem.
√ √
a) f (x) = x, [1, 9] b) g(x) = x 8 − x, [0, 8] c) h(x) = 3x2 − 5x + 1, [1, 4]
x
d) F (x) = , [1, 4] e) G(x) = arctan x, [0, 1] f ) H(x) = x3 − x2 + 4x, [0, 2]
x+2
2. Consider the function f (x) = |x − 3| on the interval [1, 5]. Which of the hypotheses of Rolle’s Theorem are
satisfied? Is there a point c that satisfies the conclusion of Rolle’s Theorem? Explain your answers.
3. Consider the function f (x) = x2/3 on the interval [−1, 27]. Which of the hypotheses of the Mean Value Theorem
are satisfied? Is there a point c that satisfies the conclusion of this theorem? Explain your answers.
4. Let a and b be constants with a < b. Find the point that is guaranteed by the Mean Value Theorem for the
function f that is defined by f (x) = (x − a)2 (b − x)4 on the interval [a, b].
5. Let a and b be constants with a < b, and let r and s be positive constants. Find the point that is guaranteed
by the Mean Value Theorem for the function f that is defined by f (x) = (x − a)r (b − x)s on the interval [a, b].
6. Find the point c guaranteed by the Mean Value Theorem for the function f (x) = x2 on an arbitrary interval
[a, b]. Simplify the value of c and note something interesting about it.
7. Find the point c guaranteed by the Mean Value Theorem for the function f (x) = 1/x on an arbitrary interval
[a, b], where a > 0. Simplify the value of c and note something interesting about it.
8. Let f be a differentiable function defined on [a, b], let A, B, and C be constants with A ̸= 0, and consider the
function g defined by g(x) = Af (x) + Bx + C. Show that a point c guaranteed by the Mean Value Theorem for
the function g on [a, b] is also a point guaranteed by this theorem for f .
9. Let f be a differentiable function. Suppose that the equation f (x) = 0 has n distinct solutions, where n > 1 is
a positive integer. Prove that the equation f ′ (x) = 0 has at least n − 1 solutions.
10. Prove the Mean Value Theorem. Start by defining a function g by
f (b) − f (a)
g(x) = f (x) − (x − a) − f (a),
b−a
and showing that g satisfies the hypotheses of Rolle’s Theorem. Then show that the point c guaranteed by
Rolle’s Theorem for g is the point c required for f . Also, explain what the function g represents in geometric
terms.
60 Chapter 1 The Derivative

1.27 Applications of the Mean Value Theorem

The Mean Value Theorem is used to prove many of the useful properties of the derivative. One such property
and its simple corollary, both of which we have already been assuming to be true, is the following result.

THEOREM 1.26 Monotonicity Theorem Let f be continuous on [a, b] and differentiable on (a, b).

a) If f ′ ≥ 0 on (a, b), then f is increasing on [a, b].

b) If f ′ ≤ 0 on (a, b), then f is decreasing on [a, b].

c) If f ′ = 0 on (a, b), then f is constant on [a, b].

Proof. We will prove part (a); the proofs of parts (b) and (c) are similar. Suppose that a ≤ u < v ≤ b.
Applying the Mean Value Theorem to the function f on the interval [u, v], there exists a point c ∈ (u, v)
such that f (v) − f (u) = f ′ (c)(v − u). Since f ′ (c) ≥ 0 and v − u > 0, it follows that f (v) ≥ f (u). Hence, the
function f is increasing on [a, b].

COROLLARY 1.27 Let f and g be two differentiable functions defined on [a, b]. If f ′ = g ′ on [a, b], then
there exists a constant k such that f (x) = g(x) + k for all x ∈ [a, b].

As an example of the theorem, let f be the function defined by f (x) = 2x + sin x. Since f ′ (x) =
2 + cos x > 0 for all x ∈ R, the function f is increasing on every interval [a, b] and thus increasing on R.
As for the corollary, suppose we need to find a function g such that g ′ (x) = 3x2 for all x. It is clear that
both g(x) = x3 and g(x) = x3 + 4 will work. Is there another, possibly very complicated and/or unusual
function, whose derivative is also 3x2 ? The answer is no; Corollary 1.27 guarantees that every function
whose derivative is 3x2 for all values of x is of the form x3 + k for some constant k.
Theorem 1.26 appears to be an almost trivial result, especially if one is encouraged to visualize a graph
as resembling its tangent lines. However, it is sometimes the case that seemingly obvious results are not
all that easy to prove; this is just one example. By the way, it is possible for a function to have a positive
derivative at a point without being an increasing function in an open interval containing that point. The
function f defined by {
x/2 + x2 sin(1/x), if x ̸= 0;
f (x) =
0, if x = 0;
satisfies f ′ (0) = 1/2, but f is not increasing on any open interval that contains 0. The reader should try to
obtain a graph of this function using a calculator.
The Monotonicity Theorem can be used to prove inequalities. Suppose a differentiable function h satisfies
h(a) = 0 and h′ (x) > 0 for all x > a. Since h is increasing for x > a, it follows that h(x) > h(a) = 0 for
all x > a. As an example, suppose we would like to prove that ex > 1 + x for all x > 0. Define a function
h by h(x) = ex − (x + 1). Then h(0) = 0 and h′ (x) = ex − 1 > 0 for all x > 0. It follows that h(x) > 0 or
ex > x + 1 for all x > 0. The following theorem provides a more theoretical example.
1.27 Applications of the Mean Value Theorem 61

THEOREM 1.28 Let f be a differentiable function defined on an interval I. If f ′ is increasing on I,


then the graph of f lies above its tangent lines on I.

Proof. Let c ∈ I. The function Tc that represents the tangent line to y = f (x) when x = c is defined by
Tc (x) = f ′ (c)(x − c) + f (c). We must show that f (x) ≥ Tc (x) for all x ∈ I (see the figure).
y
.......
... ..
.. y = f (x)
...
.
. . ................ y = Tc (x)
...
.........
... ...
...
......
.. ....
..
.
.....................................................
...
.

. .....
.....
......
.
..
......
.. .....
......
..... ........... x
......
......
c

Let x ∈ I and assume that x > c; the case in which x < c is similar. By the Mean Value Theorem, there
exists zx ∈ (c, x) such that f (x) − f (c) = f ′ (zx )(x − c). Since f ′ is increasing on I and zx > c, we find that
f ′ (zx ) ≥ f ′ (c). Using the fact that x − c > 0,

f (x) = f ′ (zx )(x − c) + f (c) ≥ f ′ (c)(x − c) + f (c) = Tc (x),

as desired. This completes the proof.

Exercises
1. Prove that the given function is increasing on every interval [a, b] and thus increasing on R.
a) f (x) = 2x3 + 10x − 18 b) g(x) = x5 − 5x3 + 30x c) h(x) = 8x − 3 cos 2x
2. Find the most general function whose derivative is given.
x
a) f ′ (x) = x2 + x + 1 b) g ′ (x) = 2 c) h′ (x) = 8 sin 2x
x +4
3. Prove part (b) of Theorem 1.26.
4. Prove part (c) of Theorem 1.26, then use it to prove Corollary 1.27.
5. Prove that each of the following inequalities is valid for x > 0.
√ x 1 2
a) sin x < x b) 1 + x < 1 + c) ex > 1 + x + x
2 2
6. Write a version of Theorem 1.28 that is valid when f ′ is decreasing on I.
7. Use Theorem 1.28, or the corresponding version discussed in the previous exercise, to prove each of the following
inequalities is valid for all x > 0.
a) 3x4/3 ≥ 4x − 1 b) 5x6/5 ≥ 6x − 1 c) ln x ≤ x − 1

8. During a 15 minute interval, a car started with a velocity of 35 mi/hr and ended with a velocity of 75 mi/hr.
Show that at some time during this interval, the acceleration of the car was 160 mi/hr2 .
9. Suppose that f (0) = 0 and that f ′ is positive on (−2, 5). Prove that f is negative on (−2, 0) and positive on
(0, 5).
10. Prove that x4 + 6x + c has at most two real roots no matter what value the constant c has.
62 Chapter 1 The Derivative

1.28 Concavity and Inflection Points

The derivative of a function f is another function f ′ . It is thus possible to find the derivative (f ′ )′ of f ′ . This
function is usually abbreviated f ′′ and referred to as the second derivative of f . Since f ′′ is also a function,
its derivative (f ′′ )′ = f ′′′ can be computed. This process can be repeated indefinitely; the functions that are
obtained in this way are known as the higher derivatives of f . Since the prime notation becomes awkward
when there are many primes, the symbol f (n) is used to denote the n’th derivative of f . If y = f (x), then
dn y
the n’th derivative of y with respect to x is also denoted by .
dxn
Since f (n) is the derivative of f (n−1) , it gives the slope of the graph y = f (n−1) (x), but the more relevant
question is whether or not f (n) gives any useful information about the function f . A physical interpretation
of higher derivatives that applies to the motion of an object will be considered in the next section. For a
geometric interpretation, consider the two curves sketched below.
y y
...... ......
... ..
.. ... ..
.....
.
. ...
. .
...
......... f and g are increasing;
y = f (x) .. y = g(x).... f ′ and g ′ are positive;
.. ..
.
.........
. ...... f ′ is increasing;
.. .
............ .... g ′ is decreasing.
..........
.... ..........
..... x ..
... x

The function f is increasing at an increasing rate, while the function g is increasing at a decreasing rate. To
distinguish between functions such as these, we make the following definition.

DEFINITION 1.29 Let f be a differentiable function defined on an open interval I. The function f is
concave up on I if f ′ is increasing on I and concave down on I if f ′ is decreasing on I.

Since a positive derivative indicates an increasing function and a negative derivative indicates a decreasing
function, the following theorem holds.

THEOREM 1.30 Suppose that f is twice differentiable on an open interval I.

a) If f ′′ is positive on I, then f is concave up on I.

b) If f ′′ is negative on I, then f is concave down on I.

Hence, the second derivative of a function f provides information about the concavity of f . There
is a geometric interpretation of the third derivative, but it will not be discussed here as it is much more
complicated. The other higher derivatives do not have a geometric interpretation for f , but they do reveal
some useful information about a function. This will become clear when power series are discussed later in
the book.
While sketching a graph with varying concavity, one can “feel” a turning point when the concavity
changes. These points, known as inflection points, are not as dramatic as relative extreme points but they
are important points on a graph. The point (c, f (c)) is an inflection point for the function f sketched below.
By definition, an inflection point is a point on the graph of a continuous function at which the concavity
1.28 Concavity and Inflection Points 63

changes.
y ..
......
.. ..
.
....... ..
....... .......... ...
.. .... .. The graph is concave down on (−∞, c]
.
... f (c) .•...... ... y = f (x) and concave up on [c, ∞).
....
.. ...... ......
... ........
...
..
..........
..... x
c

Possible inflection points for the graph of y = f (x) occur at those values of x in the domain of f for which
either f ′′ does not exist or f ′′ = 0. However, a function may not have an inflection point at a point with
this property. For example, the function f (x) = x4 satisfies f ′′ (0) = 0, but the function does not have an
inflection point at (0, 0). It is necessary to check that the concavity actually changes at the point in question,
that is, make certain that the second derivative changes sign at the point.

Problem: Determine the intervals on which the function f defined by f (x) = xe−x is concave up and those
on which it is concave down. Find the (x, y) coordinates of any inflection points.

Solution: Using the product rule to find f ′ and f ′′ yields f ′ (x) = (1 − x)e−x and f ′′ (x) = (x − 2)e−x . It is
clear that f ′′ is defined for all x and that f ′′ (x) = 0 only when x = 2. On the interval (−∞, 2), we see that
f ′′ is negative, while f ′′ is positive on the interval (2, ∞). Hence, the function f is concave up on [2, ∞) and
concave down on (−∞, 2]. The point (2, 2/e2 ) is an inflection point.

Exercises
1. Find and simplify the second derivative of the given function. Be careful computing the second derivative.

a) f (x) = (x2 + x + 1)10 b) g(x) = x2 + 4 c) h(x) = sin3 x

2. Find a general formula for f (n) (x). Hint: Find four or more derivatives, then look for a pattern.
a) f (x) = e2x b) f (x) = xex c) f (x) = 1/x

3. Find f (102) for the function f (x) = cos x. Do not take 102 derivatives!
4. For the given function, find the intervals on which the function is concave up and those on which it is concave
down. Find the (x, y) coordinates of any inflection points.
1
a) f (x) = x2 − x3 b) f (x) = x2 − c) f (x) = x4 − 6x3 + 12x2 + 4x
x
8 ln x 2
d) f (x) = e) f (x) = f ) f (x) = e−x /4
x2 + 1 x

5. On what percentage (of the total length) of the interval [0, π] is the function f (x) = x2 + sin 2x concave down?
6. Let f be a twice differentiable function and suppose that the equation f (x) = 0 has 3 distinct solutions. Prove
that the equation f ′′ (x) = 0 has at least 1 solution. What can you say if f (x) = 0 has 7 distinct solutions?
7. Find a polynomial P of degree 3 so that P has an inflection point at (2, 4) and a y-intercept of 12.
8. Discuss the relationship between concavity and Theorem 1.28.
64 Chapter 1 The Derivative

1.29 Velocity and Acceleration

Suppose that a particle is moving along a straight line and that its distance s from a fixed point O on the
line at time t is given by the function s(t). Then its velocity v, acceleration a, and jerk j are given by
ds
v= , rate of change of position with respect to time (velocity);
dt
dv d2 s
a= = 2, rate of change of velocity with respect to time (acceleration);
dt dt
da d3 s
j= = 3, rate of change of acceleration with respect to time (jerk).
dt dt
Since there are two different directions to travel from the point O, these quantities may be either positive
or negative; positive for one direction and negative for the other. On occasion, only the speed of a particle
is important—this quantity is defined to be |v|.
Given a position function s, it is easy to find the functions v, a, and j simply by taking derivatives.
In a typical application, however, information about the acceleration or velocity is given and the position
function must be determined. This means that the derivative or second derivative of a function is known
and it is necessary to find the function. The process of finding a function from its derivative is called
antidifferentiation; an introduction to this process was presented in Section 1.21. Suppose that the
acceleration function a of an object is known. Then an antiderivative gives the velocity function v and
another antiderivative gives the position function s. The constants that arise in the antidifferentiation
process can usually be found from further information about the motion of the object. To illustrate, suppose
that the position function s(t) must be determined from

a(t) = 12t, v(0) = 20, and s(1) = 8.

Since an antiderivative of 12t is 6t2 + C, we find that v(t) = 6t2 + 20. (We choose the constant so that
v(0) = 20.) To find the position function s, we take another antiderivative. An antiderivative of 6t2 + 20
is 2t3 + 20t + C. The fact that s(1) = 8 reveals that C = −14. Hence, the position function s is given by
s(t) = 2t3 + 20t − 14.
For an applied example, suppose that a ball is thrown straight upward from a height of 6 feet with
an initial velocity of 80 feet per second. Near the surface of the earth, the acceleration due to gravity is
essentially constant and this constant is denoted by g. (The value of g is approximately 9.8 m/sec2 or 32
ft/sec2 .) Adopting the convention that up is positive, down is negative, and ground level is 0, the acceleration
function of the ball is a(t) = −32. (This value of g is chosen because of the units given in the problem.)
Hence (ignoring any effect of air resistance),

v(t) = −32t + c1 , v(0) = 80 ⇒ c1 = 80;

s(t) = −16t2 + 80t + c2 , s(0) = 6 ⇒ c2 = 6.

Thus, the height of the ball (in feet) at any time t (seconds) is given by s(t) = −16t2 + 80t + 6. This function
can then be used to determine (a) the maximum height of the ball, (b) the length of time the ball is in the
air, and (c) the speed with which the ball hits the ground.
1.29 Velocity and Acceleration 65

To answer (a), note that the maximum height of the ball occurs at time t1 , where v(t1 ) = 0. (The ball
has an instantaneous velocity of 0 when it is at its peak.) Solving v(t) = 0 gives t1 = 2.5. The maximum
height of the ball is thus s(2.5) = 106 feet. The ball is on the ground when s(t) = 0. Using the quadratic
formula, √ √
40 ±
1696 10 ± 106
s(t) = 0 ⇒ 8t − 40t − 3 = 0 ⇒ t =
2
= .
16 4
( √ )
Since t > 0, we find that the ball hits the ground after 14 10 + 106 ≈ 5.074 seconds. This is the length of
time the ball is in the air. Finally, the solution to (c) is simply
(1( √ )) √

v 10 + 106 = 8 106 ≈ 82.365.
4

Therefore, the impact speed of the ball is approximately 82.365 ft/sec.

Exercises
1. Let v(t) and a(t) represent the velocity and acceleration functions of some particle and suppose that a(t1 ) < 0
and v(t1 ) < 0 for some time t1 . Is the speed of the particle increasing or decreasing at t1 ?

2. Let s(t) = t2 + 11 denote the position function of a particle. Find the velocity and acceleration of the particle
when t = 5.
3. Find the position function s(t) from the given information.
a) a(t) = 50, v(0) = 10, s(0) = 20
b) a(t) = 2t + 1, v(0) = −7, s(0) = 4

c) a(t) = 3/ t + 4, v(0) = 2, s(5) = 14
d) j(t) = 48t, a(0) = 0, v(0) = 10, s(0) = 0
4. A toy rocket is shot straight upward from the ground with an initial velocity of 185 feet per second. How high
does it go and how long is it in the air?
5. A ball thrown upward from the ground reaches a maximum height of 225 feet. What was its initial velocity in
miles per hour?
6. Suppose that on earth you can throw a ball straight up to a height of 180 feet. If you went to a planet where
the acceleration due to gravity is 10 ft/sec2 , how high could you throw the ball?
7. The Sears Tower in Chicago has a height of 1450 feet. How long does it take for an object dropped from the
top of this building to hit the ground? How fast is it going when it hits the ground?
8. A certain car is able to brake with a deceleration of 4.92 m/sec2 . How long does such a car take to come to a
stop if it is initially traveling at 24.6 m/sec? What is the distance traveled during the braking process?
9. The head of a rattlesnake can accelerate 50 m/sec2 in striking a victim. If a car could do as well, how long
would it take for it to reach a speed of 60 mi/hr from rest? (1 meter ≈ 3.28 feet)
10. Raindrops fall to earth from a cloud 5000 ft above the earth’s surface. If they were not slowed by air resistance,
how fast would the drops be moving when they struck the ground?
11. A ball thrown straight up from a height of 2 m takes 2.25 sec to reach a height of 36.8 m. What is its speed at
this height? What was its initial speed? How much higher will the ball go?
12. A hot air balloon is ascending at the rate of 12 m/sec at a height 80 m above the ground when a brick is dropped.
How long does it take for the brick to reach the ground? With what speed does it hit the ground?
13. From what height would a car have to be dropped in order for it to hit the ground at 60 mi/hr?
t2 12
14. Suppose that the position of a particle is given by s(t) = − 2 + 15, where t is measured in seconds and s(t)
2 t
is measured in meters. Find the minimum velocity of this particle for 2 ≤ t ≤ 10.
15. Suppose that a particle is moving in a straight line with a velocity of 120 ft/sec. At time t = 0, the particle
begins to decelerate at the rate of 6 t ft/sec2 . How far does this particle travel before coming to a stop?
66 Chapter 1 The Derivative

1.30 Polynomial Approximation


Consider the functions f (x) = x2 + 4x − 3, g(x) = x, and h(x) = sin x. Suppose that it is necessary to
evaluate, without a calculator, each of these functions for x = 5.2. Finding f (5.2) is relatively easy, but
finding good approximations for g(5.2) and h(5.2) requires much more effort and a deeper understanding of
these functions. Since calculators are readily available, this may seem like a silly exercise, but consider the
following three observations:

1. It is important to have a historical perspective when it comes to knowledge. For many years,

calculations such as 5.2 and sin 5.2 had to be made without the assistance of calculators and the
methods developed for doing so can provide useful insight.

2. Calculators make evaluations of functions very easy, but it is important to have some understanding
of how calculators arrive at their answers. Since calculators rely on addition and multiplication,
some formulas involving these operations are performed by a calculator when it is requested to
evaluate trigonometric or logarithmic functions. Calculus has provided some of the tools required
for determining how to best approximate these functions in this way.

3. There are instances in applications when it is necessary to replace a function with a simpler func-
tion. For example, in some physics problems, it is necessary to replace sin x with x (recall that
lim sin x/x = 1) in order to make it possible to solve the problem.
x→0

Since polynomials can be evaluated using only additions and multiplications, it is helpful to find polynomial
approximations for functions that are not polynomials.
A moment’s reflection makes it clear that different polynomials are needed to approximate a function
at different points on its graph. Given a function f and a point a, the problem is to find a polynomial of a
certain degree that approximates the function f for values of x near the point a. The linear function ℓ, the
quadratic function q, and the cubic function c that approximate f for values of x near a are

ℓ(x) = f (a) + f ′ (a)(x − a);


1 ′′
q(x) = f (a) + f ′ (a)(x − a) + f (a)(x − a)2 ;
2
1 1
c(x) = f (a) + f ′ (a)(x − a) + f ′′ (a)(x − a)2 + f ′′′ (a)(x − a)3 .
2 6

All three of these functions go through the point (a, f (a)). Note that the linear function ℓ is simply the
tangent line to the graph at the point (a, f (a)); it has the same slope at a as the function f . The quadratic
function q has both the same slope and concavity at a as the function f , that is, q ′ (a) = f ′ (a) and q ′′ (a) =
f ′′ (a). Finally, the cubic function c satisfies c′ (a) = f ′ (a), c′′ (a) = f ′′ (a), and c′′′ (a) = f ′′′ (a).
One nice feature of these polynomial approximations is that each one builds on the previous one. In
general, the higher the degree of the polynomial, the better the polynomial approximates the function f . A
visual way to see this is to have a calculator plot the graphs of the function and its polynomial approximations.
In the figure below, the linear approximation to sin x and the cubic approximation to sin x are shown (for
x near the point 0). The reader should verify that the given polynomials for sin x are determined by the
1.30 Polynomial Approximation 67

formulas presented above.


y y
.
....... .. .
.......
.. ..
.. ...y. = x .. ..

..
.. ..... ...
..
......................... y = sin x ... ...........
.
..... ..... ... ..
........ ....................y = sin x
....
. ... ......... .. .
... .. ...
..... −2 −1 .... .
. ...... x ..... .....−2 −1 ......... 1 2 ..... ............... x
....... ..
. 1 2 ............ . ...
.......................
. .......................... ...y = x − x3 /6
.. ..
.. ....
...
.. ....
....
Note that as the degree of the polynomial increases, the polynomial graphs become closer to the graph of
the function and the portions of the graph that are close expands.

As a numerical example, consider the function f (x) = x at the point a = 9. We then have

f (x) = x1/2 ; f (9) = 3;


1
1 1 ℓ(x) = 3 + (x − 9);
f ′ (x) = x−1/2 ; f ′ (9) = ; 6
2 6 1 1
1 1 q(x) = 3 + (x − 9) − (x − 9)2 ;
f (x) = − x−3/2 ;
′′
f ′′ (9) = − ; 6 216
4 108 1 1 1
3 −5/2 1 c(x) = 3 + (x − 9) − (x − 9)2 + (x − 9)3 .
′′′
f (x) = x ; f ′′′ (9) = ; 6 216 3888
8 648

To get a sense for how these functions approximate x, we find that


ℓ(8.5) ≈ 2.916667, q(8.5) ≈ 2.915509, c(8.5) ≈ 2.915477, 8.5 ≈ 2.915476.

Once again, it is clear that the approximations improve as the degree of the polynomial increases.

Exercises
1. Find the linear, quadratic, and cubic approximations to the function f at the point a.
√ √
a) f (x) = 1/x, a = 1 b) f (x) = x, a = 4 c) f (x) = 3 x, a = 8
d) f (x) = ex , a = 0 e) f (x) = ln x, a = 1 f ) f (x) = arctan x, a = 0
√ √
2. Use the linear, quadratic, and cubic approximations to ex at 0 to approximate 10 e and e. Then use a calculator
to determine the accuracy of each approximation.

3. Use a quadratic approximation to estimate 220. (Choose a to be a perfect square near 220.)

4. Use a linear approximation to estimate 3 220. (Choose a to be a perfect cube near 220.)
5. Determine the formula for the fourth degree polynomial approximation to a function f at a point a.

6. Suppose that f is a function with the property that f ′ (x) = x2 + 4 and f (0) = 3. Estimate f (1/2) with both
a linear and a cubic approximation. (Make no attempt to find f (x).)
68 Chapter 1 The Derivative

1.31 Newton's Method

Determining solutions to equations has been a practical problem for centuries. Ideally, an exact solution
is sought, but when this is not possible, a method for generating an approximate solution to any degree of
accuracy is desirable. It is only in recent years that the availability of calculators has made this problem
quite a bit easier. Consider the following three equations:

(1) 2x2 = 5x − 3; (2) x5 − 4x2 + 2x − 7 = 0; (3) 3 sin x = x.

It is easy to use the quadratic formula to find exact solutions to equation (1). An approximate solution to
(2) can be obtained by trial and error; 1 is too small and 2 is too big, then 1.7 is too small and 1.8 is too
big, etc. The same idea can be tried on (3), but it is much more difficult to evaluate sin x without some
assistance. Of course, in this day and age these equations can be solved using a calculator. The purpose of
this section is to discuss Newton’s method for approximating solutions to equations so that you have some
idea about what goes on behind the scenes when you use a calculator to solve an equation.
Newton’s method provides a way of approximating solutions to an equation of the form h(x) = 0. The
key idea behind this method is that a tangent line to a curve is a good local approximation of the curve.
By looking at a graph of y = h(x) or making some educated guesses, it is possible to find an approximate
solution x0 to the equation. The line tangent to the curve y = h(x) at the point (x0 , h(x0 )) resembles the
graph of y = h(x) for x near x0 . Hence, the x-intercept of this tangent line should be a better approximation
to a solution of h(x) = 0 (see the figure).
y
..
...... ..
.......
.•.....
h(x0 )
.......
.

. x1 = x0 −
y = h(x) ....
.. h′ (x0 )
........
.
.

............ h(x1 )
........ x2 = x1 −
............ h′ (x1 )
........ ..
................... .
..
... ..
...........•. .........
... ....
h(xn )
.... ... xn+1 = xn −
.
. ............. ......... h′ (xn )
.. ... ..
..
...
....... x..2 x..1
. x0
................... x general iteration formula
..........
.........
The x-intercept of the tangent line to the curve at the point (x1 , h(x1 )) should be an even better approxi-
mation to a solution of h(x) = 0. This process can be continued to obtain approximations to any desired
degree of accuracy. To find a formula for the x-intercepts of the tangent lines that appear in this process,
let c be any real number and write down an equation for the line tangent to y = h(x) at (c, h(c)), then find
its x-intercept:
h(c)
y − h(c) = h′ (c)(x − c); y =0 ⇒ x=c− .
h′ (c)
Using this formula, we can obtain x1 from x0 , then x2 from x1 , and so on (see the equations and the iteration
formula to the right of the figure).
1.31 Newton’s Method 69

We will illustrate Newton’s method by approximating the positive root of the polynomial P (x) = x4 −
3x − 8. Since P is continuous and since P (1) = −10 < 0 < 2 = P (2), the Intermediate Value Theorem
guarantees a solution to P (x) = 0 in the interval (1, 2). Since P (2) is closer to 0 than P (1), a good choice
for x0 is 2. The iteration formula and the first few terms it generates are given below.

x1 = 1.93103448276
P (xn )
xn+1 = xn − ′ x2 = 1.92671132259
P (xn )
x4 − 3xn − 8 x3 = 1.92669501894
xn+1 = xn − n 3
4xn − 3
x4 = 1.92669501871

Hence, the number 1.926695 is a good approximation of the desired root. Note that x2 is accurate to four
decimal places and that not much computational effort is required to determine x2 . As in this example,
Newton’s method converges quite quickly in most cases. Typically, when xn and xn+1 agree to d decimal
places, the approximation xn+1 agrees with the actual root of the function to at least d decimal places.

Exercises
1. Use Newton’s method to find a solution to the equation that is accurate to six decimal places.
a) x3 + 4x − 8 = 0 b) e−x = x c) 2 sin x = x
d) x5 = 700 − 4x e) tan x = x, x > π f ) ln x = x − 4, x > 4

2. By applying Newton’s method to the(equation)x2 − a = 0, show that the square root of a can be calculated
1 a
using the iteration formula xn+1 = xn + , a formula used by the ancient Babylonians.
2 xn
a) Give an explanation of this formula that does not involve calculus. Hint: It looks like an average.
b) Use this method to approximate the square roots of 2, 3, 5, and 6 in the following way. Let x0 be 3/2 for
both 2 and 3 and let x0 be 2 for both 5 and 6, then for each case, compute x1 and x2 as rational numbers
(a ratio of integers) without using a calculator (to be in the proper spirit of the endeavor).
c) Use your calculator to compare your answers with the actual square roots. How close are the values?
3. Use Newton’s method to find an iteration process for cube roots that is similar to the one for square roots. Use
your formula to approximate the cube roots of 2 and 3 in the following way. Let x0 = 1 in both cases and
compute x1 and x2 as rational numbers. Compare your answers with the actual cube roots.
4. Show that Newton’s method gives the formula xn+1 = 2xn − ax2n as an iteration formula to find the reciprocal
of a. Does this formula work very well for approximating 1/7? Use x0 = 0.1.
5. Here is an example of a practical problem that leads to an equation that cannot be solved exactly. A chord of
length 4 cuts off an arc of length 5 in a circle. Find the radius of the circle and the central angle determined by
the chord (the angle whose vertex is at the center of the circle and whose sides extend to the ends of the chord).
You may use a calculator for this problem and not bother with Newton’s method.
6. Provide a sketch for a situation in which Newton’s method could fail to find a root. Hint: Look at the denominator
of the iteration formula.
70 Chapter 1 The Derivative

1.32 Implicit Differentiation

Some curves in the plane cannot be represented in the form y = f (x); just imagine a fancy loop de loop
scribble. Such a curve does not represent a function (think about the vertical line test), but the curve
certainly appears to have a tangent line at most points. All the derivative formulas developed so far have
been for curves expressed in the form y = f (x), so a new technique is required for curves that are not or
cannot be expressed in this form.
The most familiar example of a curve that is not the graph of a function is a circle of radius r generated
by the equation x2 + y 2 = r2 . This equation does not define a function since each value of x between −r

and r generates two values of y. This particular equation can be solved for y to obtain y = ± r2 − x2 . The
√ √
equation y = r2 − x2 represents the top half of the circle while the equation y = − r2 − x2 represents the
bottom half of the circle (see the figure).
y y y
.....
.......
.....
.......
√ .....
.......

2 2
....... x + y = r
. ..
2
........ y = r 2 − x2 y = − r2 − x2
.
......... ............. ......... .............
.. ... ... ...... ...
... ... ... ... −r r
... ... ............. . .. ............ ... . ............... x
.. x ... x
... ... −r r ... ..
.
.... . .... .
......
..........................
.. ....... ..
......
..................

Hence, the equation x2 + y 2 = r2 gives rise to two functions and these functions are said to be defined
implicitly by the equation x2 + y 2 = r2 . A function is defined implicitly if some values of x determine one
or more values of y but a formula for y in terms of x is not given explicitly in the form y = f (x).
The technique of implicit differentiation makes it possible to find the derivative of a function that is
defined implicitly. The basic idea is to assume that y is a differentiable function of x, even if it is not possible
to find this function, and differentiate the entire equation with respect to x. For example, differentiating
x2 + y 2 = r2 with respect to x, assuming y is a function of x and r is a constant, gives

dy dy x
2x + 2y =0 or =− .
dx dx y

Note the use of the chain rule when differentiating the term y 2 . The beauty of this method is that it gives
the right answer no matter which function we use for y:
√ √
y = r2 − x2 ; y = − r2 − x2 ;
dy −x x dy x x
=√ =− ; =√ =− .
dx r2 − x2 y dx r 2 − x2 y

In fact, the answer obtained by implicit differentiation is precisely the result given by geometry; if (x, y) is
on the circle, then the radial line has slope y/x and the tangent line has slope −x/y. Implicit differentiation
can also be used to find higher derivatives. In this case,

dy x
2 ( ) y · 1 − x · y+x·
d y d x dx = − y y 2 + x2 r2
2
= − =− 2 2
=− 3
= − 3.
dx dx y y y y y
1.32 Implicit Differentiation 71

Note the insertion of the formula for dy/dx and the use of the original equation x2 + y 2 = r2 .
An interesting curve known as the folium of Descartes is defined by the equation x3 + y 3 = 3axy, where
a is a positive constant. A graph of this curve when a = 4/3 is given below; the line y = −x − a is an
asymptote for the folium of Descartes.
y
....
.......
x3 + y 3 = 4xy
...........
.............
.....
2
.. . ...... ....
....
....
...
...
...
...
...
.. .
............
.....
..... ... . .. . . ...
..... ...
..... ....
..... .....
.. . . . . . .....................
.
.....
..... .
...
......
.......... x
−3 .....
..... 2
.....
.....
..... ...
..... ..
.....
..... .
..... .
..... ..
..... .
..... ..
..... ..
..... .
.......
.......
.......
.......
.......
.....
....
−3 ...
....
...
...
....
..
.....
.....
. y = −x − 4/3

To find dy/dx for this curve, assume that y is a function of x and differentiate the equation x3 + y 3 = 4xy
with respect to x to obtain

dy ( dy ) dy 3x2 − 4y
3x2 + 3y 2 =4 x +y ⇒ = .
dx dx dx 4x − 3y 2

This formula can be used to find the slope of the curve at any point on the curve, but both the x and y
coordinates of the point in question must be known. In particular, at the point (2, 2), the slope is −1.

Exercises
1. The equation 4x2 − y 2 = 1 defines two functions of y implicitly. Solve for these two functions and find dy/dx
for each of them. Then find dy/dx implicitly and compare the results.
2. Use implicit differentiation to find dy/dx.
a) x2 + y 2 = x + 4xy b) x3 + y 3 = 2x + y c) x4 + 3x2 y + y 3 = 10
y x
d) (x2 + y 2 )2 = 9(x2 − y 2 ) e) x3 + xy + y 2 = ey f) + =8
x y+1
3. Find an equation for the line tangent to the curve defined by x4 + xy + y 3 = 11 at the point (1, 2).
4. Consider the folium of Descartes x3 + y 3 = 9xy/2.
a) Find the slope of this curve at the points (1, 2), (2, 1), and (9/4, 9/4).
b) In addition to (2, 1), find two more points on this curve for which the x coordinate is 2.
5. Find an equation for the line normal to the hyperbola y 2 − x2 = 7 at the point (3, 4).
6. For the curve defined by x3 + 4xy 2 + 6y = 8, find an equation for each of the tangent lines to the curve at the
points where the curve intersects the coordinate axes.
7. Find an equation for a tangent line to the curve x2 − y 2 = 5 that passes through the point (1, 1).
8. Find the points (other than the four points where the curve meets the x and y axes) on the curve 2x4 + y 4 = 96
for which the normal line goes through the origin.
d2 y 36
9. For the curve x2 + xy + y 2 = 6, show that =− .
dx2 (x + 2y)3
10. Find the ordered pair (s, t) that satisfies the equation x2 − xy + y 2 = 1 and has the largest possible value for t.
72 Chapter 1 The Derivative

1.33 Related Rates

In a related rates problem, related quantities are changing over time. If the rates at which some quantities
are changing are known, it is usually possible to determine the rates at which the related quantities are
changing. Since rates can be determined using the derivative, calculus plays a role in solving such problems.
For example, suppose that at a given instant the sides of a square are 15 inches long and that all of the
sides are increasing at the rate of 4 inches per minute. To find the rate at which the area of the square is
increasing at this instant, we begin by noting that A = s2 , where s represents the length of the side of the
square in inches and A represents the area of the square in square inches. Since both A and s are implicitly
defined functions of time t (measured in minutes), we can differentiate this equation implicitly with respect
to t to obtain
dA ds dA
= 2s and thus = 2 · 15 · 4 = 120.
dt dt dt s=15
(Note carefully the difference in notation between a general derivative and a derivative evaluated at a certain
instant.) It follows that the area of the square is increasing at a rate of 120 in2 /min. (Pay attention to units
while working these problems.) A more detailed and involved example is given in the following problem.

Problem: An inverted cone with a diameter of 4 meters and a height of 6 meters is being filled with water
at a rate of 1 cubic meter per minute. At what rate is the depth of water in the cone changing when the
water is 4 meters deep?

Solution: We are given the rate at which volume changes and are asked to determine the rate at which
depth changes. It is thus necessary to find an equation that relates these two quantities. Let V be the
volume (in cubic meters) of water in the cone, let d be the depth (in meters) of water in the cone, and let
t be the elapsed time (in minutes). Then dV /dt = 1 and we must find dd/dt when d = 4. Using similar
triangles and the formula for the volume of a cone (see the figure), we find that V = πd3 /27. Both V and d
are functions of t, but they are not explicitly given. Differentiating the equation implicitly with respect to
t, then substituting the specific values of the variables yields the computations to the right of the figure.
...................................
........... ......
.. ..... ...
... ..............................
..............................2
...... ...
. .. π 2
................. ...... ...............
...
... . . . V = x d
... ............................... .. 3
... ...
... .. π 3
... ...
... .. V =
27
d
d x ... ....
... ..
= ... ...
...
. . dV π dd
6 2 ... .... . = d2
d ... 6 .......... ... dt 9 dt
... ... ........ ... ..
dd
x=
3 ........... ............x........................ π
..
.................................................
..... 1 = 42
... ...... .. 9 dt d=4
...d....... ..
... ...... .. dd 9
=
............ dt d=4 16π
....
Thus, the depth of water is increasing at a rate of 9/(16π) ≈ 0.179 meters per minute when the depth is 4
meters. It is important to note that specific values of the variables are not substituted until derivatives are
taken.
1.33 Related Rates 73

Some general guidelines that are useful for solving related rates problems are given below.

1. Read the problem carefully, draw a picture if it is helpful, and understand what is being asked.

2. Determine the quantities that vary with time and assign variables to them. Be careful not to
mistake constants given in the problem with values that depend on time. Determine the rates that
are known and those that are to be found.

3. Find a relationship between the quantities whose rates are known and those whose rates are to be
determined. This step may require some extra effort.

4. Use implicit differentiation to differentiate the equation relating the quantities with respect to time.
This will give a relationship involving the rates of change of the relevant quantities.

5. Substitute the given values of the variables and the known rates into the equation from (4) and
solve for the unknown rate. Be certain to obtain the correct units for the rate.

Exercises
1. The radius of a circle is increasing at the rate of 3 cm/sec. At what rate is the area of the circle increasing when
the radius is 20 cm?
2. The sides of an equilateral triangle are increasing at the rate of 1 cm/sec. At what rate is the area of the triangle
increasing when the sides are 4 cm?
3. Gas is pumped into a spherical balloon at the rate of 1 ft3 /min. How fast is the diameter of the balloon increasing
when the balloon contains 36π ft3 of gas?
4. One airplane flew west over an airport at 300 km/hr. Ten minutes later another airplane flew south over the
airport at 230 km/hr. Assuming that the airplanes were at the same altitude, determine the rate at which they
were separating 20 minutes after the second airplane flew over.
5. A balloon leaves the ground 400 meters from a person standing at ground level. When the balloon is at a height
of 200 meters, it is rising at the rate of 20 m/min.
a) How fast is the person’s angle of observation increasing at this instant?
b) How fast is the distance between the person and the balloon increasing at this instant?
6. A upright cone (vertex up) with a radius of 4 m and height of 10 m is being filled with water at a rate of 2.5
m3 /min. How fast is the water level rising when the water is 6 m deep?
7. The minute hand on a clock is 4 inches long and the hour hand is 2 inches long. At what rate is the distance
between the tips of the hands changing at 3: 00? What is the rate at 3: 40?
8. Suppose the surface area of a sphere is decreasing at a rate of 30 cm2 /min when its radius is 2 m. At what rate
is the volume of the sphere decreasing at this instant?
9. A lighthouse is 2 km away from a straight shoreline and its light makes 3 revolutions per minute. How fast is
the beam of light moving along the shoreline when the light is 0.5 km from the nearest point on the shoreline?
10. Sand is leaking out of an elevated bin and falling into a conical pile at the rate of two cubic feet per minute. The
radius of the base of the cone is always three times the height of the cone. At what rate is the circumference of
the base increasing when the height of the pile is four feet?
74 Chapter 1 The Derivative

1.34 Curvature

Consider the graph of a smooth function f . In addition to having a different slope at each point on the
graph, the graph also has a different curvature (or amount of bend) at each point. It is both useful and
interesting to find a numerical value for the curvature of a curve. For instance, the acceleration of a car
moving along a road with constant speed is related to how much the road bends.
It should be clear that the curvature of a circle is the same at each point on the circle, and that the
curvature of a circle decreases as the radius increases. Since a circle of radius r turns through an angle of
2π radians over a distance of 2πr units, the curvature is defined to be 1/r. The curvature of a general curve
will vary from point to point along the curve. Recall that the slope of a curve at a point is defined to be
the slope of the line that best approximates the curve at that point. Similarly, the curvature of a curve at
a point is defined to be the curvature of the circle that best approximates the curve at that point (see the
figure).
y .
y .
....... .......
.. .. ............................. ..
................
. .......
.. ......
...... .....
..... .....
.... ...
... ...
... ...
... ...
.. y = f (x) .. y = f (x)
.
... ... The radius of the circle that
... . ...
.... ... .
...
...
... . .... ... • . ...
best approximates the curve
... .................... ... ..
..
... ..
...
... ... . .
....
... ....... ........ ... ...
... ... ..
. ...
...
.
varies from point to point.
.......... • ....... ....
...............•..
... ...
... ..
......................................... .....
..... ..
.....
...
.
• ...........
..... x
......
.......
............
.....
........... x
..........................
........
.....

There are several ways to find the approximating circle; one method is presented in the next paragraph and
another is outlined in the exercises.
Consider the graph y = f (x) of a twice differentiable function f and suppose that (a, f (a)) is a point
on this graph. The circle that best approximates f at (a, f (a)) goes through the point (a, f (a)) and has the
same slope and concavity at a as the function f . Let the equation of the circle be (x−h)2 +(y −k)2 = r2 . The
goal is to find values for h, k, and r so that the circle has the desired properties. Using implicit differentiation
on the equation (x − h)2 + (y − k)2 = r2 , we obtain

dy x−h d2 y r2
=− and =− .
dx y−k dx 2 (y − k)3

In order for the circle to best approximate f at a, the three unknowns h, k, and r must satisfy the equations:

( )2 a−h r2
(a − h)2 + f (a) − k = r2 , f ′ (a) = − , f ′′ (a) = − ( )3 .
f (a) − k f (a) − k

The first equation indicates that the point (a, f (a)) is on the circle, the second guarantees that the circle
and the curve have the same slope at a, and the third gives the circle and the curve the same concavity at
a. These three equations in the three unknowns h, k, and r look rather intimidating to solve. However, if
we let A = h − a, B = k − f (a), c = f ′ (a), and d = f ′′ (a), the equations become

A2 + B 2 = r2 , A = −cB, r2 = dB 3 ,
1.34 Curvature 75

which look much more manageable. We must solve for A, B, and r in terms of c and d. Substituting the
second and third equations into the first equation yields B = (c2 + 1)/d. It then follows fairly easily that
( )3
f ′ (a) ( ) 1 + (f ′ (a))2 1 + (f ′ (a))2
h = a − ′′ 1 + (f ′ (a))2 , k = f (a) + , and r =2
.
f (a) f ′′ (a) (f ′′ (a))2

If f ′′ (a) = 0 (which corresponds to a possible inflection point), then the circle has infinite radius, which
means that the curve is essentially a straight line at the point (a, f (a)).
The radius of the approximating circle is called the radius of curvature of the function and is usually
denoted by ρ. The curvature of a function, which is the reciprocal of the radius of curvature, is usually
denoted by κ. Thus, the radius of curvature ρ and curvature κ of a twice differentiable function f at (x, f (x))
are given by
( )3/2
1 + (f ′ (x))2 f ′′ (x)
ρ(x) = and κ(x) = ( )3/2 .
|f ′′ (x)| 1 + (f ′ (x))2
Note that the numerator for κ(x) is f ′′ (x) rather than |f ′′ (x)|; this makes it possible to distinguish between
curves that are concave up (positive curvature) and those that are concave down (negative curvature). The
center of the circle that best approximates the curve y = f (x) at a point, which is called the center of
curvature, can be found using the formulas for h and k derived in the previous paragraph.

Exercises
1. For each curve, find the curvature at the indicated points.
a) y = x2 , x = 0, 1 b) y = x3 , x = 0, 1 c) y = x4 , x = 0, 1
2
d) y = x4 − 2x2 , x = 1, 2 e) y = e−x , x = 0, 1 f ) y = ln | cos x|, x = 0, π/4

2. For the curve y = x4 − 2x2 , sketch the curve and its curvature function on the same graph.
3. For the curve y = x3 − 3x, find the center of curvature when x = 2.
4. For the curve y = x2 , find the center of curvature when x = a.

5. For y = arccos(1 − x) − 2x − x2 , find a general formula for κ(x). (After some algebra, it is quite simple.)
6. Find the maximum curvature of the curve y = ex .
7. Let P be a polynomial of degree n ≥ 2 and let κ be the curvature of P . Prove that lim κ(x) = 0 and
x→∞
lim κ(x) = 0.
x→−∞

8. Consider the function f (x) = −c ln(1 − x2 ), where c is a positive constant. Sketch the graph of the curvature of
this function for c = 0.5 and c = 0.8. You should find that the graphs look rather different on the interval (−1, 1).
Find the value of c for which the shape of the curvature graph changes. (The exact value is not complicated,
but if you cannot find it, at least find an approximation.)
9. Here is a different approach to finding the center of curvature of a twice differentiable function f at a point a.
For each b ̸= a, let (ub , vb ) be the point of intersection of the normal lines through (a, f (a)) and (b, f (b)). Use
properties of circles to explain why it is reasonable to expect that lim (ub , vb ) is the center of curvature (h, k) of
b→a
f at a. Show that
F (b) − F (a) x 1
ub = , where F (x) = + f (x) and G(x) = ,
G(b) − G(a) f ′ (x) f ′ (x)
and thus h = lim ub = F ′ (a)/G′ (a). Then find the value of k and show that these values are the same as those
b→a
found by the method discussed in this section.
76 Chapter 1 The Derivative

1.35 Supplementary Exercises

Remark. The exercises in this section follow the order of the text.
1. Find an equation for the line through the point (2, 4) that is parallel to the line through the points (−1, 10) and
(15, 2).
2. Find an equation for the line tangent to the circle x2 + y 2 = 34 at the point (3, 5).
3. A certain concession stand sold 850 hot dogs on a day when the price was set at $2 per hot dog and sold 1100
hot dogs on a day when the price was set at $1.50 per hot dog. Assuming a linear relationship between number
of hot dogs sold and sales price, how many hot dogs would be sold at $1.65 per hot dog?
4. Consider the trapezoid with vertices at (0, 0), (1, 2), (4, 2), and (6, 0). Find a vertical line that divides the
trapezoid into two parts of equal area. With a little more work, find a horizontal line with the same property.
5. Find the distance from the point (1, 2) to the line y = 3x + 4.
6. Find a point on the line 5x − 7y = 22 that has integer coordinates.
7. Let A, B, and C be non-collinear points. Let ℓA be the perpendicular bisector of AB and let ℓC be the
perpendicular bisector of BC. Explain why the point of intersection of ℓA and ℓC is the center of the circle that
goes through A, B, and C. Use this fact to find an equation for the circle that goes through the points (0, 0),
(1, 4), and (3, 2).
8. The graph of a function y = f (x) is given. Find the quantities listed to the right of the graph.
y.
.
.....
......
....... 4 .•.
.......
.......
....... ... a) lim f (x)
x→−1− e) lim f (x)
....... 3 • ... y = f (x) x→2+
....... ...
b) lim f (x)
f ) lim f (x)
....... x→−1+
...•. 2 ..
x→3
c) lim f (x)
... x→1 g) f (1)
. ◦......................................
◦.. 1
... . .◦. d) lim f (x)
x→2−
h) f (2)
.... ..
.................. ............
.... x
−4 −3 −2 2 −1
3 1 4
 2
x , if x < 0;
9. Sketch a graph of the function f (x) = x + 1, if 0 ≤ x ≤ 4; then find lim+ f (x), lim f (x), and lim+ f (x).
 x→0 x→1 x→4
1/x, if x > 4;
10. List the discontinuities of the given function.
x 2 x2 + 4
a) f (x) = b) g(x) = c) h(x) =
x2 −4 1 + cos x x−1
11. Sketch the graph of a single function f with all the following properties: (i) f is defined on R, (ii) f is continuous
except at 3 and 8, (iii) f has different one-sided limits at 3, and (iv) lim f (x) does not exist.
x→8+
12. Suppose that f : [a, b] → [a, b] is continuous on [a, b]. Prove there is a point c ∈ [a, b] such that f (c) = c.
13. Use algebra to evaluate each of the following limits.
x−2 x2 − 1 x4 − 1
a) lim b) lim 2 c) lim
x→2 2x2 − 3x − 2 x→−1 x − 3x − 4 x→1 x3 − 1
1 1 1
√ − −1
x−9 x 2 (1 + h)2
d) lim √ e) lim f ) lim
x→9 x−3 x→4 x − 4 h→0 h
x2 − a2
14. Find a positive constant a so that lim exists and find the value of the limit.
x→1 x2 + 2x − 3

f (v) − 7
15. Suppose that f is a continuous function with the property that lim = 3. Find an equation for the line
v→2 v−2
tangent to the graph of y = f (x) when x = 2.
16. Find equations for the tangent and normal lines to the curve y = x4 when x = 1.
17. Find two points on the curve y = x3 for which the tangent line goes through (4, 0).
18. Suppose that the height h in inches of a beanstalk after t days is h = t3 /3. When is the beanstalk growing at a
rate of 2 feet per hour?
1.35 Supplementary Exercises 77

19. Find and simplify the derivative of the given function.


√ √ 9 √ 6
a) f (x) = 8 4 x − 15 5 x b) g(x) = c) h(x) = x−
5x2 x
20. Find equations for the tangent and normal lines to the curve y = 4/x when x = 2.

21. Find a point on the curve y = 4 x for which the tangent line goes through (−15, 0).

22. Find a point on the curve y = 3 x for which the tangent line goes through (0, 4).
23. Find and simplify the derivative of the given function.
1
a) f (x) = (3x − 1)5 b) g(x) = √ c) h(x) = (x2 + x + 1)8
5x2 + 4

24. Find both values of c for which the tangent line to f (x) = (x2 + x + 4)4 at (c, f (c)) goes through the origin.
25. Find and simplify the derivative of the given function. Be sure to factor as much as possible.
√ 2x − 1
a) f (x) = x x2 + 2 b) g(x) = c) h(x) = x2 (4x − 15)5
3x + 4

26. Determine all the values of x for which the tangent line to the graph of y = (2x + 7)3 (3x − 13)5 is horizontal.
27. Find an equation for the line tangent to the graph y = 6x/(x2 + 2) when x = 1.
28. Find the maximum and minimum outputs of the function on the given interval. Assume that 0 < a < b.

a) f (x) = x2 − 4x + 10, [1, 5] b) g(x) = x3 − 3x + 4, [0, 5] c) h(x) = x2 + 4/x2 , [0.8, 4]
d) F (x) = x3 − a2 x, [0, 2a] e) G(x) = ax/(a2 + x2 ), [a/2, 3a] f ) H(x) = x/a + b/x, [a, b]
29. Find the volume of the largest cone with surface area 36π. Assume that the surface area of the cone includes
the base as well as the sides.
30. Find the volume of the largest right circular cylinder that can be inscribed in a right circular cone of radius r
and height h. Find the fraction of the volume of the cone that is taken up by the optimal cylinder. Also, find
the ratio of height to radius for the optimal cylinder.
31. A wire with length L will be cut into two pieces and the pieces bent into different shapes.
a) Suppose that one piece is used to construct a square and the other piece is used to construct a circle. Where
should the cut be made in order to minimize the sum of the two areas?
b) Suppose that one piece is used to construct a square and the other piece is used to construct an equilateral
triangle. Where should the cut be made in order to minimize the sum of the two areas?
c) Suppose that one piece is used to construct a circle and the other piece is used to construct an equilateral
triangle. Where should the cut be made in order to minimize the sum of the two areas?
d) Suppose that one piece is used to construct a square and the other piece is used to construct a hexagon.
Where should the cut be made in order to minimize the sum of the two areas?
32. A company has plants that are located (in an appropriate coordinate system) at the points (0, 1), (0, −1), and
(3, 0). The company plans to build a distribution center on the x-axis. Where should the center be located in
order to minimize the sum of the distances from the three plants?
33. Find a general formula for the distance from a point (x0 , y0 ) to the line Ax + By + C = 0.
34. Let a, b, and d be positive numbers. Suppose that two vertical poles of heights a and b are separated by a
distance d (all units are the same). Find the length of the shortest wire that can run from the top of one pole to
a point on the ground between the poles and up to the top of the other pole. Do you notice anything interesting
about the two triangles formed by the optimal wire, the two poles, and the ground?
35. Let a and b be positive constants and define a function f by f (x) = b − ax2 . For each value of x between 0 and
the positive x-intercept of f , the tangent line to the graph cuts off a triangle in the first quadrant. Find the
intercepts of the tangent line that cuts off the triangle of least area and the area of this triangle.
36. A pole leans away from the sun at an angle of 11◦ from the vertical. It casts a shadow of 25 feet when the angle
of elevation of the sun with the horizontal is 68◦ . Find the length of the pole.
37. Find, to the nearest tenth of a degree, the acute angle formed by the lines 2x + y − 5 = 0 and 3x − 4y − 2 = 0.
38. Use the Intermediate Value Theorem to prove that the equation 4 sin x = x has a positive solution.
39. Find the maximum value for the sum of the sines of the three angles of an isosceles triangle.
ah − 1
40. Find the value of lim . Hint: Interpret the limit as a derivative.
h→0 h
41. Evaluate each of the following limits.
78 Chapter 1 The Derivative

x21 − 1 2x2 − 3x − 2 (x + 1)2


a) lim b) lim c) lim
x→1 x4 − 1 x→2 x3 + x2 − 6x x→−1 1 + cos(πx)
x2 9x4 + 3x x6
d) lim e) lim f ) lim
x→∞ 3x2 − 4x + 1 x→∞ (x2 − 1)(2x2 + 7) x→∞ ex

42. Find the (x, y) coordinates of all inflection points. Treat a as a positive constant.
2
/a2
a) f (x) = a2 /(a2 + x2 ) b) g(x) = e−x c) h(x) = (x3 − a3 )/x
43. Suppose a person has a vertical leap of 42 inches. What initial velocity (in miles per hour) is required for such
a leap? How much hang time does the person have?
44. A person throws a ball straight upward along side of a tree. The ball is released from a height of 6 feet, just
reaches the top of the tree and then is caught by the person at a height of 4 feet. If the ball remains in the air
for 4.6 seconds, how tall is the tree?
45. Suppose that a particle is moving in a straight line with a velocity of 160 ft/sec. At time t = 0, the object begins
to decelerate at the rate of 4 t ft/sec2 . How far does this particle travel before coming to a stop?
46. For the curve y = x5 /5 − x3 /3, find the curvature when x = 1 and x = 2.
47. Find the maximum curvature of the curve y = x4 /4.
48. Show that the maximum curvature of y = x3 − 3x does not occur at its relative extreme points.
49. Give an example of a function that has zero curvature at a point on its graph that is not an inflection point.
50. The curvature of a curve is related to the normal component of acceleration; think of getting pushed toward the
door of a car as you round a corner quickly. Railroad tracks must be designed in such a way that the curvature
does not change abruptly. Suppose that the x-axis for x ≤ 0 and the line y = x for x ≥ 1 represent two sections
of railroad track that must be joined with a curve of the form y = Ax5 + Bx4 + Cx3 + Dx2 + Ex + F . Find
constants A, B, C, D, E, F so that the curve joins the two pieces of track and has the same tangent line and same
curvature at the junction points.
51. Let b be a constant and let κ be the curvature of the function f (x) = x3 + bx. Show that κ has exactly two
relative extreme outputs. Find the values of x where these extreme outputs occur.
2
The Integral

The derivative concept is motivated by the geometric problem of finding the slope of the tangent line to a
general curve. As a quick review, to compute the slope of the curve y = f (x) at the point (c, f (c)), take any
other point (v, f (v)) on the graph and determine the slope of the secant line. The limit of the slopes of the
secant lines as v → c is the slope of the tangent line at c (see the figure).
y
..
......
.....
.. y = f (x)
.. .
..
......................... secant line f (v) − f (c)
...
...
..................
......
Slope of secant line is
v−c
;
f (v) ..•
.
.....
.
.
......
..
f (v) − f (c)
.
........ .......... tangent line
...................................
.............
........ ...
...
.. . slope of tangent line is lim .
...
.... v−c
f (c) ....................................................• ...
..
....
. .
.. ... .......
. ..... .. v→c
... .. .. ..
.....
......
......
.....
......
............
...... x
c v

The slope of the secant line is given by the difference quotient (f (v) − f (c))/(v − c). The slope of the tangent
line is found by letting the point (v, f (v)) slide along the curve to the point (c, f (c)). In other words, the
numbers v get closer to c. Hence, the geometric problem of determining the slope of a curve at (c, f (c))
f (v) − f (c)
reduces to the algebraic problem of evaluating lim . Since a general curve has a different slope
v→c v−c
at every point, this process gives rise to a function; the new function gives the slope of the original function
at any particular point. Thus, the derivative of a function f is another function f ′ defined by

f (v) − f (x)
f ′ (x) = lim
v→x v−x

for each value of x in the domain of f for which the limit exists. The number f ′ (c) represents the slope of
the graph y = f (x) at the point (c, f (c)). It also represents the rate of change of y with respect to x when

79
80 Chapter 2 The Integral

x is near c; this interpretation of f ′ opens the door to a number of physical applications. Differential calculus
is concerned with finding simple ways to evaluate derivatives (rather than using the definition), using the
derivative to study the graph and behavior of a function, and exploring various applications of the derivative
to physical problems.
The integral concept is motivated by the geometric problem of finding the area of a region with curved
boundaries. The most familiar region with curved boundaries is the circle and attempts to find its area can
be traced back several thousand years. To phrase the problem in a modern setting, let f be a continuous
nonnegative function defined on an interval [a, b] and let R be the region under the curve y = f (x) and above
the x-axis on the interval [a, b] (see the figure).
y
.....
........
y = f (x)

................ ...... Determine the area of


f (a) ........ ... ... ....
... . .. the region R bounded by
.. .......
...
f (b)
...
... ... .
...
... ... .. ... the curve y = f (x) and
...
... ... .. ...
...
...
...
...
..
...... ...
...
...
three straight lines.
... ...
... ...
... ...
.. ..
... R ...
... ...
.... .... .............
..... x
a b

The problem is to find the exact value of the area of the region R. The solution to the area problem leads
to the definition of the integral and, as with the derivative, the limit concept appears in the definition
of the integral. In general, solving the area problem or, equivalently, evaluating integrals turns out to be
more difficult than the problem of finding the slope of a curve. In the late seventeenth century, an amazing
connection between the tangent problem and the area problem was discovered, thus making the area problem
easier to solve. The discovery of this connection, which is known as the Fundamental Theorem of Calculus,
is considered to be the origin of calculus. By taking advantage of this result, many great achievements in
both mathematics and the physical sciences were accomplished. In spite of its geometric motivation as area,
the integral has a wide variety of applications. These include computing geometric quantities such as volume
and arc length, and physical quantities such as work and force. We will consider several of these applications
in this chapter after first defining the integral and exploring its connection to the derivative.
2.1 Summation notation 81

2.1 Summation notation

Suppose you are asked to represent the sum of the squares of the first one hundred positive integers. Since
this sum is rather long, you would probably abbreviate it as 12 + 22 + 32 + · · · + 992 + 1002 . Since long sums
of this type are common in mathematics, it is helpful to have a shorthand notation for them. The upper
∑ 2
100
case Greek letter sigma Σ is used for this purpose. We write i for the sum of the squares of the first one
i=1
hundred positive integers. Further (and more abstract) examples of this summation notation include


n ∑
p+1
ai = a1 + a2 + a3 + · · · + an and f (k) = f (0) + f (1) + f (2) + · · · + f (p + 1).
i=1 k=0

The letter i or k as used here is called the index of summation and the sum begins at the integer below the
symbol Σ and ends at the integer above the symbol Σ, running through all of the integer values in between.

n ∑
n−1 ∑
n+4
The generic sum 12 + 22 + · · · + n2 can be represented by i2 or (k + 1)2 or even (i − 4)2 ; there is
i=1 k=0 i=5
more than one way to represent a sum in summation notation.
As a simple example of summation notation, consider the following sum:


4
(2i3 − 3i) = (2 · 13 − 3 · 1) + (2 · 23 − 3 · 2) + (2 · 33 − 3 · 3) + (2 · 43 − 3 · 4)
i=1
= −1 + 10 + 45 + 116 = 170
( ) ( )
= 2 13 + 23 + 33 + 43 − 3 1 + 2 + 3 + 4

4 ∑
4
=2 i3 − 3 i.
i=1 i=1

Of course, summation notation is not all that helpful for the sum of just a few terms, but this example
illustrates the main idea. As indicated by the rewriting of this sum in a different way, the commutative,
associative, and distributive properties of addition and multiplication can be applied to summation notation.
These facts yield the following properties of sums; the letter C represents a quantity that does not depend
on the index i.

n ∑
n ∑
n ∑
n
1. C = nC 3. (ai + bi ) = ai + bi
i=1 i=1 i=1 i=1
∑n ∑
n ∑n ∑n ∑n
2. Cai = C ai 4. (ai − bi ) = ai − bi
i=1 i=1 i=1 i=1 i=1

The proofs of these properties are not hard; simply write out both sides of the equations and note that they
are equal. In addition to these properties of sums, we also need formulas for the sums of various powers of
the first n positive integers. The formula for the sum of the first n positive integers can be derived rather
easily by writing the sum forward and backward;

S =1+ 2 + 3 + · · · + (n − 1) + n

S = n + (n − 1) + (n − 2) + · · · + 2 +1

and then adding the two expressions. Thus 2S = n(n + 1) and the formula for the sum is

n(n + 1) ∑
n
n(n + 1)
1 + 2 + 3 + ··· + n = or i= .
2 i=1
2
82 Chapter 2 The Integral

The technique used to find this sum is a useful one to remember. To find formulas for the sums of higher
powers of positive integers, we begin by collecting some data.

n ∑
n ∑
n ∑
n
n i i2 i3 i4
i=1 i=1 i=1 i=1

1 1 1=1· 3
3 1 = 12 1=1· 5
5

2 3 5=3· 5
3 9 = 32 17 = 5 · 17
5

3 6 14 = 6 · 7
3 36 = 62 98 = 14 · 35
5

4 10 30 = 10 · 9
3 100 = 102 354 = 30 · 59
5

5 15 55 = 15 · 11
3 225 = 152 979 = 55 · 89
5

6 21 91 = 21 · 13
3 441 = 212 2275 = 91 · 125
5

7 28 140 = 28 · 15
3 784 = 282 4676 = 140 · 167
5

n ∑
n
We already have a formula for i. Comparing these sums with those for i3 reveals a simple (and
i=1 i=1
somewhat intriguing) relationship between them; the latter are the squares of the former. With a little more

n ∑
n
effort, a pattern appears in the i2 column; the sum i is multiplied by the fraction (2n + 1)/3 to obtain
i=1 i=1

n
2
i . The last column is the most complicated one, but even there a pattern develops when the sums of the
i=1
fourth powers are compared to the sums of the squares. Notice that the numerator of the multiplier goes up
by the numbers 12, 18, 24, . . .. The numerators can then be written as
( ) ( ) ( ) ( )
6 1 − 1, 6 1 + 2 − 1, 6 1 + 2 + 3 − 1, 6 1 + 2 + 3 + 4 − 1,

and so on. Since we have a formula for the sums in parentheses, these numbers are given by 6n(n + 1)/2 − 1
and the desired multiplier is thus (3n2 + 3n − 1)/5. Putting all of this information together gives us the
following table:

n
n(n + 1) n2 n
i= = +
i=1
2 2 2

n
n(n + 1)(2n + 1) n3 n2 n
i2 = = + +
i=1
6 3 2 6

n
n2 (n + 1)2 n4 n3 n2
i3 = = + +
i=1
4 4 2 4

n
n(n + 1)(2n + 1)(3n2 + 3n − 1) n5 n4 n3 n
i4 = = + + −
i=1
30 5 2 3 30
It is important to note that we have not proved these formulas; they are merely conjectures. After discussing
mathematical induction in the next chapter, it will be possible to give rigorous proofs of these conjectures.
For now, we will accept these as correct formulas. Studying the expanded form of each sum of the powers
of the first n positive integers, a pattern begins to emerge. By focusing on the highest two powers of n, one
is led to the conjecture that

n
nr+1 nr
ir = + + smaller positive integer powers of n,
i=1
r+1 2
2.1 Summation notation 83

for every positive integer r. This conjecture is indeed true and it can be proved using mathematical induction.
As an example of these specific formulas and the general properties of sums, consider the following:

∑75
( 3 ) ∑75 ∑75 ( 75 · 76 )2 ( 75 · 76 )
2i − 5i = 2 i3 − 5 i=2 −5 = 16, 230, 750.
i=1 i=1 i=1
2 2

For a second example, note how the “long” sum can be expressed in a simpler form:


n+1 ∑
n+1 ∑
n+1
2 + 5 + 8 + · · · + (3n + 2) = (3i − 1) = 3 i− 1
i=1 i=1 i=1
3 1
= (n + 1)(n + 2) − (n + 1) = (n + 1)(3n + 4).
2 2
Finally, it is important to note that these sum formulas are only valid when the starting value of the
index is 1. When this is not the case, we can do some subtraction:


34 ∑
34 ∑
8
34 · 35 · 69 8 · 9 · 17
k2 = k2 − k2 = −
6 6
k=9 k=1 k=1
= 17(35 · 23 − 4 · 3) = 17(700 + 105 − 12) = 17(800 − 7) = 13600 − 119 = 13481.

(The extended calculations are intended to reveal that the final answer can be obtained with minimal
calculation and no electronic device.) In the following sections, we will see more interesting applications of
these formulas.

Exercises
1. Write each of the following sums in summation notation.
a) 2 + 7 + 12 + · · · + 997 b) 1
2
+ 2
3
+ 3
4
+ ··· + 29
30
c) 1 + 3 + 9 + · · · + 2187
2. Write the sum 2 + 6 + 10 + · · · + (4n + 6) in summation notation in two different ways: one starting with i = 1
and another ending with i = n.
3. Use formulas in this section to find each of the following sums. Simplify your answers.

12 ∑15 ∑
25
a) 2i3 b) (3k2 − 4k + 2) c) 3
i=1 k=1 j=8


60 ∑
n ∑
m
d) i2 e) (4i − 2) f) k(k + 1)
i=22 i=1 k=1

4. Find simple formulas for each of the following sums. Hint: Write the sum in expanded form (without simplifying
the terms) and notice lots of cancellation. Sums of this type are known as telescoping sums.
∑n ( ) 2n (
∑ ) ∑n ( )
1 1
a) (i + 1)6 − i6 b) − c) aj − aj+2
i=1
k k+1 j=1
k=1

5. Find a formula, one that does not involve summation notation, for the indicated sum.
a) 4 + 6 + 8 + · · · + (2n + 2) b) 1 + 4 + 7 + · · · + (3n + 4) c) 12 + 42 + 72 + · · · + (3n − 2)2
1 3 + 23 + 3 3 + · · · + n 3
6. Evaluate the limit: lim .
n→∞ 3n4
r −1
n
7. Prove that r + r2 + r3 + · · · + rn = r · , where r is any real number other than 1 and n is a positive
r−1
integer. Hint: Let z = r + r2 + r3 + · · · + rn and write out a similar expression for rz. Find and simplify rz − z,
then solve for z.
84 Chapter 2 The Integral

2.2 Area under a curve

Let f be a continuous nonnegative function defined on an interval [a, b]. Consider the problem of finding the
area A under the curve y = f (x) and above the x-axis on the interval [a, b]. The basic idea for solving this
problem is the following: use rectangles to approximate the area, then let the number of rectangles increase
in such a way that the approximation to the area improves; the limiting value of the rectangular areas should
give the area under the curve. The simplest way to carry out this process is to divide the interval [a, b] into
n equal subintervals, where n is an arbitrary positive integer, use the right endpoint of each subinterval to
determine the height of a rectangle that approximates the curve (see the figure), find the sum of the areas
of the n rectangles, then take the limit as n increases indefinitely.
y
......
.......
y = f (x) b−a
.... is the width of each subinterval;
........ ......... .....
.......................................
.....
...
... .... .... ....
n
f (a) ....
... . ... ... 1
....
... ... ... .... ...
...
...
.................... a1 = a + (b − a) is the right endpoint of the first subinterval;
n
. ......
... ... ...
f (b)
...
... . ... ... ...
.. .......... .........
... . ...
...
...
. ..
.... ... ... ...
... ... . ...
...
...
...
...
...
2
a2 = a + (b − a) is the right endpoint of the second subinterval;
...
... ... .. ....... ...... ...
...
...
...
...
... n
...
... ......... ....... ......
...
...
...
...
...
...
...
... ... ... ... ... 3
a3 = a + (b − a) is the right endpoint of the third subinterval;
... ...
... ... ... ... ... ...
... ... ... ... ...
... ... ... ... ...
... ... n
...
...
...
...
...
...
...
...
...
...
...
...
..
...
...
...
...
...
...
...
... ... ...
...
...
...
.
... ... ... ... ... ... n
...
...
...
...
...
...
...
...
...
... ...
... an = a + (b − a) = b is the right endpoint of the nth subinterval.
...
...
...
...
...
...
...
...
...
... ...
...
n
...
... .... .... .... .. .... ............
...... x
a a1 a2 a3 · · · an−1 b

Carrying out this process shows that the area A under the curve y = f (x) and above the x-axis on the
interval [a, b] is given by

∑n ( i )b − a ∑n
b−a
A = lim f a + (b − a) = lim f (ai ) ,
n→∞
i=1
n n n→∞
i=1
n

where ai = a + i(b − a)/n represents the ith hash mark on the interval [a, b]. The geometry problem of
finding the area under a curve is reduced to the algebra problem of finding the limiting value of a sum.
As a simple example, let f (x) = x2 and consider the interval [0, b], where b is a positive number. The
area A under this curve on [0, b] is

∑n ( )2 ( b3 n(n + 1)(2n + 1) ) b3
b3 ∑ 2
n
ib b
A = lim = lim 3 i = lim · = ;
n→∞ n n n→∞ n i=1 n→∞ n3 6 3
i=1

the formula for the sum of the squares of the first n positive integers makes the limit easy to evaluate. More
generally, if r is a positive integer and f (x) = xr , then the area Ar under the graph of y = f (x) and above
the x-axis on the interval [0, b] is given by (see the general formula presented in the previous section)
n ( )r
∑ br+1 ∑ r br+1 ( nr+1 )
n
ib b br+1
Ar = lim = lim r+1 i = lim r+1 + smaller integer powers of n = .
n→∞
i=1
n n n→∞ n i=1
n→∞ n r+1 r+1

Hence, a formula for the area under any curve of the form y = xr , where r is a positive integer, follows

n
pretty easily from the formula for the sum ir .
i=1
2.2 Area under a curve 85

To illustrate how this general result can be used to find areas under specific curves, let A[a,b] represent
the area under the curve y = x6 and above the x-axis on the interval [a, b]. The formula derived above
indicates that A[0,b] = b7 /7 for any positive number b. Using symmetry and simple properties of areas, we
find that
37 2187
A[0,3] = = ;
7 7
27 17 127
A[1,2] = A[0,2] − A[0,1] = − = ;
7 7 7
27 37 2315
A[−2,3] = A[−2,0] + A[0,3] = A[0,2] + A[0,3] =+ = .
7 7 7
Notice that once a formula for area has been obtained, it is no longer necessary to use the definition of area
as a limit of a sum in order to find the area. This is particularly important to realize for curves that define

elementary areas. For example, to find the area under the curve y = 1 − x2 on the interval [−1, 1], we
simply need to recognize that the area under consideration is half the area of a circle of radius 1. It follows
that the area of this region is π/2; there is no need to evaluate the limit of a sum.

Exercises
1. Find the area under the curve y = x2 and above the x-axis on the given interval. You should use the b3 /3 result
derived in this section along with some simple reasoning.
a) [0, 12] b) [3, 7] c) [−3, 4]
2. Find the area of the region bounded by the graph of y = x , the x-axis, and the tangent line to y = x2 at the
2

point (a, a2 ), where a is a positive constant. Hint: Sketch a careful graph.


3. Use the definition of area under a curve as a limit of a sum to derive formulas for the area under the curves
y = x and y = x3 on the interval [0, b], where b is a positive constant. Imitate the example in the text.
4. Use the fact that the area under the curve y = xr and above the x-axis on the interval [0, b] is br+1 /(r + 1) to
find the area under the curve on the given interval.
a) y = x5 , [0, 2] b) y = x3 , [1, 5] c) y = x4 , [−2, 4]
5. Use facts from geometry to find the area under the curve on the given interval.
√ √
a) y = 3 − |x − 1|, [−1, 2] b) y = 9 − x2 , [0, 3] c) y = 9 − x2 , [0, 2]
6. Use the definition of area under a curve as a limit of a sum to find the area under the curve y = 2x on the
interval [0, 1]. You will need to use the sum formula given in Exercise 7 of Section 2.1.

7. Find the area under the curve y = x and above the x-axis on the interval [0, 4] and the area under the curve

y = 3 x and above the x-axis on the interval [0, 8]. (Draw a graph and think creatively.)
r
8. Let r be an even positive integer.
( Show that)the area under the curve y = x and above the x-axis on the interval
[a, b] is given by the formula b r+1
−ar+1
/(r + 1). Hint: The results in this section show that this formula
is valid for intervals of the form [0, b], where b is any positive number. Use this fact, along with properties of
area and the symmetry of the curve y = xr (this is where the assumption that r is even is used), to derive the
formula for a generic interval [a, b]. You must consider the cases 0 ≤ a < b, a < 0 < b, and a < b ≤ 0 separately.
9. Let f be a continuous nonnegative function defined on [a, b] and let k be a constant. Prove that the area under
the curve y = kf (x) and above the x-axis on the interval [a, b] is k times the area under the curve y = f (x) and
above the x-axis on the interval [a, b]. (This follows easily from the definition of area as a limit of a sum.)
86 Chapter 2 The Integral

2.3 The integral of a continuous function

The limit formula for the area under a curve that was derived in the last section forms the basis for the
definition of the integral of a continuous function. Be certain you understand the significance of each term
in the definition.

∫b
DEFINITION 2.1 The integral of a continuous function f on an interval [a, b], denoted by a
f (x) dx,
is defined by
∫ b ∑n ( ) b−a
i
f (x) dx = lim f a + (b − a) · .
a n→∞
i=1
n n

It can be shown that the limit that appears in the definition exists for every continuous function, but a proof
of this fact is beyond the level of this text. We often write the definition of the integral more simply as
∫ ∑
n
b
b−a
f (x) dx = lim f (ai ) · ,
a n→∞
i=1
n

where it is assumed that ai = a + i(b − a)/n for i = 0, 1, . . . , n. Note that the function f considered in
the definition is an arbitrary continuous function; the condition that f be nonnegative (as it was in the last
section) is not part of the definition. This impacts the area interpretation of the integral, but the limit and
the sum still make sense from a mathematical point of view. Hence, although the integral can be used to
represent certain areas, a general integral does not determine an area. Various interpretations of the integral
will become apparent when we discuss applications of the integral.
To get a feeling for this definition, we will express a particular integral as a limit of a sum. If f is defined

by f (x) = 10 + x2 and the interval of interest is [1, 3], then
∫ √ (
√ ∑ 2i )2 2
3 n
10 + x2 dx = lim 10 + 1 + · .
1 n→∞
i=1
n n

As you can well imagine, directly evaluating a limit as complicated as this is almost a hopeless task. We will
soon find a simple way to evaluate integrals that avoids the need for limits and sums.
The letters and symbols used in the notation for the integral are explained below.

The symbol is a variation on the letter S, the first letter of the Latin word summa for sum.

The numbers a and b are the lower and upper limits of integration, respectively.

The function f is sometimes referred to as the integrand.

The letter x is known as a dummy variable as it merely holds a place.

The quantity dx represents the small difference between two very close values of x (see Section 1.6).
∫b
Hence, the symbol a f (x) dx can be interpreted as the sum of products of the form f (x) dx, where the sum
is taken over all the x values from x = a to x = b. If f happens to be a nonnegative continuous function,
2.3 The integral of a continuous function 87

then the product f (x) dx can be interpreted as the area of a very skinny rectangle (see the figure).
y
.
.....
........
y = f (x)
... ....
....... .......... .. .....
The thin rectangle above position x
f (a) ........ ........ ..... .
.... .... ...
. ... .... has height f (x) and width dx;
f (b)
...
...
...
... ... . ......
...
...
...
...
...
...
... ... ....
...
its area is thus f (x) dx.

...
...
...
...
... .. ... b
...
...
...
...
..
..... ...
...
...
Summing all these areas gives f (x) dx,
... ... ... a
... ... ...
...
..
...
..
...
.. which represents the area under the curve.
... ... ...
... ... ...
... ... ... ............
..... x
a x b
∫b
Therefore, when f is nonnegative, the integral a
f (x) dx represents the area under the curve y = f (x) and
above the x-axis on the interval [a, b]. In applications involving the integral, the product f (x) dx often has
other physical interpretations. For example, if x represents time in seconds and f (x) represents the velocity
of a particle in meters per second, then f (x) dx is the displacement (measured in meters) of the particle. In
∫b
this case, the integral a f (x) dx gives the total displacement of the particle during the time interval [a, b].

Exercises
1. Use the definition of integral to express the given integral as a limit of a sum.
∫ 3 ∫ π ∫ 5
1
a) (4x3 + x) dx b) sin x dx c) dx
1 0 2 x

2. Use the definition of integral to express the given limit as an integral.


∑n ( )3 ∑ n ( ( )2 ) ∑n
3i 3 2i 2i 2 1
a) lim 4+ b) lim 5 + c) lim
n→∞
i=1
n n n→∞
i=1
n n n n→∞
i=1
n + i
∫4
3. Use the definition of the integral to evaluate 1
(3x − x2 ) dx.
∫b
4. Use the definition of the integral to evaluate 0
ex dx, where b is a positive constant. (See Exercise 7 in Section
2.1.)
∫b
5. Suppose that f is continuous and negative on [a, b]. Give an area interpretation for a
f (x) dx.
6. Suppose that f is continuous on [a, b] and assumes both positive and negative values.
∫ ∫b
b
a) Give area interpretations for both a f (x) dx and a |f (x)| dx.
∫ ∫b
b
b) Give displacement interpretations for both a f (x) dx and a |f (x)| dx, assuming x is time and f (x) is
velocity.
∫ ∫b
b
c) Which of the numbers a f (x) dx and a |f (x)| dx is larger?
∫ b
br+1 − ar+1
7. The purpose of this exercise is to provide a proof of the fact that xr dx = , where r is a positive
a r+1
integer and [a, b] is any interval.
a) Explain why Exercise 8 from the previous section establishes this result for even positive integers.
b) Adapt the proof for even powers to odd powers. The only real difference occurs when a or b are negative for
then the integral (or a portion of the integral) must be interpreted as negative area. Remember to consider
the cases 0 ≤ a < b, a < 0 < b, and a < b ≤ 0 separately.
∫ b ∑ (
n−1
i
)
b−a
8. Let f be continuous on [a, b]. Prove that f (x) dx = lim f a + (b − a) . Hint: Subtract the sum
a n→∞
i=0
n n
given here from the sum in the definition, then take the limit.
88 Chapter 2 The Integral

2.4 Evaluating the integral of a polynomial

As mentioned in the last section, finding the value of an integral from the definition can be an intimidating
task. One way to simplify this task is to establish general properties that are satisfied by integrals. In this
section, we will only consider a few basic properties of integrals, but they will be sufficient to allow us to
find the integral of any polynomial with very little effort.
Let f and g be continuous functions defined on an interval [a, b] and let k be an arbitrary constant. The
following general properties of the integral are simple consequences of the definition of the integral and the
corresponding properties of sums listed in Section 2.1.
∫ b ∫ b ∫ ∫
( ) b b
1. k dx = k(b − a) 3. f (x) + g(x) dx = f (x) dx + g(x) dx
a a a a
∫ ∫ ∫ b( ∫ ∫
b b ) b b
2. kf (x) dx = k f (x) dx 4. f (x) − g(x) dx = f (x) dx − g(x) dx
a a a a a
The proofs of these properties involve more writing than thinking. For example, a proof of property (3)
reads as follows.
∫ b( ∑
) n
( ) b−a
f (x) + g(x) dx = lim f (ai ) + g(ai ) · (definition of integral)
a n→∞
i=1
n
(∑
n
b−a ∑
n
b − a)
= lim f (ai ) · + g(ai ) · (property of sums)
n→∞
i=1
n i=1
n

n
b−a ∑n
b−a
= lim f (ai ) · + lim g(ai ) · (property of limits)
n→∞
i=1
n n→∞
i=1
n
∫ b ∫ b
= f (x) dx + g(x) dx (definition of integral)
a a
Proofs of the other properties will be left for the reader.
At this point in our development of the integral, we have three ways to evaluate integrals. The first way
is to use the definition, but for most functions this is a rather complicated thing to do. The second way,
which only works for a few simple situations, is to interpret the integral as an area and find the area some
other way, most likely with a formula from geometry. The third way is to use formulas and properties that
have been derived from the definition of the integral. The most relevant formula found thus far is
∫ b
br+1 − ar+1
xr dx = ,
a r+1
where r is a positive integer and a and b are real numbers with a < b. A proof of this fact was outlined
in Exercise 7 of the previous section. Since this is an important result, we will include some of the details.
Suppose that r is an odd positive integer and let A[c,d] denote the signed area of the region bounded by the
curve y = xr and the x-axis on the interval [a, b] (positive if above the x-axis and negative if below). If
a < 0 < b, then
∫ b
xr dx = A[a,b] = A[a,0] + A[0,b] (area interpretation of integral, “property” of area)
a
(−a)r+1 br+1
= −A[0,−a] + A[0,b] = − + (symmetry of curve y = xr , Section 2.2 result)
r+1 r+1
br+1 − ar+1
= (r + 1 is an even integer)
r+1
2.4 Evaluating the integral of a polynomial 89

Think carefully about each of the steps required in this proof. The most important step is the use of the
formulas for the sums of the powers of the first n positive integers that were obtained earlier in this chapter.
This result, combined with properties (1) through (4), makes it possible to evaluate the integral of any
polynomial. For example,
∫ 3 ∫ 3 ∫ 3 ∫ 3 ∫ 3
(2x4 + 3x2 − 5x + 4) dx = 2 x4 dx + 3 x2 dx − 5 x dx + 4 dx
1 1 1 1 1
35 − 15 33 − 13 32 − 12
=2· +3· −5· + 4 · (3 − 1)
5 3 2
= 96.8 + 26 − 20 + 8 = 110.8.

Using the definition to write out this integral as the limit of a sum gives some appreciation for how much
effort is avoided by these simple formulas.

Exercises
1. Write out a proof of property (4) of integrals using the definition of the integral.
2. Use properties and formulas from this section to evaluate each of the following integrals.
∫ 4 ∫ 6 ∫ 2
a) 7x3 dx b) (8t + 5) dt c) (2x3 − 6x + 3) dx
2 1 −1
∫ 3 ∫ 1 ∫ 1
d) (2x2 + 4x + 7) dx e) (4x3 − 6x2 − 2x + 1) dx f) (t7 + 2t3 − 6t − 8) dt
1 −2 0

3. Use properties of the integral and the area interpretation of an integral to evaluate each of the following integrals.
∫ 2 ∫ r( √ ) ∫ 1( √ √ )
a) (3 + 2|x − 1|) dx b) t + 2 r2 − t2 dt c) 4 1 − x2 − x − x2 dx
−1 0 0

4. Evaluate each of the following integrals. Use any method that has been considered thus far.
∫ 2( 2 ) ∫ 1( √ ) ∫ 4( )
x x
a) − + 4 dx b) x − 2 1 − x2 dx c) 2|x − 3| − x2 dx
1 2 5 0 1

5. Suppose that v(t) = 5 − 2t gives the velocity in meters per second of a particle at time t seconds. Find the
distance traveled by the particle over each of the following time intervals.
a) 0 ≤ t ≤ 1 b) 0 ≤ t ≤ 2 c) 0 ≤ t ≤ 4
∫ 2 ∫ 2
6. Use formulas to evaluate tx2 dx and tx2 dt. Note the importance of the differential term.
1 1
90 Chapter 2 The Integral

2.5 Further properties of the integral

In the definition of the integral, it is assumed that a < b. It is convenient to adopt the conventions that
∫a ∫a ∫b
a
f (x) dx = 0 and b f (x) dx = − a f (x) dx. Furthermore, there is nothing magical about the value of the
function at the right endpoint of each subinterval; any point in the subinterval will do. This observation
leads to the following slight generalization of the definition of the integral. This form of the definition will
be useful when we consider various applications of the integral.

DEFINITION 2.2 The integral of a continuous function f on an interval [a, b] is defined by


∫ b ∑
n
f (x) dx = lim f (ti )(ai − ai−1 ),
a n→∞
i=1

where ai−1 ≤ ti ≤ ai for i = 1, 2, . . . , n. (Recall that ai = a + i(b − a)/n.)

Let f and g be continuous functions defined on an interval [a, b]. The following further properties of
integrals are valid. (The numbers are a continuation of those in the previous section.)
∫ b
5. If m ≤ f (x) ≤ M for all x ∈ [a, b], then m(b − a) ≤ f (x) dx ≤ M (b − a).
a
∫ b ∫ b
6. If f (x) ≤ g(x) for all x ∈ [a, b], then f (x) dx ≤ g(x) dx.
a a
∫ s ∫ t ∫ s
7. If r, s, t ∈ [a, b], then f (x) dx = f (x) dx + f (x) dx.
r r t
Assuming that r < t < s, property (7) is virtually obvious from the area interpretation of the integral.
However, it is rather difficult to prove this fact using the definition of the integral, and the proof will
therefore be omitted. It is important to realize that property (7) is valid for all choices of r, s, and t; the
order does not matter. The following figure gives a geometric representation of this result; note the change
of sign when the limits of integration are interchanged.
y
....... y = f (x)
.......

................... ....
.. .....
By interpreting the integrals as area,
.......... ....
... .
.
. . ∫ ∫ ∫
...
...
... ..
... .
... .. ... ...... s t t
....
.... ... . ... f (x) dx = f (x) dx − f (x) dx
... ...
..
... .. ....
... ... .. r r s
...
...
...
... .. . ...
... ∫ ∫
...
...
...
...
.
....... ...
...
t s
.... .... .... ....... = f (x) dx + f (x) dx.
. . . ............ x r t
r s t

The inequality properties (5) and (6) follow easily from the definition of the integral. Values for m and
M are typically chosen to be the minimum and maximum outputs, respectively, of the function f on the
interval [a, b]. These properties can be used to find rough estimates for the value of an integral. For example,
∫3√
suppose we want to find lower and upper bounds for the value of the integral 0 x3 − 3x + 6 dx. Noting
√ √ √
that the minimum and maximum outputs of the function x3 − 3x + 6 on [0, 3] are 4 and 24 (you need
to use some simple ideas from differential calculus to see this), respectively, we find that
∫ 3 √ √ √
6=2·3≤ x3 − 3x + 6 dx ≤ 24 · 3 = 6 6.
0
2.5 Further properties of the integral 91

√ √
To illustrate property (6), since the inequality x6 − 2 < x6 is valid on the interval [2, 5], we obtain
∫ √ ∫
5 5
54 − 24 609
x − 2 dx ≤
6 x3 dx = = .
2 2 4 4

Note that it is easy to find the value of the integral of the simpler function.
It is also helpful to understand properties (5) and (6) from the perspective of area. For example, property
(5) asserts that the area under a curve is trapped between the area of two rectangles (see the figure).
y
.
.....
.....
y = f (x)
area of small rectangle ≤ area under curve ≤ area of large rectangle
.... .
.......
M ...... ........
...............................................................................................................
.. ..

...
... ...
...
... ... ... ....
..................................................................................................................
...
.. ∫ b
... . .. ...
m(b − a) ≤ f (x) dx ≤ M (b − a)
.. .
..
...
... ... .
...
...
a
...
... ... .. ...
...
...
... ... .. ...
...

m
...
.......
...
.................................................................................................................. .
...
..
.. .. ................. x
a b

From the figure, it makes sense that there is a rectangle with width b − a whose area is exactly the area under
the curve. This fact is made explicit in the following theorem; the proof (which will be left as an exercise)
requires two properties of continuous functions, namely the Extreme Value Theorem and the Intermediate
Value Theorem. This result will be used in the next section to prove the Fundamental Theorem of Calculus.

THEOREM 2.3 Mean Value Theorem for Integrals If f is continuous on [a, b], then there exists
∫b
a point c ∈ [a, b] such that f (c)(b − a) = a f (x) dx.

Exercises
1. Use property (5) to find upper and lower bounds for each integral.
∫ 5 ∫ 4 ∫ 6 √
1 1
a) dx b) √ dx c)
3
x2 − 6x + 15 dx
2 2x − 1 1 3+2 x 2

2. Use property (6) to determine the largest integral in each pair. Justify your answers.
∫ 6√ ∫ 6 ∫ π ∫ π ∫ 2 ∫ 2 2
a) x4 + 5 dx, x2 dx b) sin3 x dx, sin5 x dx c) ex dx, ex dx
2 2 0 0 1 1
∫ 4 √
3. Use property (6) to prove that x4 − 1 dx ≤ 21. Do not evaluate the given integral.
1
∫3 ∫3 ∫3 ∫3
4. Given that 1 f (x) dx = 6, 2
f (x) dx = 2, 1 g(x) dx = 10, and 2 g(x) dx = −3, evaluate
∫ 3( ) ∫ 2( ) ∫ 2( )
a) 2f (x) − 3g(x) dx b) 4f (x) + g(x) dx c) 2g(x) − x2 dx
1 3 1
∫ 3
5. Use property (7) to evaluate |2x − x2 | dx.
0
6. Give a proof of the Mean Value Theorem for integrals. Illustrate this theorem geometrically.
7. For f (x) = x2 on [1, 4], find the point c guaranteed by the Mean Value Theorem for integrals.
∫ x
8. Let f (x) = (t3 − 2t2 + 5) dt for all x > 1. Use formulas to find f (x), then compute f ′ (x).
1
92 Chapter 2 The Integral

2.6 The Fundamental Theorem of Calculus

As a reminder, the area problem is the following: find the area under the curve y = f (t) and above the t-axis
on the interval [a, b], where f is any continuous and nonnegative function defined on [a, b]. The solution

n
to this problem leads to the expression lim f (ai ) b−a
n . The quantity represented by this complicated n→∞ i=1
expression is a number; the area of the region in question. When Newton considered the area problem, he
made the insightful leap to treat area as a function. Let A(x) be the area under the curve y = f (t) and
above the t-axis on the interval [a, x]; the number A(x) clearly depends on x. Note that A(a) = 0 and the
problem is to find A(b).
y y
..... .....
........ ........
y = f (t) y = f (t)
....................... ...... ................ ............ The shaded area
f (x) .................. .... .... .... .... .... .... ................. ... ....
. ......... ... ........................... .... is A(v) − A(x);
..... .. .. .. .. .. .. .. .. ..... .
.... . . . . . . . . .... .. .. .....
f (x) ... ............ ..
............................. ..
...
...
..... .. .. .. .. .. .. .. .. ..... .. . ... ... the rectangle has
.... . . . . . . . . .... ... . ... ... .
... .. ...............
...
...
... A(x) ....
...
.. ... ... ...
...
...
...
...
... ... .. ................... area f (x)(v − x).
... ... ... . . ... ...
.... .. ..................
.... . . . . . . . . ....
..... . . . . . . . . .....
.. . . . . . . . . ..
... . . . . . . . . ...
..... ...
....
.
.. .....
...
...
... .. ............. ....
. ............ ..
. . t . . . ........... t
a x b a xv b

Since the area A is now a function, it is possible to consider its rate of change. To Newton, it was clear that
the rate of change of the area function was the height of the curve, that is, A′ (x) = f (x). He simply noted
that A(v) − A(x) ≈ f (x)(v − x) when v > x is very close to x (see the figure: the area under the curve is
A(v) − A(x)
close to the area of the rectangle). It follows that A′ (x) = lim = f (x). A formal statement and
v→x v−x
proof of this result is given at the end of the section.

To get a sense of this result, for each x ∈ [0, 1], let A(x) denote the area under the curve y = 1 − t2
and above the t-axis on the interval [0, x]. Using some simple facts from trigonometry and geometry, we find
that y
.
......
.........

y= 1 − t2 A(x) = area under curve on [0, x]
1 ............... ( √ )
......•....... x, 1 − x2 = area of sector + area of triangle
..... .. .. √
..... .. .
.

θ = arcsin x θ................ ....... ... = 21 arcsin x + 12 x 1 − x2 .


.
.. .. . .. ..................
..... ....
.... . t
x


The reader should verify that A′ (x) = 1 − x2 , as expected by the previous discussion.
In modern terms, Newton’s observation is that the derivative of a function defined as an integral is
simply the integrand. This fact indicates that the processes of integration and differentiation are very much
related. Consequently, this result is known as the Fundamental Theorem of Calculus.

THEOREM 2.4 Fundamental Theorem of Calculus If f is continuous on an interval [a, b] and a


∫x
function F is defined by F (x) = a f (t) dt for all x in [a, b], then F ′ (x) = f (x) for all x in [a, b].

Proof. Let x ∈ [a, b]. If v ∈ [a, b] and v ̸= x, then the Mean Value Theorem for integrals states that there
∫v
is a point cv between v and x such that f (cv )(v − x) = x f (t) dt. Note that lim cv = x. We then have
v→x
2.6 The Fundamental Theorem of Calculus 93

F (v) − F (x)
F ′ (x) = lim (definition of derivative)
v→x v−x
∫ ∫ x
1 ( v )
= lim f (t) dt − f (t) dt (definition of F )
v→x v − x
∫ av a
1
= lim f (t) dt (property (7) of integrals)
v→x v − x x

= lim f (cv ) (MVT for integrals)


v→x

= f (x). (f is continuous)

This completes the proof.

For a couple of examples to illustrate this theorem, consider the following:


∫ x ∫ 2x2
d d
e−u du = e−(2x · 4x = 4xe−2x .
2 2 2 4
3 3 /2 ) /2
sin(t ) dt = sin(x ) and
dx 1 dx 0

Note the use of the Chain Rule in the second example. We can also answer questions such as the following:
find a function f so that f (1) = 5 and f ′ (x) = x cot x. By the Fundamental Theorem of Calculus, the
function ∫ x
f (x) = 5 + t cot t dt
1

has the desired properties.

Exercises
1. Find the derivative of each of the following functions.
∫ x√ ∫ x ∫ t
4 1
a) f (x) = t2 + 9 dt b) g(x) = dt c) h(t) = √ dx
3 2−t
3
0 −2 x4 + 9
∫ 2 ∫ u2 √ ∫ ex
3
d) F (x) = (1 − e−t ) dt e) G(u) = s s3 + 2 ds f ) H(x) = sin t2 dt
x 0 −1
∫ 4 ∫ 2x ∫ ex √
2
g) u(x) = et dt h) v(x) = cos t3 dt i) w(x) = t3 + 4 dt
−x2 x x2

∫ 1 ∫ 2x
sin t
2. Determine F ′′ (π/4) given that F (x) = f (t) dt and f (x) = dt.
t
∫ x x 1
1
3. Evaluate lim 4 sin(2t3 ) dt.
x→0 x 0
∫ x
5
4. Determine the interval on which the function f defined by f (x) = dt is concave up.
0 2t2 − 6t + 19
5. Find an integral expression for a function f such that f (2) = 0 and f ′ (x) = sin(x4 ).
∫ x
6. Suppose that F (x) = |t| dt for all x ≥ −1. Find F (0), F ′ (0), F (1), F ′ (1), F (3), and F ′ (3).
−1
∫ x
7. Consider the function G defined by G(x) = |t| dt for all x ≥ −2. Use the area interpretation of the integral
−2
to find a formula for the function G that does not involve an integral, then find G′ (x).
∫ b
8. Explain how Newton’s observation shows that cos t dt = sin b for any b > 0.
0
94 Chapter 2 The Integral

2.7 The Fundamental Theorem of Calculus, continued

Leibniz discovered the connection between integration and differentiation by studying the properties of sums
of differences. Consider any sequence of numbers such as 2, 5, 7, 11, 14, 20. Summing all of the differences
between the numbers gives the difference between the first and last numbers:

(5 − 2) + (7 − 5) + (11 − 7) + (14 − 11) + (20 − 14) = 20 − 2.

Every sequence of numbers has this property. When a function is applied to a sequence and the corresponding
differences are summed, the result is the last value of the function minus the first value of the function. For
example, applying x2 to the above sequence and summing the differences yields

(52 − 22 ) + (72 − 52 ) + (112 − 72 ) + (142 − 112 ) + (202 − 142 ) = 202 − 22 .

The term dx in the integral notation represents the difference between two successive x values. The equality
∫b
a
dx = b − a, which follows easily from the definition of the integral, simply indicates that the sum of all
the differences of the x values gives the length of the interval, the last value of x minus the first value of
x. For Leibniz, a quantity such as dz represented a little difference of z values and it was clear to him that

dz = zlast − zfirst ; the sum of all the little differences gave the total difference. This is a valid result even
when z represents a function.

It was also evident to Leibniz that y dx gave the area under a curve. The quantity y dx represents the
area of an extremely thin rectangle of height y and width dx and the sum of all these areas is the area under
the curve (see the figure). If a function z can be found so that dz = y dx (which is equivalent to dz/dx = y),
then the area problem has been solved. The area problem thus simplifies to the following inverse tangent
problem: given a function y of x, find a function z of x such that dz/dx = y.
y
.
...... The area is the sum of the y dx areas;
......
y = f (x) ∫ b
............. ......
......... ............. .. ....
.
A= y dx
... ... .
... ..
. .. ∫
a
...
...
...
... .
... ..
.. .......
.
b
...
... y ...... ... .. ... = dz, assuming dz = y dx
... ... ... .. ...
... a

...
...
...
...
...
... ..... ...
...
... ............ = z(b) − z(a).
... x
a dx b
It follows that integrals can be evaluated by finding an antiderivative of the integrand. A function F is an
antiderivative of a function f on an interval I if F ′ (x) = f (x) for all x ∈ I. This connection between
integration and differentiation is another part of the Fundamental Theorem of Calculus.

THEOREM 2.5 ∫ Fundamental Theorem of Calculus If f is a continuous function defined on an


b
interval [a, b], then f (x) dx = F (b) − F (a), where F is any antiderivative of f .
a
∫x
Proof. Let G be the function defined by G(x) = a
f (t) dt for all x ∈ [a, b]. By Theorem 2.4, we know
that G′ (x) = f (x) for all x ∈ [a, b]. Since F and G have the same derivative, there exists a constant C such
that G(x) = F (x) + C for all x ∈ [a, b] (see Corollary 1.27 in Section 1.27). It follows that
∫ b
( ) ( )
f (x) dx = G(b) − G(a) = F (b) + C − F (a) + C = F (b) − F (a).
a
2.7 The Fundamental Theorem of Calculus, continued 95

Writing down the definitions of the derivative and the integral side by side gives some indication of the
surprising nature of this result. Furthermore, after spending some time using the definition of the integral to
evaluate integrals, one can certainly appreciate how much this theorem simplifies the evaluation of integrals.
b
Here are four examples illustrating this part of the Fundamental Theorem of Calculus; the symbol F (x) a is
a common abbreviation for F (b) − F (a).
∫ 2( ( x4 ) 2 ( ) (1 )
) 1
2x3 − 6x + 1 dx = − 3x2 + x = 8 − 12 + 2 − −3+1 =− ;
1 2 1 2 2
∫ π/3 ( √ ) ( ) √
( ) π/3 3 3
(cos x − 2 sin x) dx = sin x + 2 cos x = +1 − 0+2 = − 1;
0 0 2 2
∫ 3
12 6 3 2 16
3
dx = − 2 = − − (−6) = ;
1 x x 1 3 3
∫ 5
1 5
dx = ln 1 + x = ln 6 − ln 2 = ln 3.
1 1+x 1

Antiderivatives of many simple functions can be found by thinking about differentiation in reverse.
The Fundamental Theorem of Calculus provides the most common method for evaluating integrals. It
states that an integral can be evaluated by finding an antiderivative of the integrand and plugging in the
endpoints. However, it is important to remember that integration is not antidifferentiation. An integral
represents the limit of a special type of sum. In many cases, integrals can be evaluated by finding an
antiderivative. However, there are other ways to evaluate an integral; for instance, an integral sometimes
represents a simple geometric area. In addition, the recognition of an integral as a limit of a sum will be
necessary for applications of the integral.

Exercises
1. Evaluate the following integrals.
∫ 3 ∫ π/2 ∫ π/4
a) (3x2 − 2x + 1) dx b) sin x dx c) sec x tan x dx
1 0 0
∫ ∫ ∫
3 2
1 2 √
d) (z 2 + z) dz e) dt f) (x3 − x) dx
−2 1 t3 1
∫ 2 ∫ 2 ( ) ∫ 2
1
g) dx h) 4 + 5|x| dx i) ex dx
1 x −3 0
∫ 5 ∫ 1/2 ∫ 2 √
5 1
j) dx k) √ dx l) 4 − x2 dx
2 2x − 1 0 1 − x2 0

2
2. Find the area of the region under the curve y = 2/(1 + x ) and above the x-axis on the interval [−1, 1].
3. Find the area of the region under the curve y = 1/x and above the x-axis on the interval [1, b], where b > 1.
∫ x ∫ x
d
4. Find conditions on f and F so that the statements f (t) dt = f (x) and F ′ (t) dt = F (x) − F (a) are
dx a a
valid. What do these statements say about the operations of integration and differentiation?
96 Chapter 2 The Integral

2.8 Finding antiderivatives: guess and check

The Fundamental Theorem of Calculus provides an excellent method for evaluating many integrals, but in
order to use this theorem, it is first necessary to find an antiderivative for the integrand. Recall that an
antiderivative of a function f is a function F with the property that F ′ (x) = f (x) for all x in the domain
of f . Antiderivatives are not unique since adding a constant to a function does not change its derivative.

The standard notation to represent all of the antiderivatives of a function f is f (x) dx and it is referred
∫b
to as the indefinite integral of f ; the symbol a f (x) dx is then called the definite integral of f . To

illustrate this symbol, note that 12x3 dx = 3x4 + C. The arbitrary constant C is called the constant of
integration; every antiderivative of 12x3 is of the form 3x4 plus some constant.
Finding antiderivatives is much more challenging than finding derivatives; it involves techniques and
skills rather than the simple use of formulas. As a start to finding antiderivatives, we simply note that every
derivative formula becomes an antiderivative formula when read backward. (See the list of basic formulas in
the table of integrals in Appendix B.) Even with these formulas, there is still a fair amount of work involved
in finding antiderivatives. Sometimes basic algebra is enough:
∫ ∫
(x2 + 3)2 dx = (x4 + 6x2 + 9) dx = 15 x5 + 2x3 + 9x + C;
∫ ∫ (
x−1 2 )
dx = 1− dx = x − 2 ln |x + 1| + C.
x+1 x+1
But more often than not, the chain rule comes into play. As a reminder, the chain rule is a derivative formula
( )′
for composite functions. It states that f (g(x)) = f ′ (g(x))g ′ (x) and its reversal requires some practice. As
a start, note that

d 4 4x3 + 2x 2x3 + x 1
ln x + x2 + 1 = 4 implies dx = ln x4 + x2 + 1 + C;
dx x + x2 + 1 x4 + x2 + 1 2

d 1
sin3 2x = 3 sin2 2x · 2 cos 2x implies sin2 2x cos 2x dx = sin3 2x + C.
dx 6
Look at these examples carefully and notice how the chain rule reels terms in when antidifferentiating.
One method for reversing the chain rule is guess and check: make an educated guess as to what the
antiderivative should look like, check the guess by taking its derivative, and modify the guess as necessary.
This method emphasizes the reverse nature of antidifferentiation and is also a good review of differentiation.
It also reinforces the idea that∫antiderivatives can always be checked.
x3
As an example, consider √ dx. The major part of the function under consideration involves
4 + x4
(4 + x4 )−1/2 . Since differentiation reduces powers by one, antidifferentiation increases powers by one. Thus
a reasonable guess for an antiderivative is (4 + x4 )1/2 . Since

d 1 2x3 x3 1√
(4 + x4 )1/2 = (4 + x4 )−1/2 4x3 = √ , we have √ dx = 4 + x4 + C.
dx 2 4 + x4 4 + x4 2
It should be pointed out that this method only works when the guess is off by a constant; the reader should
carefully consider why this is the case (see Exercise 5 below). Here is a second example.
∫ guess : e−2x ;
2

−2x2 1
xe−2x dx = − e−2x + C.
2 2
xe dx, hence
check : d −2x 2
= −4xe−2x ;
2
4
dx e
2.8 Finding antiderivatives: guess and check 97

It is often a good idea to check (by differentiation) the final answer one more time. For a third example,
consider the following.

∫ guess : tan5 (x/3);


4 2
tan (x/3) sec (x/3) dx, 5
check : d
dx tan5 (x/3) = 5 tan4 (x/3) sec2 (x/3)(1/3) = tan4 (x/3) sec2 (x/3).
3
It follows that ∫
3
tan4 (x/3) sec2 (x/3) dx = tan5 (x/3) + C.
5
As a final comment, functions that look similar can have very different antiderivatives. For example,
∫ √ ∫ ∫ √
x 1 1
√ dx = − 1 − x2 + C; √ dx = arcsin x + C; √ dx = ln x + x2 − 1 + C.
1 − x2 1 − x2 x2 − 1

Each of these results can be checked by differentiating the function to the right of the equal sign. It is thus
very important to exercise caution when finding antiderivatives.

Exercises
1. Evaluate each of the following indefinite integrals.
∫ √ ∫ ∫
2
a) x2 x3 + 1 dx b) √ dt c) sin(4x) cos3 (4x) dx
3t − 7
∫ ∫ ∫ ( )3
sec2 t √ 1 1
d) 5
dt e) ( x − 1)2 dx f) 1+ dt
tan t t t2
∫ ∫ ∫
sin t 1 2
g) dt h) dt i) dx
(2 + 3 cos t)2 t2 + 2t + 1 5x + 1
∫ ∫ ∫
5 6x + 6 ln x
j) dx k) dx l) dx
(4x − 3)2 x2 + 2x − 3 x
∫ ∫ ∫
1 + et − e3t 6
m) (1 − e2t ) dt n) dt o) dx
e2t 1 + 9x2
2. Evaluate each of the following definite integrals.
∫ 4 ∫ 3 ∫ 2
√ 1 10
a) x(x + 1) dx b) dx c) dt
1 0 4x + 3 1 (1 − 2t)3
∫ 1 ∫ a ∫ π
x sin x
d) t sin(πt2 ) dt e) √ dx f) dx
0 0
2
a +x 2
0 1 + 3 cos2 x
∫ 1 √
3. Evaluate (x + 2) 1 − x2 dx by first writing the integral as the sum of two integrals. Think carefully!
0
∫ ∫ ∫ ∫
x2 x2 x2 x3
4. Evaluate √ dx, dx, dx, and dx.
4 + x3 4 + x3 (4 + x3 )2 4 + x2

5. Consider 6x2 sin(x2 ) dx. After making the guess cos(x2 ) for the antiderivative and checking it, Sally decided
that the answer was −3x cos(x2 ) + C. Explain how Sally arrived at this conclusion and why it is incorrect.
98 Chapter 2 The Integral

2.9 Finding antiderivatives: integration by substitution

When finding antiderivatives, it may be helpful to make a change of variables. An appropriate substitution
can make it clear what form the antiderivative should have. Consider the following example:
∫ ( ) ∫ ∫
9x2 let u = 4 + x3 , 9x2 3 3
dx, ; dx = du = − + C.
(4 + x3 )2 2
then du = 3x dx (4 + x3 )2 u 2 u

The substitution u = 4+x3 transformed the integral from one in the variable x to a simpler one in the variable
u. After finding the antiderivative in terms of the variable u, back substitution provides the antiderivative
of the original function; ∫
9x2 3
dx = − + C.
(4 + x3 )2 4 + x3
Of course, this problem could have been solved using the guess and check method, but it illustrates the
essential features of integration by substitution.
The key idea in finding antiderivatives using substitution is to completely transform the integral from
one variable to another variable in such a way that the new integral is easier to evaluate than the old integral.
There are no solid guidelines for a substitution—it is an acquired skill and thus requires a fair amount of
practice. For some problems, there is more than one suitable choice for a substitution. If the original
integrand (usually in the variable x) involves a composite function, then letting the new variable (usually u
plays this role) be the inside function is often a good place to start. It is also important to keep track of the
dx and du terms. Here is another example to illustrate this method.
∫ ( ) ∫ ∫
x2 let u = x3 , x2 1/3 1
6
dx, ; 6
dx = 2
du = arctan u + C.
x +1 then du = 3x2 dx x +1 1+u 3

x2 1
It follows that 6
dx = arctan(x3 ) + C.
x +1 3
Integration by substitution can also be used to evaluate definite integrals. In this case, the limits of
integration must also be transformed into appropriate values for the new variable. For example,
∫ 1 ( )
√ let u = 4 − 3x, u = 4 when x = 0
54x 4 − 3x dx,
2
;
0 then du = −3 dx; u = 1 when x = 1
∫ 1 ∫ 1 ( ∫ 4
√ 4 − u )2 √ ( 1 ) √
54x 4 − 3x dx =
2
54 u − du = 2 (4 − u)2 u du.
0 4 3 3 1
(Notice how the negative sign was used to change the order of the limits of integration.) To evaluate the
new integral, expand the integrand:
∫ 4 ∫ 4
2468
2 (16 − 8u + u2 )u1/2 du = 2 (16u1/2 − 8u3/2 + u5/2 ) du = .
1 1 105
The last few steps (find an antiderivative and plug in the endpoints) are routine and have been omitted.
On occasion, a substitution of some sort is made with no real direction in mind other than a hope that
the new integral will look more familiar. Consider the following example.
 √ 
∫ let u = t,
1  .
√ dt,
t+ t then t = u2 and dt = 2u du
2.9 Finding antiderivatives: integration by substitution 99

Notice how we solved for t before taking differentials; this is sometimes an easier way to determine the
relationship between the derivatives of the variables. We then have
∫ ∫ ∫ √
1 2u 1
√ dt = 2
du = 2 du = 2 ln |1 + u| + C = 2 ln |1 + t| + C.
t+ t u +u u+1

The bottom line here is to try some sort of substitution and see what happens. If it appears the substitution
did not make the integral any easier, try some other substitution.

Exercises
1. Evaluate each of the following indefinite integrals.
∫ ∫ ∫ √
x cos t
a) x3 (x4 + 6)4 dx b) √ dx c) √ dt
4x2 + 1 t
∫ ∫ ∫
x+2 x √
d) dx e) √ dx f) (t − 3) 2t + 1 dt
(x2 + 4x − 1)3 x+2
∫ ∫ ∫
6(x + 1) t x3
g) dx h) dt i) √ dx
(x + 3)(x − 1) (t2 + 8)2 a2 − x2
∫ ∫ ∫
et/2 x t
j) √ dt k) dx l) dt
1 − et 1 + x4 (1 − 2t)2

2. Evaluate each of the following definite integrals.


∫ π ∫ e2 ∫ 1
sin t ln x x
a) 2
dt b) dx c) dx
0 (3 + 2 cos t) 1 x 0 (1 + 2x2 )3
∫ √2 √ ∫ 3 ∫
√ 3
9x2
d) x 4 − x4 dx e) t t + 3 dt f) √ dx
0 0 2/3
3
3x − 1

x √
3. Evaluate √ dx two ways; first with u = x + 1, then with u = x + 1. Are the solutions the same?
x+1
∫ 4
x−1
4. Evaluate dx two ways; first using long division, then with a substitution.
1 x+2
x2 y2
5. Find the area of the ellipse defined by the equation 2 + 2 = 1, where a and b are positive constants. Hint:
a b
Express the area as an integral, but do not use the Fundamental Theorem of Calculus to evaluate it.
6. Let f and g be functions defined on appropriate intervals and assume that both f and g ′ are continuous functions.
Prove that ∫ ∫
b g(b)
f (g(x))g ′ (x) dx = f (u) du.
a g(a)

This fact offers a justification for the method of integration by substitution as applied to definite integrals. Hint:
Let F be an antiderivative of f and use the Fundamental Theorem of Calculus to evaluate both integrals.
100 Chapter 2 The Integral

2.10 Finding antiderivatives: integration by parts

In an attempt to find an antiderivative of x cos x, suppose that we try the function F (x) = x sin x. It follows
that F ′ (x) = x cos x + sin x; the product rule generates two terms, one desirable and the other not. To
eliminate the unwanted term sin x, it is necessary to add a term to F (x) whose derivative is − sin x. Since
cos x is such a function, the modified guess becomes F (x) = x sin x + cos x and

F ′ (x) = x cos x + sin x − sin x = x cos x,

as desired. The problem of finding the antiderivative of a function obtained as one part of the derivative of a
product of two functions is a common one and the technique of integration by parts formalizes the guessing
that was involved in this example.
Integration by parts is antidifferentiation’s answer to the product rule. Suppose that u and v are
differentiable functions of x. Then the product rule states that

( )′ ( )′
u(x)v(x) = u(x)v ′ (x) + v(x)u′ (x) or u(x)v ′ (x) = u(x)v(x) − v(x)u′ (x).

Rewriting the last equation in terms of antiderivatives yields


∫ ∫ ∫ ∫
u(x)v (x) dx = u(x)v(x) − v(x)u′ (x) dx, or simply

u dv = uv − v du;

the differential form is the most common way to express integration by parts. This formula shows how to
find the antiderivative of a function that can be identified as the product of one function with the derivative
of another. It may appear to be a circular formula since both sides of the equation involve an indefinite
integral, but if the integral on the right side of the equation is easier than the one on the left side, then
progress has been made.
As an illustration of integration by parts, we will redo the opening example.
∫ let u=x and dv = cos x dx;
x cos x dx,
then
du = dx and v = sin x;
∫ ∫
x cos x dx = x sin x − sin x dx = x sin x + cos x + C.

Note that when finding v from dv, the constant of integration is assumed to be 0. Two further examples
should clarify the essential ideas.
∫ let u=x and dv = e2x dx;
2x
xe dx,
then du = dx and v = e2x /2;
∫ ∫
1 2x 1 2x 1 1 1
xe2x dx = xe − e dx = xe2x − e2x + C = (2x − 1) e2x + C.
2 2 2 4 4
In this example, we are seeking an antiderivative for xe2x and beginning with the guess xe2x /2. Since xe2x /2
is a product of two functions, its derivative will have two terms. One of these terms is exactly the function
we desire (namely, xe2x ). Since the other term (e2x /2) is unwanted, it needs to be removed by subtracting
2.10 Finding antiderivatives: integration by parts 101

an appropriate function whose derivative is precisely this quantity, which in this case is e2x /4. Checking the
answer by differentiating indicates how this happens:

d ( 1 2x 1 2x ) 1 1
xe − e + C = xe2x + e2x − e2x = xe2x .
dx 2 4 2 2

For the second example, consider the definite integral

∫ 1 let u = arctan x and dv = dx;


arctan x dx,
0 then du = dx/(1 + x2 ) and v = x;
∫ 1 ∫
1

1
x π 1 1 π 1
arctan x dx = x arctan x − 2
dx = − ln 1 + x2 = − ln 2.
0 0 0 1+x 4 2 0 4 2
Note how the limits of integration are handled in the case of a definite integral.
Integration by parts is a viable option when the integrand consists of the product of two functions and one
of the functions is not a constant multiple of the derivative of the other function (integration by substitution
works in this case). It can even be used when the integrand consists of a single function as in the last
example. There are no specific guidelines for splitting the integrand into u and dv, but dv should be easy to
∫ ∫
antidifferentiate and the new integral v du should be simpler than the original one u dv. It is sometimes
useful and/or necessary to first make a substitution before using integration by parts. Furthermore, it is
sometimes necessary to perform integration by parts more than once to evaluate an integral.

Exercises
1. Evaluate each of the following indefinite integrals. Treat a as a nonzero constant.
∫ ∫ ∫
a) xe−x dx b) x sin 4x dx c) arcsin x dx
∫ ∫ ∫
d) ln x dx e) (ln x)2 dx f) x sin(ax) dx
∫ ∫ ∫

g) xeax dx h) x 2x + 1 dx i) x sec2 x dx
∫ ∫ ∫
j) x2 e−x dx k) x2 cos(ax) dx l) x arctan x dx

2. Evaluate each of the following definite integrals.


∫ π/2 ∫ 1 ∫ e
a) x sin 2x dx b) x cos(πx/2) dx c) x ln x dx
0 0 1
∫ 1/2 ∫ 4 √
∫ 1
x/2 x
d) 4xe dx e) e dx f) x3 sin(πx2 ) dx
0 1 0

3. Find the area of the region under the graph of y = ln x and above the x-axis on the interval [1, e2 ].

8x
4. Evaluate √ dx in two ways; one using a substitution and one using integration by parts.
4x + 3
5. What happens if a constant is added when determining v from dv in the formula for integration by parts?
102 Chapter 2 The Integral

2.11 Improper integrals

The definition of the integral requires a continuous function f on a closed and bounded interval [a, b]. A
definite integral is said to be improper if either the function f is unbounded on [a, b] or the interval of
∫1 ∫∞
integration is unbounded. For example, the integrals 0 x−2 dx and 2 e−x dx are both improper; the
first because 1/x2 is unbounded on (0, 1] and the second because the interval of integration is unbounded.
Integrals of the latter type are easy to spot since ∞ and/or −∞ appears as a limit of integration, but
improper integrals involving an unbounded function are more difficult to recognize since it is necessary to
consider the behavior of the function on the interval of integration. Although improper integrals do not fit
the “proper” definition of an integral, the limit concept makes it easy to assign them a meaning.
Suppose that f is continuous for all x ≥ a for some number a. Then
∫ ∞ ∫ b
f (x) dx = lim f (x) dx,
a b→∞ a

provided that the limit exists. When the limit exists, the integral is said to converge or be convergent;
∫b
if not, the integral is said to diverge or be divergent. An integral of the form −∞ f (x) dx, where f is
continuous for all x ≤ b, is defined analogously.
Two examples should be sufficient to illustrate this concept.
∫ ∞ ∫ b
1 1
dx = lim dx = lim ln b = ∞;
1 x b→∞ 1 x b→∞
∫ 2 ∫ 2 ( 2 )

ex/2 dx = lim ex/2 dx = lim 2ex/2 = lim (2e − 2ea/2 ) = 2e.
−∞ a→−∞ a a→−∞ a a→−∞

The first improper integral is divergent while the second is convergent. The first integral implies that the
area under the curve y = 1/x on the interval [1, ∞) is infinite. However, the reader should verify that the
area under the curve y = 1/x2 on the interval [1, ∞) is 1. This pair of results concerning the area of two
regions that have an infinite length is rather surprising since the two graphs look very similar.
On occasion, it is useful to know whether or not an integral over an infinite interval is convergent even if
its value cannot be determined exactly. The following intuitively clear result is helpful in this context. The
proof involves some advanced ideas and will not be presented here.

THEOREM 2.6 Suppose that f and g are continuous functions and that 0 ≤ f (x) ≤ g(x) for all x ≥ a.
∫∞ ∫∞
If the integral a g(x) dx converges, then the integral a f (x) dx also converges.
∫∞
To illustrate the use of this theorem, consider the improper integral 0
e−x cos4 x dx. It would be rather
difficult to find an antiderivative for the function e−x cos4 x and thus find the exact value of this integral.
∫∞
However, since e−x cos4 x ≤ e−x for all x ≥ 0 and since 0 e−x dx converges (the reader should verify this),
∫∞
the integral 0 e−x cos4 x dx converges by Theorem 2.6.
We now turn to the case in which f is unbounded on [a, b]. Suppose that a function f is continuous on
an interval [a, b) and that f becomes infinite at b. Then
∫ b ∫ c
f (x) dx = lim− f (x) dx,
a c→b a
2.11 Improper integrals 103

provided the limit exists. A similar definition is made if a function f is continuous on (a, b] and becomes
infinite at a. For example, the following improper integral is convergent since the appropriate limit exists:
∫ ∫
1
1 1
1 √ 1 √
√ dx = lim √ dx = lim 2 x = lim (2 − 2 a) = 2.
0 x a→0+ a x a→0 + a a→0 +


This integral is improper since the graph of y = 1/ x has a vertical asymptote at x = 0. As mentioned
earlier, you need to look carefully at the integrand to spot this type of improper integral.

Exercises
1. Determine whether or not the integral converges. If it converges, find its value.
∫ ∞ ∫ ∞ ∫ −3
1
a) e−x/4 dx b) 12xe−3x dx c) √
3
dx
0 0 −∞ 5x + 7
∫ ∞ ∫ ∞ ∫ ∞ 2
6 8 x + 6x + 3
d) 3
dx e) dx f ) dx
2 x 1 2x + 15 1 x4
∫ 1 ∫ 3 ∫ 7.5
4 8 10
g) √ dx h) dx i) √ dx
2 (3 − x) 15 − 2x
3
x 2
0 1

2. Use Theorem 2.6 to show that the integral converges.


∫ ∞ ∫ ∞ ∫ ∞
100 5 + 2 sin x
a) √ dx b) e−2x sin2 x dx c) dx
4 4 + x3 0 1 x2

3. Find all values of r for which the integral converges.


∫ ∞ ∫ ∞ ∫ 1
1 1
a) dx b) erx dx c) dx
1 xr 0 0 xr
∫∞
4. Suppose that f is continuous for all real numbers. Give a definition for the improper integral −∞
f (x) dx.
5. Suppose that f is continuous on [a, b] except at the point c ∈ (a, b) and that f becomes infinite at c. Give a
∫b
definition for the improper integral a f (x) dx.
6. Apply the definitions from the previous two exercises to the following integrals. Assume that a is a positive
constant.
∫ ∞ ∫ 1 ∫ 1
a3 1 1
a) dx b) √ dx c) dx
2
−∞ a + x
2
0
3
4x − 1 −2 x
2
∫ ∞
7. Given the hypotheses of Theorem 2.6, what conclusion can be drawn if f (x) dx diverges? Use this result to
∫ ∞ √
a

show that the improper integral x−3 x4 + 10 dx diverges.


1
104 Chapter 2 The Integral

2.12 Area between curves

Consider the region outlined in bold in the figure below and suppose that ℓ is a continuous function such
that ℓ(x) gives the vertical distance across the region for each value of x in the interval [a, b]. Divide the
interval [a, b] into n subintervals of equal length and use the right endpoint of each subinterval to determine
a rectangle that approximates the region on that subinterval.
...............................
.. .........................................
.. ...... ....................................................................
.. ...... ...... ...... ...... .............................................................................
ℓ(a) .. ...... ...... ...... ...... ...
...
..
.. ...... ...... ...... ...... ...
...
.. ℓ(b)
.. ...... ...... ...... ...... ℓ(a4 ) ..................... For each x between a and b,
.. ...... ...... ...... ...... .
..
................................................................................................................................ ... ℓ(x) is the distance across the figure.
....... ..
.....
..... ...
.....
...... .....
.......... b−a
ai = a + i ·
|...................|..................|...................|...................|.............................................................................................|..................|. n
···
a = a0 a1 a2 a3 a4 an−1 an = b

Let A be the area of this region. The sum of the areas of the n rectangles gives an approximation to A:


n ∑
n
b−a
A≈ ℓ(ai )(ai − ai−1 ) = ℓ(ai ) .
i=1 i=1
n

It is intuitively clear that the approximation to A becomes more accurate as n increases. Since ℓ is a
continuous function on [a, b], the definition of the integral yields


n ∫
b−a b
A = lim ℓ(ai ) = ℓ(x) dx.
n→∞
i=1
n a

Thus, area is the integral of cross-sectional length, a generalization of the formula for the area of a rectangle
(length × width), where the length is constant across the width of the figure.
A useful way to remember the integral version of the area formula is given in the following figure:
...........
.. ............ ℓ(x) is the distance across the figure at x;
.. .............
.. .................................................
.. ............................. ......... ℓ(x) . dx is the small width of a rectangle;
....... ........ ..
.
dx ...... . dA = ℓ(x) dx is the area of a thin rectangle;
.... ...
....
.... ... the total area is the sum of these little areas;
∫ ∫ b
............
A = dA = ℓ(x) dx.
|..........................................................................|..........................................................................|. a
a x b

The key step is the determination of a formula for the distance across the figure. In our illustrations, we
have assumed that this distance is vertical, but it may be better to look at the figure horizontally (as in the
example in the next paragraph) or in some other direction.
As an example, we will find the area of the region bounded by the curves x = 2y 2 and x − 2y = 4. The
first curve is a parabola opening to the right and the second curve is a straight line. Substituting x = 2y 2
into the equation x − 2y = 4 yields y 2 − y − 2 = 0, and it follows that the curves intersect at the points
2.12 Area between curves 105

(2, −1) and (8, 2). A glance at the graph (see the figure) indicates that the region between the curves is
easier to describe horizontally than vertically. For each value of y between −1 and 2, the distance ℓ(y) across
the figure is ℓ(y) = (2y + 4) − 2y 2 , the x-value of the curve on the right minus the x-value of the curve on
the left. The calculation of the area A of the region between the curves is given to the right of the graph.
y .
......
... ..
.
............... ∫ 2
2 x = 2y 2 ....................................... .......
.... ............ A= (2y + 4 − 2y 2 ) dy
. . . . . . . . . . . . . . ..
...
.................................................................................. −1
y
.....................................................ℓ(y) ... ( 2 ) 2
.......
....
. ..
. . . .. . . ........ = y 2 + 4y − y 3
........ 3 −1
..........
..... x
........ . . (
2
............. ........ x = 2y + 4 . . . .. 8 16 ) ( 2)
.... = 4+8− − 1−4+
−1
..... ......... ........................................... 3 3
....... .............................
.................. = 9.
−3

It is also possible to determine the area of this region using vertical cross-sections, but more effort is involved.
The curves need to be in the form y = f (x), and two integrals are needed because the lower boundary changes
when x = 2. The reader should verify that
∫ √ ∫ 8 (√
2
x x x − 4)
A= 2 dx + − dx.
0 2 2 2 2

When finding the area between curves, it is a good idea to first decide whether vertical or horizontal cross-
sections provide the best approach; one method is often easier than the other.

Exercises
1. Find the area of the region bounded by the given curves. Treat a as a positive constant.
a) y = 2x − x2 , y = 2 − x b) x = y 2 , 2y = 3 − x c) xy = 4, x + y = 5
2 4 3
d) y = x , y = ax e) y = x , y = a x f ) xy = a2 , x + y = a2 + 1, (a > 1)
2. Find the area of the region bounded by the curves y = x and y = 6 − x in two different ways. 2

a) Find the vertical distance across the region and integrate with respect to x.
b) Find the horizontal distance across the region and integrate with respect to y.
3. Find the total area of the region bounded by the curves y = 4x and y = x3 .
4. Find the area of the region bounded by the curves y = e2x , y = ex/2 , and y = 4.
5. Let R be the region under the curve y = 9/x2 and above the x-axis on the interval [1, 3].
a) Find a vertical line that divides the region R into two pieces of equal area.
b) Find a horizontal line that divides the region R into two pieces of equal area.
√ √ √
6. Let a > 0. Find the area of the region bounded by the curve x + y = a and the coordinate axes.
7. Let R be the region between the curves y = 3/x2 and y = 12/x3 on the interval [1, ∞). Find the area of R.
Hint: Sketch the graphs carefully to determine the region R.
8. Let R be the region in the first quadrant that is bounded by the curves y = 1/x, y = x, and y = 3x. Find a
value for a so that the line y = ax divides the region R into two pieces of equal area.
106 Chapter 2 The Integral

2.13 Volume: method of cross-sections

Let f be a continuous nonnegative function defined on an interval [a, b] and let R be the region under the
curve y = f (x) and above the x-axis on the interval [a, b] (see the left graph in the figure). Consider the
three-dimensional solid S that is generated when the region R is revolved around the x-axis (a rather warped
hourglass that is lying on its side in this case). A solid formed in this manner is called a solid of revolution.
To determine the volume of S, we approximate the region R with rectangles, then approximate the volume
of S by summing the volumes of the cylinders generated by the rectangles when they are revolved around
the x-axis. As we have done before, divide the interval [a, b] into n subintervals of equal length and use the
right endpoint of each subinterval to determine the height of a rectangle that approximates the curve (see
the right graph in the figure).
y y
.. ..
.... ....
...... ......
y = f (x) y = f (x)
. ... . .......
.................... ... ..... ....................
...........................................
.. ....
............ . ....... ... ...
...
f (a) .... . . f (a) ...
... .... ...

.. ...... . ..
.... ... ...
...
.. ............................
... .....................
..
... ... .
...
...
... ... ... ...
f (b) ...
... ... . ... f (b) ...
. ...
...
...
...
...
...
.. . .... ...
...
...
... ... .... ... ... ... ...
...
... .. .. ...
...
...
... ... ... ......... ......... ...
...
...
...
...
...
...
... .
..........
. ...
...
...
... .... ... ....... ..... ...
...
...
...
...
...
...
...
...
...
...
...
..... ...
..
...
.......
...
...
...
...
.. . . .. ... ... . ... ... ... ..
... ... ... ... ... ... ... ...
..
... R ..
...
...
...
...
...
...
...
...
...
... ...
... ...
...
... ... ... ... ... ... ... ...
.... .... ... ... ... ... ... ...
. . .................. x . . . .
.. . ................. x
a b a0 a1 a2 a3 · · · an−1 an

When the rectangles are revolved around the x-axis, flat cylinders or disks (think of a quarter or dime
standing on its edge) are generated. The radius of the disk is the height of the curve and the small height of
the cylinder is the length of the subinterval on the x-axis, namely, (b − a)/n. The volume of each cylinder is
thus π(f (ai ))2 (b − a)/n. As n increases, the cylinders formed by the rotated rectangles better approximate
the solid S. By the definition of the integral, the volume V of the solid S is given by
∑ ∫ b
n
( )2 b − a ( )2
V = lim π f (ai ) = π f (x) dx.
n→∞
i=1
n a

As with area, there is an easy way to remember this volume formula. When a vertical line at position
x is revolved around the x-axis, a thin disk is generated. Summing the volumes of all these disks gives the
total volume of the solid.
y .
...... f (x) is the radius of the disk;
....... y = f (x)
dx is the height of the disk;
.............dx ....
.. .....
f (a) ......... ......... .
. . π(f (x))2 is the area of the base of the disk;
.... ..
f (b)
....
... ... ..
... . ... ....... dV = π(f (x))2 dx is the volume of the disk;
... ..
...
... . ...
...
...
... .
... ... ... ...
... (rate of change of volume is cross-sectional area);
...
...
....
...
...
...
. .
..... ...
...
...
.. . . .. . .. total volume is the sum of all the little volumes;
...
..
....
... f (x) ....
... ∫ ∫ b
... .. ..
...
..
...
..
...
.. ................
. x V = dV = π(f (x))2 dx.
a x b a

The cross-sections of a solid of revolution are circles. More generally, suppose a solid S extends from
x = a to x = b and that for each value of x between a and b, the area of the cross-section perpendicular to
2.13 Volume: method of cross-sections 107

the x-axis has area A(x). For a solid of revolution, we would have A(x) = π(f (x))2 since the cross-sections
are circles. However, the cross-sections could be squares or triangles or some other shape. Then, assuming
∫b
that the area function A is continuous on [a, b], the volume V of S is given by V = a A(x) dx. In other
words, the volume of a solid is the integral of its cross-sectional area. The derivation of this formula is similar
to the derivation for the special case in which the solid is a solid of revolution.
As an example, consider the region that is bounded by the curves xy = 4 and x + y = 5 (see the figure).
When this region is revolved around the x-axis, the cross-sections of the resulting solid are washers. (A
more technical term for this cross-section is annulus, the region between two concentric circles.) The area
of the washer at position x is the area of the large circle (which has radius 5 − x) minus the area of the small
circle (which has radius 4/x). Hence, for each value of x between 1 and 4, the area of the cross-section is
π(5 − x)2 − π(4/x)2 . The volume V of this solid is computed to the right of the figure.
y
.......
... ............................................
... ..
4 ........
... ...... .. ............................................................................................................... ∫ 4( )

... ..... x + y = 5 ....................... .......... 5−x............................. V = π(5 − x)2 − π(4/x)2 dx


3 ... ...... ................... ... ... ... ......
. .
(
1
) 4
..................................... 16
...
.... ....
..... ......... .... ..... ...... ....
4/x . ... ... .
... ... .... .. 1
= π − (5 − x)3 +
2 ....... .... .... ........... ... . .. 3 x 1
...................
.
................. ..................... .. . .
........... ........................................................... = 9π.
1 xy = 4 ...................................................................
......................
.....
.......... x
1 x 4 washer cross-section
Now suppose that the region bounded by the curves represents the base of a solid that is sitting on the page
and projecting out toward you. Suppose further that each cross section of this solid taken perpendicular to
the x-axis is a square. Since the side of each square is 5 − x − (4/x), the volume of the solid is
∫ 4( ∫ 4(
4 )2 40 16 ) ( x3 16 ) 4
5−x− dx = x2 − 10x + 33 − + 2 dx = − 5x2 + 33x − 40 ln x − = 57 − 80 ln 2.
1 x 1 x x 3 x 1

Exercises
1. Let R be the region under the given graph and above the x-axis on the specified interval. Find the volume of
the solid that is generated when R is revolved around the x-axis.
a) y = x2 , [0, 3] b) y = ex , [0, 2] c) y = 2/x, [1, 4]
2. Find the volume of the solid that is generated when the region bounded by the curves y = x2 and y = 2x is
revolved around the (a) x-axis, (b) y-axis.
3. Find the volume of the solid that is generated when the region bounded by the curves y = x and y = 3x − x2 is
revolved around the x-axis.
4. Derive the formula for the volume of a cone of radius r and height h. Hint: Revolve an appropriate region.
5. Derive the formula for the volume of a sphere of radius r. Hint: Revolve an appropriate region.
6. A sphere of radius r is cut in two pieces by a plane perpendicular to a diameter of the sphere. The smaller piece
has height h (when lying on its flat side). Find its volume.
7. Let R be the region bounded by the curves y = x2 and y = 4. Find the volume of the solid that is generated
when R is revolved around (a) the line x = 3, (b) the line y = 4.
8. Suppose that the base of a solid is the part of the parabola y = 80 − 0.1x2 that lies above the x-axis and that
each cross section perpendicular to the y-axis is an equilateral triangle. Find the volume of this solid.
9. Suppose that the base of a solid is the region bounded by the curves y = x2 and y = 5x and that each cross
section perpendicular to the x-axis is a square. Find the volume of this solid.
10. Derive the formula for the volume of a square pyramid with base a × a and height h.
108 Chapter 2 The Integral

2.14 Volume of solids of revolution: method of cylindrical shells

Another way to find the volume of a solid of revolution is called the method of cylindrical shells or simply
the shell method. Once again, let f be a continuous nonnegative function defined on an interval [a, b],
where a > 0, and let R be the region under the curve y = f (x) and above the x-axis on the interval [a, b].
Consider the three-dimensional solid S that is generated when the region R is revolved around the y-axis.
To determine the volume of S using the shell method, we approximate the region R with rectangles and sum
the volumes of the cylindrical shells generated by the rectangles when they are revolved around the y-axis.
Letting the number of rectangles increase indefinitely will give the volume V of S. To accomplish this,
divide the interval [a, b] into n subintervals of equal length and use the midpoint ci = (ai−1 + ai )/2 (where,
as usual, ai = a + i (b − a)/n) of each subinterval to determine the height of a rectangle that approximates
the curve. (The use of the midpoint rather than the right endpoint will become clear in a moment.) When
these rectangles are revolved around the y-axis, cylindrical shells (visualize a hollow tin can) are generated.
y y
..... .....
........ ........
y = f (x) y = f (x)

...................... .......... ...................... ....................................


......................

............ .......
.. ...
... .. . .........................
... .. .......................
f (a) ... .. ..... f (a) ...
...
.... ...
.
.
..
.
....
. .......... .............
.
...
...
...
...
...
... . .
. ... .
... ... ..... ...
...
...
...
... .. .......... .........
... .
f (b) ... f (b) ... ... ...
..
.. ...
...
... ... ...
...
... ... . ...
...
...
...
...
...
...
. .. ...
...
... . ... ...
... ... ...
.... ... .......... .........
...
...
... ..
. . ...
...
... ...
...
...
...
...
...
...
....
........... ...
....
...
...
... ........ ...
...
... ....
...
...
...
...
.. . ... ... ..
... . ... ... ... ..
... ... ... ... ... ... ... ...
.. .. ... ... ... ... ... ...
...
... R ...
...
... ...
...
...
...
...
..
... ...
... ...
...
... ... c ...
... 1 ... 2 ... 3 .... c c ...
...
cn ...
.... .... ..................... .. ... ... ... ... .....................
x | | | . | x
a b a0 a1 a2 a3 ··· an−1 an

Let Vi be the volume of the cylindrical shell that is generated when the rectangle above the interval [ai−1 , ai ]
is revolved around the y-axis. The volume of a cylindrical shell is the area of the annulus that forms its base
times the height of the cylinder. Since the inner radius of the annulus is ai−1 and the outer radius is ai ,

( )
Vi = πa2i − πa2i−1 f (ci ) = π (ai + ai−1 )(ai − ai−1 )f (ci ) = 2πci (ai − ai−1 )f (ci ).

Using the more general definition of the integral (see Section 2.5), it follows that


n ∑
n ∫ b
V = lim Vi = lim 2πci f (ci )(ai − ai−1 ) = 2πxf (x) dx.
n→∞ n→∞ a
i=1 i=1

(You should now see the reason for the choice of the midpoint ci .)
The shell method volume formula is easy to remember using the fact that an area sweeps out a volume
as it moves. The f (x) by dx “rectangle” (see the following figure) moves in a circle of radius x as it revolves
around the y-axis and therefore sweeps out a volume of 2πxf (x) dx, the product of distance traveled (2πx)
and area (f (x) dx). The sum of all these tiny volumes gives the total volume. Rather than memorize
a specific formula for the shell method, you should remember this essential idea, an idea that is especially
2.14 Volume of solids of revolution: method of cylindrical shells 109

helpful when the region is revolved around lines other than the coordinate axes (as in the following example).
y
......
...... f (x) is the height of the rectangle;
y = f (x)
..............dx ....
.. .....
dx is the width of the rectangle;
f (a) .... .......
... .
.
. . 2πx is the distance the rectangle travels around the y-axis;
f (b) ..
..
... ..
... .
... ..
.. ....
. dV = 2πxf (x) dx is the volume swept out by the rectangle;
... .
... ..
. .
..
.. ...
...
........ ... total volume V is the sum of all the little volumes;
.. ...
...
.. ∫ ∫ b
.. ... f (x)
...
.. V = dV = 2πxf (x) dx.
. ...
.
................... x a
a x b

To illustrate the shell method, consider again (see the last section) the region bounded by the curves
xy = 4 and x + y = 5, and suppose that this region is revolved around the line x = −2. A thin vertical
rectangle at position x moves in a circle of radius x + 2 when it is revolved around the line x = −2. The
height of the rectangle is the distance between the two curves; (5 − x) − (4/x). The volume V of the resulting
solid is thus ∫ 4 ( 4)
V = 2π(x + 2) 5 − x − dx = (39 − 32 ln 2)π.
1 x
(Multiply out the integrand to evaluate the integral.) Note that this volume is different than the volume 9π
obtained when the same region is revolved around the x-axis.

Exercises
1. Let R be the region under the given graph and above the x-axis on the specified interval. Find the volume of
the solid that is generated when R is revolved around the y-axis.
a) y = cos x, [0, π/2] b) y = 10/(1 + x2 ), [0, 3] c) y = ex , [0, 2]
2. Let R be the region in the first quadrant that is bounded by the curves y = 7x and y = 16x − x3 . Find the
volume of the solid that is generated when R is revolved around the y-axis.
3. Let R be the region bounded by the curves y = x2 and y = 2 − x2 . Find the volume of the solid that is generated
when R is revolved around the line x = 3.
4. Use the shell method to derive the formula for the volume of a cone of radius r and height h.
5. Use the shell method to derive the formula for the volume of a sphere of radius r.
6. A cylindrical hole of radius r is bored through the center of a sphere with radius R > r. The remaining solid
resembles a bead since it has a flat top and bottom with a hole through the middle.
a) Find the volume of the bead.
b) Find a value of r (in terms of R) that leaves a solid with exactly half of the original volume of the sphere.
c) Represent the volume of the bead as a function of the height h of the bead.
7. Suppose that 0 < r < a and consider the circle defined by the equation (x − a)2 + y 2 = r2 . Let D be the region
inside this circle and let R be the right half of D. Find the volumes of the solids that are generated when the
regions D and R are revolved around the y-axis.
8. Let R be the region under the graph of y = 2x2 and above the x-axis on the interval [0, 3]. Set up, but do not
evaluate, an integral that represents the volume of the solid that is generated when R is revolved around (a) the
line x = 3, (b) the line y = 20, (c) the line x = −1, and (d) the line y = −4.
9. Let Ra be the region under the curve y = x2 and above the x-axis on the interval [0, a], where a is a positive
constant. Find a value for a so that the volume of the solid that is generated when Ra is revolved around the
y-axis is one-half the volume of the solid that is generated when R2 is revolved around the y-axis. Answer the
analogous question for the case in which the regions are revolved around the line x = 2.
110 Chapter 2 The Integral

2.15 Force exerted by a liquid

Anyone who has had the pleasant experience of diving into a lake or swimming pool on a hot summer day
has probably noticed some pain in their ears when they were underwater. This pain results from the pressure
of the water against the eardrum and this pressure increases with depth. It has been experimentally verified
that the pressure exerted by a liquid at a given point in the liquid is the same in all directions and has
magnitude wd, where w is the weight density of the liquid and d is the depth of the given point. For our
purposes, the units for w will be pounds per cubic foot, the units for d will be feet, and the units for pressure
will be pounds per square foot. The problem considered in this section is to find the force (in pounds)
exerted by a liquid on one side of a submerged plate.
This problem has a simple solution if the plate is horizontal—just multiply the pressure exerted by the
liquid at the depth of the plate by the area of the plate. The problem is more difficult if the plate is vertical
for then different parts of the plate occupy different depths and therefore experience different pressures. To
make the discussion more precise, suppose that a plate is submerged vertically in a liquid with weight density
w. Since points on the plate at the same depth experience the same pressure, the horizontal distance across
the plate at a given depth is the most important quantity to know. Let ℓ(y) represent the distance across
the plate at vertical position y and assume that ℓ is continuous on [a, b] (see the figure).
y
......
......
surface of liquid
c ....................................................................................................................

........ At vertical position y,


b
...... ........
..... ...
...
ℓ(y) dy is the area,
... ...
... c − y is the depth,
... ..
... ℓ(y) ..... w(c − y) is the pressure,
y . .
dy.................................................................
..... ... dF = w(c − y)ℓ(y) dy is the force.
...... ..
........
........... ..... The total force F on the plate is the sum of all the forces;
∫ ∫ b
a ......
F = dF = w(c − y)ℓ(y) dy.
a
......
........... x

The data to the right of the figure provides a quick derivation of the formula
∫ b
F = w(c − y)ℓ(y) dy
a

for the force F exerted by the liquid on one side of the submerged plate. The details of this derivation using
the definition of the integral will be left as an exercise.
As a specific example, suppose that a window on the side of a swimming pool is in the shape of an
inverted parabola and has the dimensions and depth recorded on the figure below. To find the force on the
window, we can place the origin in any convenient position. We will assume that the origin is the midpoint
of the base of the window. The parabola then has the form y = 3 − kx2 , where k is some constant. Since
y = 0 when x = ±3, we find that k = 1/3. Hence, an equation for the parabola is y = (9 − x2 )/3. We need
to solve this equation for y in order to find the horizontal distance across the figure. Solving for y yields
2.15 Force exerted by a liquid 111

√ √
y = ± 9 − 3y so the horizontal distance across the window is 2 9 − 3y for each value of y between 0 and 3.
The force F of water on this window is computed to the right of the figure. (Take some time to understand
each of the quantities that appears in the integrand and to determine the substitution that was used in the
evaluation of the integral.)

..............................................................................................................................................................................................
water level
2′
∫ 3 √
................ F = w(5 − y) 2 9 − 3y dy
... ......... ...... ............... 0
... ..... ∫
....
..
.. .... 2w 9
.. .. ′ ..
..
.
. .... = (6 + u) u1/2 du
.. 3 ... 9 0
.. . .
.
..
.. ...
.. ..
..
... = 45.6w.
.. .
..................................................................................................................
..


6

The weight density of fresh water is approximately 62.4 pounds per cubic foot so the water exerts a force of
about 2845 pounds on the window.

Exercises
1. Suppose that the parabolic plate in the example is placed horizontally on the bottom of a swimming pool that
is 12 feet deep. Find the force exerted by water on this plate. Use w = 62.4 lb/ft3 .
2. Find the force exerted by a liquid with weight density w on one side of each vertically submerged plate. The
units on the figures are feet and the top of each plate is four feet beneath the surface of the liquid. For part (b),
make certain that you correctly identify the height of the triangle.
a) b) c)
......................................6........................................... ... ............2......................
.. ... ..... ...... ... .. ..
.. ........... ...
. ...
... ...
.. ..
.. ...
.. ... .. ... .. ...
...... . .. ...
..

5 ... . . .. 6.. .. ... ..


..
...
... ... 5 .....
.. ...... ... ... ...
...
.. ...........
.
.. ... .. ...
.... ..
..
...
...
..
.. ........ ....................................................................
..
..
......................................................................
..
....
6
right triangle equilateral triangle trapezoid

d) e) f) 6
...............................................................................
... .. .
.........
........ ............ ... ..
..
...
.....
..
... ... .. .
.......................... ... ..
..
....... ...... ............ ....... ... ... ... ... ... ... ... ... ... ... ... ... ..........
..
. ... 5 .... .
.
..
. .. ... ... ..
.. 3 ....
..
... ... 4 .. ... .... ...
..
.. .
... . ...
...... . . .... ... ...
................................................................................... .........................
..
.....................

semicircle circle parabola

3. Let R be the region bounded by the curves y = 4x and y = x2 . Suppose that R is a vertical plate with the
water level at the line y = 20. Assuming all units are feet, find the force exerted by water on one side of R. Use
w = 62.4 lb/ft3 .
4. Provide the details of the derivation of the formula for F that is given in the text. You will need to use words
as well as equations; see the derivations in the last three sections for the correct style.
112 Chapter 2 The Integral

2.16 Work

In physics, work is defined as the product of force and displacement. If a force of 8 lbs is applied over a
displacement of 4 ft, then the work done by the force is 32 ft-lbs. Since force and displacement are actually
vector quantities (they have both magnitude and direction), this sort of computation is only valid when the
force and displacement have the same direction. We will make this assumption throughout this section. In
this case, the work W done by a constant force F operating over a distance d is defined by W = F d. If the
force is variable, then this formula for work is no longer valid. As you should expect by now, the definite
integral is needed to handle this situation.
Suppose that F is a continuous function defined on an interval [a, b] and that F (x) represents the force
applied at the point x. We want to find the work done by this variable force F over the interval [a, b]. As
usual, divide the interval [a, b] into n subintervals of equal length and let ai = a+i(b−a)/n for i = 0, 1, . . . , n.
On the interval [ai−1 , ai ], the force is almost constant and has a value very close to F (ai ). Hence, the work W
∑n
done by the force F over the interval [a, b] is approximated by W ≈ F (ai )(ai − ai−1 ). The approximation
i=1
to W improves as n gets larger, so we conclude that


n ∫ b
W = lim F (ai )(ai − ai−1 ) = F (x) dx.
n→∞ a
i=1

In other words, the work W done by the variable force F is dW , where dW = F (x) dx is the little bit of
work done by the force F (x) over the short distance dx. The following problems illustrate the use of this
formula.

Problem: A force of 8 pounds is required to hold a spring 6 inches beyond its natural length. How much
work is required to stretch the spring 8 inches beyond its natural length?

Solution: The force F (x) required to hold a spring (either compressed or stretched) x units from its
equilibrium position (x is positive if stretched, negative if compressed) has been experimentally verified to
be F (x) = kx, where k is a constant that depends on the size and composition of the spring. This fact is
known as Hooke’s Law. In order to compute work in ft-lbs, we will use ft and lbs as our units. Since a force
of 8 lbs is required to hold the spring 0.5 ft beyond its natural length, Hooke’s Law becomes 8 = 0.5k. It
follows that k = 16 lb/ft. The work W required to stretch this spring 8 inches beyond its natural length is
thus ∫ 2/3
32
W = 16x dx = ft-lb.
0 9
The reader should verify that it takes 8/9 ft-lb of work to stretch the spring the first four inches and 24/9
ft-lb of work to stretch the spring the next four inches.

Problem: An inverted cone with a diameter of four feet and a height of six feet is full of water. Find the
minimum amount of work required to pump the water to a level three feet above the top of the cone.

Solution: The work here is done against gravity. Each layer of water is raised a different distance and thus

requires a different amount of work. It is probably best to think in terms of the formula W = dW , where
2.16 Work 113

dW represents the little bit of work required to raise a thin layer of water. Let x be the distance in feet from
the bottom of the cone and let r be the radius in feet of the cone at position x (see the figure). The layer
of water at position x has volume πr2 dx ft3 and this layer must be raised a distance of 9 − x ft. Using the
fact that the weight density of water is 62.4 lbs/ft3 , it follows that the work dW required to lift this layer of
water is dW = 62.4πr2 dx(9 − x). The calculations to find the total work required are shown to the right of
the figure.

.......................................
.. ............ ......
.. ...
.... 2
................................................................
.. ∫
...... ...
...
. .. .
............... ..
.
...... W = dW
... ............................................... ...
...

... ... . ∫
..
...
... ...
.
... ....
= 62.4πr2 dx(9 − x)
x r ...
...
...
...
...
= ... ....
. . ∫
...
( x )2
6 2
x ... 6 .......... ... =
6
62.4π (9 − x) dx
r= ... ... .
3 ... ........ ............... ........ ... 0 3
......... .............r.................... ∫ 6
................................................... 62.4π
... ....... .. = (9x2 − x3 ) dx
...x...... .. 9 0
... ....... ..
........... ≈ 7057
........
Thus, the minimum amount of work required for this process is approximately 7057 ft-lbs.

Exercises
1. A force of 4 lbs is required to hold a spring 4 inches beyond its natural length. How much work is required to
stretch the spring 10 inches beyond its natural length?
2. A force of 8 lbs is required to hold a spring 4 inches beyond its natural length. How much work is required to
compress the spring 2 inches from its natural length?
3. Suppose that 40 ft-lbs of work are required to stretch a spring from its natural length to 6 inches beyond its
natural length. How far from its natural length will 100 ft-lbs of work stretch it?
4. A hemispherical bowl with a radius of five feet is full of water. Find the minimum amount of work required to
pump the water to a level two feet above the top of the bowl.
5. A tank in the shape of an inverted cone with a radius of 6 feet and a height of 10 feet contains water that is 9
feet deep. Find the minimum amount of work required to pump the water up and over the top of the tank.
6. The cross-section of a trough filled with water is an equilateral triangle (vertex down) with side length 4 feet
and the trough is 8 feet long. Find the minimum amount of work required to pump the water up and over the
top of the trough.
7. A chain weighing 1 lb/ft is used to raise a 100 lb object 40 ft out of a well. Find the work required to do this.
8. Recall that Newton’s law of gravitation states that the force due to gravity is inversely proportional to the
square of the distance from the center of the earth, that is, F = k/d2 . The constant k can be determined from
the weight of the object on the surface of the earth. For the following problems, use 3960 miles for the radius of
the earth and mi-lbs as the units of work.
a) Find the work required to launch a 500 lb satellite from the earth’s surface to an orbit of 800 miles.
b) Find the work required to “lift” a 200 lb object from the earth’s surface to a height of 100 miles.
c) Find the work required to move an 800 lb object from 100 miles to 600 miles above the earth.
114 Chapter 2 The Integral

2.17 Center of mass

For the record, the discussion in this section will be quite informal. Also, we assume that the reader has
some intuitive sense concerning center of mass. For example, the center of mass of a circle is its geometric
center; a circle will be balanced on the tip of a pencil placed at its center. The center of mass of a collection
of particles makes it possible to treat the collection as a single point. For instance, a point on the surface of a
punted football travels in a very wobbly path, but (ignoring air resistance) the center of mass of the football
travels in a parabola. The center of mass of an object makes a number of calculations involving Newton’s
second law (F = ma) much easier; it also shows up regularly in other applications. In many engineering and
physics textbooks, the formulas for the coordinates of the center of mass of a collection of particles are given
by ∫ ∫ ∫
x dm y dm z dm
x= ∫ , y= ∫ , z= ∫ .
dm dm dm

The symbol dm in the integral y dm represents the little bit of mass that occupies the space where the
y-coordinate has the value y. (Some experience on a playground toy known as a teeter-totter should make
it clear why the product y dm of distance and mass, often called a moment, is relevant.) These formulas
are quite general. They include the cases in which (i) the collection of particles occupies one, two, or three
dimensions, (ii) the density varies from point to point, and (iii) the number of particles is finite or infinite.
Because of all this variability, it takes some practice to learn how to apply them in various situations.

As an example, let R be the region bounded by the curves y = x and y = 2 x and assume that the
density ρ of this region is constant. (The units of ρ in this case would be mass/area.) A sketch of the
region is given below as well as the calculations required to find (x, y). The little bit of mass at position x
( √ )
is determined by the vertical distance between the curves at position x. Thus dm = ρ dA = ρ 2 x − x dx.
The little bit of mass at position y is determined similarly using the horizontal distance between the curves
at position y.
y
......

...
.......
.... 4 √
4 x = y 2 /4
√ ......
.
......... .
. xρ(2 x − x) dx
.
y = 2 x ................ x = ∫0 = 1.6
.. .. 4 √
3
.... .............................. ρ(2 x − x) dx
y ..... .. ..
...
...
...

............... ....... x = y
0

.
... ∫ ( )
2 ... .... .. 4
y2
.... ...........
y=x
. yρ y − dy
.
.... ..... y = ∫0 4 (
4
) =2
1 .. .... y2
ρ y−
.. .... dy
......... 0 4
............
..... x
1 x 2 3 4

As in the example, for most of the problems we will consider, the density ρ of the object will be
constant. Its units will be mass/length, mass/area, or mass/volume depending on the situation. Referring
to the example, note how a constant ρ cancels in the formulas for the coordinates of the center of mass. When
the density is constant and an object has symmetry, the center of mass always lies on any line of symmetry
of the object. For example, the center of mass of a rectangle with constant density is at its geometric center.
2.17 Center of mass 115

We now consider a rather general three dimensional problem. Let f be a continuous nonnegative function
defined on [a, b], let R be the region under the curve y = f (x), and let S be the solid that is generated when
R is revolved around the x-axis. We want to find the center of mass of S. By symmetry, both y and z are
zero. To find x, note that the little bit of mass at position x depends on the size of the circular cross-section
at x. It follows that dm = ρπ(f (x))2 dx and thus
y .
.......
.. ..
y = f (x)
....... ..... ∫ ∫
.......... ............
b b
f (a)
.. . ... .... xρπ(f (x))2 dx x(f (x))2 dx
f (b)
..
... ... .. . ...
...
...
... .
... ..
. .. .........
.
a
x= ∫ b
= ∫ a
b
.
...
...
...
... ...... ..... 2 2
...
....
...
.... ..
... ρπ(f (x)) dx (f (x)) dx
... ... ...
.. .. .. a a
... ... ... ...........
.... x
a x b

Exercises
Unless noted otherwise, assume that all “objects” have constant density.
1. Find the center of mass of the following collection of masses: a mass of 3 at (3, 2, 1), a mass of 2 at (2, −1, 2), a
mass of 4 at (5, −2, 4), and a mass of 1 at (4, 6, 0).
2. A two kilogram mass is located at (−1, 1) and a five kilogram mass is located at (2, 4). Where should a one
kilogram mass be located so that the collection has the origin as its center of mass?
3. Find the center of mass of the region under the curve and above the x-axis on the specified interval.
a) y = x2 , [0, 3] b) y = 4/x, [1, 4] c) y = 1 − x4 , [0, 1]
4. Find the center of mass of the region bounded by y = x2 and y = ax, where a is a positive constant.
5. Find the center of mass of a quarter circle of radius r.
6. Find the center of mass of a solid cone with height h and radius r.
7. Find the center of mass of a solid hemisphere of radius r.
8. A stump with constant density and the shape of a chopped off cone has a top radius of one foot, a bottom radius
of two feet, and a height of two feet. How far above the ground is its center of mass?
9. Suppose that a thin straight rod with a length of 1 meter has variable density. The density x meters from its
left end is ρ(x) = 10(x2 + x + 1) grams per meter. Find the center of mass of this rod.
10. Let f be a continuous nonnegative function defined on an interval [a, b] and let R be the region under the graph
∫ ∫
ρ b( )2
of y = f (x) and above the x-axis on the interval [a, b]. Show that y dm = f (x) dx.
2 a
11. Use the previous exercise to find the center of mass of the region under the graph of y = x3 on the interval [0, 2].
12. Prove the Theorem of Pappus: the volume of a solid of revolution is the product of the area of the revolved
region and the distance traveled by its center of mass during the revolution.
13. Consider the region bounded by the curves y = 4x and y = x2 . Find the volume of the solid that is generated
when this region is revolved around the line y = 4x. Hint: Use the Theorem of Pappus.
14. Prove that the force exerted by a liquid on one side of a submerged vertical plate is equal to the force exerted by
the liquid on the same plate if the plate is placed horizontally at the depth of the center of mass of the vertical
plate.
15. Find the x-coordinate of the center of mass of the solid that is generated when the region under the curve and
above the x-axis on the specified interval is revolved around the x-axis.
a) y = x2 , [0, 4] b) y = ex , [0, 2] c) y = 9/x2 , [1, 3]
116 Chapter 2 The Integral

2.18 Arc length

Let f be a continuous function defined on an interval [a, b] and consider the following problem: find the length
s of the curve y = f (x) from the point (a, f (a)) to the point (b, f (b)). As we have done in our previous
work, let n be a positive integer and define ai = a + i(b − a)/n for i = 0, 1, 2, . . . , n. An approximation to s
can be obtained by finding the length of the polygonal path (a sequence of line segments) joining the points
(ai , f (ai )) for i = 0, 1, 2, . . . , n (see the figure).
y
.....
........

..................................•.....................................................•............. y = f (x)
..
f (b) .•
..
....
. ..

......
........
.................... ....
.
............•
...
. .
.
....
.
..... .
.
..
...
...
.. ... .
...
..................
...
...
...
...
...
...
... .....
...
...
....... .....
.
.
.
. .
.... .....
...
............... ....... . ....
..
........
............ ................................................•..
..
.
.....
f (a) •.... ....•........

............
..... x
a0 a1 a2 a3 ··· an−2 an−1 an

Since each portion of the polygonal path is a straight line, its length can be found using the distance formula.
The sum of all these lengths gives an approximation to s:


n √
s≈ (ai − ai−1 )2 + (f (ai ) − f (ai−1 ))2 .
i=1

The quantity in the sum does not fit the pattern in the definition of the integral because the term ai − ai−1
does not appear as a multiplier. However, a little algebra and a little theory yield

√ ( f (a ) − f (a ) )2 √
i i−1
(ai − ai−1 )2 + (f (ai ) − f (ai−1 ))2 = (ai − ai−1 ) 1+ = 1 + (f ′ (ti ))2 (ai − ai−1 ),
ai − ai−1

where ti is some point between ai−1 and ai . The existence of the point ti is guaranteed by the Mean Value
Theorem under the added assumption that f is differentiable on [a, b]. Since the approximation to s appears
to improve as n increases, it follows that


n √ ∫ b √
s = lim 1 + (f ′ (ti ))2 (ai − ai−1 ) = 1 + (f ′ (x))2 dx.
n→∞ a
i=1

The last step requires that f ′ be continuous on [a, b]. We have thus obtained the following result.

THEOREM 2.7 arc length


If a function f has a continuous derivative on an interval [a, b], then the
∫ b√
length of the curve y = f (x) from (a, f (a)) to (b, f (b)) is given by 1 + (f ′ (x))2 dx.
a

There is a quick and intuitive way to remember the arc length formula using differentials. By focusing
on a tiny portion of the curve y = f (x), we can consider the so-called differential triangle (see the figure);
the base is dx, the height is dy, and the hypotenuse, which represents a very short, almost straight, portion
2.18 Arc length 117

of the curve, is ds. Thus, a little change dx in x generates a little change dy in y and these two determine
ds, the little change in the arc length.

y ........................................
. ......... .......
..... ....... ......
...... .........
. .....
..... (ds)2 = (dx)2 + (dy)2
f (b) .•. ...
..... ...

... ..
.
. ...

... ....
... ...
....
. ...
..
.........
..
.... .. ...
.
. ... ....
.
... ...
... s = ds
................ ... .
... ...
. .. ...
.
....
ds.... .. dy
.. ∫ √
... ...
...
. ..
.
...
.. ....
... ..
..
.
= (dx)2 + (dy)2
....y = f (x)
...
...
.. . ...
.
. ..............................
...
...
. ...
...
...

........ ( )2
...
..... ..
...
. ∫
•........
..... ..
...... dx .....
f (a) .......
......... .......
...... dy
......................................... = 1+ dx
.....
........... x dx
a b differential triangle

Although the integral representing the arc length of a curve is easy to write down, most such integrals
are very difficult to evaluate. The reason for this lies in the fact that the square root makes it rather difficult
to find an antiderivative. Except for a few contrived functions, most arc length integrals require advanced
techniques of integration to evaluate in exact terms. To illustrate a contrived example, we will find the
3x4 1
length of the curve y = + on the interval [1, 2]. We first note that
4 24x2
( dy )2 ( 1 )2 ( 3 )2 1 ( 1 )2 ( 3 1 )2
1+ = 1 + 3x3 − = 1 + 3x − + = 3x + .
dx 12x3 2 12x3 12x3

It should be clear how unique this example is! It follows that the length of the curve is
∫ 2(
1 ) ( 3x4 1 ) 2 361
3x3 + dx = − = .
1 12x3 4 24x2 1 32

Exercises

1. Let f (x) = r2 − x2 , where r > 0. Use the arc length formula to find the length of the portion of the curve
y = f (x) on the interval [0, r]. (Note the appearance of an improper integral.) Why does the answer look
familiar?
2. Find the length of the curve y = x3/2 on the interval [0, b], where b > 0. Use your formula to find a rational
number b for which the length of this curve is also a rational number. Can you find an integer b for which the
length of this curve is also an integer?
3. Find the length of the curve on the given interval.
x3/2 x2
a) y = − 2x1/2 , [1, 4] b) y = − ln x, [1, 4] c) y = (ex + e−x )/2, [0, ln 10]
6 8
2x3 1 x5 1
d) y = + , [1, 2] e) y = + 3 , [1, 2] f ) y = ln | cos x|, [0, π/3]
3 8x 10 6x

4. Find the length of the entire curve given by the equation x2/3 + y 2/3 = 1. (Note that this equation is similar to
the equation of the unit circle; the only difference is the exponents on the variables x and y. The graph of this
equation has a diamond shape.)
5. Find the exact length of the curve y = x5/4 on the interval [0, 16].
6. Find the center of mass of a wire bent in the shape of a quarter circle of radius r. Assume the wire has constant
density.
118 Chapter 2 The Integral

2.19 Integrals involving quadratic polynomials

In order to find the length of a portion of the parabola y = x2 , we need to find an antiderivative of the

function 1 + 4x2 . None of the techniques discussed thus far apply to this function. In the next few sections,
we will consider more advanced techniques for finding antiderivatives and thus greatly expand our list of
functions that can be antidifferentiated. Before beginning this section, the reader is encouraged to review the
method of completing the square for quadratic polynomials and the technique for performing long division
for the quotient of two polynomials. A brief summary of these can be found in Appendix A.
Integrals that contain a quadratic polynomial as part of the integrand occur often in applications. Using
some algebraic techniques, an antiderivative can sometimes be found fairly easily. For example,
∫ ∫ (
x+1 x 1 ) 1 1
dx = + dx = ln |x2 + 9| + arctan(x/3) + C;
x2 + 9 2 2
x +9 x +9 2 3

splitting the integral into two fractions leads to integrals that can be evaluated using basic formulas. Similarly,
algebra in the form of long division yields
∫ ∫ (
3x2 + 8x + 2 18 ) 3
dx = 3x − 4 + dx = x2 − 4x + 18 ln |x + 4| + C.
x+4 x+4 2

Once again, some algebra reduces the problem to one that is much easier. Another useful algebraic tool that
applies to quadratic polynomials is completing the square. For example,
∫ ∫
x+4 x+4
dx = dx complete the square
x2 + 12x + 40 (x + 6)2 + 4

u−2
= du let u = x + 6
u2 + 4
∫ (
u 2 )
= − du split up
u2 + 4 u2 + 4
1
= ln |u2 + 4| − arctan(u/2) + C basic formulas
2
1 (x + 6)
= ln |x2 + 12x + 40| − arctan +C return to x
2 2

All three of these examples involve more algebra than calculus. As is evident in the last example, it is
important to pay attention to details. Note that the final answer may be quite a bit more complicated than
the original function. With some care, it is possible to avoid a substitution for these problems:
∫ ∫
4x − 3 4(x + 1) − 7
√ dx = √ dx
24 − 2x − x2 25 − (x + 1)2
∫ ∫
4(x + 1) 7
= √ dx − √ dx
25 − (x + 1)2 25 − (x + 1)2
√ (x + 1)
= −4 25 − (x + 1)2 − 7 arcsin + C.
5
If you find the algebra in this problem to be troublesome or confusing, you should use the substitution
method (let u = x + 1 in this case).
2.19 Integrals involving quadratic polynomials 119

Since these integrals can sometimes cause difficulties, we give another, slightly more complicated example.
Read each step carefully. When doing these problems, just take your time and write out the terms slowly
and clearly.
∫ ∫
x x
√ dx = √ ( )2 dx complete the square
x − x2 1
− x − 12
4

2x
= √ dx multiply by 2
2
1 − (2x − 1)2

1 u+1
= √ du let u = 2x − 1
2 1 − u2

1 ( u 1 )
= √ +√ du split up
2 1 − u2 1 − u2
1( √ )
= − 1 − u2 + arcsin u + C basic formulas
2
1( √ )
= − 1 − (2x − 1)2 + arcsin(2x − 1) + C return to x
2
√ 1
= − x − x2 + arcsin(2x − 1) + C simplify
2

Once again, note that there is quite a bit of algebra here and only a little bit of calculus.

Exercises
1. Complete the square as a first step to evaluate each of the following integrals.
∫ ∫ ∫
3x − 2 2x + 1 4
a) dx b) √ dx c) √ dx
8 + 4x + x2 8 + 2x − x2 6x − x2
∫ ∫ ∫
4x + 3 5x + 7 6x − 1
d) dx e) dx f ) √ dx
x2 + 3x + 5 x2 + 4x + 10 4x − x2
2. Evaluate each of the following integrals.
∫ ∫ ∫
x x+3 x2
a) dx b) dx c) dx
3x + 1 x2 + 10 x2 + 4
∫ ∫ ∫
2x2 + 3x + 1 x3 2x
d) dx e) dx f) dx
x2 + 1 x2 + 2x + 2 x−4

12 2 2 12
3. Use the fact that 2 = − to evaluate dx.
x − 6x x−6 x x2 − 6x
120 Chapter 2 The Integral

2.20 Using a table of integrals

Finding an antiderivative can be a lot of work and sometimes requires a certain amount of cleverness. In
order to avoid repeating routine computations or remembering clever tricks, various mathematicians over
the years have compiled lists of antiderivatives. Whenever an antiderivative is needed, one can consult
the list rather than take the time to rediscover the result; the list is a labor-saving device. A list of
antiderivatives is typically called a table of integrals (although table of indefinite integrals would be a
more appropriate title) and extensive tables of integrals can be found in reference books. The formulas in
a table of integrals are generally arranged according to the form of the integrand; formulas for exponential
functions, logarithmic functions, trigonometric functions, inverse trigonometric functions, etc. Tables also
include many antiderivatives for functions involving the quadratic forms a2 + u2 , a2 − u2 , and u2 − a2 as
these types occur frequently in applications. A short table of integrals suitable for this text can be found in
Appendix B. Take a few minutes to look over this table and become familiar with the types of integrals it
includes. In the next section, we will discuss one method that indicates how many of these integral formulas
were obtained.
Even with a table of integrals, finding antiderivatives is not a simple process. It is often necessary to
make a substitution before the table can be used and some care must be made to determine the appropriate
formula to use. The following three examples illustrate some of these ideas.
∫ ∫
4 4 1 √
√ dx = √ √ du let u = 3x
3x − 10
2 3 u − 10
2

4 √
= √ ln u + u2 − 10 + C integral formula 46
3
4 √ √
= √ ln 3x + 3x2 − 10 + C return to x
3

∫ √ ∫
1 √
x 4 − x dx =
4 4 − u2 du let u = x2
2
1(u √ 4 )
= 4 − u2 + arcsin(u/2) + C integral formula 13
2 2 2
1 2√
= x 4 − x + arcsin(x2 /2) + C
4 return to x
4

∫ ∫ √
√ 1 + u2
1 + ex dx = 2 du let u = ex/2
u
√ 1 + √1 + u2

= 2 1 + u − 2 ln
2 +C integral formula 34
u
√ √
= 2 1 + ex − 2 ln 1 + 1 + ex + x + C return to x

(You should pay careful attention to the last step in the third example; it involves properties of logarithms.)
The general idea is to make a substitution that transforms the integral into one which appears in the table.
As usual, it is important to pay attention to detail when evaluating more complicated integrals. In this
regard, be very careful with the differential terms that appear in your substitution.
2.20 Using a table of integrals 121

Integral formulas 62–65, 73, and 74 are known as reduction formulas because the formulas indicate
how the integral of a function involving an exponent is related to a similar integral with the exponent
reduced, that is, made smaller. By using a reduction formula a sufficient number of times, integrals of this
type can be evaluated. As indicated in the next example, it is important to be aware of basic trigonometric
identities when using the reduction formulas for trigonometric functions. The appropriate identity here is
tan2 x + 1 = sec2 x and the reduction formula for secant is integral formula 65.
∫ ∫
sec x tan x dx = sec3 x (sec2 x − 1) dx
3 2

∫ ∫
= sec5 x dx − sec3 x dx
∫ ∫
1 3
3
= sec x tan x + sec x dx − sec3 x dx
3
4 4

1 1
= sec3 x tan x − sec3 x dx
4 4

1 1(1 1 )
= sec x tan x −
3
sec x tan x + sec x dx
4 4 2 2
1 1 1
= sec3 x tan x − sec x tan x − ln | sec x + tan x| + C.
4 8 8
Note that the reduction formula was not used on both integrals in the third step. Since the integrals in the
following step can be combined, this ends up saving some work.

Exercises
1. Evaluate each of the following integrals using the table of integrals in Appendix B. Indicate by number the
formula that was used.
∫ √ ∫ ∫
1 6x
a) x2 4 + x2 dx b) dx c) dx
(25 + 4x2 )3/2 x4 − 9
∫ ∫ √ 2 ∫ √
3 4x − 9 5 − x2
d) √ dx e) 2
dx f ) dx
2
x + 4x + 8 x x
∫ √ 2x ∫ ∫
e −1 6 cos x sin 2x
g) dx h) √ dx i) √ dx
ex 4 − e2x 9 − cos2 x

2. Find the area (give the exact value) of the region bounded by the hyperbola x2 − y 2 = 9 and the line x = 6.
3. Find the perimeter (give the exact value) of the region bounded by the parabola y = x2 and the line y = 4x.
4. Derive the reduction formula for sine. Hint: Use integration by parts.
5. Derive the reduction formula for secant. Hint: Use integration by parts.
6. Use reduction formulas to evaluate the integrals.
∫ ∫ ∫
a) sec5 x dx b) cos4 x dx c) sec5 x tan2 x dx
∫ ∫ ∫
2
d) sin4 x cos2 x dx e) x7 e−x dx f) (ln x)3 dx

∫ √
3b − x
3b
7. Evaluate dx, where b is a positive constant. This integral appears when finding the area of the
x
0 x+b √
loop in the folium of Descartes. Hint: Try u = x + b.
122 Chapter 2 The Integral

2.21 Trigonometric substitution

The integration technique known as trigonometric substitution can be used to evaluate integrals involving
the quadratic forms a2 + u2 , a2 − u2 , and u2 − a2 . In fact, this technique of integration is how many of
the entries under these headings in a table of integrals are obtained; one of the goals of this section is to
help explain how some integration formulas were discovered. The idea is to take advantage of the basic
trigonometric identities

1 + tan2 x = sec2 x, 1 − sin2 x = cos2 x, and sec2 x − 1 = tan2 x.

Note that these identities show how a sum of two terms can be transformed into a perfect square. By making
an appropriate trigonometric substitution and using one of these identities, the integral is converted into
an integral involving trigonometric functions. Assuming this new integral can be evaluated, the original
problem has been solved. To illustrate this technique, consider the following example.

let x = 2 sin θ,
∫ √
4 − x2 dx; then dx = 2 cos θ dθ and
√ √
4 − x2 = 4 − 4 sin2 θ = 2 cos θ.

We used a trigonometric identity to change the quantity in the square root into a perfect square. Using the
reduction formula for cosine (integral formula 63), it follows that
∫ √ ∫
4 − x2 dx = 4 cos2 θ dθ

= 2 sin θ cos θ + 2θ + C
x √
= · 4 − x2 + 2 arcsin(x/2) + C.
2
It is important to remember to convert your answer back to the original variable. In this case, the relation-
ships between sin θ and x, and cos θ and x have already been noted in the above substitution:

x √
x = 2 sin θ ⇒ sin θ = and θ = arcsin(x/2); 2 cos θ = 4 − x2 .
2

In other problems (see the example at the end of the section), it may be necessary to use a right triangle
and/or inverse trigonometric functions in order to find the appropriate relationships.
The basic idea behind trigonometric substitution is to use a trigonometric identity to reduce the

quadratic form to a single trigonometric function. In the previous example, the quadratic form 4 − x2 was
reduced to 2 cos θ after the substitution x = 2 sin θ was made. The appropriate substitution depends on the
form of the quadratic:
{ {
a2 − u2 u = a sin θ
if the integral contains u2 + a2 , then make the substitution u = a tan θ .
u2 − a2 u = a sec θ
In each case, the substitution and a trigonometric identity turn the quadratic form into a single term. After
the substitution, the new integral is a product of trigonometric functions that can often be evaluated using
reduction formulas or some other technique.
2.21 Trigonometric substitution 123

As mentioned earlier, most of the entries in a table of integrals that involve quadratic forms can be
discovered using the technique of trigonometric substitution. As an example, consider the following integral,
where a is assumed to be a positive constant.

let u = a tan θ,

1
du; then du = a sec2 θ dθ and
(a + u2 )3/2
2

(a2 + u2 )3/2 = (a2 + a2 tan2 θ)3/2 = a3 sec3 θ


∫ ∫ ∫
1 a sec2 θ 1 1
du = 3 3
dθ = 2 cos θ dθ = sin θ + C.
(a + u2 )3/2
2 a sec θ a a2
To express sin θ as a function of u, use the fact that tan θ = u/a and form a right triangle containing the
angle θ:
............. u
.. ........... √a2 + u2 tan θ =
a
;
u .. ......... u
.. ........ sin θ = √ .
..................................θ.......................... a2 + u2
a

It follows that ∫
1 1 u
du = 2 sin θ + C = √ + C.
(a2 + u2 )3/2 a a2 a2 + u2
We have thus “discovered” integral formula 41 in the table of integrals.
One final comment is worth making. When performing trigonometric substitution, be very careful to
include the differential terms dx or du. As seen in the above examples, the terms dx and du are non-trivial
functions of θ and their omission introduces a large error into the early stages of an integration problem.

Exercises
1. Use trigonometric substitution to evaluate each of the following integrals.
∫ √ ∫ √ 2 ∫
16 − x2 x + 16 1
a) dx b) dx c) dx
x x4 (x2 + 1)2
∫ ∫ ∫
8 x2 x2
d) dx e) dx f ) dx
(x2 + 1)3 (1 − x2 )3/2 (1 − x2 )5/2

2. Use trigonometric substitution to evaluate each of the following integrals, where a is a positive constant. Use
the table of integrals in Appendix B to check your answers.
∫ √ ∫ √ ∫
1
a) a2 + u2 du b) a2 − u2 du c) √ du
u2 − a2
∫ √ 2 ∫ ∫
u − a2 1 u3
d) du e) √ du f) √ du
u u2 a2 − u2 u2 − a2
124 Chapter 2 The Integral

2.22 Integrating rational functions

In this section, we will indicate how to find an antiderivative of a rational function (a function that can be
expressed as the ratio of two polynomials). To motivate the main idea, note that

2 1 5 2x(x + 2) x+2 5x2 7x2 + 3x − 2


− 2+ = 2 − 2 + 2 = .
x x x+2 x (x + 2) x (x + 2) x (x + 2) x3 + 2x2

It follows that
∫ ∫ (
7x2 + 3x − 2 2 1 5 ) 1
dx = − 2+ dx = 2 ln |x| + + 5 ln |x + 2| + C.
x3 + 2x2 x x x+2 x

Thus, a complicated looking rational function can be antidifferentiated by first writing it as a sum of simple
fractions. In this example, we started with the simple fractions and added them together to obtain a rational
function. It is a more difficult problem to begin with the rational function and break it into simpler fractions.
Writing a rational function as a sum of simpler fractions is known as partial fraction decomposition.
As an example of this method for evaluating integrals, consider the following integration problem:

5x3 + x2 + x + 2
dx.
x4 + x2

To find the partial fraction decomposition of the integrand, write

5x3 + x2 + x + 2 5x3 + x2 + x + 2 A B Cx + D
= = + 2+ 2 ,
x4 + x2 x2 (x2 + 1) x x x +1

for constants A, B, C, and D. Determining the general form of the partial fraction decomposition will be
discussed in a moment; for now just accept this form as a reasonable sum to give the correct denominator.
Clearing the denominators, we find that

5x3 + x2 + x + 2 = Ax(x2 + 1) + B(x2 + 1) + (Cx + D)x2

= (A + C)x3 + (B + D)x2 + Ax + B.

The only way that these two polynomials can be equal for all values of x is for the coefficients to be identical.
Thus, A + C = 5, B + D = 1, A = 1, and B = 2. It follows that

5x3 + x2 + x + 2 1 2 4x − 1
= + 2+ 2 ,
x4 + x2 x x x +1

(you can always check these answers by adding the fractions) and
∫ ∫ (
5x3 + x2 + x + 2 1 2 4x 1 )
dx = + 2+ 2 − 2 dx
x4 + x2 x x x +1 x +1
2
= ln |x| − + 2 ln |x2 + 1| − arctan x + C.
x
Note that this integral problem involves more algebra than it does calculus.
As is evident in this example, the difficult part of integrating a rational function is determining its
partial fraction decomposition. The following steps will lead to the partial fraction decomposition of a
rational function that is in proper form, that is, the degree of the polynomial in the numerator is less than
2.22 Integrating rational functions 125

the degree of the polynomial in the denominator. (If this is not the case, perform long division and work
with the remainder.)

1) Factor the denominator of the rational function into a product of linear terms and irreducible
quadratic terms. (An irreducible quadratic is a second degree polynomial with no real roots.)
The Fundamental Theorem of Algebra guarantees that such a factorization is always possible.

2) For each linear factor of the form (ax + b)n , put

A1 A2 An
+ + ··· +
ax + b (ax + b)2 (ax + b)n

into the partial fraction decomposition.

3) For each irreducible quadratic factor of the form (ax2 + bx + c)n , put

A1 x + B1 A2 x + B2 An x + Bn
+ + ··· +
ax2 + bx + c (ax2 + bx + c)2 (ax2 + bx + c)n

into the partial fraction decomposition.

For example, the partial fraction decomposition of the rational function

x5 + 3x2 − 6
(x + 2)(2x − 3)3 (x2 + x + 1)2

is given by
A1 A2 A3 A4 A5 x + A6 A7 x + A8
+ + + + 2 + .
x + 2 2x − 3 (2x − 3)2 (2x − 3)3 x + x + 1 (x2 + x + 1)2
A proof of the existence and uniqueness of partial fraction decompositions (which really is an algebra problem
and has nothing to do with calculus) is beyond the scope of this book, but the form of the decomposition will
make more sense after you have worked several problems. The constants that appear in the partial fraction
decomposition can be determined as in the example.

Exercises
1. Evaluate each of the following integrals.
∫ ∫ ∫
1 2x + 3 x + 10
a) dx b) dx c) dx
x2 + 5x x2 − 3x − 4 2x2 + 5x − 3
∫ ∫ ∫
4 x2 + 2x + 3 6x2 + 7x + 20
d) dx e) dx f) dx
x − 3x + 2x
3 2 x3 − 2x2 + x x3 + 6x2 + 10x
∫ ∫ ∫
x+4 1 2x3 + 8x2 + 2x + 23
g) dx h) dx i) dx
3
x +x x4 − 1 x4 + 5x2 + 4
∫ ∫ ∫
12 x3 2x + 5
j) dx k) dx l) dx
x3 − 8 2x2 + 3x − 2 (x2 + 2x + 2)3
126 Chapter 2 The Integral

2.23 Numerical integration

The Fundamental Theorem of Calculus provides a way to evaluate definite integrals without resorting to
the definition of the integral. However, if an antiderivative of the integrand cannot be found (which occurs
regularly in applications), then it is necessary to consider the definition:
∫ ∑ b−a∑
b n n
f (x) dx = lim f (ti ) (ai − ai−1 ) ≈ f (ti ),
a n→∞
i=1
n i=1

where ti is any point in the interval [ai−1 , ai ] (see Section 2.5). For large values of n, the sum should be
a reasonable approximation for the value of the integral. Some more efficient ways of using this idea to
approximate the value of an integral will be discussed in this section.
Suppose that f is a continuous function defined on an interval [a, b]. Let n be a positive integer, let
xi = a + i (b − a)/n and yi = f (xi ) for i = 0, 1, 2, . . . , n, and let ci = (xi−1 + xi )/2 for i = 1, 2, . . . , n. Then
∫b
each of the following represent an approximation to a f (x) dx; the approximations generally become more
accurate as n increases.

b−a( )
Rn = y1 + y2 + · · · + yn−1 + yn right endpoint rule
n
b−a( )
Ln = y0 + y1 + · · · + yn−2 + yn−1 left endpoint rule
n
b−a( )
Mn = f (c1 ) + f (c2 ) + · · · + f (cn ) midpoint rule
n
b−a( )
Tn = y0 + 2y1 + 2y2 + · · · + 2yn−1 + yn trapezoid rule
2n
b−a( )
Sn = y0 + 4y1 + 2y2 + 4y3 + 2y4 + · · · + 2yn−2 + 4yn−1 + yn Simpson’s rule (n must be even)
3n

The first three rules follow immediately from the definition by choosing some routine values for ti . It is easy
to verify that Tn = 12 (Ln + Rn ), the average of the left and right endpoint estimates. A sketch of a graph
of a continuous nonnegative function shows that this average is usually a much better approximation to the
area under the curve; it also reveals that Tn corresponds to approximating the area under the curve using
trapezoids. With a little care (see the exercises), it can be shown that S2n = 31 Tn + 32 Mn , a weighted average
of Tn and Mn . Geometrically, Simpson’s rule is obtained by using parabolas to approximate the function
rather than lines. It follows that Simpson’s rule is usually the most accurate of the five rules. The formula
for Simpson’s rule hinges on the following remarkable property of quadratic functions: if Q(x) = px2 +qx+r,
then ∫ b
Q(a) + 4Q(c) + Q(b)
Q(x) dx = (b − a),
a 6
where c is the midpoint of the interval [a, b]. In other words, the integral of a quadratic function is determined
by its values at the endpoints and at the midpoint of the interval. A proof of this fact, as well as how it is
used to generate Simpson’s rule, will be left to the exercises.
2.23 Numerical integration 127
∫ 2 √
To get a sense for these formulas, consider the integral 1 + x3 dx and take n to be 6. Then
0

i 0 1 2 3 4 5 6
xi 0 1/3 2/3 1 4/3 5/3 2
yi 1 1.01835 1.13855 1.41421 1.83586 2.37268 3

The formulas for the trapezoid rule and Simpson’s rule yield

2 ( )
T6 = 1 · 1 + 2 · 1.01835 + 2 · 1.13855 + 2 · 1.41421 + 2 · 1.83586 + 2 · 2.37268 + 1 · 3 ≈ 3.2600;
2·6
2 ( )
S6 = 1 · 1 + 4 · 1.01835 + 2 · 1.13855 + 4 · 1.41421 + 2 · 1.83586 + 4 · 2.37268 + 1 · 3 ≈ 3.2411.
3·6
The actual value of this integral is known to be 3.2413, accurate to 4 decimal places. This shows that
Simpson’s rule gives good accuracy without too much computation. If a much better approximation is
required, a calculator or computer can be programmed to perform the routine calculations using a large
value for n. Since Simpson’s rule requires almost identical computations as the trapezoid rule and because
for a given value of n it is usually much more accurate, it is the clear choice when an approximation to
an integral is required. (Actually, there are other numerical procedures that are used to approximate the
value of an integral. Questions concerning the accuracy of numerical estimations of many different types,
including the theory behind the analysis of numerical algorithms, are addressed in a branch of mathematics
called numerical analysis. Since most numerical algorithms require the use of a computer, numerical
analysis must consider the methods of computation (such as evaluation techniques and round-off errors) as
well as the mathematical theory of approximation.)

Exercises
1. For the given integral and value of n, find Tn and Sn accurate to four decimal places.
∫ 2 ∫ 1 ∫ 1
2 dx
a) e−x dx, n = 8 b) sin x2 dx, n = 4 c) √ ,n=6
1 0 0.4 1 + x3
∫ 1
sin x
2. Use Simpson’s rule with n = 8 to approximate dx. What value should the integrand have at 0?
0 x
3. Use Simpson’s rule with n = 6 to approximate the length of the sine curve on the interval [0, π].
4. Consider the following table of values for a function f :
x 1 2 3 4 5
f (x) 2 3 5 7 2
Assume that f is a nonnegative continuous function defined on [1, 5] and let R be the region under the curve
y = f (x) and above the x-axis on the interval [1, 5]. Use Simpson’s rule to approximate (a) the area of R, (b)
the volume generated when R is revolved around the x-axis, and (c) the center of mass of R.
5. Suppose that the following table represents the velocity of a particle moving in a straight line.
t (sec) 0 1 2 3 4 5 6
v (m/sec) 0 5 12 15 14 6 0
Use Simpson’s rule to approximate the distance traveled by the particle.
6. Prove that S2n = 31 Tn + 23 Mn for each positive integer n.
7. Prove the property of quadratic functions stated in this section.
8. Use the property of quadratic functions stated in this section to derive Simpson’s rule.
128 Chapter 2 The Integral

2.24 Supplementary Exercises

1. Find each of the following sums. Simplify your answers.


∑n ∑ n ∑
n
a) (2i − 1) b) (2i − 1)2 c) (2i − 1)3
i=1 i=1 i=1

4n 3
(n + 1) + (n + 2) + (n + 3) + · · · + 2n
2. Evaluate the limits: lim and lim .
n→∞ 1 2 + 2 2 + 32 + · · · + n 2 n→∞ n2
3. Here is an unlikely application to bowling alleys.
a) Suppose that bowling pins are lined up in the customary triangular pattern except that there are 100 rows
instead of the usual 4 rows. How many bowling pins are required?
b) Suppose that bowling balls are stacked up in a rather dangerous square pyramid formation that is 60 balls
high. How many bowling balls are in the stack?
∫2
4. Use the definition of integral to evaluate 1 (2x2 − x + 3) dx.
∑n
2(n + 2i)2
5. Evaluate lim .
n→∞
i=1
n3
∫ 10
6. Evaluate 4 |4t − 24| dt.
7. Use a property of integrals to determine the largest integral in each pair. Justify your answers.
∫ 3√ ∫ 3 ∫ 2π ∫ 2π ∫ 1 ∫ 1
2
a) x6 − 1 dx, x3 dx b) sin3 x dx, sin5 x dx c) ex dx, ex dx
2 2 π π 0 0

8. Find the point c guaranteed by the Mean Value Theorem for integrals.
a) f (x) = x3 , [1, 2] b) f (x) = x, [a, b] c) f (x) = x2 , [a, b]
9. Find the derivative of each of the following functions.
∫ x ∫ 4x ∫ 2t2
4t 2
a) f (x) = √ dt b) g(x) = cos(t2 ) dt c) h(t) = e−x /2
dx
0 t4 + 2 3 −2
∫ x( )
1 2
10. Evaluate lim 3 e2t − 1 dt.
x→0 x 0
∫ x √
11. Determine the interval on which the function f defined by f (x) = 4t2 − 2t + 7 dt is concave down.
0

2
12. Find an integral expression for a function f such that f (1) = 0 and f ′ (x) = ex /4 .

13. Evaluate ex sin x dx in the following way. Use integration by parts with u = ex and note that the new integral
is not really any easier than the original one. Then use integration by parts on the new integral, again with
u = ex . It may seem that you are going in∫circles, but if you write out the equation you have obtained, you will
see that it can be solved for the function ex sin x dx.

14. Repeat the steps in the previous exercise to evaluate ex cos x dx.
15. Find the area of the region bounded by the given curves. The x-coordinates of the points of intersection are
integers.

a) y = 12/x, y = 3 + 12x − 3x2 b) y = x4 , y = 2/(1 + x2 ) c) y = x3 , y = 32 x

16. Find the area of the region bounded by the curves y = sec x and y = 2 on the interval (−π/2, π/2).
17. Let R be the region under the curve y = 4xe−x/2 and above the x-axis on [0, ∞). Find the area of R.
18. Let R be the region under the curve y = 4/x and above the x-axis on the interval [1, 4]. Find (a) a vertical line
and (b) a horizontal line that divides the region R into two pieces of equal area.
19. Consider the region that lies under the graph of y = 2x − x2 and above the x-axis. Find a line of the form
y = mx that divides this region into two parts of equal area.
20. Find the center of mass of the following collection of masses: a mass of 3 is located at (3, 2), a mass of 2 is
located at (2, −1), a mass of 4 is located at (5, −2), and a mass of 1 is located at (−2, 0).
21. Let R be the region under the curve y = 4/x3 and above the x-axis on the interval [1, ∞). Find the center of
mass of R. Is the center of mass of R in the region R?
2.24 Supplementary Exercises 129

22. Let T be the triangular region with vertices (0, 0), (2, 0), and (0, 2). Suppose that the density of this region (in
units of mass per area) is ρ(x) = 1 + 5x at a distance x units from the y-axis. Find the center of mass of T .
23. Let R be the region under the graph of y = ex and above the x-axis on the interval [0, 2]. Find the center of
mass of the solid that is generated when R is revolved around the x-axis.
24. Let R be the region under the graph of y = 4 − x2 and above the x-axis on the interval [0, 2] and let S be the
solid that is generated when R is revolved around the y-axis. Find the center of mass of S.
25. Let f be a continuous nonnegative function defined on an interval [a, b] and let R be the region under the graph
of y = f (x) and above the x-axis on the interval [a, b]. Show that
∫ ∫ b( )2
1 b 1
x= xf (x) dx and y = f (x) dx,
A a 2A a
where A is the area of the region R. Assume that the region has constant density ρ.
26. Use the previous exercise to find the center of mass of the region under the graph of y = sin x on the interval
[0, π].
27. Derive the reduction formula for tangent.
28. Use reduction formulas to evaluate the integrals.
∫ ∫ ∫
a) sin5 x dx b) x3 ex dx c) tan6 x dx
∫ ∫ √
∫ e
d) x2 e−x dx e) x2 e x dx f) (ln x)4 dx
1
∫ 6
29. Use Simpson’s rule with n = 2 to approximate 4x3 dx, then compare the approximation with the exact value
2
of the integral. You should notice something interesting.
∫ b
a3 + 4c3 + b3
30. Show that x3 dx = (b − a), where c is the midpoint of the interval [a, b]. What does this result
a 6
imply about Simpson’s rule and cubic polynomials?
∫ 1√
31. For the integral 1 + x2 dx, find M2 , M4 , T2 , T4 , S2 , and S4 and compare all of these values to the actual
0
value of the integral. Search the errors carefully and look for any patterns that you can find.
32. Suppose that the circle given by the equation x2 + y 2 = 6y − 2x is revolved around the line x − y = 6. Find the
volume of the “donut” that is generated.
33. Suppose that the water is four feet deep in a hemispherical tank (flat side up) that has a radius of five feet. How
many gallons of water are in the tank? One gallon is approximately 231 cubic inches.
34. Let R be the region under the curve y = 6e−x/3 and above the x-axis on the interval [0, ∞). Find the volume
of the solid that is generated when R is revolved around the x-axis.
35. Evaluate each of the following integrals. You should not need a table of integrals.
∫ ∫ ∫
1 x x3
a) 2
dx b) 2
dx c) dx
9+x 9+x 9 + x2
∫ ∫ ∫
√ 2
d) x2 x + 2 dx e) xex dx f) x2 ex dx
∫ 1( )2 ∫ 4 √ ∫ 4 √

g) x + 3 dx h) x 16 − x2 dx i) (3x + 5) 16 − x2 dx
∫0 ∫0 √ ∫0
x+2 1
j) dx k) 2x 3 3 4
x + 6 dx l) √ dx
(x2 + 4x + 9)3 4 − 9x2
∫ ∫ ∫ √
sin x
m) x csc2 x dx n) sin 2x cos3 2x dx o) √ dx
x
∫ ∞ 2 ∫ ∞ ∫ 2
x +2 −x/5 1
p) 5
dx q) xe dx r) 2
dx
1 x 0 −1 x
∫ ∫ ∫
ex ex e2x
s) x
dx t) 2x
dx u) x
dx
e +1 e +1 e +1
36. Let R be the region under the graph of y = x2 and above the x-axis on the interval [0, 2]. Find the exact value
of each of the following.
a) the area of R
b) the volume of the solid generated when R is revolved around the x-axis
130 Chapter 2 The Integral

c) the volume of the solid generated when R is revolved around the y-axis
d) the volume of the solid generated when R is revolved around the line x = 5
e) the volume of the solid generated when R is revolved around the line x = −3
f ) the volume of the solid generated when R is revolved around the line y = 4
g) the x-coordinate of the center of mass of R
h) the y-coordinate of the center of mass of R
i) the x-coordinate of the center of mass of the solid generated when R is revolved around the x-axis
j) the perimeter of R
k) the force on one side of R if R is a submerged vertical plate with the fluid (weight density w) level at y = 8
37. Let R be the region under the graph of y = sin x and above the x-axis on the interval [0, π]. Repeat the previous
exercise for this region R. You will need to approximate the value for part (j).
38. Suppose that the figure below represents a two-dimensional region with a parabola forming its curved boundary.
8
.......................................................................................
... .
..
.
... ...
.
... .
.
.... 4
.
....
.
. ....
.... .
.
..
...
.....
....... ...... ...........
.........................
a) Find the area of this region.
b) Find the center of mass of this region. Specify its location carefully.
c) Find the force exerted by water on one side of this region, assuming it is a vertically submerged plate (assume
the units are feet) with the top of the plate six feet below the surface of the water. Use w = 62.4 lb/ft3 .
39. Find the volume of the solid that is generated when the region on one side of a chord in a circle of radius r is
revolved around the chord. Assume that the height of the chord is a, where 0 < a < r.
3
Sequences and Series

For any real number x ̸= 1 and any positive integer n, elementary algebra yields

1 − xn+1
1 + x + x2 + · · · + xn = .
1−x

(To check this, you can either do long division on the fraction or multiply both sides by 1 − x.) If |x| < 1,
then |x|n+1 gets closer to 0 as n increases, and it is tempting to write

1
(1) 1 + x + x2 + x3 + · · · = .
1−x

This equation asserts that it is possible to add an infinite number of numbers and find the exact value of
the sum. For example, when x = 0.1, it asserts that 1 + 0.1 + (0.1)2 + (0.1)3 + · · · = 10/9, a believable result
since the decimal expansion of 1/9 is 0.1111 . . .. Assuming that (1) is valid for |x| < 1, integration yields

1 1 1
x + x2 + x3 + x4 + · · · = − ln |1 − x|,
2 3 4

which expresses the natural logarithm function as an infinite degree polynomial. If we substitute −1 for x
(even though it is out of the specified range), we obtain

1 1 1 1 1
ln 2 = 1 − + − + − + ···.
2 3 4 5 6

This is certainly a new way to look at logarithms. Replacing x with −x2 in (1) and integrating gives an
expression for arctan x as well as an interesting formula for π:
1
1 − x2 + x4 − x6 + · · · = ;
1 + x2
1 1 1
x − x3 + x5 − x7 + · · · = arctan x;
3 5 7
1 1 1 π
1 − + − + ··· = .
3 5 7 4
131
132 Chapter 3 Sequences and Series

(What value of x was used to get the formula for π/4?) Are these equations valid? We know that the integral
of a finite sum of functions is the sum of the individual integrals (see property (3) of integrals in Section
2.4). Is the integral of a sum the sum of the integrals even when there are an infinite number of terms? Is
it really possible to represent functions such as ln |1 − x| and arctan x as infinite degree polynomials? Can
all functions be represented in this way? Are there new functions that we have never seen before that can
be represented in this way? These sorts of questions provide the motivation for the concepts considered in
this chapter.
To obtain a better sense of these new kinds of equations, let’s think about decimal expansions for a
moment. The number 1/3 is equal to the infinite decimal 0.33333 . . .. In other words,

1 1 1 1 1
=3· + 3 · 2 + 3 · 3 + 3 · 4 + ···.
3 10 10 10 10

The irrational number 2 also has a decimal expansion, but there is no repeating pattern to its digits:

√ 1 1 1 1
2 = 1.414213562 . . . = 1 + 4 · + 1 · 2 + 4 · 3 + 2 · 4 + ···.
10 10 10 10

However, even if we do not know all of the digits, we do know that every real number x can be written as

1 1 1 1
x = n + d1 · + d2 · 2 + d3 · 3 + d4 · 4 + · · · ,
10 10 10 10

where n is an integer and the digits di are integers between 0 and 9 inclusively. The calculations on the
previous page seem to indicate that a function f can be written as

f (x) = a0 + a1 x + a2 x2 + a3 x3 + a4 x4 + · · ·

for values of x in an appropriate interval, where the coefficients ai are real numbers. Hence, in some sense,
functions are determined by a sequence of numbers in much the same way that numbers are determined by
a sequence of digits. Recognizing that many functions (however, as we will see, not all functions) can be
represented in this way is extremely important in both pure and applied mathematics.
This chapter is more abstract than the first two chapters of this text. Many students find infinite series
to be the most challenging topic in calculus. You will need to focus more on ideas and concepts rather than
on calculations. Consequently, the way you study mathematics may need to adapt as you work through this
chapter. These sentences are not intended to scare you, but rather to alert you to the fact that the nature
of the course is about to change. The reward for this extra effort is an increased ability to reason abstractly
and an opportunity to catch a glimpse of some fascinating aspects of mathematics. We begin our study of
infinite series with a proof technique known as mathematical induction.
3.1 Mathematical Induction 133

3.1 Mathematical Induction

Statements of the form, “for each positive integer n, something is true”, occur in all branches of mathematics.
Two simple examples are

1. For each positive integer n, the number n2 − n + 41 is a prime number.


n(n + 1)(2n + 1)
2. For each positive integer n, 12 + 22 + 32 + · · · + n2 = .
6
To prove that statements such as these are false, it is only necessary to find one positive integer n for which
the statement is false. For instance, statement (1) is false; you should be able to find a positive integer n
for which n2 − n + 41 is not a prime number. However, it is not possible to prove such statements are true
by showing that they are true for several values of n; the formulas or statements must somehow be verified
for every positive integer n. Since it is not possible to actually prove individually an infinite number of
statements, some other method of proof is needed. The Principle of Mathematical Induction is a useful tool
for proving some statements of this type. This important property is stated below. (Recall that the symbol
Z+ represents the set of positive integers and that the symbols a ∈ S mean that a is a member of the set S.)

Principle of Mathematical Induction: If S is a set of positive integers that contains 1 and satisfies the
condition “if k ∈ S, then k + 1 ∈ S”, then S = Z+ .

The Principle of Mathematical Induction can be compared to a chain reaction. If we know that each
event will set off the next (the condition in quotes) and if the first event occurs (1 ∈ S), then the entire
chain reaction will occur. Perhaps you have seen one of those amazing domino exhibits where thousands of
dominoes fall over in interesting patterns. The dominoes must be set up in such a way that each one knocks
over the next (“if k ∈ S, then k + 1 ∈ S”), and someone must begin the process by pushing over the first
domino (S contains 1).
Given a statement of the form “for each positive integer n, something is true”, let S be the set of all
positive integers n for which the statement is true. In order to prove that the statement is true for all positive
integers, we must show that S = Z+ . By the Principle of Mathematical Induction, it is sufficient to prove
that S contains 1 and satisfies the condition “if k ∈ S, then k + 1 ∈ S”. In almost every situation of this
type, the statement is easy to prove for n = 1. However, a proof that k + 1 ∈ S under the assumption that
k ∈ S requires more effort. For many of the induction arguments in calculus, the proof of this implication
involves some form of algebraic manipulation. Two examples of such proofs are given below. The first proof
is written rather formally, while the second is written in a more informal style. The second form is more
common, but you may use whichever style you prefer.

n(n + 1)(2n + 1)
THEOREM 3.1 For each positive integer n, 12 + 22 + 32 + · · · + n2 = .
6
Proof. We will use the Principle of Mathematical Induction. Let S be the set of all positive integers n
such that
n(n + 1)(2n + 1)
12 + 22 + 32 + · · · + n2 = .
6
134 Chapter 3 Sequences and Series

Since 1 = (1 · 2 · 3)/6, it follows that 1 ∈ S. Suppose that k ∈ S for some positive integer k. This means that

k(k + 1)(2k + 1)
12 + 22 + 32 + · · · + k 2 = .
6

We then have
k(k + 1)(2k + 1)
12 + 22 + 32 + · · · + k 2 + (k + 1)2 = + (k + 1)2
6
k + 1( 2 )
= 2k + k + 6k + 6
6
(k + 1)(k + 2)(2k + 3)
= ,
6
which indicates that k + 1 ∈ S. We have thus shown that “if k ∈ S, then k + 1 ∈ S”. By the Principle of
Mathematical Induction, S = Z+ . Hence,

n(n + 1)(2n + 1)
12 + 22 + 32 + · · · + n2 =
6

for all positive integers n.

THEOREM 3.2 For each positive integer n, the integer 9n − 8n − 1 is divisible by 64.

Proof. We will use the Principle of Mathematical Induction. Since 0 is divisible by 64, the statement is
valid when n = 1. Suppose that 9k − 8k − 1 is divisible by 64 for some positive integer k. We want to show
that 9k+1 − 8(k + 1) − 1 is divisible by 64. Since 9k − 8k − 1 is divisible by 64, there exists an integer j such
that 64j = 9k − 8k − 1. We then have

9k+1 − 8(k + 1) − 1 = 9 · 9k − 8k − 9 = 9(9k − 1) − 8k = 9(64j + 8k) − 8k = 9(64j) + 64k = 64(9j + k).

Since 9j + k is an integer, it follows that 9k+1 − 8(k + 1) − 1 is divisible by 64. The result now follows by
the Principle of Mathematical Induction.

For the record, the Principle of Mathematical Induction can also be stated as follows.

Principle of Strong Mathematical Induction: If S is a set of positive integers that contains 1 and
satisfies the condition “if 1, 2, . . . , k ∈ S, then k + 1 ∈ S”, then S = Z+ .

This stronger form of induction (the statement is assumed to be true for all of the positive integers up to k,
not just for k) is needed in some cases. An example of a proof that requires this stronger form of induction
follows.

THEOREM 3.3 Suppose that a1 = 6, a2 = 18, and an+1 = an + 6an−1 for each positive integer n > 1.
Then an = 2 · 3n for each positive integer n.

Proof. It is easy to see that the statement is true for n = 1 and n = 2. (We need to check both of these
cases, since these numbers do not fit the general pattern for the generation of terms.) Suppose that ai = 2 · 3i
for all the integers i = 1, 2, . . . , k for some positive integer k ≥ 2. We must show that ak+1 = 2 · 3k+1 . Using
3.1 Mathematical Induction 135

the assumption that all the terms up to k satisfy the pattern (all we really need to know is that the pattern
is valid for the terms k and k − 1), we find that

ak+1 = ak + 6ak−1 = 2 · 3k + 6(2 · 3k−1 ) = 2 · 3k + 4 · 3k = 6 · 3k = 2 · 3k+1 ,

as desired. By the Principle of Strong Mathematical Induction, the formula an = 2·3n is valid for all positive
integers n.

To prove a statement using the Principle of Mathematical Induction, it is necessary to first verify that
the statement is true when n = 1 (or some other starting value), then to prove the induction hypothesis “if
k ∈ S, then k + 1 ∈ S” (or the modified form when using strong induction). In practice, it is usually a good
idea to verify the statement for the first few positive integers, say n = 1, 2, 3, 4. This often gives you a better
understanding of the problem and perhaps some insight into why the “formula” is true, and it increases your
confidence that the statement is valid.

Exercises
1. Prove that 1 + 3 + 5 + · · · + (2n − 1) = n2 for each positive integer n.

n2 (n + 1)2
2. Prove that 13 + 23 + 33 + · · · + n3 = for each positive integer n.
4
1 1 1 1 n
3. Prove that + + + ··· + = for each positive integer n.
1·2 2·3 3·4 n · (n + 1) n+1

4. Prove that for each positive integer n, the integer 32n+1 + 2n+2 is divisible by 7.

5. Let a1 = 1 and an+1 = 3 − (1/an ) for each integer n ≥ 1. Prove that 1 ≤ an ≤ 3 for each positive integer n.

6. Let b1 = 1, b2 = 2, and bn = 3bn−1 − 2bn−2 for each positive integer n > 2. Prove that bn = 2n−1 for each
positive integer n. (You will need the stronger form of induction here.)

7. This exercise refers to the Fibonacci numbers 1, 1, 2, 3, 5, 8, 13, 21, 34, . . .. These numbers are defined by f1 = 1,
f2 = 1, and fn+1 = fn + fn−1 for each n ≥ 2.
a) Prove that f1 + f2 + · · · + fn = fn+2 − 1 for each positive integer n.
b) Prove that f12 + f22 + · · · + fn2 = fn fn+1 for each positive integer n.
c) Prove that f1 + f3 + f5 + · · · + f2n−1 = f2n for each positive integer n.
d) Prove that f1 f2 + f2 f3 + f3 f4 + · · · + f2n−1 f2n = f2n
2
for each positive integer n.
e) Prove that fn+1 fn−1 = fn2 + (−1)n for each positive integer n > 1.
( √ ) ( √ )
f ) Let α = 1 + 5 /2 and β = 1 − 5 /2; the numbers α and β are the solutions to the equation x2 = x + 1.
( ) √
Prove that fn = αn − β n / 5 for each positive integer n.
8. The following set of results provides a different way to think of factorials.
a) Use mathematical induction and L’Hôpital’s Rule to prove that lim xn e−x = 0 for all positive integers n.
x→∞
∫∞
b) Use part (a) and mathematical induction to prove that 0 xn e−x dx = n! for all positive integers n.
c) Use the result from part (b) to explain why 0! is defined to be 1.
136 Chapter 3 Sequences and Series

3.2 Sequences

A sequence of real numbers is an ordered list of real numbers that continues indefinitely; one real number
for each positive integer. Sequences will be denoted by symbols such as {an }, where it is assumed that n
runs through the positive integers in increasing order: {an } = a1 , a2 , a3 , a4 , a5 , a6 , . . .. The real numbers an
are called the terms of the sequence and the letter n is called the index of the sequence. A sequence can
often be represented as a function of n:
{1} 1 1 1 1 1 { n } 1 2 3 4 5 6
= 1, , , , , , . . . = , , , , , ,...
n 2 3 4 5 6 2n + 1 3 5 7 9 11 13
{ } { 2}
(−1)n = −1, 1, −1, 1, −1, 1, . . . n = 1, 4, 9, 16, 25, 36, . . .

Sometimes a sequence is defined by writing out enough terms to generate a pattern; it is assumed that the
pattern continues indefinitely. For example, 1, 2, 1, 3, 1, 4, 1, 5, 1, 6, . . . represents a sequence.
It is useful to have some descriptive terminology to describe various types of sequences.

DEFINITION 3.4 Let {an } be a sequence of real numbers.

a) {an } is bounded above if there is a number M such that an ≤ M for all n.

b) {an } is bounded below if there is a number m such that an ≥ m for all n.

c) {an } is bounded if there is a number M such that |an | ≤ M for all n.

d) {an } is increasing if an+1 ≥ an for all n.

e) {an } is decreasing if an+1 ≤ an for all n.

f ) {an } is monotone if it is either increasing or decreasing.

g) {an } is convergent if there is a number a such that lim an = a. This means that for each ϵ > 0
n→∞
there exists a positive integer N such that |an − a| < ϵ for all n ≥ N .

For example, the sequence {1/n} is bounded below by 0, the sequence {(−1)n } is bounded by 1, the
sequence {n2 } is increasing, and the sequence {n/(2n + 1)} converges to 1
2. Informally, a sequence {an }
converges to a if the terms get closer to a as you go farther out in the sequence. Part (g) of the definition
makes this somewhat vague idea precise by carefully quantifying the words ‘closer’ and ‘farther’. For the
most part, it is possible for a sequence to have one of the properties listed in the definition without having
the others. (Some examples will be requested in the exercises.) However, one connection between these
properties is given by the following theorem.

THEOREM 3.5 A convergent sequence is bounded.

Proof. Let {an } be a sequence that converges to a. By the definition of convergent sequence, there is
a positive integer N such that |an − a| < 1 for all n ≥ N . This inequality is equivalent to the statement
|an | < 1 + |a| for all n ≥ N . Letting M = max{|a1 |, |a2 |, . . . , |aN |, 1 + |a|}, it then follows that |an | ≤ M for
all n. Hence, the sequence {an } is bounded.
3.2 Sequences 137

For simple sequences defined as a function of n, it is often easy to find the limit. For example, the
{ }
sequence n/(6n − 5) converges to 1/6 since the constant 5 becomes insignificant when n is large:

n ( n 1/n ) 1 1
lim = lim · = lim = .
n→∞ 6n − 5 n→∞ 6n − 5 1/n n→∞ 6 − (5/n) 6

(If you can recognize these limits without including the extra steps, then you are encouraged to do so as long
as you can explain your reasoning if necessary.) Sometimes it is helpful to convert a sequence into a function
then use the properties of functions to analyze the sequence. To obtain a function from the sequence {an },
choose a function f defined for all x ≥ 1 such that f (n) = an for each positive integer n. A plot of the
sequence {an } can be represented as a collection of dots; one dot for each positive integer n. The graph of
the corresponding function f is a curve that goes through all of the dots. If f is increasing on the interval
[1, ∞), then {an } is an increasing sequence; if lim f (x) = L, then the sequence {an } converges to L. It
x→∞
is then possible to use calculus to analyze the function f and thus obtain information about the sequence
{an }. For the sequence considered above, the function would be f (x) = x/(6x − 5). Since

(6x − 5) · 1 − x · 6 5
f ′ (x) = =− ,
(6x − 5)2 (6x − 5)2
{ }
the function f is decreasing on [1, ∞). This means that the sequence n/(6n − 5) is decreasing.

Exercises
1. Give examples of sequences that have the given properties: (a) monotone but not bounded; (b) bounded but not
monotone; (c) monotone but not convergent; (d) convergent but not monotone; (e) bounded but not convergent;
(f) both increasing and decreasing; (g) neither bounded above nor bounded below.
2. Find the first five terms of the given sequence.
{ } { } {∑
n }
2 + (−1)n 1 · 3 · 5 · · · · · (2n − 1) k+1
a) b) c)
3n 2n k!
k=1

3. Represent the given sequence using a formula.


1 2 3 4
a) , , , , 5 , 6 ,...
2 4 8 16 32 64
b) 2, 0, 2, 0, 2, 0, . . . c) 1 1 1
, , , 1 , 1 , 1 ,...
2 6 12 20 30 42

4. Find the limit of the sequence.


{ } { } { }
2n + 1 n2 + 2n + 2 k
a) b) c) √
7n − 3 2 − 3n2 4k2 + 3k + 1
{√ { }
√ } {√ } ln n
d) n+5− n e) k2 + k − k f)
n
{ } { (√ )} {( )n }
n
2−1
k
g) k sin(π/k) h) k i)
n+1

5. A sequence {xn } of positive numbers is increasing if xn+1 /xn ≥ 1 for all n and decreasing if xn+1 /xn ≤ 1 for all
n. Use this fact to determine if the given sequence is increasing or decreasing.
{ } { n} { }
n 2 (2n)!
a) b) c)
3n + 1 n! (n!)2
{ }
1 1 1
6. Prove that the sequence 1 + √ + √ + · · · + √ is unbounded. (You may use the following fact: if {an }
2 3 n
is an unbounded sequence of positive numbers and bn ≥ an for all n, then {bn } is unbounded.)
138 Chapter 3 Sequences and Series

3.3 Properties of Sequences

Given two sequences {an } and {bn }, it is possible to perform algebraic operations on their terms and form
new sequences. If the two sequences converge, then the new sequences converge as well and have the expected
limits. Since the proofs of the results recorded in the following two theorems involve the abstract definition
of a convergent sequence, they will be omitted. However, the results should be intuitively clear.

THEOREM 3.6 If {an } converges to a, {bn } converges to b, and c is a constant, then

a) the sequence {c an } converges to c a;

b) the sequence {an + bn } converges to a + b;

c) the sequence {an − bn } converges to a − b;

d) the sequence {an bn } converges to ab;

e) the sequence {an /bn } converges to a/b, assuming that b ̸= 0 and bn ̸= 0 for all n.

THEOREM 3.7 Squeeze Theorem If an ≤ xn ≤ bn for all n and the sequences {an } and {bn } both
converge to L, then the sequence {xn } also converges to L.

The next theorem lists the limits of some specific sequences that occur frequently. Because of their
importance, these sequences and their limits should be intuitively understood and committed to memory.

THEOREM 3.8 Limits of Specific Sequences

1. If p > 0, then the sequence {1/np } converges to 0.

2. If |r| < 1, then the sequence {rn } converges to 0.

3. If |r| < 1 and p > 0, then the sequence {np rn } converges to 0.

4. If |r| > 1, then the sequence {rn } is not bounded.



5. If a > 0, then the sequence { n a} converges to 1.

6. The sequence { n n} converges to 1.

7. If a is a real number, then the sequence {(1 + (a/n))n } converges to ea .

Proof. Parts (1), (2), (4), and (5) are easy to visualize. Part (3) follows from the fact that exponential
functions grow much more quickly than power functions. The details of these proofs will be omitted.
To prove part (6), we will use the Binomial Theorem in order to indicate how these sequence results can

be proved without using calculus. Let tn = n n − 1 for each n. It is sufficient to prove that {tn } converges
to 0. By the Binomial Theorem (see Appendix A),

n(n − 1) 2 n(n − 1) 2
n = (1 + tn )n = 1 + ntn + tn + · · · + tnn > tn
2 2

for all n ≥ 3. Rearranging this inequality yields 0 < tn < 2/(n − 1) for all n ≥ 3. Hence, the sequence
{tn } converges to 0 by the Squeeze Theorem.
3.3 Properties of Sequences 139

( )x
To prove part (7), let f (x) = 1 + (a/x) . Using L’Hôpital’s Rule, we find that

( ) −a/x2
( ) ln 1 + (a/x) 1 + (a/x) a
lim ln f (x) = lim = lim = lim = a.
x→∞ x→∞ 1/x x→∞ −1/x 2 x→∞ 1 + (a/x)

It follows that lim f (x) = ea . Therefore, the sequence {(1 + (a/n))n } converges to ea .
x→∞
{√ }
To illustrate the use of these theorems, consider the sequence n 3/n . Then
{√n
}
3 converges to 1 by part (5) of Theorem 3.8;
{√ }
n
n converges to 1 by part (6) of Theorem 3.8;

{n√ } { n 3} 1
3/n = √ converges to = 1 by part (e) of Theorem 3.6.
n
n {√ } 1
We have thus shown that n 3/n converges to 1 by using previous results. We typically do not give this
{ 8n + n3 }
much detail when finding limits of sequences. For a second example, consider the sequence .
23n+2 + n
Performing some algebra and using part (3) of Theorem 3.8, we find that

8n + n3 8n + n3 1/8n 1 + n3 (1/8)n 1+0 1


lim = lim 3n 2 · = lim = = .
n→∞ 23n+2 + n n→∞ 2 · 2 + n 1/8 n n→∞ 4 + n(1/8)n 4+0 4
{ 8n + n3 } 1
It follows that the limit of the sequence is .
23n+2 + n 4

Exercises
1. Find, with proof (see the first of the above examples), the limit of the sequence.
{√ } { } { 5 }
√ k n √

n k n
a) 4n b) 3 8 + k c) 7 2
2 3n

2. Find the limit of the sequence. Justify your answers.


{( )n } {( )n } {( )k }
4 1 k
a) 1+ b) 1− c)
n 2n k+3
{ k
} { n } {√ }
2 4 + n2 n
d) e) f )) 2n2 + 4n + 3
3 k + 6k 22n+1 + n3
{ √ } {∑ n } { }
1 2 n + n4
g) ( k − 1)k √
k
h) i)
n2 + k 5n − n
k=1

3. Use the squeeze theorem to prove that the sequences {2n /n!} and {n!/nn } converge to 0.
{ }
4. Suppose that 0 < a < b. Find, with proof, the limit of the sequence (an + bn )1/n .
5. Prove that {|an |} converges to 0 if and only if {an } converges to 0. Give an example of a sequence {bn } such
that {|bn |} converges but {bn } does not converge.
6. Suppose that {an } converges to 0 and that {bn } is bounded. Prove that {an bn } converges to 0.
{ }
1 · 3 · 5 · · · · · (2n − 1)
7. Prove that the sequence is unbounded.
2n
140 Chapter 3 Sequences and Series

3.4 The Completeness Axiom

One of the disadvantages of the definition of a convergent sequence is that it is first necessary to find the limit
of the sequence before proving the sequence converges. Fortunately, it is possible to show that a sequence
converges without finding its limit. For the purposes of this book, we will treat the following statement as
an axiom (a statement accepted without proof). It is called the Completeness Axiom since it indicates that
there are no holes in the real number line.

Completeness Axiom: Every bounded monotone sequence of real numbers converges.

This statement is easy to accept from the perspective of the number line. If an infinite sequence of dots is
moving to the right (an increasing sequence) and if the dots cannot go beyond a certain point (a bounded
sequence), then the dots must be piling up at some point (the sequence converges).

x1 x2 x3 x4 x5 x6 ··· L (limit) M (upper bound)

The Completeness Axiom indicates that it is possible to prove that a sequence converges without finding its
limit; simply show that the sequence is bounded and monotone.
∑n
Consider the sequence {an } defined by an = 1/k 2 for each positive integer n. Since the expression
k=1
an+1 − an simplifies to 1/(n + 1)2 , it follows that {an } is an increasing sequence. To prove that {an } is
bounded, we need to find an upper bound for {an }. Using some estimation, for each n ≥ 2, we find that
n (
∑ ∑ ∑ ∑ 1)
n n n
1 1 1 1
an = = 1 + < 1 + = 1 + −
k2 k2 k(k − 1) k−1 k
k=1 k=2 k=2 k=2
( 1 ) ( 1 1 ) ( 1 1 ) ( 1 1) 1
=1+ 1− + − + − + ··· + − = 2 − < 2.
2 2 3 3 4 n−1 n n

It follows that {an } is bounded by 2. Since {an } is bounded and increasing, it must converge by the
Completeness Axiom. Note that we have not found the limit of this sequence; we just know it is some
number between 1 and 2.
A sequence {xn } may not be presented in the form xn = f (n) for some function f . As an example of
another way to define a sequence, consider the sequence {bn } defined by b1 = 1 and bn+1 = 1/(1 + bn ) for
all n ≥ 1. Using the formula for bn , it is easy to write out the first few terms of this sequence;

1 2 3 5 8
{bn } = 1, , , , , , · · · .
2 3 5 8 13

Sequences defined in this way, where successive terms depend on previous terms, are known as recursively
defined sequences. Since an explicit formula is not given, it can be difficult to find terms further down
the line; for instance, to find b1000 , we would need to successively list all of the terms b1 through b1000 (or
program a computer to do the work for us). However, in this specific case, you might notice the appearance
of the Fibonacci numbers and thus be able to prove that bn = fn /fn+1 for all n. Since there is a formula for
fn (see the exercises in Section 3.1), we could then find b1000 more quickly.
3.4 The Completeness Axiom 141


Consider the sequence {cn } defined recursively by c1 = 1 and cn+1 = 1 + cn for n ≥ 1. The first three
√ √ √
terms of this sequence are 1, 2, and 1 + 2 or, to the nearest thousandth, 1.000, 1.414, and 1.554. We
will prove that {cn } is increasing and bounded. In particular, we will use the Principle of Mathematical
Induction to prove that cn < cn+1 for all n and that cn < 2 for all n. To prove that cn < cn+1 for all n, we
begin by noting that c1 < c2 . Suppose that ck < ck+1 for some positive integer k. We then have

√ √
ck < ck+1 ⇒ 1 + ck < 1 + ck+1 ⇒ 1 + ck < 1 + ck+1 ⇒ ck+1 < ck+2 ,

which is the desired inequality for n = k + 1. By the PMI, we find that cn < cn+1 for all n, that is, the
sequence {cn } is increasing. We now move on to a proof that cn < 2 for all n. Once again, it is clear that
c1 < 2. Suppose that ck < 2 for some positive integer k. Then

√ √ √
ck+1 = 1 + ck < 1 + 2 = 3 < 2,

and it follows from the PMI that cn < 2 for all n. Since {cn } is bounded and increasing, the Completeness
Axiom guarantees that it is a convergent sequence.
For recursively defined sequences, it is often possible to find the limit of the sequence once it is known
to exist. We will illustrate the basic idea with the sequence {cn } defined in the previous paragraph. Let L
√ √
be the limit of {cn }. By properties of sequences, the sequence { 1 + cn } converges to 1 + L. Also, the
sequence {cn+1 } converges to L since it is just the sequence {cn } without the first term. Hence, the equality
√ √
cn+1 = 1 + cn yields L = 1 + L or L2 − L − 1 = 0 when we take the limit as n goes to infinity. Since L
( √ )
must be positive, the sequence {cn } converges to L = 1 + 5 /2.

Exercises

n
1. Prove that the sequence {An } defined by An = 4k/(k3 + 1) for each positive integer n converges. (Compare
k=1
this sequence with the sequence {an } considered in the text.)
1 1 1 1
2. For each positive integer n, let xn = + + + ··· + . Prove that the sequence {xn } converges.
n+1 n+2 n+3 2n

n
3. Prove that the sequence {sn } does not converge, where sn = k−3/4 .
k=1

4. Consider the sequence {bn } defined in the third paragraph of this section. Suppose that it has been shown that
{bn } converges. Find the limit of the sequence.
rn 5
5. Let r1 = 8 and rn+1 = + for each n ≥ 1. Assuming that {rn } converges, find its limit.
2 rn
6. Let d1 = 2 and dn+1 = 3 + (dn /8) for each n ≥ 1. Prove that {dn } converges, then find its limit.
7. Define a sequence {an } by a1 = 1 and an+1 = 3 − (1/an ) for n ≥ 1. Prove that 1 ≤ an ≤ 3 for all n, then prove
that {an } is an increasing sequence. Conclude that {an } converges and find its limit.

8. Let z1 = 3 and let zn+1 = 3 + zn for n ≥ 1. Prove that the sequence {zn } converges, then find its limit.
142 Chapter 3 Sequences and Series

3.5 Infinite Series

Given a sequence {ak }, an infinite series (or simply, a series) is an expression of the form



ak = a1 + a2 + a3 + a4 + a5 + a6 + · · ·,
k=1

which represents the sum of all of the terms of the sequence {ak }. As it is not possible to actually perform
an infinite number of additions, some other approach is required to make sense of this expression. For each
∑n
positive integer n, let sn = ak . The sequence {sn } is known as the sequence of partial sums of the
k=1
series. A series converges if and only if its corresponding sequence of partial sums converges. The sum of


a convergent series is the limit of its sequence of partial sums: ak = lim sn . If the sequence of partial
k=1 n→∞
sums does not converge, then the series is said to diverge. Consider the following four examples.
i. If {ak } = {1}, then {sn } = {n}. Note that {ak } converges to 1 and that {sn } does not converge.
ii. If {ak } = {2−k }, then {sn } = {1 − 2−n }. Note that {ak } converges to 0 and that {sn } converges to
1. (Refer to Exercise 2.1.7 to verify the formula for sn .)
√ {∑ n √ }
iii. If {ak } = {1/ k}, then {sn } = 1/ k . Note that {ak } converges to 0 but that {sn } does not
k=1
converge. (Refer to Exercise 3.2.6 to see that the sequence {sn } is unbounded.)
{ 1 1 } {∑n (
1 1 )} 1 1
iv. If {ak } = − , then {sn } = − = − . In this case, we see
k+1 k+2 k+1 k+2 2 n+2
k=1
that {ak } converges to 0 and that {sn } converges to 1/2.
Unlike examples (i), (ii), and (iv), the sequence of partial sums cannot usually be expressed in a simple form.
Thus, an important but challenging problem is to determine when a particular infinite series converges. As
a start, it should be intuitively clear that the terms of a convergent series must converge to 0.



THEOREM 3.9 Let ak be an infinite series.
k=1


(a) If ak converges, then lim ak = 0.
k=1 k→∞


(b) (Divergence Test) If lim ak does not exist or lim ak ̸= 0, then ak diverges.
k→∞ k→∞ k=1



Proof. Suppose that the series ak converges. Then its corresponding sequence {sn } of partial sums
k=1
converges to some number S, and we find that lim (sn − sn−1 ) = S − S = 0. For n ≥ 2,
n→∞

sn − sn−1 = (a1 + a2 + · · · + an ) − (a1 + a2 + · · · + an−1 ) = an .

It follows that lim an = 0, which proves (a). Part (b) is simply the contrapositive of part (a).
n→∞

The usual algebraic properties of scalar multiplication and addition are valid for infinite series.


∞ ∑

THEOREM 3.10 Let ak and bk be two convergent series. Then
k=1 k=1

∞ ∑
∞ ∑

a) the series cak converges and cak = c ak , where c is any real number;
k=1 k=1 k=1
∑∞ ∑
∞ ∑
∞ ∑

b) the series (ak + bk ) converges and (ak + bk ) = ak + bk .
k=1 k=1 k=1 k=1
3.5 Infinite Series 143

Proof. A proof of part (b) is summarized in the following equations.


∞ ∑
n (∑
n ∑
n ) ∑
n ∑
n ∑
∞ ∑

(ak + bk ) = lim (ak + bk ) = lim ak + bk = lim ak + lim bk = ak + bk .
k=1 n→∞ k=1 n→∞ k=1 k=1 n→∞ k=1 n→∞ k=1 k=1 k=1

Be certain that you are able to justify each step in this equation.

An important class of infinite series are the geometric series, which are series of the form


ark = a + ar + ar2 + ar3 + ar4 + ar5 + · · · ,
k=0

where a and r are constants. Each term of the series is r times the previous term. To determine whether or
not this series converges, we need to look at its sequence {gn } of partial sums. Some algebra yields
( )
( ) ( ) a 1 − rn+1
gn − rgn = a + ar + · · · + ar n
− ar + ar + · · · + ar
2 n+1
= a − ar n+1
⇒ gn = .
1−r

If |r| < 1, the sequence {rn+1 } converges to 0 and the sequence {gn } converges to a/(1 − r). In other words
(pay particular attention to this), the sum of a geometric series is simply the first term of the series divided
by 1 − r, where r is the common ratio.



THEOREM 3.11 Geometric Series Let a be a nonzero real number. The geometric series ark
k=0
converges to a/(1 − r) if |r| < 1 and diverges if |r| ≥ 1.

Exercises
1. Find an explicit formula or pattern for the sequence of partial sums for the given series.

∞ ∑
∞ ∑∞ ( )
1 1 1 1
a) b) c) −
5 5k k+3 k+4
k=1 k=1 k=1

2. Prove that each of the following series diverges.




k2 ∑

6 ∑∞
1
a) 2
b) √
k
c) 3/4
5k + 60 3 + 10 k
k=1 k=1 k=1



3. Find the sum of each of the following series: for (c), assume that |b| < 1; for (i), use k 2 2−k = 6.
k=1

a) 1
3
− 2
9
+ 4
27
− 8
81
+ ··· b) 0.4 + 0.04 + 0.004 + · · · c) b2 − b4 + b6 − b8 + · · ·


1 ∑

(−1)k 2k ∑

6k
d) e) f)
4k 5k−1 102k−1
k=1 k=1 k=2



2 k + 3k ∑

2 · 3k − (−2)k+1 ∑

3k2 + 5
g) h) i)
4k 6k−1 2k
k=1 k=1 k=2



4. Prove that the sequence {ak } converges if and only if the series (ak − ak+1 ) converges.
k=1

∞ 2n + 5
5. Let ak be an infinite series and suppose that sn = for all n ≥ 1, where {sn } is its corresponding
k=1 3n − 4
sequence of partial sums. Find a1 , a2 , a10 , and the sum of the series.
144 Chapter 3 Sequences and Series

3.6 The Integral Test



If all of the terms of a series ak are positive, then its corresponding sequence {sn } of partial sums is
k=1
increasing. To prove that {sn } converges in this case, it is sufficient to prove that {sn } is bounded (see
the Completeness Axiom). One way to do this involves relating the partial sums to the area under a curve.
Suppose there is a function f such that f (k) = ak for all positive integers k and that f is continuous and
decreasing on the interval [1, ∞). Using the properties of the integral or the interpretation of the integral as
area, we obtain the inequalities listed to the right of the following graph. Do not let the inequalities frighten
you. The first inequality is just comparing the areas of rectangles to the area under the curve; some are
overestimates and some are underestimates. The second inequality merely adds the inequalities listed in the
first line with some care needed to match up the subscripts. The third inequality uses a property of integrals
to combine the individual integrals and summarizes what has been found. This last inequality shows that
∫∞
the sequence {sn } is bounded if and only if the improper integral 1 f (x) dx converges; a fact known as the
Integral Test. It is stated a little more generally in the next theorem but do note that typically K = 1 .
y
.
...... ∫ k+1 ∫ k
... ..
... ak+1 ≤ f (x) dx ≤ ak ≤ f (x) dx;
.... y = f (x) k k−1
ak−1 ...... n ∫
∑ ∑
n n ∫

........ k+1 k
.................................................... f (x) dx ≤ ak ≤ a1 + f (x) dx;
................................
... ..............
ak
................................... ....................
... k k−1
ak+1 ...
.
...
...
.
....
.. ...
...
. ....... ∫ n
k=1
∫ n+1
k=1
∫ n
k=2

... ... ...


...
...
..
...
...
... f (x) dx ≤ f (x) dx ≤ sn ≤ a1 + f (x) dx.
... ... ...
... ... ... 1 1 1
... .. ... .....
........... x
k−1 k k+1



THEOREM 3.12 Integral Test Let ak be a series with positive terms and let f be a function
k=1
that satisfies f (k) = ak for all k. Suppose that f is continuous and decreasing for all x ≥ K for some integer
∑∞ ∫∞
K. Then the series ak converges if and only if the improper integral K f (x) dx converges.
k=1

The following two examples illustrate how easy the Integral Test is to apply.
∑∞
1
1) The series 3/2
converges since the function 1/x3/2 is continuous and decreasing for x ≥ 1 and
k
∫ ∞ k=1
b
1 −1/2
dx = lim −2x = 2. By looking back at the equations used to prove the Integral
1 x3/2 b→∞ 1
Test, the reader should convince themselves that the sum of this series is between 2 and 3.

∑ 1
2) The series 1/3
diverges since the function 1/x1/3 is continuous and decreasing for x ≥ 1 and
k
∫ ∞ k=1
1 3 2/3 b
dx = lim x = ∞.
1 x1/3 b→∞ 2 1

In these examples, the function related to the series is clearly continuous and decreasing; for more complicated
series, it may be necessary to do some work to verify that the function is decreasing. Of course, the Integral
Test is only useful when the terms of the series correspond to a function that can be easily integrated.
3.6 The Integral Test 145

In addition to the geometric series, another important collection of infinite series are series of the form
∑∞
1
; such series are known as p-series. As the two examples indicate (p = 3/2 and p = 1/3), the Integral
kp
k=1
Test provides an easy way of determining when a p-series converges.



THEOREM 3.13 A p-series 1/k p converges if p > 1 and diverges if p ≤ 1.
k=1

Proof. The series clearly diverges if p ≤ 0 since the terms do not go to 0 in this case. When p is positive,
the sequence {1/k p } is a decreasing sequence that converges to 0 and the function 1/xp is continuous and
decreasing on [1, ∞). For p ̸= 1, we find that
∫ ∞
1 1 1 b 1 ( 1 )
dx = lim · = lim − 1 .
1 xp b→∞ 1 − p xp−1 1 b→∞ 1 − p bp−1

If p > 1, then the limit exists and has a value of 1/(p − 1); the Integral Test then indicates that the series
converges. If 0 < p < 1, then the limit does not exist (it has a value of ∞) so the series diverges for these
values of p. We leave the special case p = 1 as an exercise (see Exercise 1b).

In contrast to the geometric series, there is no simple method for finding the sum of a p-series. A number
of interesting techniques, often involving power series, Fourier series, or complex variables, have determined
the sum of a p-series for some values of p. For example, it has been shown that

∑ ∞

1 π2 1 π4
2
= and 4
= .
k 6 k 90
k=1 k=1

It may come as a surprise that the number π appears in these sums. One method for verifying the first sum
can be found in the supplementary exercises of this chapter.

Exercises
1. Use the Integral Test to determine whether or not the series converges.
∑∞
1 ∑

1 ∑

k ∑

1
a) b) c) d)
k 4/3 k ek k2 + 1
k=1 k=1 k=1 k=1



1 ∑

1
2. Use the Integral Test (with K = 2) to show that diverges and converges.
k ln k k(ln k)2
k=2 k=2

n
1
3. Use the ideas behind the Integral Test to estimate an integer n so that > 20.
k
k=1
4. Find all values of p > 0 for which the series converges.
∑∞
1 ∑

p ∑

1 ∑

1
a) p2
b) c) 4p−p2
d)
k kp k k(ln k)p
k=1 k=1 k=1 k=2

n ∫
1 1 1 dx
5. For each positive integer n, let zn = 1 + + + ··· + − . Use the ideas in this section to prove that
2 3 n 1 x
{zn } is a decreasing sequence of positive terms and thus convergent.
146 Chapter 3 Sequences and Series

3.7 Comparison Tests



As we saw in the last section, if all of the terms of a series ak are positive, then the corresponding sequence
k=1
{sn } of partial sums is increasing. It follows that the series converges if and only if {sn } is bounded. The
two comparison tests stated in this section provide one way to determine when this occurs.

THEOREM 3.14 Comparison Test Suppose that 0 ≤ ak ≤ bk for all k ≥ K, where K ∈ Z+ .



∞ ∑

a) If bk converges, then ak converges.
k=1 k=1
∑∞ ∑

b) If ak diverges, then bk diverges.
k=1 k=1

Proof. To make the proof easier to follow, we assume that K = 1. Let {sn } and {tn } be the sequences of

∞ ∑
∞ ∑

partial sums for the series ak and bk , respectively. Both of these sequences are increasing. If bk
k=1 k=1 k=1
converges, then {tn } is bounded. Since 0 ≤ sn ≤ tn for all n, the sequence {sn } is also bounded. It follows


that the series ak converges. This proves (a); part (b) is the contrapositive of (a).
k=1

In less concise but more user friendly language, the Comparison Test states that for positive term series,
less than convergent is convergent and greater than divergent is divergent. In order to use the Comparison
Test, we need a large collection of series that are known to converge or diverge: the geometric series and the
p-series are the most common choices. Consider the following two examples.

∑ ∑ 1 ∞
k2 k2 k2 1
1. 5 4
converges since 3
converges and 5 4
≤ 5 = 3 for all k ≥ 1.
k + 3k + 2 k k + 3k + 2 k k
k=1 k=1
∞ √ ∞ √ √
∑ k ∑ 1/3 k k 1/3
2. diverges since √ diverges and ≥ = √ for all k ≥ 5.
2k + 5 k 2k + 5 3k k
k=1 k=1

Note that care (even perhaps extreme care) must be taken with the inequalities that occur in the Comparison
Test. If you think the series converges, then you must show that its terms are smaller than the terms of a
known convergent series. Similarly, if you think the series diverges, then you must show that its terms are
larger than the terms of a known divergent series.
The following variation on the Comparison Test avoids the use of inequalities; this is a real advantage
in many cases. Read the statement of the theorem carefully since the inequality involving α is not the same
in each part of the test.


∞ ∑

THEOREM 3.15 Limit Comparison Test Let ak and bk be two series for which ak and bk
k=1 k=1
are positive for all k ≥ K and suppose that α = lim (ak /bk ) “exists”, either as a number or ∞.
k→∞

∞ ∑∞
a) If bk converges and 0 ≤ α < ∞, then ak converges.
k=1 k=1
∑∞ ∑

b) If bk diverges and 0 < α ≤ ∞, then ak diverges.
k=1 k=1



Proof. Suppose that the series bk converges and that 0 ≤ α < ∞. Since the sequence {ak /bk }
k=1
converges, it is bounded. Consequently, there exists a positive number M such that |ak /bk | ≤ M for all k.
3.8 Absolute Convergence 147



Since 0 ≤ ak ≤ M bk for all k ≥ K, the series ak converges by the Comparison Test. The proof of part
k=1
(b) is similar.


To illustrate the Limit Comparison Test, consider the series k 2 /(k 3 + 5k + 2). For large values of k,
k=1
the numbers 5k and 2 are small in comparison to k 3 , so the terms of the series are similar to k 2 /k 3 = 1/k


when k is large. The series 1/k diverges and
k=1

( k2 1) k3
lim 3
÷ = lim 3 = 1.
k→∞ k + 5k + 2 k k→∞ k + 5k + 2



Hence, the series k 2 /(k 3 + 5k + 2) diverges by the Limit Comparison Test.
k=1

Exercises
1. Use the Comparison Test to determine whether or not the series converges.
∑∞
1 ∑∞
2 ∑∞
5
a) b) √ c) √
3k2 + 2k + 6 k + 6 k k 3 + 8k
k=1 k=1 k=1
2. Use the Limit Comparison Test to determine whether or not the series converges.


k2 ∑∞
2k + 5 ∑

10
a) b) c) √
5k4 + 2k3 − 3 k2 − 5k + 8 3 2
k + 4k
k=1 k=1 k=1
3. Determine whether or not the series converges.


k−4 ∑∞
k4 + 5k2 ∑

k2 + 3
a) b) c)
k 2 − 3k + 7 3k6 − 7k5 + 6k − 1 12k2 + 8k + 1
k=1 k=1 k=2
∑∞ k
3 ∑

3k + k ∑

1
d) e) f)
2 k + 5k 2k + k4 ln k
k=1 k=1 k=2
4. Let {dk } be the sequence of all positive integers (listed in increasing order) that do not have the digit 0 in their
∑∞
decimal representation. (For instance, 125 is in the sequence but 105 is not.) Prove that the series 1/dk
k=1
converges and that the sum of the series is less than 90.

3.8 Absolute Convergence

The tests for convergence considered thus far require series with positive terms, or at least with only a
finite number of negative terms. For series with an infinite number of both positive and negative terms, the
following theorem is sometimes useful.


∞ ∑

THEOREM 3.16 If the series |ak | converges, then the series ak also converges.
k=1 k=1

Proof. The result follows from the inequality 0 ≤ ak + |ak | ≤ 2|ak | and the Comparison Test. There are
several missing details here; the reader should attempt to fill them in.

∑ cos 2k
To illustrate this theorem, consider the series . Due to the oscillating nature of the cosine
k3
k=1
function, this series has an infinite number of positive and negative terms. However, we easily note that
cos 2k ∑∞ ∑∞
1 1 cos 2k
3 ≤ 3 for all k. Since the series is a convergent p-series, the series converges by
k k k3 k3
k=1 k=1

∑ cos 2k
the Comparison Test. By the previous theorem, we know that the series converges.
k3
k=1
148 Chapter 3 Sequences and Series

As a result of this theorem, we are led to the following definition.



DEFINITION 3.17 Let ak be a given series.
k=1

∞ ∑

a) The series ak converges absolutely if |ak | converges.
k=1 k=1
∑∞ ∑
∞ ∑

b) The series ak converges conditionally if ak converges and |ak | diverges.
k=1 k=1 k=1

Given a series with positive and negative terms, it is a good idea to first determine whether or not the
series converges absolutely. There are more tests available for series with positive terms, and if the series
converges absolutely, it certainly converges. Note that two tests are necessary to conclude that a series
∑∞ ∑

converges conditionally; one to prove |ak | diverges and another to prove ak converges.
k=1 k=1
As we will discover when studying power series, many series that appear in applications have terms that
alternate signs. The following test is often useful for such series.



THEOREM 3.18 Alternating Series Test Consider the series (−1)k+1 ak , where ak > 0 for all
k=1


k ≥ K. If ak+1 ≤ ak for all k ≥ K and lim ak = 0, then the series (−1)k+1 ak converges.
k→∞ k=1

Proof. We will assume that K = 1. Let {sn } be the sequence of partial sums of the alternating series.
The hypotheses of the theorem indicate that the terms of this sequence are distributed on the number line
as follows:

0 s2 s4 s6 s5 s3 s1

Thus, the sequence {s2n } is bounded and increasing, and {s2n−1 } is bounded and decreasing. By the
Completeness Axiom, each of these sequences converges. Since s2n = s2n−1 − a2n and {a2n } converges to 0,
the sequences {s2n } and {s2n−1 } have the same limit. Therefore, the sequence {sn } converges.

∑ (−1)k+1
The Alternating Series Test is usually very easy to apply. For example, the series converges
k
k=1
by the Alternating Series Test since {1/k} decreases to 0. This series thus provides a simple example of a
conditionally convergent series.

∑ (−1)k+1 k
Let’s consider a more involved example: determine if the series is absolutely convergent,
4k 2 − 1
k=1
∑∞
k
conditionally convergent, or divergent. The first step is to consider the series . Since the series
4k 2 − 1
k=1
∑∞
1 ( k 1) 1 ∑∞
k
is a divergent p-series and lim ÷ = , we see that diverges by the Limit
k k→∞ 4k 2 − 1 k 4 4k 2 − 1
k=1 k=1
Comparison Test. This shows that our given series is not absolutely convergent. However, the sequence
{k/(4k 2 − 1)} converges to 0 and is decreasing. (To verify the latter, note that

d ( x ) 4x2 − 1 − 8x2 4x2 + 1


= =− <0
dx 4x − 1
2 (4x − 1)
2 2 (4x2 − 1)2
3.8 Absolute Convergence 149

∑ (−1)k+1 k
for all x ≥ 1.) It follows that the series converges by the Alternating Series Test. Putting
4k 2 − 1
k=1
these two facts together shows that our original series is conditionally convergent. As mentioned previously,
note that two tests are necessary to conclude that a series is conditionally convergent.
As a final comment, it is important to realize that a sequence of positive numbers can converge to 0 but
not be a decreasing sequence. As one example, consider the sequence {ak } given by

1 1 1 1 1 1 1 1 1 1
1, 1, , , , , , , , , , , . . . ,
2 4 3 9 4 16 5 25 6 36

where the pattern for the terms should be clear. It is easy to see that {ak } converges to 0 and that the


sequence {ak } is not monotone. For the series (−1)k+1 ak , note that the partial sum corresponding to 2n
k=1
satisfies

2n ∑
n
1 ∑ 1
n
s2n = (−1)k+1 ak = − .
k k2
k=1 k=1 k=1

Since the first sum goes to ∞ (a divergent p-series) and the second sum converges to a number (a convergent


p-series), we see that the series (−1)k+1 ak does not converge. Hence, when using the Alternating Series
k=1
Test, it is important to verify that the appropriate sequence is decreasing.

Exercises
1. Show that the following series are absolutely convergent.


cos k ∑∞
2k sin k ∑∞
(−1)k k2 ∑∞
(−1)k arctan k
a) 2
b) k
c) d)
k 3 k4 + 10 k3/2
k=1 k=1 k=1 k=1

2. Determine whether or not the given series converges.




(−1)k+1 ∑∞
(−1)k+1 ∑

(−1)k+1 k ∑

ln k
a) b) √ c) d) (−1)k+1
3k + 7 4k − 1 4k − 1 k
k=1 k=1 k=1 k=1



(−1)k+1 k
3. Prove that the series is conditionally convergent.
2k2 + 3
k=1


4. Find all values of p > 0 for which the series (−1)k+1 /kp is conditionally convergent.
k=1

∞ ∑

5. Let ak be an absolutely convergent series. Prove that the series a2k converges.
k=1 k=1

∞ ∑

6. Give an example of a series for which ak converges but a2k diverges.
k=1 k=1

∞ ∑

7. Explain why (−1)k ak converges if and only if (−1)k+1 ak converges.
k=1 k=1
8. Determine if the series is absolutely convergent, conditionally convergent, or divergent.
∑∞
(−1)k+1 (k + 1) ∑∞
(−1)k k ∑∞
(−1)k k ∑∞
sin3 k
a) b) √ c) √ d)
4k 3 + 7k2 − 1 5k2 + 4 k3 − 4k 2 + 20 k2 + 3
k=1 k=1 k=1 k=1

∑∞
(−1)k+1 ∑∞
(−1)k+1
9. Consider the two series k
and . Does the Alternating Series Test apply to these series?
k=1
k 2+(−1)
k=1
k3+(−1)k
Do these series converge or diverge?
150 Chapter 3 Sequences and Series

3.9 Root and Ratio Tests

Two of the more useful tests for absolute convergence are the Ratio Test and the Root Test.



THEOREM 3.19 Ratio Test Let ak be a series and suppose that ℓ = lim ak+1 /ak exists. Then
k=1 k→∞
the series converges absolutely if ℓ < 1 and diverges if ℓ > 1.

Proof. Suppose ℓ < 1. Let r be a number that satisfies ℓ < r < 1. By the definition of a convergent

sequence, there exists a positive integer p such that ak+1 /ak < r for all k ≥ p. It follows that
( |a | ) ( |a | )
p p
|ap+1 | < r |ap | = r p+1
, |a p+2 | < r |a p+1 | < rp+2 ,
rp rp
( |a | ) ( |a | )
p p
|ap+3 | < r |ap+2 | < r p+3
, |a p+4 | < r |a p+3 | < rp+4 ,
rp rp
( ) ∞ (
∑ )
and, in general, |ak | < |ap |/rp rk for all k > p. Since the series |ap |/rp rk is a convergent geometric
k=1


series, the series ak converges absolutely by the Comparison Test.
k=1
Now suppose that ℓ > 1. By the definition of a convergent sequence, there exists a positive integer q

such that ak+1 /ak > 1 for all k ≥ q. It follows (as above) that |ak | > |aq | > 0 for all k > q, which indicates


that the sequence {ak } does not converge to 0. By the Divergence Test, the series ak diverges.
k=1


∞ √
THEOREM 3.20 Root Test Let ak be a series and suppose that ℓ = lim k
|ak | exists. Then
k=1 k→∞
the series converges absolutely if ℓ < 1 and diverges if ℓ > 1.

Proof. The proof of this theorem will be left as an exercise.

The Ratio Test is a convenient test to use when factorials are involved since a lot of cancellation occurs


in the ratio of the terms. As an example, consider the series (−10)k /k!. Since
k=1
(−10)k+1 (−10)k 10k+1 k!
10
ℓ = lim ÷ = lim = lim = 0,
k→∞ (k + 1)! k! k→∞ 10k (k + 1)! k→∞ k + 1

the series converges absolutely by the Ratio Test. When using the Root Test, it is helpful to remember that
√ ∑

lim k k = 1. For example, the series 2k /k 6 diverges by the Root Test since
k→∞ k=1

2k 2
= lim √
k
ℓ = lim 6 k
= 2.
k→∞ k k→∞ ( k)6

For a slightly more complicated example involving these tests, consider the series

∑ 1 · 5 · 9 · · · · · (4k − 3) 1 1·5 1·5·9 1 · 5 · 9 · 13
= + + + + ···.
(−3)k k! (−3)1 1! (−3)2 2! (−3)3 3! (−3)4 4!
k=1

Due to the product, it is probably best to use the Ratio Test. Inverting and multiplying, we find that
1 · 5 · 9 · · · · · (4k + 1) (−3)k k!
4k + 1 4
ℓ = lim · = lim = .
k→∞ (−3)k+1 (k + 1)! 1 · 5 · 9 · · · · · (4k − 3) k→∞ 3(k + 1) 3
(Pay careful attention to the cancellation that occurs in this example.) Since ℓ > 1, the series diverges by
the Ratio Test.
3.9 Root and Ratio Tests 151

In essence, the Root and Ratio Tests compare the given series with a geometric series. (Recall that a
geometric series converges when |r| < 1.) These tests give divergence only when the terms of the series get
infinitely large. It follows that the Root Test and the Ratio Test will be inconclusive for a number of series,
that is, series for which the value of the limit ℓ is 1. Inconclusive means that we have learned nothing at
all; the series might converge absolutely or converge conditionally or not converge at all. It is not difficult
to show that any series whose terms can be expressed as rational or algebraic functions of k will give ℓ = 1.
For example, both of the series

∑ ∞

4k 2 + 3k − 2 4k + 3
(−1)k+1 · and √
k 3 − k 2 + 2k + 4 k5+ k3 + 8
k=1 k=1

give a value of ℓ = 1 in either the Ratio Test or the Root Test. When this occurs, some other test must
be used to determine whether or not the series converges. It is important to recognize this in advance so
that you do not spend time reaching a result that is inconclusive. In such cases, it is probably best to

∞ ∑

try one of the comparison tests on the series |ak |. If this series diverges, the series ak may still
k=1 k=1
converge conditionally. The only test that we have considered that is useful at this stage of the process is
the Alternating Series Test.

Exercises

∞ ∑

1. Show that ℓ = 1 when either the Ratio Test or the Root Test are applied to the series 1/k and 1/k2 .
k=1 k=1
2. Determine whether or not the following series converge.

∞ ∑

8k ∑

5k (k!)2
a) k5 5−k b) c)
k! (2k)!
k=1 k=1 k=1
∞ (
∑ )k ∑
∞ ∑

k k! (−2)k k!
d) e) f)
2k + 15 1 · 3 · 5 · · · · · (2k − 1) 4 · 7 · 10 · · · · · (3k + 1)
k=1 k=1 k=1

3. Determine if the series converges absolutely, converges conditionally, or diverges.


∑∞
(−1)k ∑

(−6)k+1 ∑

(−4)k+1
a) √ b) c)
k+4 k! k3k
k=1 k=1 k=1



(−1)k ∑

(−1)k+1 k ∑

k3
d) e) f) (−1)k+1 k
k2 − 3k − 7 k2 + 5 5
k=1 k=1 k=1

4. Prove the Root Test. (The proof is similar to that of the Ratio Test, but less complicated.)

∞ k
5. Show that the Ratio Test is inconclusive for the series 2−k+1+(−1) , but that the Root Test shows that the
k=1
series converges. Hint: Write out the first six terms of the series.
6. Find all values of x for which the series converges absolutely. Hint: Express ℓ as a function of x.


k 2 xk ∑

(−1)k+1 2k k ∑

(−1)k+1 k
a) b) x c) (x − 2)k
2k k! 5k
k=1 k=1 k=1
152 Chapter 3 Sequences and Series

3.10 Power Series

Let {ck }∞
k=0 be a sequence of real numbers and let a be a real number. A power series centered at a is an

expression of the form




ck (x − a)k = c0 + c1 (x − a) + c2 (x − a)2 + c3 (x − a)3 + c4 (x − a)4 + · · · .
k=0

The constants ck are known as the coefficients of the power series and the number a is called the center
of the power series. As we will see, a power series looks and behaves like an infinite degree polynomial.
A power series represents a function of x; for each fixed value of x, a power series becomes an infinite
series of real numbers and the sum of the series is the value of the function at x. The Root Test or the Ratio
Test can be used to determine the domain of a function defined as a power series. As an example, consider
the following power series centered at 3:

∑ 1 1 1 1
(x − 3)k = 1 + (x − 3) + (x − 3)2 + (x − 3)3 + · · · .
(2k + 1) 4k 3·4 5 · 42 7 · 43
k=0

To determine the values of x for which this power series converges, we can apply the Root Test (in this case,
we could just as easily use the Ratio Test);
1/k |x − 3| |x − 3|
1 k
lim k
(x − 3) = lim √
k
= .
k→∞ (2k + 1) 4 k→∞ 4 2k + 1 4

The power series thus converges absolutely when |x − 3| < 4 and diverges when |x − 3| > 4. The convergence
of the power series at the points x = −1 and x = 7 has not been determined since the Root Test does not
give any information at these points.

∑ (−1)k
For x = −1, the series is , which is convergent (use the Alternating Series Test);
2k + 1
k=0

∑ 1
for x = 7, the series is , which is divergent (use the Comparison Test).
2k + 1
k=0
Therefore, the power series converges for all x in the interval [−1, 7). To phrase this conclusion in another
way, the domain of the function defined by this power series is the interval [−1, 7).
In general, the set of values of x for which a power series converges is an interval centered at the center
of the power series. The previous example illustrates this property quite well and provides a clear suggestion
as to why it is true. A general proof of the following theorem is more complicated and will be omitted.



THEOREM 3.21 Given a power series ck (x − a)k , one of the following occurs:
k=0
a) for some ρ > 0, the series converges absolutely if |x − a| < ρ and diverges if |x − a| > ρ;

b) the series converges absolutely for all real numbers;

c) the series converges only for x = a.

The number ρ is called the radius of convergence of the power series; this number is assumed to be
∞ in part (b) and to be 0 in part (c). The radius of convergence of a power series can usually be found using
3.10 Power Series 153

the Root Test or the Ratio Test. The set of all x values for which the power series converges is known as the
interval of convergence of the power series. To determine the interval of convergence, it is necessary to
check the endpoints separately (see the example prior to the theorem). Since these are the points for which
the Root Test and the Ratio Test provide no information, some other test is required. It may seem like a lot
of work to determine whether or not two additional points are in the domain of the function, but the series
associated with these points are often the most interesting ones. As for the power series considered earlier
in this section, its radius of convergence is 4 and its interval of convergence is [−1, 7).

Exercises
1. Find the radius of convergence of the given power series.


1 ∑

2k ∑

(k + 2)3k k
a) (x − 2)k b) (x + 1)k c) x
3k2 k! 5k+1
k=1 k=0 k=0

2. Find the interval of convergence of the given power series.




3 ∑

(−1)k+1 k ∑

2k
a) xk b) (x − 4)k c) (x + 2)k
k 2 5k 3k k+3
k=1 k=1 k=0



3. Suppose ck (x − 3)k converges when x = 8 and diverges when x = 9.
k=0
a) Determine, with justification or an explanation that there is not √
sufficient information, whether or not the
power series converges at the following values of x: x = 4.4, x = 110, x = 8.5, x = 0, and x = −2.
b) Determine the smallest possible interval of convergence for this power series (think about endpoints).
c) Determine the largest possible interval of convergence for this power series (think about endpoints).
4. The interval of convergence of a power series is [−2, 9). Find its center and radius of convergence.
5. Find the interval of convergence of the given power series.


1 k ∑

1 k ∑

(−1)k k ∑

a) x b) x c) x d) xk
k2 k k
k=1 k=1 k=1 k=1

6. Give an example of a power series with the specified interval of convergence.


a) [−3, 3] b) [2, 8) c) (−4, 12) d) (−3, 1]
7. Find (in more familiar terms) the function represented by the given power series. In other words, find the sum
of the series for all values of x in its interval of convergence. Hint: Use the geometric series.


1 k ∑
∞ ∑

1
a) x b) (−2)k (x + 3)k c) (x − 1)k
2k 3k
k=0 k=0 k=1
154 Chapter 3 Sequences and Series

3.11 Properties of Power Series

Since power series look very much like polynomials, it is tempting to treat them like polynomials when it
comes to the calculus operations of differentiation and integration. This important property of power series
is recorded in the next theorem; the technical proof will be omitted.

THEOREM 3.22 Power series can be differentiated and integrated term by term, just like polynomials.
The resulting power series have the same radius of convergence as the original series.

In symbols, this theorem states that for the function f defined by




f (x) = c0 + c1 (x − a) + c2 (x − a)2 + c3 (x − a)3 + c4 (x − a)4 + · · · = ck (x − a)k ,
k=0


f ′ (x) = c1 + 2c2 (x − a) + 3c3 (x − a)2 + 4c4 (x − a)3 + · · · = kck (x − a)k−1 , and
k=1
∫ x ∑ ck ∞
c1 c2 c3
f (t) dt = c0 (x − a) + (x − a)2 + (x − a)3 + (x − a)4 + · · · = (x − a)k+1 .
a 2 3 4 k+1
k=0

The theorem also asserts that all three of these power series have the same radius of convergence.
One application of this property of power series is finding formulas for the sums of certain series. The
geometric series has one of the simplest formulas:

∑ ∞
1
= 1 + x + x2 + x3 + x4 + · · · = xk , valid for |x| < 1.
1−x
k=0

Integrating both sides of this equation yields the formula




1 1 1 1 k
− ln 1 − x = x + x2 + x3 + x4 + · · · = x , valid for |x| < 1.
2 3 4 k
k=1

Differentiating both sides of the geometric series formula, then multiplying by x yields

∑ ∞
x
= x + 2x2 + 3x3 + 4x4 + · · · = kxk , valid for |x| < 1.
(1 − x)2
k=1

These examples indicate that many familiar functions can be represented as power series; see the next section
for more on this topic. To illustrate how the power series just obtained can be used to find sums of series,
note that

∑ 3(−1)k+1

∑ 1 ( 1 )k 1


k 1 1/4 1
= −3 − = 3 ln 1 + = 3 ln(3/2) and = ·( )2 = .
k2k k 2 2 4k+1 4 1 − (1/4) 9
k=1 k=1 k=1

As another example involving geometric series, consider carefully the following set of equalities:

x ∑( 2x )k ∑ 2k xk+1
∞ ∞
∑ 2k−1 ∞
x x 1
= · = = = xk .
3 − 2x 3 1 − (2x/3) 3 3 3 k+1 3k
k=0 k=0 k=1

These equations are valid for all values of x for which |x| < 3/2.
3.11 Properties of Power Series 155



Suppose that ck xk has radius of convergence ρ > 0. The function f defined by this power series can
k=0
be differentiated any number of times to obtain

f (x) = c0 + c1 x + c2 x2 + c3 x3 + c4 x4 + c5 x5 + c6 x6 + · · · ; f (0) = c0 ;

f ′ (x) = c1 + 2c2 x + 3c3 x2 + 4c4 x3 + 5c5 x4 + 6c6 x5 + · · · ; f ′ (0) = c1 ;

f ′′ (x) = 2c2 + 6c3 x + 12c4 x2 + 20c5 x3 + 30c6 x4 + · · · ; f ′′ (0) = 2c2 ;

f (3) (x) = 6c3 + 24c4 x + 60c5 x2 + 120c6 x3 + · · · ; f (3) (0) = 6c3 ;

f (4) (x) = 24c4 + 120c5 x + 360c6 x2 + · · · ; f (4) (0) = 24c4 ;

and so on indefinitely. A few moments of reflection reveals a pattern in the value of the derivatives evaluated
at the center. These observations lead to the following result.



THEOREM 3.23 Let f (x) = ck (x − a)k , where the power series has radius of convergence ρ > 0.
k=0
Then f is infinitely differentiable on (a − ρ, a + ρ) and f (k) (a) = k! ck for each k ≥ 0.

This theorem shows how the derivatives of a function f defined by a power series are related to the
coefficients of the power series: the coefficient of the term (x − a)k is f (k) (a)/k!. For example,

∑ k+1 k+1 (k + 1)!
if f (x) = (x − 4)k , then f (k) (4) = k! · = .
2k 2k 2k
k=0

We will see more interesting uses of this formula in the next section.

Exercises
1. Use the geometric series to find a power series expression (centered at 0) for the given function. Determine the
interval of convergence for each power series.
1 x 1
a) f (x) = b) f (x) = c) f (x) =
1 − 4x 1 + 2x 5−x
x 1
d) f (x) = e) f (x) = arctan x f ) f (x) =
1 + x2 (1 − x)3


2. By differentiating a power series found in this section, find a formula for the sum of the series k2 xk .
k=1


3. By making a change of indices, rewrite each series in the form ck xk . (The power on x is k).
k=K


k k−1 ∑

5k ∑

(−1)k (k − 1) k−2
a) x b) xk+1 c) x
2k (2k + 1)! 4k+1
k=1 k=1 k=2

4. Use power series found in this section to find the sum of the given series. Be careful on part (c).


5 ∑

k3k ∑

(−1)k+1
a) b) c)
k2k 4k 4k (k − 1)
k=1 k=1 k=2


5. Find coefficients ck so that the power series ck xk is unchanged when it is differentiated.
k=0
156 Chapter 3 Sequences and Series

3.12 Taylor Series



Suppose that the power series ck (x − a)k has radius of convergence ρ > 0. We saw in the last section
k=0
that the function f defined by this power series is infinitely differentiable on the interval (a − ρ, a + ρ) and
that f (k) (a) = k! ck for each integer k ≥ 0. Hence, the derivatives of f at the center of the power series
are related to the coefficients of the power series. Turning things around, we can start with an infinitely
∑∞
f (k) (a)
differentiable function f , find its derivatives, then write the resulting power series (x − a)k . This
k!
k=0
series is called the Taylor series of f centered at a or, in the frequently occurring case in which a = 0, the
Maclaurin series of f . (For the record, it is possible that a function and its corresponding Taylor series
are not equal. However, we will not consider such functions.)
The most direct way to express a function f as a Taylor series centered at a is to find a formula for
(k)
f (a). If the derivatives of a function have a particularly simple pattern, it is not too hard to do this. For
example, since the functions ex , sin x, and cos x have derivatives that form simple patterns, it is relatively
easy to determine their Maclaurin series; the details will be left to the reader. Note the similarities in the
Maclaurin series for these functions.

∑ 1 k 1 1
ex = x = 1 + x + x2 + x3 + · · ·
k! 2 6
k=0
∑∞
(−1)k 1 1 5 1
sin x = x2k+1 = x − x3 + x − x7 + · · ·
(2k + 1)! 6 120 5040
k=0
∑∞
(−1)k 2k 1 1 4 1 6
cos x = x = 1 − x2 + x − x + ···
(2k)! 2 24 720
k=0

The radius of convergence of each of these series is ∞, that is, these series converge for all values of x.

As a more complicated example, we will find the Taylor series centered at 8 for the function f (x) = 3
x.
The first step is to write out the first few derivatives and try to spot a pattern.

f (x) = x1/3
1
f ′ (x) = x−2/3
3
1 2 for k ≥ 2,
f ′′ (x) = − · x−5/3 2 · 5 · 8 · · · · · (3k − 4) −(3k−1)/3
3 3 f (k) (x) = (−1)k+1 · x
1 2 5 −8/3 3k
f (x) = · · x
(3)
3 3 3 2 · 5 · 8 · · · · · (3k − 4)
f (k) (8) = (−1)k+1 ·
1 2 5 8 3k 23k−1
f (x) = − · · · x−11/3
(4)
3 3 3 3
1 2 5 8 11 −14/3
f (x) = · · · ·
(5)
x
3 3 3 3 3
Notice that when searching for a pattern, it is best not to multiply out terms. Even so, it takes some practice
to learn how to identify the patterns that appear. Since in this case, the first two terms of the Taylor series
do not fit the pattern, they are pulled out separately. We thus obtain


√ 1 2 · 5 · 8 · · · · · (3k − 4)
3
x=2+ (x − 8) + (−1)k+1 · (x − 8)k .
12 3k 23k−1 k!
k=2
3.12 Taylor Series 157

The reader should verify that the radius of convergence of this power series is 8.
It is often possible to find the Maclaurin series for a function by using the known Maclaurin series of
another function and thus avoid finding a pattern for the derivatives. For example, composition of functions
and term by term operations yield all of the following results:

∑ ∞

1 k 3k
ex = x ; e3x = xk ;
k! k!
k=0 k=0
∑∞ k ∞
∑ ∑∞
3 k+2 3k−2 3k k
x2 e3x = x = xk ; e3x − 1 = x ;
k! (k − 2)! k!
k=0 k=2 k=1
∞ ∞ ∫ ∞
e3x − 1 ∑ 3k k−1 ∑ 3k+1 x
e3t − 1 ∑ 3k
= x = xk ; dt = xk .
x k! (k + 1)! 0 t k · k!
k=1 k=0 k=1

We have thus found power series representations centered at 0 for all these functions. However, a cautious
person would ask whether or not these are actually the Maclaurin series of the functions. This is a valid
question since the Maclaurin series formula was not used to obtain these series. But a function f can have
only one power series representation centered at a (the coefficients must be of the form f (k) (a)/k!), so any
method that generates a power series centered at a for a function f actually finds the Taylor series for f
centered at a.

Exercises
1. Use known series to find the Maclaurin series for the given function.
a) f (x) = e2x b) f (x) = sin 5x c) f (x) = cos x2
( )
d) f (x) = x2 cos(4x) e) f (x) = xe−x f ) f (x) = sin x − x /x3
∫x ( )
g) f (x) = 0 sin t/t dt h) f (x) = ex + e−x /2 i) f (x) = cos2 x

2. Use known Maclaurin series to determine in more familiar terms the function represented by the power series.


(−1)k k ∑

(−1)k 2k ∑

(−2)k 2k+1
a) x b) x c) x
k! k! (2k)!
k=0 k=0 k=0

3. Find the indicated Taylor series and determine its radius of convergence.
√ √
a) f (x) = x, a = 9 b) f (x) = 3 x, a = 1 c) f (x) = 1/(3 − x), a = 5

4. Find the Maclaurin series for the function f (x) = 1/ 1 − x.
5. For the given function, find f (100) (0). Hint: First find the Maclaurin series of the function.
( )
a) f (x) = xe−2x b) f (x) = x sin x3 c) f (x) = 1 − cos x2 /x4

6. Explain why the function x17/3 does not have a Maclaurin series.

7. Let i = −1 and assume that i behaves just like any other constant. Use known Maclaurin series to prove that
eix = cos x + i sin x. Use this equation to prove that eiπ + 1 = 0 and also to find ii .
158 Chapter 3 Sequences and Series

3.13 Taylor's Theorem

Suppose that a function f can be expressed as a Taylor series centered at a. It follows that the partial sums of
∑n
f (k) (a)
the Taylor series can be used to approximate the function. That is, f (x) ≈ (x − a)k = Pn (x), with
k!
k=0
the approximation to f improving as n increases. The polynomial Pn is called the nth Taylor polynomial
of f at a. Since polynomials can be evaluated using only the operations of addition and multiplication
(operations that can be performed by hand or easily programmed into a computer), a Taylor polynomial
provides a good way for approximating the values of a function. However, if the exact value of a function
is not known, it is difficult to determine when an approximation is accurate. The following theorem, a
generalization of the Mean Value Theorem, provides one measure of the accuracy of a Taylor polynomial
approximation.

THEOREM 3.24 Taylor’s Formula Suppose that f is defined on some open interval I that contains
the point a and let n be a positive integer. If f (n+1) exists on I, then for each x ∈ I there exists a point z
between x and a such that


n
f (k) (a) f (n+1) (z)
f (x) = (x − a)k + (x − a)n+1 .
k! (n + 1)!
k=0

Proof. Fix x ∈ I with x ̸= a and define a function F on I by


n
f (k) (t)
F (t) = (x − t)k + B(x − t)n+1
k!
k=0

for each t ∈ I, where the constant B is chosen so that F (a) = f (x). Note that F (t) is like the Taylor series
of f centered at t and evaluated at x. By the hypotheses, the function F is continuous and differentiable on
the interval with endpoints x and a, and F (x) = f (x) = F (a). By the Mean Value Theorem, there exists
a point z between x and a such that F ′ (z) = 0. Differentiating F using the product rule, then simplifying
yields
f (n+1) (t)
F ′ (t) = (x − t)n − (n + 1)B(x − t)n .
n!
f (n+1) (z)
(The details will be left as an exercise.) Since F ′ (z) = 0 and z ̸= x, we find that B = . Hence,
(n + 1)!


n
f (k) (a) f (n+1) (z)
f (x) = F (a) = (x − a)k + (x − a)n+1 .
k! (n + 1)!
k=0

Taylor’s Formula provides a way to estimate the magnitude of the quantity |f (x) − Pn (x)|:


n f (n+1) (z) M
f (k) (a) n+1 |x − a|
n+1

f (x) − (x − a)k = (x − a)n+1 ≤ ,
k! (n + 1)! (n + 1)!
k=0

where Mn+1 is the maximum value of |f (n+1) (x)| on the interval with endpoints x and a. This quantity
represents the distance between the functional value f (x) and its approximation Pn (x). In practice, the
3.13 Taylor’s Theorem 159

actual value of f (x) is not known. The error term lets us know roughly how close Pn (x), which is easily
evaluated, is to f (x). As an example, suppose we need to find a Taylor polynomial centered at 0 that
approximates ex with an accuracy of 10−4 on the interval [−2, 2]. Since all of the derivatives of ex are ex ,
it is easy to verify that Mn+1 = e2 < 7.5 on the interval [−2, 2] for all n. To obtain the desired degree of
7.5 · 2n+1
accuracy, we must choose a positive integer n such that < 10−4 . Some trial and error reveals that
(n + 1)!
n = 11 has the desired property. In other words,
( 1 11 )
x 1 1
e − 1 + x + x2 + x3 + · · · + x < 10−4
2 6 11!

for all x ∈ [−2, 2]. To illustrate this numerically, we can compute P11 (2) ≈ 7.3890460 and compare it to the
known value of e2 ≈ 7.3890561. You can see that the approximation is actually much better than 10−4 .
Another way to get a sense for how the Taylor polynomials approximate a function is to graph the function
and its Taylor polynomials on the same set of axes. The figure below indicates how the approximation to ex
improves as the degree of the Taylor polynomial (centered at 0) increases.
y .. y ..
.
.......
... .. ... .
.......
... ..
y = ex .....
..
. ......
40 y = e x
.. 40 .
...
... .....
.
30 .. 30 .... y = P6 (x)
... ......
. ..
20
.
... y = P (x) 20
......
.. . 2 ..
10 .
...... ................. 10 .
.......
. .. ...
..................... .........
.................................................... ......
.......... x .
.......
...
...
...
...
...
...
...
...
...
...
. .
...
.. ......
.......... x
−2 −1 1 2 3 4 −2 −1 1 2 3 4

In order to evaluate quantities such as e2 and sin 2, calculators use some sort of approximation. Taylor poly-
nomials are just one way to obtain approximations; other techniques are discussed in the field of numerical
analysis.

Exercises
1. Supply the missing details in the proof of Taylor’s Formula.
2. Use the theory in this section to answer the following questions.
a) Determine the accuracy of the approximation x − x3 /6 for sin x on the interval [−1, 1].
b) Find a Taylor polynomial that approximates sin x to 10−5 on the interval [−3, 3].
c) Find a Taylor polynomial that approximates ex to 10−3 on the interval [−2, 2].
d) Determine the accuracy of the approximation 1 − x2 /2 + x4 /24 for cos x on the interval [0, 1.6].
√ √ √
3. Find√the fifth degree Taylor polynomial for x centered at 4. Use the polynomial to approximate 3, 4.2,
and 5. How accurate are your estimates?
√ √ √
4. Find√the fourth degree Taylor polynomial for 3 x centered at 8. Use the polynomial to approximate 3 6, 3 7,
and 3 7.6. How accurate are your estimates?

1 + x

5. Find the seventh degree Taylor polynomial centered at 0 for the function ln , then use this polynomial to
1 − x
approximate ln 2, ln 3, and ln 5.
160 Chapter 3 Sequences and Series

3.14 Supplementary Exercises

Remark. The following exercises involve ideas concerning sequences.


1. Find the limit of the sequence.
{ } { } {√ }
5n + 1 2n2 + 7
a) b) c) 4k2 + k − 2k
8n − 3 5n − 8n2
{√ } { } { (√ )}
k3 + k2 − k 7−1
3 k
d) e) k tan(π/4k) f) k
{ √ } {( )n } {( )k }
9k k 4 1 2k
g) h) 1+ i)
32k−1 + 5k 4n 2k + 5
{ }
n3
2. Find the limit of the sequence .
1 + 9 + 25 + · · · + (2n − 1)2
3. Find a sequence {xk } such that {xk } converges to 0 and {cos(π/xk )} converges to −1/2.
4. Let b1 = 4 and bn+1 = 1 + (bn /7) for each n ≥ 1. Prove that {bn } converges, then find its limit.
1 1 1
5. For each positive integer n, let xn = + + ··· + . Prove that the sequence {xn } converges.
n+1 n+2 5n
6. Define a sequence {cn } by c1 = 2 and cn = 1/(3 − cn−1 ) for n ≥ 2. Prove that {cn } converges and find its limit.
7. Let t be a positive real number and let s > 1. Prove that the sequence {nt /sn } converges to 0.
{ }
8. Let z be any real number and consider the sequence ⌊nz⌋/n , where ⌊x⌋ represents the greatest integer less
than or equal to x. Prove that this sequence converges and find its limit.
∫ n
1 1 1 dx
9. For each positive integer n, let an = 1 + √ + √ + · · · + √ − √ . Use the area interpretation of the
2 3 n 1 x
integral to show that the sequence {an } is decreasing and bounded.
Remark. The following exercises involve ideas concerning series.

∞ ∑

10. Let ak be an absolutely convergent series and suppose that {bk } is a bounded sequence. Prove that a k bk
k=1 k=1
is an absolutely convergent series.
∑∞ ∑

11. Find a value of r for which rk = 7/8 and a value of r for which rk = 7/8.
k=0 k=1


k k
12. Find the values of r for which the series r (1 + r ) converges and find the sum of the series for these values
k=0
of r. What is the minimum possible sum?
∑∞ ∑

13. Suppose that ak is a convergent series of positive numbers. Prove that sin ak converges.
k=1 k=1
14. Determine if the series converges absolutely, converges conditionally, or diverges.


4k + 7 ∑

(−1)k+1 ∑

sin(k2 )
a) b) c)
k2 + 2k + 2 10k − 3 k3
k=1 k=1 k=1


∞ k 3 ∑
∞ k+1 ∞ (
∑ )k2
(−1) k (−1) k
d) e) √ f)
4k5 + k 2 + 3 k
100 k+1
k=1 k=1 k=1



17k (k!)3 ∑

(−6)k+1 ∑

1 · 3 · 5 · · · · · (2k − 1)
g) h) i)
(3k)! (2k)! 1 · 4 · 7 · · · · · (3k − 2)
k=1 k=1 k=1


∞ ∑
∞ ∑

15. It is known that 1/k2 = π 2 /6. Use this fact to find the sums of 1/(2k − 1)2 and (−1)k+1 /k2 .
k=1 k=1 k=1
16. Arnie, Beth, and Charles are playing a game with a fair coin. Each person takes a turn (in the order listed)
flipping the coin. The first person to get a tails wins. Find the probability of winning for each person.
3.14 Supplementary Exercises 161

17. Danika, Kaden, and Yasmeen are playing a game with a fair six-sided die. Each person takes a turn rolling the
die. Danika wins if she rolls a 1, Kaden wins if she rolls a 4 or 6, and Yasmeen wins if she rolls a 2, 3, or 5. Find
the probability of winning for each person in each of the following cases.
a) The turns are taken in the order Danika, Kaden, and Yasmeen.
b) The turns are taken in the order Kaden, Danika, and Yasmeen.
c) The turns are taken in the order Yasmeen, Danika, and Kaden.


18. Let {ak } be a decreasing sequence that converges to 0 and consider the series ak .
k=1

∞ ∑
n
a) Let S be the sum of the series (−1)k+1 ak . Prove that (−1)k+1 ak − S < an+1 .
k=1 k=1
b) Suppose that the function f for which f (k) = ak for all k is continuous and decreasing on [1, ∞). Assume

∞ ∑
n ∫∞
that the series ak converges and let T be its sum. Prove that T − ak < n f (x) dx.
k=1 k=1

19. Use the previous exercise to find an integer n so that the nth partial sum of the series is within 10−6 of its sum.


(−1)k+1 ∑

(−1)k+1 ∑∞
(−1)k+1 ∑

(−1)k+1
a) b) c) √ d)
k2 k5 k ln(k + 1)
k=1 k=1 k=1 k=1



1 ∑

1 ∑

1 ∑

k
e) f) g) h)
k2 k5 k8 ek
k=1 k=1 k=1 k=1

Remark. The following exercises involve ideas concerning power series.


20. (Binomial Series) The Binomial Theorem gives a formula for the expansion of (1 + x)n when n is a positive
integer. This result can be extended to any real number r in the following way. In keeping with the notation
for binomial coefficients, let ( )
r r(r − 1)(r − 2) · · · (r − k + 1)
=
k k!

for all real numbers r and all positive integers k.


( )


r k
r
a) Show that the Maclaurin series for (1 + x) is given by 1 + x with radius of convergence 1.
k
k=1
( )


r k
b) Let f (x) = 1 + x for each x ∈ (−1, 1) and show that (1 + x)f ′ (x) = rf (x).
k
k=1
c) Solve the differential equation in part (b) to find the function f .

d) Write the Maclaurin series for 1/ 1 − x without the shorthand notation for binomial coefficients.
e) Use the result in (d) to find the Maclaurin series for arcsin x.
21. Express the value of each of the following integrals as infinite series.
∫ 1 ∫ 0.5 ∫ 1
2 sin x
a) e−x dx b) dx c) cos x2 dx
0 0 x 0


22. Suppose that the power series ck (x − 5)k converges when x = 1 and diverges when x = 12. Find the smallest
k=0
and largest possible values for the radius of convergence of this power series.


23. Suppose that the radius of convergence of ck xk is a finite number ρ. Find the radius of convergence of
k=1


3k
ck x .
k=1
24. Find Maclaurin series for each of the following functions. Hint: Begin with a geometric series.
∫ x
x x t
a) f (x) = b) g(x) = c) h(x) = dt
2−x (5 − x)2 0 1+t
6


∞ ∑

25. Find formulas for the sums of the power series k3 xk and k 4 xk .
k=1 k=1
26. Evaluate each limit by finding the first few terms of the Maclaurin series for the numerator and the denominator.
162 Chapter 3 Sequences and Series

3 2
ex − 1 1 − cos x + sin x2 ex − 1 − x2
a) lim b) lim c) lim
x→0 x − sin x x→0 e2x2 − 1 x→0 x(sin x − x)
27. Use results from this chapter to find the following sums.

∞ ∑
∞ ∑

a) k/5k b) k2 /4k c) k3 2k /3k
k=1 k=1 k=1

28. Use known Maclaurin series to determine in more familiar terms the function represented by the power series.


22k k ∑

(−1)k 32k 2k+1 ∑

(−5)k+1 2k+1
a) x b) x c) x
k! (2k + 1)! (2k)!
k=0 k=0 k=0

29. Use known Maclaurin series to determine the sum of the given series. Be careful!


(−1)k+1 52k ∑

(−1)k+1 ∑

(−1)k+1
a) b) c)
(2k + 1)! 3k (2k)! 2k (k − 1)!
k=0 k=1 k=2

(40)
30. For the given function, find f (0). Hint: Use a Maclaurin series.
a) f (x) = (1 + 2x)−1 b) f (x) = x(3 − x)−1 c) f (x) = (1 + 3x2 )−1
31. Power series can be multiplied and divided just like polynomials, and the results give valid power series repre-
sentations for the corresponding product and quotient. For example, using the Maclaurin series for ex and sin x,
we can find the first few terms of the Maclaurin series for the product ex sin x:
( )( )
ex sin x = 1 + x + 12 x2 + 16 x3 + 24 x + ···
1 4
x − 16 x3 + 120
1
x5 − 5040
1
x7 + · · ·
= x + x2 + 12 x3 + 16 x4 + x + ···
1 5
24
− 16 x3 − 1 4
6
x − 12 x + · · ·
1 5

1
120
x5 + · · ·
= x + x2 + 13 x3 − 1
30
x 5
+ ···.
Some care is required to guarantee that all of the terms of degree n have been obtained. Use this idea to find
the first five nonzero terms of the Maclaurin series for the given function.
a) ex cos x b) e−x sin x c) tan x


32. The purpose of this exercise is to find the sum of the series 1/n2 .
n=1
a) Use a reduction formula and mathematical induction to prove that
∫ π/2
4n (n!)2
sin2n+1 x dx =
0 (2n + 1)!
for each positive integer n.
b) Use the result in part (a) and a trigonometric substitution to show that
∫ 1 2n+1
x 4n (n!)2
√ dx =
0 1−x 2 (2n + 1)!
for each positive integer n.
c) Use the Maclaurin series for arcsin x (see part (e) of Exercise 20) and term by term integration to show that
∫ 1 ∑∞
π2 arcsin x 1
= √ dx = .
8 0 1 − x2 n=1
(2n − 1)2



d) Prove that 1/n2 = π 2 /6. Hint: Consider odd and even terms separately.
n=1
A
Algebra/Geometry Review


−b ± b2 − 4ac
quadratic formula: If ax2 + bx + c = 0, then x = .
2a
factoring formulas:
a2 − b2 = (a − b)(a + b)

a3 − b3 = (a − b)(a2 + ab + b2 )

a3 + b3 = (a + b)(a2 − ab + b2 )

a4 − b4 = (a − b)(a3 + a2 b + ab2 + b3 )

an − bn = (a − b)(an−1 + an−2 b + an−3 b2 + · · · + abn−2 + bn−1 )


factorial notation: If n is a positive integer, then n! = n · (n − 1) · · · · · 2 · 1. Also 0! = 1.

Binomial Theorem: a formula for powers of (a + b)n ; for (a − b)n , use (a + (−b))n
(a + b)0 = 1 1 Pascal’s Triangle
(a + b)1 = a + b 1 1
2 2 2
(a + b) = a + 2ab + b 1 2 1
(a + b)3 = a3 + 3a2 b + 3ab2 + b3 1 3 3 1
4 4 3 2 2 3 4
(a + b) = a + 4a b + 6a b + 4ab + b 1 4 6 4 1
(a + b)5 = a5 + 5a4 b + 10a3 b2 + 10a2 b3 + 5ab4 + b5 1 5 10 10 5 1
∑n ( ) ( )
n n−k k n n!
(a + b)n = a b , where =
k k k!(n − k)!
k=0
n(n − 1) n−2 2
= an + nan−1 b + a b + · · · + nabn−1 + bn
2

163
164 Appendix A Algebra/Geometry Review

( a )2 a 2
completing the square: x2 + ax = x + −
2 4
x3 − 4x2 + 2x − 3 7
long division of polynomials: = x2 − 2x − 2 − since
x−2 x−2
x2 − 2x − 2
x−2 x3 − 4x2 + 2x − 3
x3 − 2x2
−2x2 + 2x − 3
−2x2 + 4x
−2x − 3
−2x + 4
−7

multiplying by the conjugate: use the fact that (a + b)(a − b) = a2 − b2 has no middle term
√ √ √
x2 + 4x − x x2 + 4x − x x2 + 4x + x (x2 + 4x) − x2 4x
= ·√ = (√ )= (√ )
x+1 x+1 2
x + 4x + x 2
(x + 1) x + 4x + x (x + 1) x2 + 4x + x
adding fractions: first find a common denominator
x 5 x(x − 1) 5(2x + 3) x(x − 1) + 5(2x + 3) x2 + 9x + 15
+ = + = =
2x + 3 x − 1 (x − 1)(2x + 3) (x − 1)(2x + 3) (x − 1)(2x + 3) (x − 1)(2x + 3)
(a + a b + b )
1 2 1 2
midpoint of line segment from (a1 , b1 ) to (a2 , b2 ) is ,
2 2

distance between points (a1 , b1 ) and (a2 , b2 ) is (a2 − a1 )2 + (b2 − b1 )2

equation of circle with radius r and center (h, k) is (x − h)2 + (y − k)2 = r2

Two Dimensional Figures

triangle: base b, height h, area = 12 bh



3 2
equilateral triangle: side s, area = 4 s

similar triangles: Two triangles are similar if they have equal angles. Corresponding sides are in proportion.
trapezoid: bases B and b, height h, area = 12 (B + b)h
circle: radius r, circumference = 2πr, area = πr2
circular sector: radius r, angle θ (radians): arc length = θr, area = 12 θr2

Three Dimensional Figures

sphere: radius r, surface area= 4πr2 , volume = 34 πr3


right circular cylinder: radius r, height h, lateral surface area = 2πrh, volume = πr2 h

right circular cone: radius r, height h, lateral surface area = πr r2 + h2 , volume = 13 πr2 h

trigonometric identities: See Section 1.16.


properties of exponents and logarithms: See Section 1.19.
B
Table of Integrals

basic formulas

1
1. ur du = ur+1 , r ̸= −1
r+1

du
2. = ln |u|
u

3. eu du = eu

4. sin u du = − cos u

5. cos u du = sin u

6. sec2 u du = tan u

7. csc2 u du = − cot u

8. sec u tan u du = sec u

9. csc u cot u du = − csc u

du
10. √ = arcsin(u/a), where a > 0
a2 − u2

du 1
11. = arctan(u/a), where a > 0
a2 + u2 a

du 1
12. √ = arcsec(u/a), assuming that u is a positive function and a > 0
u u2 − a2 a

165
166 Appendix B Table of Integrals

forms involving a2 − u2 , where a > 0


∫ √ ∫ √
u√ 2 a2 1
13. a − u du =
2 2 a −u + 2 arcsin(u/a) 14. u a2 − u2 du = − (a2 − u2 )3/2
2 2 3
∫ √ ( ) √ 4
u a
15. u2 a2 − u2 du = 2u2 − a2 a2 − u2 + arcsin(u/a)
8 8
∫ ∫ √
1 u
16. √ du = arcsin(u/a) 17. √ du = − a2 − u2
a2 − u2 a2 − u2

u 2
u√ 2 a2
18. √ du = − a − u2 + arcsin(u/a)
a −u
2 2 2 2
∫ √ 2 √ √ ∫ √ 2 √
a − u2 a + a2 − u2 a − u2 a2 − u2
19. du = a2 − u2 − a ln
20. 2
du = − − arcsin(u/a)
u u u u
∫ √ ∫ √
1 1 a + a2 − u2 1 a2 − u2
21. √ du = − ln 22. √ du = −
u a −u
2 2 a u u a −u
2 2 2 a2 u
∫ √
u 3a4
23. (a2 − u2 )3/2 du = − (2u2 − 5a2 ) a2 − u2 + arcsin(u/a)
8 8
∫ ∫
1 1 1 u + a
24. u(a2 − u2 )3/2 du = − (a2 − u2 )5/2 25. du = ln
5 a2 − u2 2a u − a
∫ ∫
1 u u 1
26. du = √ 27. du = √
(a2 − u2 )3/2 a2 a2 − u2 (a2 − u2 )3/2 a2 − u2

forms involving a2 + u2 , where a > 0


∫ √ ∫ √
u√ 2 a2 √ 1
28. a2 + u2 du = a + u2 + ln u + a2 + u2 29. u a2 + u2 du = (a2 + u2 )3/2
2 2 3
∫ √ ( ) √ 4 √
u a
30. u2 a2 + u2 du = 2u2 + a2 a2 + u2 − ln u + a2 + u2
8 8
∫ √ ∫ √
1 u
31. √ du = ln u + a2 + u2 32. √ du = a2 + u2
a2 + u2 a2 + u2

u 2
u√ 2 a 2

33. √ du = a + u2 − ln u + a2 + u2
2
a +u 2 2 2
∫ √ 2 √ √ ∫ √ 2 √ √
a + u2 a + a2 + u2 a + u2 a2 + u2
34. du = a2 + u2 − a ln
35. 2
du = − + ln u + a2 + u2
u u u u
∫ √ ∫ √
1 1 a + a2 + u2 1 a2 + u2
36. √ du = − ln 37. √ du = −
2
u a +u 2 a u 2
u a +u2 2 a2 u
∫ √ √
u 3a4
38. (a2 + u2 )3/2 du = (2u2 + 5a2 ) a2 + u2 + ln u + a2 + u2
8 8
∫ ∫
1 1 1
39. u(a2 + u2 )3/2 du = (a2 + u2 )5/2 40. du = arctan(u/a)
5 a2 + u2 a
∫ ∫
1 u u 1
41. du = √ 42. du = − √
(a2 + u2 )3/2 a2 a2 + u2 (a2 + u2 )3/2 a2 + u2
Appendix B Table of Integrals 167

forms involving u2 − a2 , where a > 0


∫ √
u√ 2 a2 √
43. u2 − a2 du = u − a2 − ln u + u2 − a2
2 2
∫ √
1
44. u u2 − a2 du = (u2 − a2 )3/2
3

√ u( 2 )√ a4 √
45. u2 u2 − a2 du = 2u − a2 u2 − a2 − ln u + u2 − a2
8 8
∫ √
1
46. √ du = ln u + u2 − a2
u2 − a2
∫ √
u
47. √ du = u2 − a2
u2 − a2

u2 u√ 2 a2 √
48. √ du = u − a2 + ln u + u2 − a2
u2 − a2 2 2
∫ √ 2 √
u − a2
49. du = u2 − a2 − a arcsec(u/a), assuming that u is a positive function
u
∫ √ 2 √ √
u − a2 u2 − a2
50. 2
du = − + ln u + u2 − a2
u u

1
51. √ du = a arcsec(u/a), assuming that u is a positive function
u u2 − a2
∫ √
1 u2 − a2
52. √ du =
u2 u2 − a2 a2 u
∫ √ √
u 3a4
53. (u2 − a2 )3/2 du = (2u2 − 5a2 ) u2 − a2 + ln u + u2 − a2
8 8

1
54. u(u2 − a2 )3/2 du = (u2 − a2 )5/2
5

1 1 u − a
55. du = ln
u2 − a2 2a u + a

1 u
56. du = − √
(u2 − a2 )3/2 a2 u2 − a2

u 1
57. du = − √
(u2 − a2 )3/2 u2 − a2
168 Appendix B Table of Integrals

forms involving trigonometric, exponential, and logarithmic functions



58. tan u du = ln | sec u|

59. cot u du = ln | sin u|

60. sec u du = ln | sec u + tan u|

61. csc u du = ln | csc u − cot u|
∫ ∫
1 n−1
62. sinn u du = − sinn−1 u cos u + sinn−2 u du
n n
∫ ∫
1 n−1
63. cosn u du = cosn−1 u sin u + cosn−2 u du
n n
∫ ∫
1
64. tann u du = tann−1 u − tann−2 u du
n−1
∫ ∫
1 n−2
65. secn u du = secn−2 u tan u + secn−2 u du
n−1 n−1

sin((a − b)u) sin((a + b)u)
66. sin au sin bu du = −
2(a − b) 2(a + b)

cos((a − b)u) cos((a + b)u)
67. sin au cos bu du = − −
2(a − b) 2(a + b)

sin((a − b)u) sin((a + b)u)
68. cos au cos bu du = +
2(a − b) 2(a + b)
∫ √
69. arcsin u du = u arcsin u + 1 − u2

1
70. arctan u du = u arctan u − ln 1 + u2
2

eau ( )
71. eau sin bu du = a sin bu − b cos bu
+ba2
2
∫ au ( )
e
72. eau cos bu du = 2 b sin bu + a cos bu
a + b2
∫ ∫
1 n au n
73. u e du = u e −
n au
un−1 eau du
a a
∫ ∫
74. (ln u)n du = u(ln u)n − n (ln u)n−1 du
C
Answers to Exercises

C.1 Chapter 1 Answers

Section 1.1: 1. 2y − 3x = 7 2. 2x + 3y = 0, 3x − 2y = 13 3. 2x + 3y = 17 4. m = −4/3, x = 6, y = 8,


d = 4.8 5. (2, 1) 6. A(m) = −(3 − 2m)2 /(2m) 7. y = x/4 8. (0, 1), (7/25, −24/25)
Section 1.2: 1. 0.571428 571428 . . . 2. 19/99 3. A ⊆ B, A = B, B ⊆ A 4. {x ∈ R : x ̸= 5}, [−1, 3], (−2, 2)

7. 5, −10, −2/3, 10, 7, −2 8. 4x2 − 4x − 3, 2x2 + 8x − 3, 4x√− 9 9. √ f (x) = x, g(x) = x4 + 4x2 + 1;
f (x) = sin x, g(x) = 4x; f (x) = x2 , g(x) = cos x 10. 1, −2 + 2, −2 − 2 11. (−5, 5), (2, 12) 12. 2x − h 13.

3 P 2 /24 14. [−4, 16], (−5, 1), (−6.5, 11.5)
Section 1.3: 1. −2, 4/3, 1/3 2. 0.500, −0.500, 2.718, 1.000, 0.434, 1.958 3. 0.693, 1.099, 1.386, 1.609 4. −1,
1, 0, −1, 1, 0 6. 2, 1, 1, 2, 0, 1, 2, 1 7. −8, 180, 7/4
√ √
Section 1.4: 1. 5, −1; 0, − 6, 6; 0 3. −4, 1/3, −1 or 3/2 6. [5, 6]
Section 1.5: 1. 4/7, 1/4, 2/3, 4, 1/4, 8, −16, 20/3, −2/27 2. 3/2, 11/4 3. 0, 0, 2 4. 1, 7, 2/3, 3/5, 0, 2
Section 1.6: 1. 6, 3, −1/25, −1, 1/9, 1 2. c/2 3. y − 1 = 4(x − 1), y − 1 = (1 − x)/4 4. 3c2 5. 3c2 + 4c,
−4/c2 , −5/c3/2 6. 0, −4/3 7. (2, 2), (6, 2/3) 8. 10 ft/sec

Section 1.7: 1. 2x, 3x2 , 4x3 , 6x − 4, 1/2 x, 2 − 5/x2 2. y − 9 = 6(x − 3), y − 9 = (3 − x)/6 3. 4/3 4. −1/3,
3 8. f (x) = |2x − 3| 10. {n/2 : n ∈ Z}, 1 or −1
Section 1.8: 1. f ′ (x) = 4, g ′ (x) = 6x + 13, h′ (x) = −6x2 + 6x − 5, F ′ (t) = 15t4 − 8t3 + 1, G′ (z) = 2z 3 + z 2 − 2z,
H ′ (u) = u5 + u3 − 61 u, dx
dy
= 100x99 + 100, ds = 0.05t4 + 0.6t2 − 4, dw
dz
= 60w29 − 31 w10 2. y − 3 = 5(1 − x),
√ √ dt
y − 3 = (x − 1)/5 3. 0 4. 3,− 3 5. 58 m/sec 6. 105.89 ft/min 7. 36,20; 12, 3; 6, 7; 9, 16; 3, 1/5; 27,
−18
1 2 28 5 24 2t + 5 z 3 + 4z − 12
Section 1.9: 1. f ′ (x) = √ − √ , g ′ (x) = − 5 , h′ (x) = − 2 + 3 , F ′ (t) = √ , G′ (z) = ,
4 3 3
x2 3x x x 8 t3 z3
√ ( √ ) dy2 x 4 dz ds 7 1
H ′ (s) = 3 s 2 s − 1 , = 6x + 2 , = 8w3 − 9w2 + 12w − 5, = − 2 + 3/2 2. y = 2x + 4 3. 2k
dx x dw dt t 5t
4. (1, 1), (25, 5) 5. 1/9
2x3 + 3x
Section 1.10: 1. f ′ (x) = 5(4x − 3)(2x2 − 3x + 1)4 , g ′ (x) = 12(x2 + 2)(x3 + 6x)3 , h′ (x) = √ ,
x4+ 3x2 + 15
z2 + 2
F ′ (t) = 3(3t2 − 6t + 1)(t3 − 3t2 + t)2 , G′ (z) = , H ′ (s) = 4(2s + 5)(3s2 + 15s − 8)1/3 ,
(z 3
+ 6z)2/3
dy 24(3x − 2) dz 6w ds 15t(t + 2)
=− , =− 2 , =− 3
dx (3x2 − 4x +√6)4 dw (w + 6)3/2 dt (t + 3t2 )2
2x x2 + 2x + 3 + x + 1 √
2. s′ (x) = √ √ √ 3. 2x + 3y = 13 4. (−4, 2 2) 5. −12, 15, 18 6. 1665
2 x2 + 2x + 3 x2 + x2 + 2x + 3

169
170 Appendix C Answers to Exercises

3x + 4 −7 x+5 8(5 − x2 )
Section 1.11: 1. f ′ (x) = √ , g ′ (x) = , h′ (x) = , u′ (x) = √ ,
2 x+2 (2x − 1) 2 (2x + 5) 3/2
10 − x2
−2(3x + x − 11) dy
2 2
x(x + 8) ds 8(t − 1)3
v ′ (x) = (14x + 5)(x + 1)2 (2x − 1)3 , w′ (x) = , = , = ,
(x2 + 2x + 4)2 dx (4 + x2 )3/2 dt (t + 1)5
dz x(4 − 7x)(x − 2) 4
= ( )2/3 2. 2, −1/3, 1 3. y = 1 + 5(x − 4)/16 4. 4.5 s 5. x − 4y = −9, x − 4y = −1;
dx
3 4 + x2 (2 − x)5
(−2, 2), (−2/3, −2) 6. −7, −17/16, −144

Section 1.12: 1. M √ = 20,
√ m = 11;
√ M = 47, m = 20; M = 232, m = 7; M = 83/5, m = 16/3; M = 9/4, m = 2
3. 2.5, 7.5; 10 − 1/ 3, 1/ 3 4. 7/2
Section 1.13: 1. I on (−∞, 0], [3, ∞); D on √ [0, 3] 2a. I on√ (−∞, 0], [4, ∞);√D on [0, 4] 2b.
√ I on [0, ∞); D on (−∞, √0]
2c. √I on [4, ∞); D on [0,
√ 4] 2d. I on (−∞,
√ 10], [−1, 1], [ 10, ∞); D on [− 10, −1], [1, 10] 2e. I on (−∞, −1/ 2],
[1/ 2, ∞); D on [−1/ 2, 0), (0, 1/ 2] 2f. I on (−∞, 0], [8, ∞); D on [0, 8] 2g. I on [0, 3]; D on √[3, 6] 2h. √ I on
(−∞, −1], √ [3, ∞);√D on [−1, 1), (1, 3] 2i. I on (−∞, −4], [1/2, ∞); D on [−4, 1/2]√2j. I on (−∞, −a/ 3], [a/ √ 3, ∞);
D on [−a/ 3, a/ 3] 2k. I on (−∞, −a], [a, ∞); D on [−a, 0), (0, a] 2l. I on [a/ 2, ∞); D on (−∞, 0), (0, a/ 3 2]
3

Section 1.14:
√ 1a. rel max at 0, √ rel min at 4 1b. rel min
√ at 0, abs min√ is 0 1c. rel min at 4, abs min is −4 1d. rel
max at − 10, 1, rel min at −1, 10 1e. rel max at − 3, rel min at 3 1f. rel max at 8, rel min at 0 1g. rel max
at −a, rel min√at a 1h. rel √ min at −a,√a, abs min is 2a2 1i. rel min at −a, rel max at a, abs min is −1/2a, abs max
is 1/2a 2. 2 10 3. 1.5 20 4. 2 3 5. −2x3 + 9x2 + 60x
3


Section 1.15: 1. 4 30 m of fence √
2 3
2. 31250 ft2 √ ft × 116.34
3. 85.95 √ ft 4. 9.746 √ in × 7.309 √ in 5. 432
cm 6. 33.41
√ √ft , $240.64 7. r = 3
150, √
h = 2r 8. 32 3/9 √ 9. 3s2 /8 10. 3 3r2 /4, all 3r, 0.4135
11. 4πr3 /3 3, 2, 0.57735 12. 32πr3 /81, 2, 0.2963 13. 2 181 14. 24 min 58 s 15. bx + ay = 2ab 16.

d/(1 + 3 a)
√ √ √
Section 1.16:√ 1. sin(7π/6) = −1/2, cos(7π/6) = − 3/2 2. sin x = 2/3, cos x = 5/3 3. sin x = 4/ 17,
cos x = −1/ 17 4. 7.39, 9.46 5. −5π/3, −π/3, π/3, 5π/3, 7π/3, 11π/3 6. 0.7754, 2.3662, 7.0586, 8.6494

7. 0.3978, 1.1730, 3.5394, 4.3146 8. π/6, 5π/6, 13π/6, 17π/6 √ 9. −π/4,
√ 3π/4, 7π/4 12. 13.57 13. 22.0 ,
69.6◦ , 88.4◦ 17. π/4, −π/6, 2π/3, π/6, 3π/4, −π/3 18. x/ 1 − x2 , x/ 1 + x2 , 1 − 2x2
Section 1.17: 1a. f ′ (x) = cos x + 6 sin 3x 1b. g ′ (x) = 2x(sin 2x + x cos 2x) 1c. h′ (x) = −6 sin 2x cos2 2x
dy −(1 + 3 sin x) ds dw
1d. = 1e. = sin t(2 cos2 t − sin2 t) 1f. = 1 − 8 sec 4z tan 4z 1g. u′ (x) = cos3 x
dx (3 + sin x)2 dt dz
−2 sin x + cot2 x(2 csc x − sin x)
1h. v ′ (θ) = 10 tan 5θ sec2 5θ 1i. w′ (x) = 1j. F ′ (t) = 2t sin(1/t) − cos(1/t)
(2 + cot2 x)2 √
1k. G′ (t) = 20 sec2 4t 1l. H ′ (x) = 48x cos(2x2 ) sin3 (2x2 ) 4. 2/r, r, r/7 5. y + 1/2 = 4 3(x − π/6) 6. −π/3,
x(1 + 2x2 cos2 5x − 5x3 sin 5x cos 5x)
π/3, 5π/3, 7π/3, 11π/3 7. a tan a = 1 8. f ′ (x) = √
1 + x2 + x4 cos2 5x

√ ′ ′

Section 1.18:
√ 1a. f (x) = 1/ 4 − x2 1b. g (t) = 4/(16 + t2 ) 1c. h √(x) = 3/ x − 9x √1d. dy/dx =
4 2

2 arcsin x/ 1 − x2 1e. ds/dt = 1 + 2t arctan t 1f. dr/dθ = arccos θ 3. 2, − 3 4. π − 2, −(π + 6 3)/6 √ √


√ 5/4, −1 6. I on [0, π/2], [7π/6, 3π/2], [11π/6, 2π]; D on [π/2, 7π/6], [3π/2, 11π/6] 7. π/3 8. 4 3, 9 − 60,
5.
a(a + b)
Section 1.19: 1. −2, −4, 5, 3/2, 2, −1, 1/2 2. r + s, s + t − r, −2s, 2r + 3t, 3r + s, s + t, 2s − r − t, s − 4t
3. 5e + 2 4. 2.322, 3.855, 9.486, 109.551, 7.885, ±1.317 5. 42◦ , 65.7◦ , 9.986 min 6. 25, ln 2 7. 36.75 ft, 33.099

miles 8. 10 x, 1010 11. ea , ea , e
2
Section 1.20: 1a. f ′ (x) = ex −4e−2x 1b. g ′ (x) = 5+8xe−x 1c. h′ (x) = −xe−x 1d. F ′ (x) = 2/(2x+1) 1e. G′ (x) =
2 cos 2x esin 2x 1f. H ′ (x) = 3ex/2 sec2 (ex/2 ) 1g. dy/dx = 2e−x sin(e−x ) cos(e−x ) 1h. ds/dt = e−2t (3 cos 3t − 2 sin 3t)
1i. dw/dz = 4(ln z)3 /z 1j. u′ (x) = (2x + 4)/(x2 + 4x + 2) 1k. v ′ (x) = 1 + ln x
ds et (1 − e2t ) dw z 2 + ln z
1l. w′ (x) = x−4 (1 − 3 ln x) 1m. dy/dx = 3x2 + (ln 3)3x 1n. = 2t 2
1o. = √
dt (1 + e ) dz z z 2 + (ln z)2

2. −0.0445 m/sec 2
3. 3/ 4 4. a/e 5. 0, 2/e 6a. I on (−∞, −1], [0, 1]; D on [−1, 0], [1, ∞); abs max at ±1,
3
√ √ √
abs min at 0 6b. I on (0, e]; D on [ e, ∞); abs max at e 6c. I on [1, e2 ]; D on (0, 1), [e2 , ∞); abs min at 1, rel
2 √
max at e 7. (3, 2e) 9. 1/ e e 10. 1/(ln a)x
Section 1.21: 1a. f (x) = x3 + 1 1b. g(x) = 2 ln x + 5 1c. h(x) = 2(x3/2 + 1)/3 1d. F (x) = x√3
− 4x2 + 1
1e. G(x) = 5 x + 2 x − 2 x + x + 5 1f. u(t) = 2e + 10 1g. v(t) = 9 − 4 cos(2t) 1h. w(t) = t2 + 1 + 1
1 5 1 4 1 2 9 2t

2a. f (t) = 8e−2t 2b. g(t) = 5e4t 2c. h(t) = 2e5(t−1) 2d. F (x) = 12ex − 2 2e. G(x) = 4 + 26e−x
2f. H(x) = 47e3x − 1 3a. A(t) = 24e2t − 4 3b. B(t) = 3 − e−5t 3c. S(t) = 240 − 200e−t/3 4. sin 3x, cos 3x
Section 1.22: 1. 189.84 hr 2. 54280 3. 11.6 min 4. 39.23 min 5. 0.0322 lb/gal 6. 22.91 min 7. 48.56
days 8. 5.35 min 9. 5.94 yr, $14,267; 15.27 yr, $18,326 10. 6.64 yr, $15,942; 25.58 yr, $30,701
11. 11.55 yr, $27,726; never
C.2 Chapter 2 Answers 171

Section 1.23: 1. 0.0124, 0.0199, 0.0041, 0.3214, 0.1120, 0.0053 2. 0.002498, 0.000999, 0.000074 3. 0.2360,
0.0249, 0.0049 4. 19.875 < s < 20.124 5. 2.104 < r < 2.139
Section 1.24: 1. 1/3, −2/5, −1/3, −3/2, 0, 0, 2, 2, 1/3 2. ∞, ∞, −∞, ∞, −∞, ∞, −∞, ∞, ∞ 5. x = −3/2,
y = 2; x = 2, x = 3, y = 2; x = ln 2, y = 0, y = 1 6. x = 0, x = −1; x = 2; x = 1/2 7. ∞, −∞, ∞
Section 1.25: 1. 1, 40/7, −47, 1, 1/2, 4, 2, 0, 0 2. 3, 0, 2, e2 , e2 , e2 3. r, 0, er , 0, 0, er 4. 16a/9
√ √ √
Section 1.26: 1. 4, 16/3, 5/2, 3 2√− 2, (4 − π)/π, (1 + 7)/3 4. (2a + b)/3 5. (sa + rb)/(r + s)
6. (a + b)/2, arithmetic mean 7. ab, geometric mean
Section 1.27: 2. f (x) = 13 x3 + 12 x2 + x + C, g(x) = 1
2
ln(x2 + 4) + C, h(x) = −4 cos 2x + C
Section 1.28: 1. f ′′ (x) = 10(38x2 +38x+11)(x2 +x+1)8 , g ′′ (x) = 4(x2 +4)−3/2 , h′′ (x) = 3 sin x(2 cos2 x−sin2 x)
2. 2n e2x , (x + n)ex , (−1)n n!x−n−1 3. − cos x 4a. CU on (−∞, 1/3], CD on [1/3, ∞), IP at (1/3, 2/27)
4b. CU on (−∞, 0), √ [1, ∞),√CD on (0, 1], IP at√
(1, 0) √4c. CU on (−∞, √ 1], [2, ∞),
√ CD on [1, 2], IP at (1, 11), (2, 24)
4d. CU on (−∞, − 3/3], [ 3/3, ∞), CD on [− 3/3, √ 3/3],
√ IP at (− 3/3,√6),√ ( 3/3, 6) 4e.√CU on [e3/2√ , ∞), CD on
√ √
(0, e3/2 ], IP at (e3/2 , 3e−3/2 /2) 4f. CU on (−∞, − 2], [ 2, ∞), CD on [− 2, 2], IP at (− 2, 1/ e), ( 2, 1/ e)
5. 33 3 % 7. x − 6x + 4x + 12
1 3 2

Section 1.29: 1. increasing 2. 5/6, 11/216 3. 25t2 + 10t + 20, 13 t3 + 21 t2 − 7t + 4, 4(t + 4)3/2 − 10t − 44,
2t4 + 10t 4. 534.8 ft, 11.6 s 5. 81.8 mph 6. 576 ft 7. 9.5 s, 207.7 mph 8. 5 s, 61.5 m 9. 0.54 √ s 10.
385.7 mph 11. 4.4 m/s, 26.5 m/s, 1.0 m 12. 5.4 s, 41.4 m/s 13. 121 ft 14. 3.884 m/sec 15. 160 10 ft
Section 1.30: 1a. c(x) = 1 − (x − 1) + (x − 1)2 − (x − 1)3 1b. c(x) = 2 + 14 (x − 4) − 64 1
(x − 4)2 + 512
1
(x − 4)3 1c.
1
c(x) = 2+ 12 1
(x−8)− 288 (x−8)2 + 20736
5
(x−8)3 1d. c(x) = 1+x+ 12 x2 + 61 x3 1e. c(x) = (x−1)− 12 (x−1)2 + 13 (x−1)3
√ √
1f. c(x) = x − 31 x3 2. 10 e ≈ 1.1, 1.105, 1.105167, e ≈ 32 , 13 , 79
8 48
899
3. 14 1080 1
4. 6 27 1
6. 4, 4 96
Section 1.31: 1. 1.364656, 0.567143, 1.895494, 3.691203, 4.493409, 5.749031 2b. x2 = 577/408, 97/56, 161/72,
49/20 3. x2 = 91/72, 331/225 5. 2.21, 2.26
Section 1.32: 2a. (2x − 4y − 1)/(4x − 2y), 2b. (3x2 − 2)/(1 − 3y 2 ), 2c.−(4x3 + 6xy)/(3x2 + 3y 2 ),
2d. x(9 − 2x2 − 2y 2 )/y(9 + 2x2 + 2y 2 ), 2e. (3x2 + y)/(ey √ − x − 2y), 2f. (y +√1)(y 2 + y − x2 )/x((y + 1)2 − x2 )
3. 13(y − 2) = 6(1 − x) 4a. 4/5, 5/4, −1 4b. (2, (−1 − 33)/2), (2, (−1 + 33)/2) 5.(4x + 3y = 24
√ √ √ )
6. y = −2x + 4, y = 4(9 − 8x)/27 7. 3x + 2y = 5 8. (±2, ±2 2) (four points) 10. 1/ 3, 2/ 3
√ √
Section 1.33: 1. 120π cm2 /sec 2. 2 3 cm2 /sec 3. 1/18π ft/min 4. 371.806 km/hr 5. 1/25 rad/min, 4 5
m/min 6. 0.311 m/min 7. −0.1717 in/min, 0.1069 in/min 8. −3000 cm3 /min 9. 40.055 km/min 10. 1
in/min
√ √ √
Section
√ 1.34: 1a. 2, 2 5/25 1b. 0, 3 10/50 1c. 0, 12 17/289 √ 8, 44/5773/2 1e. −2, 2e2 /(e2 + 4)3/2 1f. −1,
1d. √
− 2/2 3. (−119/2, 53/6) 4. (−4a , 3a + 1/2) 5. 1/ 8x 6. 2 3/9
3 2
√ √
Section 1.35: 1. x + 2y = 10 2. 3x + 5y = 34 3. 1025 4. x = 11/4, y = 4 − 10 5. 10/2 6. (3, −1)
7. (x − 0.9) + (y − 1.9) = 4.42 8. 2, 1, 1, 4, 1, 1, 3, 4 9. 1, 2, 1/4 10. {−2, 2}, {(2n − 1)π : n ∈ Z}, {1}
2 2

13. 51 , 25 , 43 , 6, − 16
1
, −2 14. a = 1, 12 15. y = 3x + 1 16. 4x − y = 3, x + 4y = 5 17. (0, 0), (6, 216) 18. 24
√ √ √ √

19. f (x) = 2/ x − 3/ x4 , g ′ (x) = −18/5x3 , h′ (x) = 1/2 x + 6/x2 20. x + y = 4, y = x 21. (5, 4 5)
4 3 5
(√ )3
22. (216, 6) 23. f ′ (x) = 15(3x − 1)4 , g ′ (x) = −5x/ 5x2 + 4 , h′ (x) = 8(2x + 1)(x2 + x + 1)7 24. −1, 4/7

25. f ′ (x) = 2(x2 + 1)/ x2 + 2, g ′ (x) = 11/(3x + 4)2 , h′ (x) = 2x(14x − 15)(4x − 15)4 26. −7/2, 13/3, −9/16 27.
√ √ √
3y − 2x = 4 28. M = 15, m = 6; M = 114, m = 2; M = 65 , m = 4; M = 6a3 , m = − 2 9 3 a3 ; M = 12 , m = 10 3
;
√ √ 2
4 √
M = (a + b)/a, √ m = 2 b/a 29. 18 2 π √ 30. 4π r h/27, 4/9, h/2r √31. 4L/(4 + π) for square, 4L/(4
√ + 3 3) for
square,
√ πL/(3 3 + π) for circle, √ 4L/(4 + 2 3) for square 32. x = 1/ 3 33. |Ax0 + 2
√ By0 + C|/ A + B 21 1
2

34. (a + b)2 + d2 35. x = 2 b/3a, y = 4b/3 36. 42.56 ft 37. 79.7 39. 3 3/2 40. ln a 41. 4 , 2 ,
√ √ √ √
2/π 2 , 13 , 29 , 0 42. (±a/ 3, 3/4); (±a/ 2, 1/ e); (a, 0) 43. 10.2 mph, 0.94 s 44. 89.6 ft 45. 1280 5/3 ft
√( )
√ √
46. 2, 28/1453/2 47. 7 6 28/9 50. y = 3x5 − 8x4 + 6x3 51. ± 9b2 + 5 − 2b /15

C.2 Chapter 2 Answers


200 ∑
29 ∑
7 ∑
n+2 ∑
n
Section 2.1: 1. (5i − 3), i
i+1
, 3i 2. (4i − 2), (4i + 6) 3. 12168, 3270, 54, 70499, 2n2 ,
i=1 i=1 i=0 i=1 i=−1
1
3
m(m + 1)(m + 2) 4. (n + 1)6 − 1, 2n/(2n + 1), a1 + a2 − an+1 − an+2 5. n(n + 3), 1
2
(n + 2)(3n + 5),
1
2
n(6n 2
− 3n − 1) 1
6. 12
1 3

Section 2.2: 1. 576, 316
3
, 91
3
2. 12
a 4. 32
3
, 156, 1056
5
5. 13 9
2
, 4 π, 5+ 9
2
arcsin 2
3
6. 1/ ln 2 7. 16
3
, 12
172 Appendix C Answers to Exercises

( ( )3 ( )) ( )
∑n ∑n ∑n ∫7 ∫2
Section 2.3: 1. lim 4 1+ 2i
n
+ 1+ 2i
n
2
, lim
n n→∞
sin iπ
n
π
, lim
n n→∞
3
2n+3i
2. 4
x3 dx, 0
(5x2 +x) dx,
n→∞ i=1
∫ 2 −1 i=1 i=1

1
x dx 3. 23 4. eb − 1
Section 2.4: 2. 420, 165, 15 142
2
−27, − 83
, 3 , 8
3. 14, 12 (1 + π)r2 , 78 π 4. 73 , 1 (1 − π), −16 5. 4, 6, 17
15 2 2
6. 73 t, 32 x2
√√ √
Section 2.5: 1. 31 , 1; 37 , 35 ; 4 6, 4 3 15 2. 1st , 1st , 2nd 4. −18, −5, 71
3
3
5. 83 7. 7 8. f ′ (x) = x3 − 2x2 + 5
√ √ 3 √
Section 2.6: 1. f ′ (x) = x2 + 9, g ′ (x) = 4/(2 − x), h′ (t) = 1/ 3 t4 + 9, F ′ (x) = e−x − 1, G′ (u) = 2u3 u6 + 2,
4 √ √
H ′ (x) = ex sin e2x∫, u′ (x) = 2xex , v ′ (x) = 2 cos 8x3 − cos x3 , w′ (x) = ex e3x + 4 − 2x x6 + 4 2. − π4 3. 12
3 x 4 1 1
4. (−∞, 2 ) 5. 2 sin t dt 6. 2 , 0, 1, 1, 5, 3 7. G(x) = 2 + 2 x|x|
√ √
Section 2.7: 1. 20, 1, 2 − 1, 85 6
, 38 , 53
12
− 34 2, ln 2, 105 2
, e2 − 1, 52 ln 3, 16 π, π 2. π 3. ln b
(√ )
Section 2.8: 1. You should check these yourself. 2. 256 15
, 41 ln 5, − 20
9
, π1 , 2 − 1 a, 32π √
3
3. 13 + π2
√ √ √
Section 2.9: 1a. 20 1
(x4 + 6)5 , 1b. 14 4x2 + 1, 1c. 2 sin t, 1d. − 14 (x2 + 4x − 1)−2 , 1e. 23 (x − 4) x + ( 2, )

1
1f. 15 (3t − 16)(2t + 1)3/2 , 1g. 3 ln x2 + 2x − 3 , 1h. − 2(t21+8) , 1i. − 13 (x2 + 2a2 ) a2 − x2 , 1j. 2 arcsin et/2 ,
( ) (√
√ ) √
1k. 12 arctan x2 , 1l. 14 1−2x1
+ ln 2x − 1 2. 25 , 2, 19 , π2 , 12
5
6 + 3 , 1861 40
3. 23 (x − 2) x + 1 4. 3 − ln 8
5. π ab

Section 2.10: 1. Check by differentiation. 2. π
4
, (2π − 4)/π 2 , + 1), 16 − 12 4 e, 2e2 , (2π)−1
1
4
(e2 3. e2 + 1
√ ( )
Section 2.11: 1. 4, 4
3
, D, 3
4
, D, 5, 6, D, 10 13 3. r > 1, r < 0, r < 1 6. πa2 , 83 32/3 − 1 , D
Section 2.12: 1. 1 32 15
, , 2
6 √3
− 8 ln 2, 16 a3 , 3 5 1
a , 2 (a4 − 1) − 2a2 ln a 2. 125
3. 8 4. 12 ln 2 − 4.5
10 √ 6
5. x = 23 , y = 12 − 6 3 6. 16 a2 7. 15 4
8. 3

Section 2.13: 1. 243
5
π, π
2
(e4 −1), 3π 2a. 64
15
π 2b. 8
3
π 3. 56
15
π 6. 1
3
πh2 (3r −h) 7. 64π, 512
15
π 8. 32000 3
1 2
9. 625
6
10. 3
a h

Section 2.14: 1. π(π − 2), 10π ln 10, 2π(e2 + 1) 2. 324 π 3. 16π 6. 43 π(R2 − r2 )3/2 , 1 − 4−1/3 R, 61 πh3
√ 5
7. (2πa)(πr2 ), 43 πr3 + aπ 2 r2 9. 4 8, ≈ 1.2285
( ( )
√ )
Section 2.15: 1. 8985.6 lbs 2. 85w, 54 + 36 3 w, 415
3
w, 63
2
π − 18 w, 24πw, 120w 3. 9052.16 lbs

Section 2.16: 1. 25/6 ft-lbs 2. 1/3 ft-lbs 3. 3 10 in 4. 63303.1 ft-lbs 5. 55734.8 ft-lbs 6. 3993.6 ft-lbs
7. 4800 ft-lbs 8. 332773.1 mi-lbs; 19507.4 mi-lbs; 338812.5 mi-lbs
Section 2.17: 1. (3.7, 0.2, 2.3) 2. (−8, −22) 3. ( 94 , 27 10
, 9 ) 4. ( 21 a, 52 a2 )
5 4
), (3/ ln 4, 3/ ln 16), ( 12
4 4 1 3
5. ( 3π r, 3π r) 6. 4 h 7. 8 r 8. 14 ft 9. 22 11 13
11. ( 5 , 7 ) 13. ≈ 26 15. 3 , (3e4 + 1)/(2e4 − 2), 18
8 16 10
13

Section 2.18: 1. 21 πr 2. 27 1
((4 + 9b)3/2 − 8) 3. 19 6
, 158
99
+ ln 4, 20 , 227
48
, 779
240
, ln |2 + 3| 4. 6

2048
5. 9375 + 62176 29
6. ( π2 r, π2 r)
9375
( ) √ ( ) ( )

Section 2.19: 1a. 32 ln 8 + 4x + x2 − 4 arctan x+2 1b. −2 8 + 2x − x2 + 3 arcsin x−1 1c. 4 arcsin x−3
( ) 2
√ ( ) 3 3
2
1d. 2 ln x2 + 3x + 5 − √611 arctan 2x+3 √ 1e. 5
ln x + 4x + 10 − 6
arctan x+2

√ ( ) 11 2 2 6

1f. −6 4x − x2 + 11 arcsin x−2 2


( ) ( )
2a. 13 x − 19 ln |3x + 1| 2b. 12 ln |x2 + 10| + √310 arctan √x10 2c. x − 2 arctan x2
2x + 23 ln |x2 + 1| −
2d. ( ) arctan x 2e. 1 2
2
x − 2x + ln |x2 + 2x + 2| + 2 arctan(x + 1) 2f. 2x + 8 ln |x − 4|
3. 2 ln |x − 6| − ln |x|
√ √
1 x 1 x2 − 3
Section 2.20: 1a. x(x2 + 2) 4 + x2 − 2 ln x + 4 + x2 1b. √ 1c. ln 2
4 √ 25 25 + 4x2 2 x +√3 √
√ √ √ √ 5 + 5 − x2
4x − 9
2

1d. 3 ln x + 2 + x2 + 4x + 8 1e. − + 2 ln 2x + 4x2 − 9 1f. 5 − x2 − 5 ln
x ( x )
√ √ x √
e 1i. cos x 9 − cos2 x − 9 arcsin cos x
1g. −e−x e2x − 1 + ln ex + e2x − 1 1h. 3 ln √
( √ ( 2 + 4 −) e 2x 3
√ )) √ √ 1 (√ 1( )
2. 9 2 3 − ln 2 + 3 3. 4 17 + 2 65 + ln 65 + 8 6a. 2 sec x tan x + 3 sec x tan x + 3 ln | sec x + tan x|
3
4 8
1( ) 1( )
6b. 3
2 cos x sin x + 3 cos x sin x + 3x 6c. 8 sec x tan x − 2 sec3 x tan x − 3 sec x tan x − 3 ln | sec x + tan x|
5
8 48
C.2 Chapter 2 Answers 173

1( ) 1 2
6d. 8 sin5 x cos x − 2 sin3 x cos x − 3 cos x sin x + 3x 6e. − (x6 + 3x4 + 6x2 + 6)e−x
(
48 ) √ 2
6f. x (ln x)3 − 3(ln x)2 + 6 ln x − 6 7. 3 3 b2 /2
√ ( )
4 16 − x2 √ (x2 + 16)3/2 1 x
Section 2.21: 1a. 4 ln − + 16 − x 2 1b. − 1c. + arctan x
x x 48x3 2 x2 + 1
3
2x 3x x x
1d. + 2 + 3 arctan x 1e. √ − arcsin x 1f.
(x2 + 1)2 x +1 1 − x2 3(1 − x2 )3/2
1 1 11 1 3
Section 2.22: 1a. ln |x| − ln |x + 5| 1b. ln |x − 4| − ln |x + 1| 1c. ln |2x − 1| − ln |x + 3|
5 5 5 5 2
6
1d. 2 ln |x| + 2 ln |x − 2| − 4 ln |x − 1| 1e. 3 ln |x| − 2 ln |x − 1| −
x−1
1f. 2 ln |x|
+ 2 ln
|x 2
+ 6x + 10| − 17 arctan(x + 3) 1g. 4 ln |x| − 2 ln |x2 + 1| + arctan x
1 1 − x 1 3
1h. ln − arctan x 1i. ln |x2 + 4| + 5 arctan x + arctan(x/2)
4 1 + x 2 2
1 √ ( √ ) 1 3 8 1
1j. ln |x − 2| − ln |x + 2x + 4| − 3 arctan (x + 1)/ 3
2
1k. x2 − x + ln |x + 2| + ln |2x − 1|
2 4 4 5 40
3x + 1 9(x + 1) 9
1l. + + arctan(x + 1)
4(x2 + 2x + 2)2 8(x2 + 2x + 2) 8
290
Section 2.23: 1. 0.1361, 0.1353; 0.3160, 0.3099; 0.5125, 0.5127 2. 0.9461 3. 3.8194 4. 18, 3
π, ( 89
27
, 145
54
)
5. 52 m
Section 2.24: 1. n2 , 13 n(4n2 − 1), n2 (2n2 − 1) 2. 12, 32 3. 5050, 73810 4. 37/6
√ √
5. 26/3 6. 40 7. 2nd , 2nd , 1st 8. 3 3.75, (a + b)/2, (a2 + ab + b2 )/3
√ 4 ∫x 2
9. 4x/ x4 + 2, 4 cos(16x2 ), 4te−2t 10. 23 11. (−∞, 14 ) 12. f (x) = 1 et /4 dt
√ ( √ ) √
15. 36 − 24 ln 2, π − 25 , 320
3
16. π/ 2 − ln 3 + 2 2 17. 16 18. x = 2, y = 34 ln 2 19. 2 − 3
4
20. (3.1, −0.4)( 21. (2, 4/5) 22. (12/13,
) 7/13) 23. x = (3e + 1)/2(e − 1) 24. y = 4/3 26. (π/2, π/8)
4 4

28a. − 15
1
cos x 3 sin4 x + 4 sin2 x + 8 28b. (x3 − 3x2 + 6x − 6)ex 28c. 15 tan5 x − 13 tan3 x + tan x − x
( ) √ √
28d. −(x2 + 2x + 2)e−x 28e. 2 x5/2 − 5x2 + 20x3/2 − 60x + 120x1/2 − 120 e x 28f. 9e − 24 32. 100 2 π 2
27 64
33. 1378.7 gal 34. 54π 35g. 35h. 35i. 64 + 20π 35p. 1 35q. 25( 35r. D,) check the indefinite integrals
2 3
√ √
by differentiation 36. 8 32
3
, 5 π, 8π, 56
3
π, 24π, 224
15
π, 32 , 65 , 53 , 6 + 17 − 41 ln 17 − 4 , 27215
w
37. 2, 12 π 2 , 2π 2 , 2π(10 − π), 2π(6 + π), 2 π(32 − π), 12 π, 18 π,
1
(2
1
π, 6.962, 1
(64 −
) π)w 38. 64
, (0, 2.4) assuming vertex
√ 4 3
at (0, 0), 10117.12 lbs 39. 2
3
π(2r + a ) r − a + aπr 2 arcsin(a/r) − π for 0 < a < r
2 2 2 2 2
174 Appendix C Answers to Exercises

C.3 Chapter 3 Answers


{ } { }
Section 3.2: 2. 31 , 12 , 19 , 14 , 15
1
, . . .; 12 , 34 , 15 , 105 , 945 , . . .; 2, 72 , 25 , 35 177
, 40 , . . . 3. n/2n , 1 − (−1)n ,
{ } 8 16 32 6 8
−1
1/n(n + 1) 4. 7 , − 3 , 2 , 0, 2 , 0, π, ln 2, e
2 1 1 1
5. I, D, I

Section 3.3: 1. 1, 3, −7 2. e4 , e−1/2 , e−3 , 0, 12 , 1, 0, 1, 0 4. b


(√ ) √ ( √ ) ( √ )
Section 3.4: 4. 5 − 1 /2 5. 10 6. 24/7 7. 3 + 5 /2 8. 1 + 13 /2
Section 3.5: 1. n/5, (5n − 1)/(4 · 5n ), n/(4n + 16) 3. 1 4
, ,
5 9
b2 /(1 + b2 ), 1
3
, − 10
7
, 9
235
, 4, 9, 19
5. −7, 23
2
, − 26
1
, 23
√ √
Section 3.6: 1. C, D, C, C 3. n = 485, 165, 195 4. p > 1, p > 1, 2 − 3<p<2+ 3, p > 1
Section 3.7: 1. C, D, C 2. C, D, D 3. D, C, D, C, D, D
Section 3.8: 2. C, C, D, C 4. 0 < p ≤ 1 8. AC, D, CC, AC
Section 3.9: 2. C, C, D, C, C, C 6. −2 < x < 2, −∞ < x < ∞, −3 < x < 7
3. CC, AC, D, AC, CC, AC

Section 3.10: 1. 1, ∞, 2. [−5, 5], (1, 7), [−2.5, −1.5)
5
3
3. C at 0, 4.4, D at 110, not enough info for


xk


(x−5)k ∑

(x−4)k
8.5, −2; (−2, 8], [−3, 9) 4. 3.5, 5.5 5. [−1, 1], [−1, 1), (−1, 1], (−1, 1) 6. k 2 3k
, k3k
, 8k
,
k=1 k=1 k=1


(−1)k+1
k2k
(x + 1)k 7. 2/(2 − x), 1/(2x + 7), (x − 1)/(4 − x)
k=1

∞ ∑
∞ ∑
∞ ∑

Section 3.11: 1a. 4k xk , (− 41 , 14 ) 1b. (−2)k−1 xk , (− 12 , 12 ) 1c. 5−k−1 xk , (−5, 5) 1d. (−1)k x2k+1 ,
k=0 k=1 k=0 k=0

∞ ∑
∞ ∑

(−1, 1) 1e. (−1)k x2k+1 /(2k +1), [−1, 1] 1f. 1
2
(k +2)(k +1)xk , (−1, 1) 2. (x+x2 )/(1−x)3 3. k+1
2k+1
xk ,
k=0 k=0 k=0


5k−1


(−1)k (k+1)
(2k−1)!
xk , 4k+3
xk 4. 5 ln 2, 12, 1
4
ln(0.8) 5. ck = 1/k!
k=2 k=0


2k


(−1)k 52k+1 ∑

(−1)k ∑

(−16)k−1 ∑

(−1)k+1
Section 3.12: 1a. k!
xk 1b. (2k+1)!
x2k+1 1c. (2k)!
x4k 1d. (2k−2)!
x2k 1e. xk 1f.
(k−1)!



k=0


k=0


k=0


k=1 k=1
(√ )
(−1)k+1 (−1)k (−1)k 22k−1 2
(2k+3)!
x2k 1g. (2k+1)(2k+1)!
x2k+1 1h. 1
(2k)!
x2k 1i. 1+ (2k)!
x2k 2. e−x , e−x , x cos 2 x
k=0 k=0 k=1 k=1


1·3·5·····(2k−3) ∑

2·5·8·····(3k−4)
3a. 3+ 61 (x−9)+ (−1)k+1 2k 32k−1 k!
(x−9)k , ρ = 9 3b. 1+ 31 (x−1)+ (−1)k+1 3k k!
(x−1)k , ρ = 1
k=2 k=2


(−1)k+1 ∑

3c. 2k+1
(x − 5)k , ρ = 2 4. 1 + 1·3·5·····(2k−1)
2k k!
xk 5. −100 · 299 , 100!/33!, −100!/52! 7. ii = e−π/2
k=0 k=1

Section 3.13: 2. 1201


, P14 = P13 , P10 , 0.0233 3. 2+ 14 (x−4)− 64 1
(x−4)2 + 512
1
(x−4)3 − 16384
5
(x−4)4 + 131072
7
(x−4)5
1
4. 2 + 12 (x − 8) − 288
1
(x − 8)2 + 20736 5
(x − 8)3 − 248832
5
(x − 8)4 5. 2x + 23 x3 + 25 x5 + 72 x7
( √ )
Section 3.14: 1. 85 , − 14 , 14 , 13 , π4 , ln 7, 3, e1/4 , e−5/2 2. 34 3. xk = 3/(6k + 2) 4. 76 6. 3 − 5 /2 8. z

11. − 71 , 157
12. (−1, 1), (2 + r)/(1 − r2 ), 1 + 23 14. D, CC, AC, AC, D, AC, AC, AC, AC 15. π 2 /8, π 2 /12
16. 74 , 27 , 17 3
17. 13 5
, 13 5
, 13 6
; 13 2
, 13 5
, 13 9
; 13 3
, 26 5
, 26 19. 1000, 15, 1012 , exp(106 ) ≈ 10434294 , 106 , 23, 6, 17 20d.
∑ (2k)! k
∞ ∑

(2k)! ∑ (−1)k
∞ ∑

(−1)k ∑

(−1)k
4k (k!)2
x 20e. (2k+1)4k (k!)2
x2k+1 21. , 1
(2k+1) k! 2 4k (2k+1)(2k+1)!
, (4k+1)(2k)!
22. 4, 7 23.
k=0 k=0 k=0 k=0 k=0
√ ∑
∞ ∑
∞ ∑

(−1)k 6k+2
3 ρ 24. 1 k
2k
x , 5k+1
k
x , k
6k+2
x 25. (x + 4x + x)/(1 − x) , (x + 11x3 + 11x2 + x)/(1 − x)5
3 2 4 4
k=1 k=1 k=0 (√ ) ( √ ) ( )
26. 6, 3
4
, −3 27. 165
, 20
27
, 222 28. e4x , 1
3
sin 3x, −5x cos 5 x 29. − 15 sin 5, 1 − cos 1/ 3 , 12 e−1/2 − 1
30. 40! · 240 , 40!/3 , 40! · 3
40 20
31a. 1 + x − 13 x3 − 16 x4 − 30
1 5 1
x + 630 x7 + · · ·
31b. x − x2 + 31 x3 − 301 5
x + 1 6
90
x − 630
1
x7 + · · · 31c. x + 13 x3 + 152 5 17 7
x + 315 x + 62
2835
x9 + ···

You might also like