Ec2065 - SG (2022)
Ec2065 - SG (2022)
Macroeconomics
K. Sheedy
EC2065
2022
Macroeconomics
K. Sheedy
EC2065
2022
Undergraduate study in
Economics, Management,
Finance and the Social Sciences
This subject guide is for a 200 course offered as part of the University of London
undergraduate study in Economics, Management, Finance and the Social Sciences.
This is equivalent to Level 5 within the Framework for Higher Education Qualifications
in England, Wales and Northern Ireland (FHEQ).
For more information, see: london.ac.uk
This guide was prepared for the University of London by:
Dr Kevin Sheedy, Assistant Professor of Economics, Department of Economics, The
London School of Economics and Political Science.
This is one of a series of subject guides published by the University. We regret that
due to pressure of work the authors are unable to enter into any correspondence
relating to, or arising from, the guide. If you have any comments on this subject
guide, favourable or unfavourable, please use the form at the back of this guide.
University of London
Publications Office
Stewart House
32 Russell Square
London WC1B 5DN
United Kingdom
london.ac.uk
Contents
Chapter 1: The supply side of the economy ........................................................................... 8
1.1 Production functions and factors of production........................................................... 8
1.1.1 Factors of production............................................................................................ 8
1.1.2 The production function ....................................................................................... 9
1.1.3 Neoclassical production functions....................................................................... 10
1.1.4 The Cobb-Douglas production function ............................................................... 12
1.1.5 The per worker production function ................................................................... 12
1.2 Factor markets and the distribution of income .......................................................... 13
1.2.1 The Cobb-Douglas production function ............................................................... 15
Box 1.1: Understanding inequality in wages ................................................................ 15
1.3 Population growth according to Malthus ................................................................... 17
1.3.1 Demographics ..................................................................................................... 17
1.3.2 An agricultural economy ..................................................................................... 18
1.3.4 Dynamics of a Malthusian economy.................................................................... 19
1.3.5 What does (or does not) help? ............................................................................ 20
Box 1.2: A 14th-century pandemic ............................................................................... 21
1.4 Hours of work and the supply of labour ..................................................................... 22
1.5 The effects of wages on labour supply ....................................................................... 26
1.5.1 Effects on those already participating in the labour market ................................ 26
1.5.2 Effect on the labour-market participation decision ............................................. 27
1.5.3 The labour supply curve ...................................................................................... 28
1.5.4 Do higher tax rates raise more revenue? ............................................................ 30
1.6 Equilibrium and efficiency ......................................................................................... 31
1.6.1 A static macroeconomic model ........................................................................... 31
1.6.2 Equilibrium in labour and goods markets ............................................................ 31
1.6.3 Economic efficiency ............................................................................................ 32
Box 1.4: Should wages or rents be taxed to pay for public expenditure? ..................... 34
1.6.4 Taxing wages but not rents ................................................................................. 35
1.6.5 Taxing rents but not wages ................................................................................. 35
1.7 Capital accumulation ................................................................................................. 36
i
EC2065 Macroeconomics
ii
EC2065 Macroeconomics
iii
EC2065 Macroeconomics
iv
EC2065 Macroeconomics
v
EC2065 Macroeconomics
vi
EC2065 Macroeconomics
Box 7.4: Forecasting from the shape of the yield curve.............................................. 249
7.8 Risk and portfolio choice ......................................................................................... 250
Box 7.5: The typical shape of the yield curve ............................................................. 253
7.9 The functions of banks............................................................................................. 254
7.9.1 The Diamond-Dybvig model .............................................................................. 255
7.9.2 An economy with no financial intermediaries ................................................... 256
7.10 Banking as maturity transformation ...................................................................... 256
7.10.1 Bank deposits ................................................................................................. 256
7.10.2 Competition between banks ........................................................................... 257
7.10.3 The equilibrium deposit contract offered by banks ......................................... 258
7.11 Bank runs .............................................................................................................. 260
7.11.1 Strategic demands for withdrawals and bank failures ..................................... 260
7.11.2 Multiple equilibria and the possibility of bank runs ......................................... 261
7.11.3 The 2007 Northern Rock bank run .................................................................. 261
7.11.4 The ‘shadow’ banking system and the 2008 financial crisis ............................. 261
7.12 Deposit insurance and bank regulation .................................................................. 261
7.12.1 Deposit insurance ........................................................................................... 262
7.12.2 The central bank as ‘lender of last resort’ ....................................................... 262
7.12.3 Bank capital requirements .............................................................................. 262
7.12.4 Reserve requirements ..................................................................................... 263
Box 7.6: The 100 per cent reserve requirements ....................................................... 263
Box 7.7: Central-bank digital currency ....................................................................... 265
Chapter 8: Business cycles................................................................................................. 266
8.1 Nominal rigidity ....................................................................................................... 266
8.2 The new Keynesian model ....................................................................................... 268
8.3 The real effects of monetary policy.......................................................................... 271
Box 8.1: The Volcker disinflation................................................................................ 272
8.4 Business cycles due to demand shocks .................................................................... 273
Box 8.2: Can the new Keynesian model match the business-cycle stylised facts?....... 274
Box 8.3: Labour hoarding ........................................................................................... 280
8.5 The natural rate of interest...................................................................................... 281
8.5.1 Imperfect competition and the output supply curve ......................................... 281
8.5.2 Market clearing in the absence of nominal rigidities ......................................... 282
vii
EC2065 Macroeconomics
8.5.3 The long run and the short run ......................................................................... 283
8.6 Optimal stabilisation policy...................................................................................... 283
Box 8.4: Modelling monetary policy using Taylor rules and LM curves ....................... 285
8.6.1 Money supply targets, the LM curve and the IS-LM model................................ 286
8.6.2 The Taylor rule .................................................................................................. 287
8.7 Real business cycle theory ....................................................................................... 288
8.8 Business cycles due to supply shocks ....................................................................... 289
8.8.1 Supply shocks ................................................................................................... 289
8.8.2 The predictions of the RBC model ..................................................................... 290
8.8.3 Stabilisation policy? .......................................................................................... 291
Box 8.5: Sources of supply shocks in the RBC model .................................................. 291
8.9 Coordination failure model ...................................................................................... 293
8.9.1 Labour productivity spillover across firms ......................................................... 293
8.9.2 Implications for the output supply curve .......................................................... 295
8.10 Multiple equilibria and business cycles .................................................................. 296
Box 8.6: The strength of strategic complementarities................................................ 299
Chapter 9: Inflation, expectations and macroeconomic policy........................................... 300
9.1 Inflation and the Phillips curve ................................................................................ 300
9.1.1 Firms’ incentives to adjust prices ...................................................................... 300
9.1.2 Price changes and economic activity ................................................................. 301
9.1.3 Expectations ..................................................................................................... 302
9.1.4 The Phillips curve .............................................................................................. 302
9.1.5 Inflation and unemployment ............................................................................ 303
9.2 Expectations and aggregate demand ....................................................................... 303
9.2.1 Inflation expectations and real interest rates .................................................... 303
9.2.2 Expectations of the economy’s future GDP ....................................................... 304
9.3 Aggregate demand with market imperfections ........................................................ 304
9.3.1 Consumption and aggregate demand ............................................................... 305
9.3.2 Consumption and aggregate demand with credit-market imperfections........... 306
Box 9.1: Multiplier and crowding-out effects of fiscal policy ...................................... 307
Box 9.2: Asset prices and the financial accelerator .................................................... 308
Box 9.3: The 2008 financial crisis ............................................................................... 309
9.4 Inflation, aggregate demand and monetary policy................................................... 310
viii
EC2065 Macroeconomics
ix
EC2065 Macroeconomics
x
EC2065 Macroeconomics | General information
General information
Module level: 5
Credit: 30
Notional study time: 300 hours
Summary
This course will cover the fundamental principles of macroeconomics at an
intermediate level. Topics include the supply side of the economy and economic
growth, the demand side of the economy, consumption, investment, fiscal policy, the
labour market and unemployment, financial markets, banking, money and monetary
policy, business cycles, inflation, and international macroeconomics.
Conditions
Prerequisite: If taken as part of a BSc degree, the following course(s) must be
passed before this course may be attempted:
EC1002 Introduction to economics and
Either MT105A Mathematics 1 (half course) or MT1174 Calculus or MT1186
Mathematical methods).
Learning outcomes
At the end of the course and having completed the essential reading and activities,
you should be able to think about and give answers to key macroeconomic
questions, for example:
• What are the forces that drive long-term prosperity?
• Is a growth slowdown in emerging economies inevitable?
• Why are real interest rates so low?
• What causes bubbles in financial markets?
• Does the government have a budget constraint?
• How does the labour market respond to structural change and shifting
employment patterns?
• What is the role of banks and why are they inherently fragile?
• Is it a good idea for central banks to set up new digital currencies?
1
EC2065 Macroeconomics | General information
Essential reading
The textbook for the course is:
Williamson, S.D. Macroeconomics. (London: Pearson, 2018) 6th edition
[ISBN 9780134472119].
Detailed reading references in this subject guide refer to the editions of the set
textbooks listed above. New editions of one or more of these textbooks may have
been published by the time you study this course. You can use a more recent edition
of any of the books; use the detailed chapter and section headings and the index to
identify relevant readings. Also check the virtual learning environment (VLE)
regularly for updated guidance on readings.
The VLE
The VLE, which complements this subject guide, has been designed to enhance
your learning experience, providing additional support and a sense of community. It
forms an important part of your study experience with the University of London and
you should access it regularly.
The VLE provides a range of resources for EMFSS courses:
• Course materials: Subject guides and other course materials available for
download. In some courses, the content of the subject guide is transferred
2
EC2065 Macroeconomics | General information
into the VLE and additional resources and activities are integrated with the
text.
• Readings: Direct links, wherever possible, to essential readings in the Online
Library, including journal articles and ebooks.
• Video content: Including introductions to courses and topics within courses,
interviews, lessons and debates.
• Screencasts: Videos of PowerPoint presentations, animated podcasts and
on-screen worked examples.
• External material: Links out to carefully selected third-party resources.
• Self-test activities: Multiple-choice, numerical and algebraic quizzes to
check your understanding.
• Collaborative activities: Work with fellow students to build a body of
knowledge.
• Discussion forums: A space where you can share your thoughts and
questions with fellow students. Many forums will be supported by a ‘course
moderator’, a subject expert employed by LSE to facilitate the discussion and
clarify difficult topics.
• Past examination papers: We provide up to three years of past
examinations alongside Examiners’ commentaries that provide guidance on
how to approach the questions.
• Study skills: Expert advice on getting started with your studies, preparing for
examinations and developing your digital literacy skills.
Note: Students registered for Laws courses also receive access to the dedicated
Laws VLE.
Some of these resources are available for certain courses only, but we are
expanding our provision all the time and you should check the VLE regularly for
updates.
3
EC2065 Macroeconomics | General information
Syllabus
Topic 1: The supply side of the economy
Contents: Factors of production (labour, land, and capital), the production function
and the supply side of the economy, the distribution of income between factors
(wages, rents, and interest), population growth, labour-market participation and
labour supply, taxation, competitive equilibrium and efficiency, the Solow model.
Questions to address
• Why was economic growth so different before and after the Industrial
Revolution?
• Why have hours worked declined in advanced economies even though wages
are so much higher than in the past?
• Why has wage inequality increased in recent decades?
• Should land-value taxes be used?
• Is a growth slowdown in emerging economies inevitable?
Topic 2: Economic growth in the long run
Contents: Evidence on economic growth and the income distribution across
countries, convergence, saving rates and the Golden rule, technological progress,
international investment flows, institutions and misallocation, endogenous growth
theory, learning by doing, human capital, research and development, diffusion of
knowledge between countries.
Questions to address
• What are the forces that drive long-run prosperity?
• Are we saving enough for the future?
• Why is the gap between rich and poor countries so large?
• How strong should intellectual property rights be?
• What are the implications of climate change for the economy?
Topic 3: The demand side of the economy
Contents: Evidence on macroeconomic fluctuations using detrended data,
consumption, the relationship between consumption and income, durable and non-
durable goods and services, interest rates and saving, bond yields, determinants of
real interest rates, investment, the stock market, a dynamic macroeconomic model.
Questions to address
• Why are purchases of capital goods by firms very volatile while purchases of
services by households are more stable?
• Why are real interest rates so low?
4
EC2065 Macroeconomics | General information
5
EC2065 Macroeconomics | General information
monetary policy on inflation and economic activity, controlling inflation using Taylor
rules, the lower bound on nominal interest rates.
Questions to address
• Why has the link between money supply growth and inflation been unstable?
• What advantages do governments derive from being able to create money?
• What are hyperinflations and why are they so damaging?
• How should central banks conduct monetary policy?
• Is it better to have a monetary system without physical cash?
• Can and should nominal interest rates ever be negative?
• How do cryptocurrencies differ from existing forms of money?
6
EC2065 Macroeconomics | General information
7
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Using the supply-side model, we will look at what explains the level of GDP and how the
economy’s total income is distributed. This model serves as the foundation for our study of
economic growth in Chapter 2, where we will explore why there is growth in GDP per
person over time and why the level of GDP per person differs so much across countries.
Essential reading
• Williamson, Chapters 4, 5 and 7.
8
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
simplicity, we denote the fixed quantity by 𝐿𝐿. Natural resources can also be included as part
of land, broadly interpreted.
Labour as a factor of production refers to work done by people to produce goods and
services. The supply of labour to the economy depends on how many workers there are and
the number of hours worked by each person. To begin with, we will treat labour as
homogeneous and denote the quantity by 𝑁𝑁 that, depending on the context, could be
measured as a number of people or a number of hours worked. More broadly, we could also
consider the skills, training and experience of the workforce as part of what determines the
supply of labour, although these are often counted as a separate factor of production
known as ‘human capital’, considered in Chapter 2.
9
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
10
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
• Inada conditions:
lim 𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = ∞ , lim 𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 0
𝐾𝐾→0 𝐾𝐾→∞
These are for capital. The same assumptions hold in terms of 𝐹𝐹𝐿𝐿 and 𝐹𝐹𝑁𝑁 for land and
labour.
A production function that satisfies all three assumptions is called a ‘neoclassical production
function’. Figure 1.1 plots the relationship between output 𝑌𝑌 and capital 𝐾𝐾 for a neoclassical
production function 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁). This is not a plot of the entire production function
because land and labour are held constant at 𝐿𝐿0 and 𝑁𝑁0 but similar diagrams can be used to
show the relationship between 𝑌𝑌 and 𝐿𝐿, and 𝑌𝑌 and 𝑁𝑁, holding the other two factors fixed.
However, the constant returns to scale assumption cannot be illustrated in the diagram
because that would require changing all the factor inputs at the same time.
Figure 1.1: A neoclassical production function
The production function is upward sloping because its gradient is the marginal product of
capital, which is positive. The gradient declines as 𝐾𝐾 increases because the marginal product
of capital is diminishing. The Inada conditions imply the production function is extremely
steep for 𝐾𝐾 close to zero and flattens out as 𝐾𝐾 becomes very large. The production function
must also pass through the origin because some capital is essential for production given the
neoclassical assumptions.
11
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
12
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
the marginal product of capital is 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 𝑧𝑧𝑓𝑓 ′ (𝐾𝐾⁄𝑁𝑁) = 𝑧𝑧𝑧𝑧′(𝑘𝑘). The neoclassical
assumptions state that 𝑀𝑀𝑃𝑃𝐾𝐾 is positive but diminishing in capital 𝐾𝐾, which must also hold for
capital per worker 𝑘𝑘. This implies 𝑓𝑓′(𝑘𝑘) is an increasing and concave function of 𝑘𝑘.
For each of the factor markets, there is a factor price. In the labour market, this is a wage 𝑤𝑤
per hour of labour, or per person working for a given amount of time. For land, there is a
rent 𝑥𝑥 and for capital there is a rental price 𝑅𝑅. We do not consider markets for outright
purchases or sales of factors of production at this stage. In the factor markets, households
supply factors of production that they own, which are hired or rented by firms. Households
supply their own labour, together with land and capital goods. In this analysis, firms do not
own factors of production themselves. As in the earlier model of production, all supplies of
factors of production are taken as given, so the supply curves are price inelastic.
On the demand side, firms hire factors of production 𝐾𝐾, 𝐿𝐿 and 𝑁𝑁 to produce output 𝑌𝑌,
taking prices and factor prices as given. The production function is 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁), which
has the same properties assumed earlier. Firms aim to maximise profits 𝜋𝜋:
𝜋𝜋 = 𝑌𝑌 − 𝑅𝑅𝑅𝑅 − 𝑥𝑥𝑥𝑥 − 𝑤𝑤𝑤𝑤
These profits are measured in real terms, making the price of a unit of output equal to 1.
The revenue of the firm is simply the quantity 𝑌𝑌 of output it produces and sells. Its costs are
its spending on hiring factors of production, which are given by the factor prices multiplied
by the quantities hired of each factor.
The first-order conditions for profit maximisation are 𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 0, 𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 0 and
𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 0, which are equivalent to the following:
𝑧𝑧𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑅𝑅 , 𝑧𝑧𝐹𝐹𝐿𝐿 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑥𝑥 , 𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑤𝑤
The terms 𝑧𝑧𝑧𝑧𝐾𝐾 , 𝑧𝑧𝑧𝑧𝐿𝐿 and 𝑧𝑧𝑧𝑧𝑁𝑁 denote the marginal products of capital, land and labour
respectively, where 𝐹𝐹𝐾𝐾 , 𝐹𝐹𝐿𝐿 , and 𝐹𝐹𝑁𝑁 are the partial derivatives of the function 𝐹𝐹(𝐾𝐾, 𝐿𝐿, 𝑁𝑁) with
respect to 𝐾𝐾, 𝐿𝐿, and 𝑁𝑁. These marginal products are diminishing in the quantity hired of
each factor, holding the quantities of the other factors constant. Hence, in each factor
market diagram with the factor price on the vertical axis and the quantity hired on the
13
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
horizontal axis, the demand curves of firms are the downward-sloping marginal product
curves. The supply curves are vertical, representing given price-inelastic supplies of each
factor.
In competitive markets, the factor prices are determined by market clearing. The
equilibrium rental prices of capital and land 𝑅𝑅∗ and 𝑥𝑥 ∗ and the equilibrium wage 𝑤𝑤 ∗ are
found where the factor demand curves intersect the supply curves. This is illustrated in the
rental market for capital in Figure 1.2. The diagram shows how the amount of real capital
income 𝑅𝑅 ∗ per unit of capital owned is determined. The total amount of capital income is
then 𝑅𝑅 ∗ 𝐾𝐾, where 𝐾𝐾 is the supply of capital. Note that this is gross capital income – there is
no allowance made here for depreciation of capital, consistent with how GDP is a measure
of gross income. Similar diagrams can be used to see how rents of land and wages are
determined.
Figure 1.2: Factor market equilibrium
While firms are maximising profits when they choose their factor demands, perfect
competition and a constant-returns-to-scale production function imply that profits 𝜋𝜋 are
zero in equilibrium. It is a general mathematical property that if 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) has constant
returns to scale then:
𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑧𝑧𝑧𝑧𝐹𝐹𝐾𝐾 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) + 𝑧𝑧𝑧𝑧𝐹𝐹𝐿𝐿 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) + 𝑧𝑧𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁)
Intuitively, constant returns to scale implies that a 1% increase in each of the factors of
production 𝐾𝐾, 𝐿𝐿, and 𝑁𝑁 adds 1 per cent to existing output 𝑌𝑌, that is, this raises output by
0.01 × 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝐿𝐿, 𝑁𝑁). Adding 1 per cent to capital 𝐾𝐾 raises output by 0.01𝐾𝐾 × 𝑧𝑧𝑧𝑧𝐾𝐾 , where 𝑧𝑧𝑧𝑧𝐾𝐾
is the marginal product of capital. Adding 1% to labour 𝐿𝐿 would raise output by
0.01𝑁𝑁 × 𝑧𝑧𝑧𝑧𝑁𝑁 , and so on for all factors. Summing over all factors then confirms the equation
above.
Since profit-maximisation by firms in perfectly competitive markets equalises marginal
products and factor prices for each factor, it follows that
𝑌𝑌 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝐿𝐿, 𝑁𝑁) = 𝑅𝑅𝑅𝑅 + 𝑥𝑥𝑥𝑥 + 𝑤𝑤𝑤𝑤
14
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
This means that payments to factors of production will use up all a firm’s revenues, so
profits 𝜋𝜋 are zero. Strictly speaking, this shows that economic profits are zero. Where firms
own factors of production themselves rather than renting them, which is often the case with
land and capital, some of the payments to factors described above effectively go the owner
of the firm. In that case, accounting profits would not be zero. However, these would not be
true economic profits but would instead represent the implicit rental of the factors of
production owned by the firm.
One important dimension of the rise in wage inequality is the increase in the relative
wages of highly skilled workers compared to those with more basic skills. While a
university or college education is not the only measure of having skills, the ‘college-
wage premium’ in the USA and elsewhere has received much attention. The size of the
wage premium from attending university is crucial to the debate on the returns to
higher education. A large premium means the returns might be high even if the cost of
education has risen.
In the USA, prior to 1980, an average college-educated worker earned less than 60 per
cent extra compared to an average worker without a college education. By the 2010s,
this college-wage premium had risen to close to 100 per cent. At first glance, this is
puzzling because there has also been a substantial increase in the fraction of college-
educated workers during that period, which rose from 20 per cent to 50 per cent. With
diminishing returns to individual factors of production, an increase in supply should
push down the factor payment, all else being equal. These observations suggest
something else must have changed after 1980.
15
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
One hypothesis we will explore is a shift in the relative demand for workers with
different levels of skill. We separate the supply of labour into highly skilled workers 𝐻𝐻
and unskilled workers 𝑁𝑁. The supply of 𝐻𝐻 is related to the concept of ‘human capital’
studied in Section 2.8. We consider the following example of a production function 𝑌𝑌 =
𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁, 𝐻𝐻):
𝑌𝑌 = 𝑧𝑧(𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 + 𝐵𝐵𝐾𝐾 𝛼𝛼 𝐻𝐻1−𝛼𝛼 )
As with a Cobb-Douglas production function, the parameter 𝛼𝛼, a number between 0 and
1, indicates the importance of physical capital in producing goods and services. The
variable 𝑧𝑧 is total factor productivity and a change in 𝑧𝑧 affects the marginal products of
all factors of production. What is new in the production function above is 𝐵𝐵, an
exogenous variable that represents what is known as ‘skill-biased technology’. A change
in 𝐵𝐵 affects the marginal product of skilled labour 𝐻𝐻 but not the marginal product of
unskilled labour 𝑁𝑁.
The production function above resembles two Cobb-Douglas production functions that
are added together. However, it is not possible to use a standard Cobb-Douglas
production function such as 𝑌𝑌 = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻𝛽𝛽 𝑁𝑁 1−𝛼𝛼−𝛽𝛽 in this exercise. If 𝐻𝐻 were multiplied
by a coefficient 𝐵𝐵 then this would be algebraically equivalent to a change in total factor
productivity 𝑧𝑧. It is not possible to build in skill-biased technological change with a basic
Cobb-Douglas production function.
We now apply our analysis of the distribution of income. Competitive markets imply
wages 𝑤𝑤𝐻𝐻 and 𝑤𝑤𝑁𝑁 for skilled and unskilled labour that are equal to their marginal
products:
𝜕𝜕𝜕𝜕 𝜕𝜕𝜕𝜕
𝑤𝑤𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑁𝑁 = = (1 − 𝛼𝛼 )𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 −𝛼𝛼 , 𝑤𝑤𝐻𝐻 = 𝑀𝑀𝑃𝑃𝐻𝐻 = = (1 − 𝛼𝛼 )𝑧𝑧𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻 −𝛼𝛼
𝜕𝜕𝜕𝜕 𝜕𝜕𝜕𝜕
The implications for the relative wage 𝑤𝑤𝐻𝐻 /𝑤𝑤𝑁𝑁 can be deduced from these equations:
𝑤𝑤𝐻𝐻 (1 − 𝛼𝛼 )𝑧𝑧𝑧𝑧𝐾𝐾 𝛼𝛼 𝐻𝐻 −𝛼𝛼 𝐻𝐻 −𝛼𝛼
= = 𝐵𝐵 � �
𝑤𝑤𝑁𝑁 (1 − 𝛼𝛼 )𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 −𝛼𝛼 𝑁𝑁
The relative wage declines with the relative supply of high-skilled labour 𝐻𝐻/𝑁𝑁 but it
increases with skill-biased technological change, that is, higher 𝐵𝐵.
This logic suggests one explanation for the rising skill premium (the relative wage
𝑤𝑤𝐻𝐻 /𝑤𝑤𝑁𝑁 increasing) alongside an increase in the relative supply of skilled labour 𝐻𝐻/𝑁𝑁 is
skill-biased technological change. Skill-biased technological change is improvements in
technology that disproportionately boost the productivity of skilled workers compared
to unskilled workers. For example, advances in computing, telecommunications, data
science and e-commerce may increase demand for highly skilled workers but not
unskilled workers. These changes can be represented by an increase in 𝐵𝐵 rather than an
increase in total factor productivity 𝑧𝑧. Earlier technological progress that may have been
more uniform in its effects is represented by higher TFP 𝑧𝑧. As we have seen, higher 𝐵𝐵
can raise 𝑤𝑤𝐻𝐻 /𝑤𝑤𝑁𝑁 even though there is an increase in the relative supply 𝐻𝐻/𝑁𝑁 of skilled
workers. Changes in TFP do not affect relative wages 𝑤𝑤𝐻𝐻 /𝑤𝑤𝑁𝑁 .
16
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Skill-biased technological progress is not the only explanation for the rising skill
premium. Globalisation owing to lower barriers to international trade is another
explanation. Even if labour is not mobile internationally, free trade in goods tends to
equalise the skill premium across countries through equalising the relative prices of
goods that are produced more or less intensively using skilled labour. The skill premium
is then determined by the relative supply of skilled labour at world level, where there
are relatively fewer skilled workers than within advanced economies.
1.3.1 Demographics
Malthus in his Essay on the principle of population argued that per capita income and
consumption affect population growth. Lower consumption per person leads to worse
nutrition and health, hence higher death rates and infant mortality, and lower or negative
population growth rate. Furthermore, lower income per person induces families to have
fewer children they would struggle to support, hence lower birth rates and a lower
population growth rate. Higher income and consumption have the opposite effects and
raise the population growth rate. These effects are larger when people are close to
subsistence.
In the model, assume that all individuals are workers. The current population and number of
workers is denoted by 𝑁𝑁. The future population is denoted 𝑁𝑁′, where the notation ′ refers
to a value of a variable in the next time period (the future). The population growth rate
between the current and future time periods is (𝑁𝑁 ′ − 𝑁𝑁)/𝑁𝑁. If 𝐶𝐶 is aggregate consumption,
𝑐𝑐 = 𝐶𝐶/𝑁𝑁 measures average consumption per person, which is a measure of average living
standards. A mathematical representation of the demographics assumed by Malthus is:
𝑁𝑁 ′
= 𝑔𝑔(𝑐𝑐 )
𝑁𝑁
17
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The demographic function 𝑔𝑔(𝑐𝑐) is increasing in 𝑐𝑐, an example of which is depicted in Figure
1.3. The population growth rate is 𝑔𝑔(𝑐𝑐 ) − 1, so for levels of 𝑐𝑐 where 𝑔𝑔(𝑐𝑐 ) > 1 the
population is rising , and for levels of 𝑐𝑐 where 𝑔𝑔(𝑐𝑐 ) < 1 the population is falling.
Figure 1.3: Demographics and living standards
18
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
For a low current population 𝑁𝑁, 𝑓𝑓(𝐿𝐿⁄𝑁𝑁) is high and g(z𝑓𝑓(𝐿𝐿⁄𝑁𝑁)) is greater than 1, so 𝑁𝑁 ′ >
𝑁𝑁, which means the population is increasing over time. For a high current population 𝑁𝑁,
𝑓𝑓(𝐿𝐿⁄𝑁𝑁) is low and g(z𝑓𝑓(𝐿𝐿⁄𝑁𝑁)) is less than 1, so 𝑁𝑁 ′ < 𝑁𝑁, which means the population is
falling over time. This tells us that for given parameters of this model, such as the level of
technology 𝑧𝑧, the population converges to a steady state 𝑁𝑁 ∗ .
A steady state is a value of a variable such that once the economy reaches that level of the
variable, there is no further change in that variable over time. The steady state of the
Malthusian model is depicted in Figure 1.5. Since the population converges to a steady
state, per capita income and consumption also reach a steady state 𝑐𝑐 ∗ . For this to result in
zero population growth, it must be the solution of equation 𝑔𝑔(𝑐𝑐 ∗ ) = 1. This solution is
shown using the demographic function in the left panel of the figure. Intuitively, if 𝑐𝑐 > 𝑐𝑐 ∗
then the population rises, pushing down 𝑙𝑙 and 𝑦𝑦 = 𝑐𝑐.
19
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Given the steady state 𝑦𝑦 ∗ = 𝑐𝑐 ∗ for output and consumption per worker, the steady state for
land per worker 𝑙𝑙 ∗ is the solution of 𝑦𝑦 ∗ = 𝑧𝑧𝑧𝑧(𝑙𝑙 ∗ ). This is found using the per worker
production function in the right panel of the figure. Finally, given 𝑙𝑙 ∗ , the population in
steady state is simply 𝑁𝑁 ∗ = 𝐿𝐿⁄𝑙𝑙 ∗ .
As we will see in Section 2.1, the Malthusian model’s prediction of stagnation in living
standards 𝑐𝑐 ∗ is consistent with historical evidence prior to the 19th century.
20
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The pandemic also has significant distributional effects. Let us consider how total output 𝑌𝑌
is distributed among workers and owners of land in the Malthusian model. Workers do not
receive all of 𝑦𝑦 as income unless they own the land they use to produce. If competition
determines factor payments, wages 𝑤𝑤 and rents 𝑥𝑥 are equal to the marginal products of
labour 𝑀𝑀𝑃𝑃𝑁𝑁 and land 𝑀𝑀𝑃𝑃𝐿𝐿 .
With a neoclassical production function, the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 is diminishing
in the population 𝑁𝑁. Given a fixed supply of land 𝐿𝐿, a lower population 𝑁𝑁 means the
marginal product of labour is higher, so wages 𝑤𝑤 = 𝑀𝑀𝑃𝑃𝑁𝑁 rise. What about rents 𝑥𝑥 = 𝑀𝑀𝑃𝑃𝐿𝐿 ?
The per worker production function implies 𝑌𝑌 = 𝑧𝑧𝑁𝑁𝑓𝑓 (𝐿𝐿⁄𝑁𝑁), which can be used to obtain
expressions for the marginal products of land and labour using the chain rule.
21
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
𝜕𝜕𝜕𝜕 𝑁𝑁 𝐿𝐿
𝑀𝑀𝑃𝑃𝐿𝐿 = = 𝑧𝑧 𝑓𝑓 ′ � � = 𝑧𝑧𝑧𝑧′(𝑙𝑙)
𝜕𝜕𝜕𝜕 𝑁𝑁 𝑁𝑁
𝜕𝜕𝜕𝜕 𝐿𝐿 𝐿𝐿 𝐿𝐿
𝑀𝑀𝑃𝑃𝑁𝑁 = = 𝑧𝑧𝑧𝑧 � � − 𝑧𝑧𝑧𝑧 2 𝑓𝑓 ′ � � = 𝑧𝑧(𝑓𝑓(𝑙𝑙 ) − 𝑙𝑙𝑙𝑙′(𝑙𝑙))
𝜕𝜕𝜕𝜕 𝑁𝑁 𝑁𝑁 𝑁𝑁
These equations imply 𝑤𝑤 = 𝑦𝑦 − 𝑥𝑥𝑥𝑥, which says that wages are equal to output per worker
minus rent times land used per worker.
Both marginal products and hence wages and rents depend on the relative supply of land to
labour as measured by land-per-worker 𝑙𝑙 = 𝐿𝐿/𝑁𝑁. It can be seen that 𝑀𝑀𝑃𝑃𝐿𝐿 is diminishing in 𝑙𝑙
because 𝑓𝑓 (𝑙𝑙 ) = 𝐹𝐹(𝑙𝑙, 1), so 𝑓𝑓 ′′ (𝑙𝑙 ) < 0, whereas 𝑀𝑀𝑃𝑃𝑁𝑁 is increasing in 𝑙𝑙 because derivative of
𝑓𝑓(𝑙𝑙 ) − 𝑙𝑙𝑙𝑙′(𝑙𝑙) with respect to 𝑙𝑙 is 𝑓𝑓 ′ (𝑙𝑙 ) − 𝑓𝑓 ′ (𝑙𝑙 ) − 𝑙𝑙𝑓𝑓 ′′ (𝑙𝑙 ) = −𝑙𝑙𝑓𝑓 ′′ (𝑙𝑙 ) > 0. These relationships
are depicted in Figure 1.7. As the pandemic increases land per worker 𝑙𝑙, it causes a rise in
𝑀𝑀𝑃𝑃𝑁𝑁 and hence higher wages, but a fall in 𝑀𝑀𝑃𝑃𝐿𝐿 and hence lower rents.
Figure 1.7: Wages and rents with a lower population
22
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
We use a simple static model (meaning there is no saving or borrowing) to study the choice
of labour supply by households. The supply of labour by a household (in units of time, e.g.
hours) is denoted by 𝑁𝑁. Time the household enjoys as leisure is denoted by 𝑙𝑙. ‘Leisure’ is a
catch-all term for anything other than time spent earning wages, so as well as leisure in the
usual sense, it also includes cooking, cleaning and childcare, activities known as ‘home
production’. While home production includes a component of work, the household benefits
from it in the sense that otherwise the household would have to pay for equivalent services
in the market, for example, eating out in restaurants, hiring a cleaner or a childminder.
A crucial constraint is that the household has a fixed amount of time ℎ available in a given
day, week, or year. Time used for work cannot also be used for leisure:
𝑁𝑁 + 𝑙𝑙 = ℎ
By working (not counting home production), the household is paid a wage 𝑤𝑤 per unit of
time, e.g. an hourly wage. This wage is specified in real terms, as with all other variables in
this chapter. If a household works for 𝑁𝑁 hours then total wage income is 𝑤𝑤𝑤𝑤. The ultimate
purpose of work is to use the income to buy goods and services. Consumption of goods and
services (not counting home production) is denoted by 𝐶𝐶.
Some households may also be able to use non-wage income such as dividend income from
owning shares to buy goods and services. The amount of non-wage income is denoted by 𝜋𝜋.
More broadly, considering the household as a family, 𝜋𝜋 could also be interpreted as the
income of an individual’s partner separate from the amount the individual earns direct.
Our analysis must also consider taxes, which influence how much households are able to
spend of their pre-tax incomes. For now, taxes are assumed to take a ‘lump sum’ form: an
amount of tax 𝑇𝑇 that must be paid irrespective of how much a household earns or
consumes, for example, a poll tax. While this simplifies matters, most taxes are not like this
and we will see what difference it makes by considering income taxes and consumption
taxes later. Note that we allow 𝑇𝑇 to be negative, indicating the household receives a
transfer payment from the government rather than being a taxpayer.
Given wage income 𝑤𝑤𝑤𝑤, non-wage income 𝜋𝜋, and taxes 𝑇𝑇, the maximum amount of
consumption affordable to a household is:
𝐶𝐶 = 𝑤𝑤𝑤𝑤 + 𝜋𝜋 − 𝑇𝑇
As this is a static model, there is no role for saving for the future in this budget constraint, so
households will consume their income. Instead, the purpose of the model is to analyse how
much labour households will supply, which affects their income and, hence, their
consumption.
Our analysis proceeds by combining the two constraints on time and spending power. Since
𝑁𝑁 = ℎ − 𝑙𝑙, we can write a combined constraint in terms of the consumption 𝐶𝐶 and leisure 𝑙𝑙
that the household ultimately values:
𝐶𝐶 + 𝑤𝑤𝑤𝑤 = 𝑤𝑤ℎ + 𝜋𝜋 − 𝑇𝑇
23
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
This budget constraint is plotted in Figure 1.8 with leisure 𝑙𝑙 on the horizontal and
consumption 𝐶𝐶 on the vertical axis. Given the time physically available, it is not possible to
have more than ℎ hours of leisure. If a household chose the maximum amount of leisure 𝑙𝑙 =
ℎ this would mean supplying no labour and thus the ability to consume goods would depend
solely on non-wage income after tax 𝜋𝜋 − 𝑇𝑇. As leisure falls below ℎ, hours of labour supplied
increase and each extra hour of labour adds 𝑤𝑤 to wage income, increasing the ability to
consume by 𝑤𝑤. Therefore, the budget constraint is a downward-sloping straight line with
gradient −𝑤𝑤 that passes through the point (ℎ, 𝜋𝜋 − 𝑇𝑇).
Figure 1.8: Constraint on consumption and leisure
Households like both more consumption 𝐶𝐶 and more leisure 𝑙𝑙 but the constraints imply
there is a trade-off between them. To study the optimal choice of leisure and, hence, the
supply of labour, we need to say more about preferences.
We describe the household’s preferences over 𝐶𝐶 and 𝑙𝑙 using indifference curves added to
the diagram with the budget constraint. Indifference curves are downward sloping because
less of one thing the household likes requires more of the other to compensate. Indifference
curves are also assumed to be convex to the origin, as depicted in Figure 1.9. This shape
reflects a dislike of extremes where the household has very little consumption or leisure but
much more of the other.
24
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The shape of the indifference curves can be described in terms of a diminishing marginal
rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 between consumption 𝐶𝐶 and leisure 𝑙𝑙. The marginal rate of
substitution is how much extra of one good a household needs to be given to compensate
for the loss of one unit of another. The gradient of an indifference curve is −𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 , with
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 representing how much extra consumption a household needs to receive to be no
worse off after giving up a unit of leisure by supplying more labour.
Consumption and leisure are also assumed to be normal goods. This means that when
households are better off and able to reach a higher indifference curve, they choose to have
more consumption and more leisure, holding constant the hourly wage 𝑤𝑤. In the diagram,
this means the line joining points on different indifference curves where the tangent lines
have the same gradient is upward sloping.
A household wants to reach the highest indifference curve subject to the constraints. As
shown in Figure 1.10, there are two general cases to consider. First, where it is optimal to
participate in the labour market (𝑙𝑙 < ℎ), in which case the optimal consumption-leisure
choice is where an indifference curve is tangent to the constraint, mathematically, 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 =
𝑤𝑤. The second case is where the household does not find it optimal to participate in the
labour market (𝑙𝑙 = ℎ) and the optimal choice of 𝐶𝐶 and 𝑙𝑙 is at the corner of constraint.
25
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Participation in the labour market is optimal if the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 at
zero labour supply (𝑙𝑙 = ℎ) is less than the hourly real wage 𝑤𝑤, i.e. the indifference curve
passing through the corner of the constraint is less steep than the budget constraint to the
left of this point.
This logic indicates participation in the labour market is more likely when wages 𝑤𝑤 are high,
which makes the budget constraint steeper. High taxes 𝑇𝑇 or low transfer payments (the
negative of 𝑇𝑇) increase the likelihood of participation, moving the corner (ℎ, 𝜋𝜋 − 𝑇𝑇)
downwards. Similarly, a low level of other income 𝜋𝜋 increases participation, with 𝜋𝜋 being
low because the household has little wealth or an individual’s partner does not have a high
income. Finally, preferences can also matter, with a strong preference for consumption over
leisure (a low marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 ) making participation more likely.
For those choosing to participate, we can also analyse how the number of hours worked
depends on these considerations.
26
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
are. A higher wage makes a household better off and, since consumption and leisure are
both normal goods, there is desire to enjoy more leisure by choosing to work less.
Income and substitution effects are analysed in Figure 1.11. A higher wage makes the
budget constraint steeper, pivoting it around the corner. Formally, the substitution effect
(SE) is found by considering the effects of the steeper budget constraint gradient,
controlling for whether the household is made better off or worse off. Since the higher
wage makes the household better off, the substitution effect can be isolated by also making
a parallel shift downwards of the budget constraint so that it is tangent to the original
indifference curve. This results in a new tangency point north-west of the original tangency
because this is where the indifference curve is steeper. Leisure falls (labour supply
increases) and consumption rises.
Figure 1.11: Income and substitution effects on labour supply
The income effect (IE) is isolated by removing the hypothetical parallel shift of the budget
constraint used to derive the substitution effect. Hence, the income effect results from a
parallel upward shift of the budget constraint in this case, causing a movement on to a
higher indifference curve in a north-east direction. Leisure and consumption both rise, so
labour supply falls.
Overall, combining the substitution effect and income effect to obtain the combined effect,
consumption must rise but leisure may rise or fall. Thus, the effect of wages on labour
supply is ambiguous. In the diagram, income and substitution effects exactly cancel out for
leisure and labour supply but this is a special case. In general, either the substitution effect
or the income effect could dominate. If the substitution effect dominates, leisure falls and
labour supply rises, while if the income effect dominates, leisure rises and labour supply
falls.
27
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
indifference curve passing through that point needs to be steeper than the budget
constraint for non-participation to be optimal, a sufficiently high wage would cause an
individual to supply some labour. This is shown in Figure 1.12.
Figure 1.12: The effects of wages on labour-market participation
Taking 𝜋𝜋 as given, a higher wage 𝑤𝑤 has no offsetting income effect on the participation
decision because a higher wage does not make non-participants better off if they are not
earning any labour income. However, if 𝜋𝜋 is interpreted more broadly as including family
income from a partner who works, in this case higher wages have an income effect on the
household’s labour supply.
28
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
A labour supply curve can be drawn for an individual or for all households together. For
those participating in the labour market, hours worked increase with higher wages if the
substitution effect is larger than the income effect. For such households, the 𝑁𝑁 𝑠𝑠 curve is
upward sloping if the substitution effect dominates the income effect. For those not
participating initially, high wages make participation more likely and there is no offsetting
income effect until some labour is supplied by a household. An upward-sloping labour
supply curve drawn for all households can also represent more participation at higher wages
as well as those already working choosing to supply more hours.
If all households are participating in the labour market, we can think of the labour supply
curve as representing the optimality condition 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = 𝑤𝑤 with the marginal rate of
substitution between leisure and consumption rising as more labour is supplied because
leisure falls.
The labour supply curve shifts if there is a change in any variable that matters for optimal
labour supply other than the real wage. Lower non-wage income 𝜋𝜋, or higher taxes 𝑇𝑇, would
make a household worse off, reducing demand for leisure as a normal good and increasing
the supply of labour. This would cause 𝑁𝑁 𝑠𝑠 to shift to the right.
A very long-run perspective on labour supply is provided by the 160 years of data for the UK
shown in Figure 1.14. This graph shows time series of real wages, average hours worked per
week for those who have jobs and the fraction of the whole population who have jobs. Over
the 160 years, UK real wages rise by a factor of 20. Hours per worker, though, fall by around
half over this period. A broad measure of labour-market participation, the ratio of workers
to the total population (not adjusting for those of ‘working age’) does not display any clear
trend. One interpretation of this evidence points to the importance of income effects as UK
workers became significantly better off over this period and chose to work fewer hours.
Figure 1.14: UK wages and labour supply in the long run
29
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = (1 − 𝜏𝜏)𝑤𝑤
A higher tax rate 𝜏𝜏 reduces the after-tax wage (1 − 𝜏𝜏)𝑤𝑤, so the effects of tax are similar to
those of lower wages.
Since the tax rate 𝜏𝜏 affects tax revenue 𝑇𝑇 = 𝜏𝜏𝜏𝜏𝑁𝑁 𝑠𝑠 indirectly through its impact on
behaviour 𝑁𝑁 𝑠𝑠 as well as directly, the relationship between 𝜏𝜏 and 𝑇𝑇 is not always positive.
This leads to the ‘Laffer curve’ relationship between tax rates and revenue shown in Figure
1.15. To understand the Laffer curve, note that a 0 per cent tax rate obviously generates no
revenue. On the other hand, a 100 per cent tax rate implies no incentive to supply labour
because the after-tax wage is zero, so no tax revenue would be obtained in this case as
𝑁𝑁 𝑠𝑠 = 0. This basic logic indicates there is a tax rate somewhere between 0 per cent and 100
per cent where the Laffer curve peaks and tax revenue is maximised. After this point, higher
tax rates would reduce revenue.
Figure 1.15: A Laffer curve
Although the Laffer curve implies that ever higher tax rates eventually result in lower
revenue, it does not give specific guidance at which tax rate 𝜏𝜏 revenue will start to fall as 𝜏𝜏
rises. This is an empirical question.
30
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
31
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 , so the desired supply of labour is equal to the amount firms want to demand. This
occurs if the real wage 𝑤𝑤 adjusts to 𝑤𝑤 ∗ where the demand and supply curves cross.
Figure 1.16: Labour market equilibrium
Once equilibrium employment 𝑁𝑁 ∗ is known, the amount of goods and services 𝑌𝑌 produced
by firms is determined because all other factors (land, capital) are in fixed supply. This is the
amount firms supply to the goods market. On the demand side, households buy goods for
consumption 𝐶𝐶 and the government to provide public services 𝐺𝐺, so aggregate demand is
𝐶𝐶 + 𝐺𝐺. The government’s fiscal policy sets 𝐺𝐺 and households choose 𝐶𝐶 subject to their
budget constraint when making the consumption-leisure trade-off that underlies labour
supply. Equilibrium of the goods market requires 𝐶𝐶 + 𝐺𝐺 = 𝑌𝑌.
It turns out that another diagram for the goods market is not necessary given that labour-
market equilibrium has already been found. The amount of non-wage income received by
households is 𝜋𝜋 = 𝑌𝑌 − 𝑤𝑤𝑁𝑁 𝑑𝑑 . This is true irrespective of whether households or firms own
factors of production such as land and capital as households will ultimately receive these
factor payments as the owners of firms.
The household budget constraint is 𝐶𝐶 = 𝑤𝑤𝑁𝑁 𝑠𝑠 + 𝜋𝜋 − 𝑇𝑇, and substituting the government
budget constraint 𝐺𝐺 = 𝑇𝑇 and the equation for 𝜋𝜋, it follows that 𝐶𝐶 = 𝑌𝑌 + 𝑤𝑤(𝑁𝑁 𝑠𝑠 − 𝑁𝑁 𝑑𝑑 ) − 𝐺𝐺.
Writing this as 𝑌𝑌 − (𝐶𝐶 + 𝐺𝐺 ) = 𝑤𝑤(𝑁𝑁 𝑑𝑑 − 𝑁𝑁 𝑠𝑠 ), labour-market equilibrium 𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 implies
𝐶𝐶 + 𝐺𝐺 = 𝑌𝑌, so the goods market must also be in equilibrium.
32
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
produce with that unit of time. A basic function of an economy is to allow households to
turn their time as work into income that can be spent on goods and services. Here, the
market economy performs this task as well as possible and we say that the market
equilibrium is economically efficient, or Pareto efficient.
Note that we say nothing here about whether the government’s choice of public
expenditure 𝐺𝐺 is optimal and, hence, whether the allocation of resources between private
spending 𝐶𝐶 and public spending 𝐺𝐺 is optimal. The desirability of a particular amount of
public expenditure is taken as given here.
A more careful way to reach the efficiency result is to imagine a hypothetical world where a
government can control all economic decisions (consumption, employment, etc.) without
the need for markets. The government is said to be the ‘social planner’ in this case. Assume
the government acts benevolently with the aim of making the representative household as
well off as possible.
The economy’s ability to produce goods and services is still limited by the production
function 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁), where 𝐾𝐾 is denotes a factor of production other than labour, for
example, capital. The supply of this factor is fixed here. Assume that a particular level of
public expenditure 𝐺𝐺 is desirable. The government faces a resource constraint 𝐶𝐶 + 𝐺𝐺 = 𝑌𝑌
and a constraint 𝑙𝑙 + 𝑁𝑁 = ℎ on households’ time. Combining these and the production
function leads to a single constraint 𝐶𝐶 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, ℎ − 𝑙𝑙 ) − 𝐺𝐺 linking consumption 𝐶𝐶 and
leisure 𝑙𝑙, the two things households ultimately care about. The constraint and the
representative household’s indifference curves are illustrated in Figure 1.17.
The ability to raise consumption by reducing leisure and setting households to work longer
is found by differentiating the constraint with respect to leisure 𝑙𝑙:
𝜕𝜕𝜕𝜕
= −𝑧𝑧𝐹𝐹𝑁𝑁 (𝐾𝐾, ℎ − 𝑙𝑙 ) = −𝑀𝑀𝑃𝑃𝑁𝑁
𝜕𝜕𝜕𝜕
The gradient is the negative of the marginal product of labour, so the constraint is
downward sloping and becomes steeper as 𝑙𝑙 rises (because 𝑁𝑁 falls, increasing 𝑀𝑀𝑃𝑃𝑁𝑁 ). To
make representative-household utility as high as possible subject to the constraint, the
social planner would choose a combination (𝑙𝑙, 𝐶𝐶) such that 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = 𝑀𝑀𝑃𝑃𝑁𝑁 , where an
indifference curve is tangent to the constraint.
33
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Since the market equilibrium features 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = 𝑀𝑀𝑃𝑃𝑁𝑁 and satisfies the same constraints
faced by the government, the market equilibrium and the benevolent planner’s choice
coincide. It is not possible to make the representative household better off than in the
market equilibrium, so the equilibrium is efficient.
More generally, without a representative household, the market equilibrium is said to be
Pareto-efficient if no one can be made better off without making someone else worse off. It
is a general result (the first welfare theorem) that an equilibrium is Pareto efficient if:
Box 1.4: Should wages or rents be taxed to pay for public expenditure?
This application addresses the question of how a government should best pay for a given
amount of public services 𝐺𝐺 it needs to provide. Here we assume that lump-sum taxes are
not available. Instead, the government is restricted to taxing different types of income.
Assume firms produce output of goods and services using land 𝐿𝐿 and labour 𝑁𝑁. The
production function is 𝑌𝑌 = 𝑎𝑎𝑎𝑎 + 𝑏𝑏𝑏𝑏, which is linear in both 𝐿𝐿 and 𝑁𝑁. This is not a
neoclassical production function but it is useful for illustration and the arguments
developed here apply more generally. The marginal products of labour and land are
𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑎𝑎 and 𝑀𝑀𝑃𝑃𝐿𝐿 = 𝑏𝑏, where 𝑎𝑎 and 𝑏𝑏 are positive constants. With firms hiring labour
and renting land in competitive factor markets, the pre-tax wage 𝑤𝑤 and rent 𝑥𝑥 must be
equal to these constant marginal products:
𝑤𝑤 = 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑎𝑎 , 𝑥𝑥 = 𝑀𝑀𝑃𝑃𝐿𝐿 = 𝑏𝑏
34
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The economy has a fixed supply of land 𝐿𝐿. The supply of labour 𝑁𝑁 is chosen by a
representative household that owns an equal share of the economy’s land. The
government can only levy proportional taxes on incomes, setting a tax rate 𝜏𝜏𝑤𝑤 on wage
income and a tax rate 𝜏𝜏𝑥𝑥 on rental income. The total amount of tax revenue raised is
𝜏𝜏𝑤𝑤 𝑤𝑤𝑤𝑤 + 𝜏𝜏𝑥𝑥 𝑥𝑥𝑥𝑥. With 𝑤𝑤 and 𝑥𝑥 equal to the constants 𝑎𝑎 and 𝑏𝑏, the budget constraint of
the government is:
𝑎𝑎𝜏𝜏𝑤𝑤 𝑁𝑁 + 𝑏𝑏𝜏𝜏𝑥𝑥 𝐿𝐿 = 𝐺𝐺
The total amount of income (wages and rents) the representative household receives
after tax is (1 − 𝜏𝜏𝑤𝑤 )𝑤𝑤𝑤𝑤 + (1 − 𝜏𝜏𝑥𝑥 )𝑥𝑥𝑥𝑥 and the budget constraint is 𝐶𝐶 = 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )𝑁𝑁 +
𝑏𝑏(1 − 𝜏𝜏𝑥𝑥 )𝐿𝐿. With a given amount of time ℎ available, labour supply is equal to 𝑁𝑁 = ℎ −
𝑙𝑙, where 𝑙𝑙 is the choice of leisure. The combined constraint faced by the household is:
𝐶𝐶 + 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )𝑙𝑙 = 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )ℎ + 𝑏𝑏(1 − 𝜏𝜏𝑥𝑥 )𝐿𝐿
1.6.4 Taxing wages but not rents
Suppose initially that only wage income is taxed, so 𝜏𝜏𝑥𝑥 = 0. The government’s budget
constraint simplifies to 𝑎𝑎𝑎𝑎𝑤𝑤 𝑁𝑁 = 𝐺𝐺 and the household’s budget constraint is 𝐶𝐶 +
𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )𝑙𝑙 = 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )ℎ + 𝑏𝑏𝑏𝑏. The household chooses 𝐶𝐶 and 𝑙𝑙 at the tangency of the
budget constraint and an indifference curve, i.e. where 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = 𝑎𝑎 (1 − 𝜏𝜏𝑤𝑤 ). The
gradient of the budget constraint is the after-tax wage 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 ). Figure 1.18 depicts
this tangency point (𝑙𝑙1∗ , 𝐶𝐶1∗ ) as the economy’s initial equilibrium.
Figure 1.18: Taxes on rents instead of taxes on wages
35
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The household budget constraint in this case is 𝐶𝐶 + 𝑎𝑎𝑎𝑎 = 𝑎𝑎ℎ + 𝑏𝑏(1 − 𝜏𝜏𝑥𝑥 )𝐿𝐿. This has
gradient −𝑎𝑎 instead of −𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 ), so the budget constraint becomes steeper as the after-
tax wage rises. The budget constraint also shifts down at 𝑙𝑙 = ℎ, reflecting the reduction in
non-wage income after tax.
A crucial observation is that the choice of 𝐶𝐶1∗ and 𝑙𝑙1∗ under the previous tax system remains
affordable under the new one. This is because 𝐶𝐶1∗ and 𝑙𝑙1∗ satisfy 𝑎𝑎𝜏𝜏𝑤𝑤 (ℎ − 𝑙𝑙1∗ ) = 𝑎𝑎𝜏𝜏𝑤𝑤 𝑁𝑁1∗ =
𝐺𝐺 = 𝑏𝑏𝜏𝜏𝑥𝑥 𝐿𝐿 and 𝐶𝐶1∗ + 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )𝑙𝑙1∗ = 𝑎𝑎(1 − 𝜏𝜏𝑤𝑤 )ℎ + 𝑏𝑏𝑏𝑏, so they also consistent with the new
budget constraint 𝐶𝐶1∗ + 𝑎𝑎𝑙𝑙1∗ = 𝑎𝑎ℎ + 𝑏𝑏(1 − 𝜏𝜏𝑥𝑥 )𝐿𝐿. Therefore, the original choice of leisure
and consumption remains on the new budget constraint. Since the budget constraint is now
steeper, it must cut the indifference curve at this point because there is a tangency with a
less steep budget line there.
It is possible to reach a higher indifference curve under the new tax system by choosing less
leisure, a higher labour supply, and more consumption. The analysis indicates a switch from
taxing wages to taxing rents makes the representative household better off. This is because
households’ labour supply choice is distorted by a proportional income tax that
disincentivises work. On the other hand, the supply of land is inelastic and does not respond
to tax. The removal of this distortion to labour supply and production allows the
representative household to reach a higher indifference curve.
As noted, in practice, rental income may not be high enough to shift tax burden completely
away from wages (which would require total pre-tax rents exceed total public expenditure,
𝑏𝑏𝑏𝑏 > 𝐺𝐺). By having a representative household, the analysis also ignores the distributional
consequences of such shifts in the tax system.
Capital 𝐾𝐾 used for production is a stock variable, not a flow. Adding new capital to the
capital stock is a flow variable known as investment 𝐼𝐼. While capital is not immediately used
up in producing other goods, it does not last forever, in other words, there is depreciation.
Depreciation is the loss of capital from wear and tear or obsolescence, or maintenance costs
incurred to avoid this loss. We assume depreciation of capital takes place at a constant rate
𝑑𝑑 over time. The following equation describes the process of capital accumulation:
𝐾𝐾 ′ = (1 − 𝑑𝑑 )𝐾𝐾 + 𝐼𝐼
36
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Next year’s capital stock 𝐾𝐾′ is equal to capital left over (1 − 𝑑𝑑 )𝐾𝐾 after depreciation from the
current time plus investment 𝐼𝐼.
Focusing on capital and labour as the relevant factors of production and ignoring land, the
production function is:
𝑌𝑌 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝑁𝑁)
GDP is 𝑌𝑌, the labour force is 𝑁𝑁 (the number of workers), the capital stock is 𝐾𝐾, and total
factor productivity (TFP) is 𝑧𝑧. The production function 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) is assumed to be
neoclassical, and the most important of the neoclassical assumptions here is the diminishing
marginal product of capital. The Cobb-Douglas production function 𝑌𝑌 = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 is one
such example of a neoclassical production function. The value of the parameter 𝛼𝛼 could be
estimated using the capital share of income.
How is the capital stock 𝐾𝐾 measured? There are two approaches. First, the perpetual
inventory method. The change in the capital stock from one year to the next can be
calculated using 𝐾𝐾 ′ − 𝐾𝐾 = 𝐼𝐼 − 𝑑𝑑𝑑𝑑. Hence, given an estimate of 𝐾𝐾, the capital stock 𝐾𝐾′ can
be estimated by adding investment 𝐼𝐼 from the national accounts and subtracting an
estimate of the depreciation rate 𝑑𝑑 multiplied by 𝐾𝐾. Starting from some conjectured initial
value, this method can be applied iteratively to construct a time series of capital-stock
estimates.
A second method is based on imputation from capital income. Suppose we have an estimate
of the gross percentage return on capital 𝑅𝑅. Then given a measure of GDP 𝑌𝑌 and the capital
share of GDP 𝛼𝛼, the implied capital stock is 𝐾𝐾 = 𝛼𝛼𝛼𝛼⁄𝑅𝑅 .
In studying capital accumulation, we will mainly be concerned with the output per worker
𝑦𝑦 = 𝑌𝑌/𝑁𝑁, not total output 𝑌𝑌. Since the production function 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) has constant
returns to scale, output per worker is given by:
𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) 𝐾𝐾 𝑁𝑁
𝑦𝑦 = = 𝑧𝑧𝑧𝑧 � , � = 𝑧𝑧𝑧𝑧 (𝑘𝑘, 1) = 𝑧𝑧𝑧𝑧(𝑘𝑘)
𝑁𝑁 𝑁𝑁 𝑁𝑁
This shows that 𝑦𝑦 depends on capital per worker 𝑘𝑘 = 𝐾𝐾/𝑁𝑁 and TFP 𝑧𝑧. The equation 𝑦𝑦 =
𝑧𝑧𝑧𝑧(𝑘𝑘) is called the ‘per worker production function’, where the function 𝑓𝑓 (𝑘𝑘) is defined by
𝑓𝑓 (𝑘𝑘) = 𝐹𝐹(𝑘𝑘, 1). It is an increasing and concave function as shown in Figure 1.19 because
𝑧𝑧𝑓𝑓′(𝑘𝑘) is the marginal product of capital.
37
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
38
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The per worker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘) implies output per worker depends on the
ratio of 𝐾𝐾 to 𝑁𝑁. As 𝑦𝑦 is the ultimate variable of interest, we only need to keep track of the
ratio of capital per worker 𝑘𝑘 = 𝐾𝐾/𝑁𝑁 over time. By combining the equations for 𝐾𝐾 ′ and 𝑁𝑁 ′ :
𝐾𝐾 ′ (1 − 𝑑𝑑 )𝐾𝐾 + 𝑠𝑠𝑠𝑠 (1 − 𝑑𝑑 )𝑘𝑘 + 𝑠𝑠𝑠𝑠
𝑘𝑘 ′ = = =
𝑁𝑁′ (1 + 𝑛𝑛)𝑁𝑁 1 + 𝑛𝑛
Using 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘), the right-hand side of the equation depends on capital per worker 𝑘𝑘 only:
(1 − 𝑑𝑑 )𝑘𝑘 + 𝑠𝑠𝑠𝑠𝑠𝑠(𝑘𝑘)
𝑘𝑘 ′ =
1 + 𝑛𝑛
Subtracting 𝑘𝑘 from both sides leads to an equation for the change in 𝑘𝑘 over time:
𝑠𝑠𝑠𝑠𝑠𝑠 (𝑘𝑘) − (𝑑𝑑 + 𝑛𝑛)𝑘𝑘
𝑘𝑘 ′ − 𝑘𝑘 =
1 + 𝑛𝑛
We conclude from this equation that changes in the amount of capital accumulated per
worker are explained by the difference between two terms. First, 𝑠𝑠𝑠𝑠𝑠𝑠 (𝑘𝑘), the amount of
saving and hence of investment per worker, which is the saving rate 𝑠𝑠 multiplied by the per-
worker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘). Second, (𝑑𝑑 + 𝑛𝑛)𝑘𝑘, the amount of investment per
worker needed to sustain the same level of capital per worker next year. This interpretation
comes from a fraction 𝑑𝑑 of all capital depreciating, so an amount of capital per worker 𝑑𝑑𝑑𝑑
must be replaced to keep capital per worker unchanged. Furthermore, the number of
workers increases by a percentage 𝑛𝑛 each year, so if existing workers use capital 𝑘𝑘 each,
there needs to be investment 𝑛𝑛𝑛𝑛 per existing worker to give future workers the same capital
𝑘𝑘 each as current workers.
• The per worker production function 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘), which is an increasing and concave
function of 𝑘𝑘.
• The ‘saving line’ 𝑠𝑠𝑠𝑠𝑠𝑠(𝑘𝑘), a scaled-down version of 𝑧𝑧𝑧𝑧(𝑘𝑘) because of 0 < 𝑠𝑠 < 1 and,
hence, an increasing and concave function of 𝑘𝑘.
• The ‘effective depreciation line’ (𝑑𝑑 + 𝑛𝑛)𝑘𝑘, an upward-sloping straight line with
gradient given by the effective depreciation rate of capital per worker: the sum of
the depreciation rate plus the growth rate of the labour force.
Starting from a low level of capital per worker, the saving line is above the effective
depreciation line because the production function and saving line are initially very steep,
reflecting a high marginal product of capital. This means 𝑘𝑘 ′ − 𝑘𝑘 > 0, so capital per worker 𝑘𝑘
is increasing over time. This leads to higher output per worker 𝑦𝑦 since 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘), with the
economy moving along the per worker production function over time. Starting from this
position, by adding more capital per worker, workers’ productivity can be increased and the
economy experiences growth in output per worker coming from capital accumulation.
39
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Does the basic Solow model always have such a steady state where growth in per worker
incomes grinds to a halt? If the production function is neoclassical, the gradient of 𝑓𝑓(𝑘𝑘) is
extremely large for 𝑘𝑘 close to zero but declines as 𝑘𝑘 increases and approaches zero as 𝑘𝑘
becomes large. The saving line 𝑠𝑠𝑠𝑠𝑠𝑠(𝑘𝑘), as a multiple of 𝑓𝑓(𝑘𝑘), shares the same properties.
On the other hand, the gradient of the effective depreciation line is constant. Consequently,
there exists a positive steady state 𝑘𝑘 ∗ (and only one) where the saving and effective
depreciation lines cross.
Moreover, the saving line is above the effective depreciation line for 𝑘𝑘 below 𝑘𝑘 ∗ , which
means that 𝑘𝑘 rises over time when it is below 𝑘𝑘 ∗ (and would fall over time if above 𝑘𝑘 ∗ ). The
economy thus converges to 𝑘𝑘 ∗ in the long run, so this point of stagnation is eventually
reached.
The Solow model also always has a steady state with zero capital per worker because some
capital is essential for production with a neoclassical production function. However, the
economy would diverge from this steady state no matter how close it gets, so this does not
need to be taken seriously and is ignored in what follows.
In summary, while the Solow model can explain how a country can become richer starting
from a low level of capital accumulation, it cannot explain long-run growth. Intuitively,
returns to capital are high when capital is initially scarce, so investment leads to large
40
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
increases in income. After that, diminishing returns to capital means that further investment
has lower returns and, with less extra output generated per unit of capital while the
maintenance cost of capital increases proportionately, a point is reached where the capital
stock cannot rise any further.
This process of rapid capital accumulation is what the Solow model predicts for an
economy with a high saving rate and a low initial level of capital per worker. The model
predicts this would lead to a significant increase in income per worker and income per
person as the economy moves along the per worker production function. Data on real
income per person are shown in Figure 1.23 and we see that there is a dramatic
improvement in prosperity.
41
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
Another prediction of the Solow model is that the rate of progress, specifically, the growth
rate of income per person would gradually slow down over time, even if the economy
maintained very high rates of saving. It is not easy to read growth rates from a graph of
income plotted on an ordinary scale against time. By taking logarithms of the data, or
using a logarithmic scale, the gradient of the time series is informative about the growth
rate.
Figure 1.23: Singapore real GDP per person
Figure 1.24 plots the natural logarithm of real income per person against time. Here we
see that the gradient of the graph tends to decrease over time, indicating that economic
growth is slowing down in Singapore. This happens even though the saving rate does not
fall over this period but actually rises. Such a growth slowdown as capital per worker rises
is in line with the Solow model’s prediction, which is a consequence of diminishing returns
to capital.
42
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
The growth slowdown can be seen more directly in Figure 1.25, which plots growth rates of
real income per person averaged over each decade.
Figure 1.25: Singapore average growth rates by decade
While these predictions are consistent with the basic Solow model, we do not necessarily see
evidence that growth rates of income per person are falling all the way to zero. Even in
developed economies that have been experiencing growth for more than a century, we still
typically see growth being positive. In contrast, the basic Solow model predicts that the long-
run growth rate of income per person is zero. The inability to explain long-run growth is one
of the major weaknesses of the Solow model and we will return to this issue in Chapter 2.
43
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
What can we say about real returns empirically over time? It is not easy to measure 𝑟𝑟 in
a consistent way over very long periods. Figure 1.27 plots a time series of real interest
rates on UK government bonds over a 300-year period. This real bond yield is taken as a
proxy for the real return earned by savers (even though government bonds are
44
EC2065 Macroeconomics | Chapter 1: The supply side of the economy
not included in the Solow model, if they were, savers would have a choice of holding
bonds or capital, so the returns on the two assets would be linked).
Real interest rates in the UK have fluctuated over a wide range during those three
centuries, sometimes being as high as 8 per cent and sometimes turning negative.
Looking at the picture as a whole, it appears there is a moderate downward trend in
real interest rates over time.
Figure 1.27: UK real interest rates in the long run
45
EC2065 Macroeconomics | Chapter 2: Economic growth
Essential reading
• Williamson, Chapters 7 and 8.
46
EC2065 Macroeconomics | Chapter 2: Economic growth
47
EC2065 Macroeconomics | Chapter 2: Economic growth
Some countries have experienced much faster growth. In the ‘Asian tiger’ economies,
growth rates were often above 5 per cent in the 1960–90 period. This implies income
doubles in less than 15 years. Other countries have experienced even faster growth rates.
China grows at a rate of 6.3 per cent on average between 1980 and 2000, and even more
rapidly in the following decade. Figure 2.3 shows this led China’s income per person, which
in 1980 was only 5 per cent of the level in the USA, to rise to more than 20 per cent of the
US level by 2010. Income per person in India was also close to 5 per cent of the US level in
1980 but, owing to a lower growth rate than China’s, it rises only to 10 per cent of the USA
by 2016.
Figure 2.3: Why growth matters
48
EC2065 Macroeconomics | Chapter 2: Economic growth
For much of the last 2000 years, the graphs of real GDP per person over time are close to
flat lines as economies stagnated. Only in the last 300 years do we see sustained economic
growth in some countries – modern economic growth – with others only beginning to see
significant growth in the last 40–50 years.
Although GDP per person stagnates until relatively recently, there is sustained population
growth in some parts of the world over a much longer period of time. Figure 2.5 shows plots
of estimated populations over time for a number of countries. This evidence is in line with a
version of the Malthusian model from Section 1.3 where there are improvements in
technology. This technological progress does not lead to sustained rises in living standards
but does increase the population.
Figure 2.5: Rising world population
49
EC2065 Macroeconomics | Chapter 2: Economic growth
Figure 2.6 shows the world income distribution across countries in 2017. Average incomes
per person are expressed in comparable units across countries by adjusting for purchasing
power parity and converting into a US dollar equivalent. Ordering countries by their income
levels, real GDP per person is plotted against the percentiles of the world population
(ignoring any income differences within countries). We see that much of the world
population lives in countries that are far below the income levels achieved in advanced
economies.
Figure 2.6: World income distribution across countries
2.1.5 Convergence
Given the large income differences that have arisen between countries, an important
question is whether poor countries will remain much poorer than rich countries in the
future, or whether they will catch up. For poorer countries to close the gap with richer
countries, the poor must grow faster than the rich. If this happens, we say that those
countries are converging.
50
EC2065 Macroeconomics | Chapter 2: Economic growth
suggest convergence among some groups of countries but certainly not worldwide. This
means there is evidence of conditional convergence, but not absolute convergence.
Figure 2.7 is a scatterplot of the annual average growth rates of real GDP per person
achieved by a large number of countries over the period 1960–2017 against their initial
levels of real GDP per person in 1960 (measured in comparable 2017 US dollars). If there
were absolute convergence, we would expect to see a clear negative relationship between
initial income levels and subsequent growth rates as poorer countries grow faster than
richer countries. However, this is not the case. While some poorer countries grow faster,
there are many examples of poorer countries that grow at slower rates than richer
countries. Overall, the correlation between growth rates and initial incomes is close to zero.
Focusing on a narrower set of countries, Figure 2.8 restricts the sample to OECD (mainly
Europe, North America and Australasia) and East Asian countries. Now, there is a clear
negative relationship between growth rates and initial income levels, providing evidence of
convergence within this group. To the extent that these countries can be seen as having
similar fundamentals, this indicates that conditional convergence is taking place. Observe
that the growth rates in East Asian countries are even higher than the relationship between
growth and initial incomes for the OECD would suggest. Restricting attention to a narrower
group of countries would provide even clearer evidence of conditional convergence within
that group.
Figure 2.7: Limited evidence of convergence worldwide
51
EC2065 Macroeconomics | Chapter 2: Economic growth
One characteristic of some groups of countries is that they save and invest a greater share
of their GDP than the global average. For example, high levels of saving and investment are
a feature of many East Asian countries. Figure 2.9 is a scatterplot of countries’ investment
shares of GDP and their levels of real GDP per person. While the relationship is far from
perfect, overall, we can say that investment rates and levels of real GDP per person are
positively correlated. Differences in the amount of resources countries devote to capital
accumulation is thus one fundamental reason why they might not converge to the same
level of income per person in the long run.
Figure 2.9: Investment rates and income
52
EC2065 Macroeconomics | Chapter 2: Economic growth
we will first explore the implications of the Solow model for the cross-section of countries.
In particular, does the Solow model help in understanding the large differences in growth
rates and levels of income per person across countries?
In the cross-section of countries, the Solow model makes two key predictions. First, if
countries share the same fundamentals such as technologies and saving rates, then they
should converge to the same level of output per worker in the long run, even if they start
from different initial conditions. This means that controlling for a group of countries with
the same fundamentals, the initially poorer countries in the group should grow faster than
the initially richer countries. The Solow model therefore predicts conditional convergence.
The second key prediction is that if saving rates differ across countries, then there will be
differences in output per worker even in the long run. This means that the Solow model
does not predict absolute convergence (except in the unrealistic case where every country
in the world shares the same fundamentals).
We now consider the logic for the first prediction. If countries share the same fundamentals,
then they have the same lines in the Solow model diagram introduced in Section 1.8. A
version of this diagram is reproduced in Figure 2.10. There is a per worker production
function 𝑧𝑧𝑧𝑧(𝑘𝑘), a saving line 𝑠𝑠𝑠𝑠𝑠𝑠(𝑘𝑘), and an effective depreciation line (𝑑𝑑 + 𝑛𝑛)𝑘𝑘. The
fundamentals are the saving rate 𝑠𝑠, the population growth rate 𝑛𝑛, the level of total factor
productivity (TFP) 𝑧𝑧 and the per worker production function 𝑓𝑓(𝑘𝑘) itself.
As the fundamentals are the same for the countries we are considering, all have the same
per worker production function, saving line and effective depreciation line. This implies they
share the same steady state for 𝑘𝑘 and 𝑦𝑦, which is found where the saving line intersects the
depreciation line.
Figure 2.10: Conditional convergence
With the fundamentals being the same, the only differences between countries are the
amounts of capital per worker they have accumulated owing to different histories.
Specifically, those countries that have accumulated less capital begin poorer because a
lower level of 𝑘𝑘 implies less 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘) with 𝑧𝑧 being the same. The more 𝑘𝑘 is below the
53
EC2065 Macroeconomics | Chapter 2: Economic growth
steady state 𝑘𝑘 ∗ , the larger the gap between the saving and effective depreciation lines
relative to the level of 𝑘𝑘. Since this gap determines the change in capital per worker over
time and, hence, output per worker, poorer countries grow faster than richer countries. All
end up converging to the same steady-state income level in the long run.
Now suppose countries are identical in all fundamentals except for their saving rates 𝑠𝑠.
Different levels of 𝑠𝑠 imply different saving lines 𝑠𝑠𝑠𝑠𝑠𝑠(𝑘𝑘) and different saving lines result in
different steady-state levels of capital per worker, hence, output per worker. This is
illustrated in Figure 2.11. A higher saving rate allows a higher stock of capital per worker to
be sustained for the same production technology. With more capital, output per worker is
higher, although not necessarily consumption (see the analysis of the golden rule in Section
2.5). Long-run growth is always zero and is not affected by the saving rate.
Figure 2.11: Differences in saving rates
The logic of the second prediction of the Solow model is that if countries are converging to
different steady states, there is no presumption that starting from a lower level of income
per worker leads to faster subsequent growth. A poor country with a low saving rate might
be closer to its steady state than a richer one with a high saving rate, so the poorer country
grows more slowly. The countries might thus move further apart, indicating a failure of
absolute convergence. However, given the difference in saving rates, the ratio of incomes
across the two countries must stop widening further in the long run.
Box 2.1: Can the Solow model explain large income differences
across countries?
In principle, differences in saving rates and the resulting levels of capital
accumulation in the Solow model could explain why some countries are rich and
others are poor. How large are the differences in income levels predicted by the
Solow model for realistic differences in saving rates across countries? We will focus
on the income differences that will prevail in the long run, i.e. where economies
have reached their steady-state levels of capital per worker.
54
EC2065 Macroeconomics | Chapter 2: Economic growth
With this reasonable parameterisation of the Solow model, very large difference in
rates of saving and investment lead only to modest difference in income levels. This
means the Solow model has a very limited capacity to explain the income
55
EC2065 Macroeconomics | Chapter 2: Economic growth
differences of an order of magnitude of 10–20 between rich and poor countries seen in
Figure 2.6. There must be some other cause of these large income differences, perhaps
total factor productivity 𝑧𝑧.
Box 2.2: How long does convergence to the steady state take in the
Solow model?
A key prediction of the basic Solow model is that there is no growth in income per
person in the long run, i.e. when the economy has converged to its steady-state value of
capital per worker. The relevance of this prediction depends on how long it takes to get
close to the steady state – if this were to take hundreds of years then the basic Solow
model would have the ability to explain economic growth over quite long periods of
time. This exercise presents an example to illustrate the likely timeframe for
convergence to the steady state in the Solow model.
We assume the production function is 𝑌𝑌 = 𝑧𝑧√𝐾𝐾 √𝑁𝑁, which is a Cobb-Douglas form with
𝛼𝛼 = 1/2, implying that the capital and labour income shares are 50 per cent each if
markets are competitive. This is larger than the typical value of 𝛼𝛼 = 1/3, and the use of
a larger 𝛼𝛼 turns out to slow down convergence to the steady state. The per-worker
production function in this case is 𝑦𝑦 = �𝑧𝑧√𝐾𝐾 √𝑁𝑁�⁄𝑁𝑁 = 𝑧𝑧�𝐾𝐾⁄𝑁𝑁 = 𝑧𝑧√𝑘𝑘. The marginal
product of capital can be obtained by differentiating 𝑦𝑦 with respect to 𝑘𝑘, thus 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 𝑧𝑧⁄�2√𝑘𝑘� = 𝑦𝑦⁄(2𝑘𝑘). The steady state for capital per worker 𝑘𝑘 ∗ is the solution
of the equation 𝑠𝑠𝑦𝑦 ∗ = (𝑑𝑑 + 𝑛𝑛)𝑘𝑘 ∗ :
𝑠𝑠𝑠𝑠 𝑠𝑠𝑠𝑠 2
𝑠𝑠𝑠𝑠√𝑘𝑘 ∗ = (𝑑𝑑 + 𝑛𝑛)𝑘𝑘∗ ⇒ √𝑘𝑘 ∗ = ⇒ 𝑘𝑘 ∗ = � �
𝑑𝑑 + 𝑛𝑛 𝑑𝑑 + 𝑛𝑛
The steady state for output per worker 𝑦𝑦 ∗ is then found using 𝑦𝑦 ∗ = 𝑧𝑧√𝑘𝑘 ∗ :
𝑠𝑠𝑠𝑠 𝑠𝑠𝑧𝑧 2
𝑦𝑦 ∗ = 𝑧𝑧 � �=
𝑑𝑑 + 𝑛𝑛 𝑑𝑑 + 𝑛𝑛
Since the marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 is the effect on output per worker of a
marginal increase in capital per worker, the change in output 𝑦𝑦 ′ − 𝑦𝑦 is approximately
equal to 𝑀𝑀𝑃𝑃𝐾𝐾 multiplied by 𝑘𝑘 ′ − 𝑘𝑘. Hence, using 𝑘𝑘 ′ − 𝑘𝑘 = (𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)𝑘𝑘)⁄(1 + 𝑛𝑛):
′
𝑦𝑦 ′ 𝑦𝑦 𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)𝑘𝑘 𝑠𝑠𝑦𝑦 2 ⁄𝑘𝑘 − (𝑑𝑑 + 𝑛𝑛)𝑦𝑦
𝑦𝑦 − 𝑦𝑦 ≅ (𝑘𝑘 − 𝑘𝑘) = � �=
2𝑘𝑘 2𝑘𝑘 1 + 𝑛𝑛 2(1 + 𝑛𝑛)
56
EC2065 Macroeconomics | Chapter 2: Economic growth
Recalling that the steady state is 𝑦𝑦 ∗ = 𝑠𝑠𝑧𝑧 2⁄(𝑑𝑑 + 𝑛𝑛), the change in income per
person is 𝑦𝑦 ′ − 𝑦𝑦 ≅ ((𝑑𝑑 + 𝑛𝑛)⁄(2(1 + 𝑛𝑛)))(𝑦𝑦 ∗ − 𝑦𝑦). The gap between 𝑦𝑦 and its
steady-state value 𝑦𝑦 ∗ is 𝑦𝑦 ∗ − 𝑦𝑦, and the change in this gap over time is (𝑦𝑦 ∗ − 𝑦𝑦 ′ ) −
(𝑦𝑦 ∗ − 𝑦𝑦) = 𝑦𝑦 − 𝑦𝑦 ′ , so:
(𝑑𝑑 + 𝑛𝑛) ∗
(𝑦𝑦 ∗ − 𝑦𝑦 ′ ) − (𝑦𝑦 ∗ − 𝑦𝑦) ≅ − (𝑦𝑦 − 𝑦𝑦)
2(1 + 𝑛𝑛)
Therefore, the dynamics of the gap to steady state 𝑦𝑦 ∗ − 𝑦𝑦 are determined by:
(𝑑𝑑 + 𝑛𝑛)
(𝑦𝑦 ∗ − 𝑦𝑦 ′ ) ≅ �1 − � (𝑦𝑦 ∗ − 𝑦𝑦)
2(1 + 𝑛𝑛)
Taking 𝑑𝑑 ≈ 9 per cent and 𝑛𝑛 ≈ 1 per cent as reasonable values of the depreciation
rate of capital and the population growth rate results in (𝑦𝑦 ∗ − 𝑦𝑦 ′ ) ≈ 0.95(𝑦𝑦 ∗ − 𝑦𝑦).
This means 5 per cent of the gap between 𝑦𝑦 and its steady state 𝑦𝑦 ∗ is closed each
year. The rule-of-70 then implies approximately half of the gap closed in 70⁄5 = 14
years and three quarters of the gap in approximately 28 years. With only a small gap
remaining after four decades, the zero steady-state growth prediction of the Solow
model is reached after an economy has been accumulating capital for a relatively
short space of time.
57
EC2065 Macroeconomics | Chapter 2: Economic growth
58
EC2065 Macroeconomics | Chapter 2: Economic growth
The approximation in this equation uses that 𝑛𝑛𝑛𝑛 is highly likely to be small relative to 𝑛𝑛 and
𝑔𝑔 themselves. The resulting equation has the same form as that for 𝑘𝑘 ′ − 𝑘𝑘 in the basic
Solow model: the numerator is the difference between a ‘saving line’ 𝑠𝑠𝑠𝑠(𝑘𝑘𝑒𝑒 ) and an
‘effective depreciation line’ (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔)𝑘𝑘𝑒𝑒 . These lines have the same shapes as in the basic
Solow model because 𝑓𝑓(𝑘𝑘𝑒𝑒 ) is also a concave function here, and (𝑑𝑑 + 𝑛𝑛 + 𝑔𝑔)𝑘𝑘𝑒𝑒 is linear.
Capital per effective worker 𝑘𝑘𝑒𝑒 is rising when the saving line is above the depreciation line,
and is at a steady state where the two lines cross. Once 𝑘𝑘𝑒𝑒 is known, output per effective
worker is found using the per-effective-worker production function 𝑦𝑦𝑒𝑒 = 𝑓𝑓(𝑘𝑘𝑒𝑒 ). The
equivalent of the usual Solow model diagram here is shown in Figure 2.13.
Figure 2.13: Modified Solow diagram
The figure reveals there is a steady state for capital and output per effective worker 𝑘𝑘𝑒𝑒 and
𝑦𝑦𝑒𝑒 , and the economy will converge to this steady state in the long run using the same logic
as in the basic Solow model. But this does not mean that there is no economic growth in the
long run. Output per worker is by definition 𝑦𝑦 = 𝑌𝑌/𝑁𝑁 = 𝐴𝐴𝑦𝑦𝑒𝑒 . Since 𝑦𝑦𝑒𝑒 converges to a steady
state while 𝐴𝐴 grows at a constant rate 𝑔𝑔, the long-run growth rate of 𝑦𝑦 is 𝑔𝑔. Similarly, capital
per worker 𝑘𝑘 has long-run growth rate 𝑔𝑔.
Intuitively, this Solow model with constant growth in labour-augmenting technology is able
to generate long-run economic growth because growth in the capital stock in line with
labour-augmenting technology is able to scale up inputs of both capital and labour per
person in the economy. Hence, by constant returns to scale, output per person grows at the
rate of labour-augmenting technological progress. These improvements in technology are
effectively raising the return to capital and offsetting the diminishing returns to capital that
limit long-run growth in the basic version of the Solow model. The economy has a balanced
59
EC2065 Macroeconomics | Chapter 2: Economic growth
growth path where all per-worker variables grow in line with technological progress. On this
balanced growth path, returns to capital remain constant over time.
This version of the Solow model is consistent with the occurrence of long-run growth.
However, that growth comes only from technological progress, which is exogenous in the
model. The Solow model here is not actually explaining the source of long-run economic
growth – it is simply assuming it. In Section 2.6, we look at theories that make long-run
economic growth endogenous.
Using parameter 𝛼𝛼 = 1/3, note that (1 − 𝛼𝛼)⁄𝛼𝛼 = 2, so the implied capital returns across
two countries with 𝑦𝑦𝐴𝐴 ⁄𝑦𝑦𝐵𝐵 = 10 are 𝑅𝑅𝐵𝐵 ⁄𝑅𝑅𝐴𝐴 = 102 = 100. This numerical exercise suggests
the gross return to capital would be 100 times higher in the poorer country compared to
60
EC2065 Macroeconomics | Chapter 2: Economic growth
the rich country. Returns after depreciation 𝑟𝑟 = 𝑅𝑅 − 𝑑𝑑 would also differ by a large amount,
with 𝑟𝑟𝐵𝐵 = 𝑅𝑅𝐵𝐵 − 𝑑𝑑 = 100𝑅𝑅𝐴𝐴 − 𝑑𝑑 = 100𝑟𝑟𝐴𝐴 + 99𝑑𝑑.
However, if the difference in capital returns suggested by this exercise for realistic income
differences were really true then there would be huge incentives for rich countries to lend
to poor countries for investment. But international capital flows have not occurred to the
extent that would be expected if the calculation were correct. We conclude that some
assumptions made in this exercise must be incorrect. There are a number of possibilities:
• Some other factor of production that differs across countries has not been
accounted for, e.g. human capital. If human capital is higher in some countries, then
that raises their marginal product of physical capital 𝑀𝑀𝑃𝑃𝐾𝐾 .
• TFP 𝑧𝑧 differs across countries and differences in TFP imply differences in the
marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 . We explore in Box 2.3 what a difference in TFP
across countries might capture.
• Property rights are weaker in some countries, implying that investors’ actual return
is less than 𝑀𝑀𝑃𝑃𝐾𝐾 .
61
EC2065 Macroeconomics | Chapter 2: Economic growth
Institutions
Institutional differences across countries are one hypothesis for why there are large
differences in TFP. The ‘institutions’ of a country mean the set of rules that govern the
process of decision making and resource allocation in a country. These include the country’s
political and legal systems and economic institutions such as its markets, financial system,
and taxes and regulations. Institutions can affect both incentives to accumulate factors of
production and the efficiency with which those factors of production are allocated to
different uses.
A country with weak property rights, severe corruption and a poorly designed tax system
does not provide strong incentives to save, invest, or start a new business. For example, the
risk of expropriation reduces the expected return on any long-term investments, lowering
capital accumulation. This problem would also make it harder to attract foreign direct
investment, helping to explain the puzzle of why capital does not flow from rich to poor
countries.
Misallocation
Institutions can also affect the efficiency with which factors of production are allocated. For
example, in making loans, corruption might lead banks to favour those with political
connections over entrepreneurs with the best business ideas. Consequently, the economy’s
supply of capital 𝐾𝐾 and labour 𝑁𝑁 is not allocated to the businesses that can add the most to
GDP 𝑌𝑌. This shows up as a lower level of TFP 𝑧𝑧 in a country.
More generally, institutions affect incentives to pursue ‘rent-seeking’ rather than productive
activities. Productive activities are those that increase the total size of the ‘pie’, that is,
investments in physical and human capital, new businesses, and new technologies. Rent-
seeking refers to activities that give someone a larger share of the pie but do not increase its
size. For example, someone might seek a position of political power to benefit from
corruption. Institutions that reward rent-seeking activities thus divert effort from growing
the pie, which leads to lower TFP.
62
EC2065 Macroeconomics | Chapter 2: Economic growth
is maximised by saving nothing. However, that level of income 𝑦𝑦 cannot be sustained with
zero saving because the capital stock needed to produce output would eventually be
depleted.
Instead, we ask what is the highest sustainable level of 𝑐𝑐, and which saving rate is needed to
attain it? A sustainable value of 𝑐𝑐 is one that is a steady state for a particular saving rate 𝑠𝑠.
For extreme values of 𝑠𝑠, we can say immediately what the steady state for 𝑐𝑐 results. Having
𝑠𝑠 = 1 results in 𝑐𝑐 = 0, even though steady-state income 𝑦𝑦 might be very high. Having 𝑠𝑠 = 0
results in 𝑐𝑐 = 0 because the steady-state value of 𝑘𝑘 with zero saving is zero and this implies
that 𝑦𝑦 = 0 because some capital is essential for production. With the extreme values of 𝑠𝑠
leading to the worse possible outcomes for 𝑐𝑐 in steady state, it follows that an intermediate
saving rate with 0 < 𝑠𝑠 < 1 is optimal.
63
EC2065 Macroeconomics | Chapter 2: Economic growth
The golden rule level of capital per worker is the steady state where consumption per
worker is maximised. Mathematically, this can be found by differentiating the expression for
steady-state 𝑐𝑐 with respect to steady-state 𝑘𝑘, and the first-order condition is 𝜕𝜕𝜕𝜕⁄𝜕𝜕𝜕𝜕 = 0:
𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) = 𝑑𝑑 + 𝑛𝑛
Therefore, the golden rule calls for capital to be accumulated up to the point where the
marginal product of capital 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) is equal to the effective depreciation rate 𝑑𝑑 + 𝑛𝑛.
This first-order condition is illustrated geometrically in Figure 2.15. The golden rule is the
steady state for 𝑘𝑘 where the tangent to the per worker production function has the same
gradient as the effective depreciation line. This maximises the difference between the
heights of the production function and the effective depreciation line, with this difference
being the steady-state consumption per worker.
Figure 2.15: Golden rule diagram
Intuitively, the golden rule equation reflects a comparison of the marginal benefits and costs
of a higher level of capital per worker in steady state. The marginal benefit of maintaining an
64
EC2065 Macroeconomics | Chapter 2: Economic growth
extra unit of capital per worker is the additional output per worker 𝑀𝑀𝑃𝑃𝐾𝐾 each year. The
marginal cost of maintaining a stock of capital per worker permanently one unit higher is
the depreciation cost 𝑑𝑑 and the new capital 𝑛𝑛 required for new workers each year. The
difference between the benefit and the cost is extra output that can sustainably be
consumed, so the highest steady state for consumption per worker is where the marginal
benefit equals the marginal cost.
Once the golden rule level of 𝑘𝑘� is found, the saving rate 𝑠𝑠̂ that brings the economy to that
steady state is found from the equation 𝑠𝑠𝑠𝑠𝑠𝑠 (𝑘𝑘) = (𝑑𝑑 + 𝑛𝑛)𝑘𝑘:
(𝑑𝑑 + 𝑛𝑛)𝑘𝑘�
𝑠𝑠̂ =
𝑧𝑧𝑧𝑧(𝑘𝑘�)
However, starting from a steady state with a saving rate above the golden rule, the required
reduction in saving to reach the golden rule raises consumption during the transitional
period as well as the long run. In the short run, consumption per worker is actually higher
than in the long run because capital and output per worker are falling as the economy
approaches the golden rule steady state. This case is depicted in the right panel of Figure
2.16.
An economy with more capital in steady state than the golden rule level is said to be
dynamically inefficient because extra consumption in the long run can be obtained without
any short-run sacrifice. It is possible to raise consumption per worker in all subsequent years
65
EC2065 Macroeconomics | Chapter 2: Economic growth
by saving less. Where this is not true, so that some sacrifice now is necessary for a gain in
the future, the economy is said to be dynamically efficient.
If all households in the economy were alike (a representative household), then it is irrational
to continue to save more than the golden rule because everyone could be better off by
saving less. However, if saving is done by a young generation to accumulate assets for
retirement in old age, then it is possible to have a dynamically inefficient economy where
savers do not want to save less. We discuss dynamic inefficiency further in the ‘overlapping
generations’ economy introduced in Section 4.7.
In an economy with competitive markets, the gross return on capital is equal to the
marginal product of capital, that is, 𝑅𝑅 = 𝑀𝑀𝑃𝑃𝐾𝐾 . The real interest rate received by savers
would be equal to the real rate of return on capital after allowing for depreciation costs,
which means that 𝑟𝑟 = 𝑅𝑅 − 𝑑𝑑. In a steady state for capital per worker, 𝑘𝑘 and 𝑦𝑦 are constant
over time, so total GDP 𝑌𝑌 = 𝑁𝑁𝑁𝑁 grows at the same rate 𝑛𝑛 as the population 𝑁𝑁. If the
economy’s real GDP growth rate 𝑛𝑛 exceeds the real interest rate 𝑟𝑟 then:
𝑛𝑛 > 𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑
This implies 𝑀𝑀𝑃𝑃𝐾𝐾 < 𝑑𝑑 + 𝑛𝑛, and therefore the economy is dynamically inefficient because it
has more capital per worker in steady state than the golden rule level that satisfies 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑑𝑑 + 𝑛𝑛 (recalling that 𝑀𝑀𝑃𝑃𝐾𝐾 diminishes with 𝑘𝑘).
The second test for dynamic inefficiency compares the capital share of income to the
investment share of expenditure. In a competitive economy, gross capital income is equal to
𝑅𝑅𝑅𝑅 = 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾, so the capital share of all gross income is 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾/𝑌𝑌. By definition of 𝑘𝑘 = 𝐾𝐾/𝑁𝑁
and 𝑦𝑦 = 𝑌𝑌/𝑁𝑁, the capital-output ratio can also be written as 𝐾𝐾⁄𝑌𝑌 = 𝑘𝑘⁄𝑦𝑦. Using 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘)
and 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑧𝑧′(𝑘𝑘), it follows that the capital share of income is 𝑧𝑧𝑓𝑓 ′ (𝑘𝑘)𝑘𝑘⁄(𝑧𝑧𝑧𝑧(𝑘𝑘)).
The investment share of total expenditure is 𝐼𝐼/𝑌𝑌, which is equal to 𝑆𝑆/𝑌𝑌 and to the saving
rate 𝑠𝑠 in the basic Solow model. The steady-state level of capital per worker 𝑘𝑘 satisfies the
equation 𝑠𝑠𝑠𝑠𝑠𝑠 (𝑘𝑘) = (𝑑𝑑 + 𝑛𝑛)𝑘𝑘, so the investment share of GDP is 𝑠𝑠 = (𝑑𝑑 + 𝑛𝑛)𝑘𝑘/(𝑧𝑧𝑧𝑧 (𝑘𝑘)). If
the investment share is greater than the capital income share:
(𝑑𝑑 + 𝑛𝑛)𝑘𝑘 𝑧𝑧𝑓𝑓 ′ (𝑘𝑘)𝑘𝑘
>
𝑧𝑧𝑧𝑧(𝑘𝑘) 𝑧𝑧𝑧𝑧(𝑘𝑘)
By cancelling terms in 𝑘𝑘 and 𝑧𝑧𝑧𝑧(𝑘𝑘) from both sides, we that this holds if 𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) < 𝑑𝑑 + 𝑛𝑛,
which is the condition for dynamical inefficiency.
In conclusion, dynamically inefficient economies have low real interest rates relative to real
GDP growth, or high levels of investment expenditure relative to capital income.
66
EC2065 Macroeconomics | Chapter 2: Economic growth
How might economies mitigate these negative economic effects? In particular, should
economies save more to compensate? One benchmark for an appropriate level of the
saving rate 𝑠𝑠 is the golden rule, which says that the goal of the highest sustainable
level of consumption per person is at steady state for capital per worker with 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑑𝑑 + 𝑛𝑛. A higher depreciation rate 𝑑𝑑 thus reduces the golden rule capital stock. With
67
EC2065 Macroeconomics | Chapter 2: Economic growth
no change in the saving rate, we have already seen that steady-state capital per
worker will decline with higher 𝑑𝑑, so it is not clear whether the golden rule calls for a
higher or lower saving rate 𝑠𝑠.
A higher depreciation rate 𝑑𝑑 reduces steady-state output per worker 𝑦𝑦 ∗ but how large
is this effect? Consider an example we used earlier in Box 2.1 to study the effect of
differences in saving rates across countries. The production function is the Cobb-
Douglas form 𝑌𝑌 = 𝑧𝑧𝐾𝐾 𝛼𝛼 𝑁𝑁 1−𝛼𝛼 with 𝛼𝛼 = 1/3. With a Cobb-Douglas production function,
steady-state output per worker is:
𝛼𝛼
∗
1 𝑠𝑠 1−𝛼𝛼
𝑦𝑦 = 𝑧𝑧 1−𝛼𝛼 � �
𝑑𝑑 + 𝑛𝑛
We can find the effect of going from depreciation rate 𝑑𝑑1 to 𝑑𝑑2 by taking the ratio of
the resulting steady-state levels of 𝑦𝑦2∗ and 𝑦𝑦1∗ holding 𝑧𝑧 fixed:
1 2
𝑦𝑦2∗ 𝑑𝑑1 + 𝑛𝑛 3�3 𝑑𝑑1 + 𝑛𝑛
∗ = � � =�
𝑦𝑦1 𝑑𝑑2 + 𝑛𝑛 𝑑𝑑2 + 𝑛𝑛
Assuming a population growth rate 𝑛𝑛 = 1%, and a depreciation rate that rises just
one percentage point from 𝑑𝑑1 = 9% to 𝑑𝑑2 = 10%, we obtain:
𝑦𝑦2∗ 0.1
∗ =
� ≈ √0.91 ≈ 0.95
𝑦𝑦1 0.11
Therefore, when capital becomes obsolete or is destroyed 11% faster than before
(going from 9% to 10%), output per worker is 5% lower in the long run.
68
EC2065 Macroeconomics | Chapter 2: Economic growth
𝑌𝑌 = 𝐴𝐴𝐴𝐴
The resulting model is known as the ‘AK’ model after the equation for its production
function. In this model, TFP 𝐴𝐴 is assumed to be constant over time, so there is no exogenous
source of long-run growth.
Although the AK production function is not neoclassical, it still has constant returns to scale,
so it can be represented by a per-worker production function as in the Solow model. With
𝑓𝑓(𝑘𝑘) = 𝐹𝐹(𝑘𝑘, 1) = 𝑘𝑘, it follows that the per worker production function is 𝑦𝑦 = 𝐴𝐴𝐴𝐴, which can
also be seen directly by dividing 𝑌𝑌 = 𝐴𝐴𝐴𝐴 by 𝑁𝑁.
The marginal product of capital in the AK model is 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝐴𝐴𝑓𝑓 ′ (𝑘𝑘) = 𝐴𝐴, which is a constant.
This does not change with the amount of capital accumulated because there are no
diminishing returns to capital. It is also constant over time because there is no growth in TFP
𝐴𝐴 in this model.
As all of the other assumptions of the Solow model are maintained, the same equation for
the change over time in capital per worker is applicable here, the only difference being that
the per-worker production function 𝑦𝑦 = 𝐴𝐴𝐴𝐴 is used in place of 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘):
𝑠𝑠𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)𝑘𝑘
𝑘𝑘 ′ − 𝑘𝑘 =
(1 + 𝑛𝑛)
The dynamics of capital per worker and output per worker can thus be analysed using a
diagram similar to the one in the Solow model. Crucially, since the per worker production
function is linear, it and the saving line in the equivalent of the Solow diagram are now
straight lines. The AK model diagram is shown in Figure 2.18.
Figure 2.18: The AK model
The key difference compared to the Solow model diagram is that there is no intersection
between the saving and effective depreciation lines and, hence, no steady state (apart from
the uninteresting steady state at zero). This is because the saving line is a straight line,
69
EC2065 Macroeconomics | Chapter 2: Economic growth
reflecting the absence of diminishing returns to capital, unlike its concave shape in the
Solow model.
If 𝑠𝑠𝑠𝑠 > 𝑑𝑑 + 𝑛𝑛 then the saving line is steeper than the effective depreciation line and
therefore the change in capital per worker 𝑘𝑘 ′ − 𝑘𝑘 is always positive. This implies there is
perpetual growth with no convergence to a steady state. Moreover, the production function
𝑦𝑦 = 𝐴𝐴𝐴𝐴 implies output per worker 𝑦𝑦 grows at same rate as capital per worker 𝑘𝑘:
𝑦𝑦 ′ − 𝑦𝑦 𝑘𝑘 ′ − 𝑘𝑘 𝑠𝑠𝑠𝑠 − (𝑑𝑑 + 𝑛𝑛)
= =
𝑦𝑦 𝑘𝑘 1 + 𝑛𝑛
Moreover, the economy’s rate of growth depends on the saving rate 𝑠𝑠. In contrast to the
Solow model, changing the saving rate has a permanent effect on growth rate, not only on
level of income. An economy can enjoy a faster long-run growth rate by choosing to save a
higher fraction of income.
Although the AK model makes some interesting predictions, the AK production function
itself is problematic. As it completely excludes labour input 𝑁𝑁, the implied capital share of
income is 100 per cent – the AK production function is a special case of a Cobb-Douglas with
𝛼𝛼 = 1. However, including labour input 𝑁𝑁 in the production function while preserving
linearity in capital input 𝐾𝐾 would result in a production function with increasing returns to
scale. Nonetheless, there are alternative models of endogenous growth with similar
predictions to the AK model that make more plausible assumptions about the production
function.
70
EC2065 Macroeconomics | Chapter 2: Economic growth
In contrast, the Solow model predicts that a higher saving rate only raises the long-run
steady-state level of income per worker, without any effect on the long-run growth
rate. This means saving too much has a negative effect on long-run consumption and
there is a golden rule saving rate that maximises long-run consumption. In the AK
model, saving more (as long as the saving rate is less than 100 per cent) always
increases consumption in the long run.
Another important feature of endogenous growth models is that they do not generate
convergence among economies. For countries with the same TFP level, same
population growth and depreciation rates and the same saving rate, the AK model
implies the growth rate of income per worker is the same irrespective of whether a
country starts with more or less capital per worker than others. This is illustrated in
the left panel of Figure 2.20.
Figure 2.20: No convergence, or divergence, across countries
71
EC2065 Macroeconomics | Chapter 2: Economic growth
Moreover, endogenous growth also means that small but persistent differences in
saving rates can explain large gaps in income per worker across countries. This is
because different saving rates imply different growth rates, which when compounded
over time can open up very large gaps in income, as shown in the right panel of Figure
2.20. We have seen that the Solow model could not explain large income differences
even with large differences in saving rates.
Finally, observe that the AK model predicts there is no difference between the return on
capital in poor and rich countries. If countries share the same level of TFP, the marginal
product of capital is the same everywhere. This would resolve the puzzle of why capital
does not flow from rich to poor countries.
Suppose production is done by perfectly competitive firms using production function 𝑌𝑌𝑖𝑖 =
𝐹𝐹(𝐾𝐾𝑖𝑖 , 𝐴𝐴𝑁𝑁𝑖𝑖 ). Firm 𝑖𝑖 uses capital 𝐾𝐾𝑖𝑖 and hires labour 𝑁𝑁𝑖𝑖 to produce output 𝑌𝑌𝑖𝑖 . There is labour-
augmenting technology 𝐴𝐴 that is common to all firms because knowledge is a public good
(non-rival and non-excludable). Unlike the AK model, here the production function
𝐹𝐹(𝐾𝐾𝑖𝑖 , 𝐴𝐴𝑁𝑁𝑖𝑖 ) at the level of individual firms is neoclassical, so there are diminishing returns to
capital.
Each firm 𝑖𝑖 hires labour 𝑁𝑁𝑖𝑖 up to the point where the wage 𝑤𝑤 equals the marginal product of
labour 𝑀𝑀𝑃𝑃𝑁𝑁𝑖𝑖 . As all competitive firms face the same wage, the marginal products of labour
and capital end up being the same across all firms. These marginal products then determine
the distribution of income between labour and capital in the usual way. Given knowledge 𝐴𝐴,
the production function for whole economy 𝑌𝑌 = 𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴) is the same as the one for an
individual firm but with 𝑌𝑌 being GDP, and 𝐾𝐾 and 𝑁𝑁 being the total capital stock and labour
force. Assume the aggregate supply of labour 𝑁𝑁 is constant because there is no population
growth (𝑛𝑛 = 0).
72
EC2065 Macroeconomics | Chapter 2: Economic growth
The key assumption of the learning-by-doing model is that economy-wide knowledge 𝐴𝐴 rises
with the economy’s capital stock 𝐾𝐾. Specifically, we assume the two variables are
proportional with 𝜆𝜆 > 0 being a constant:
𝐴𝐴 = 𝜆𝜆𝜆𝜆
By substituting 𝐴𝐴 = 𝜆𝜆𝜆𝜆 into 𝑌𝑌 = 𝐹𝐹(𝐾𝐾, 𝐴𝐴𝐴𝐴), the aggregate production function becomes
𝑌𝑌 = 𝐹𝐹(𝐾𝐾, 𝜆𝜆𝜆𝜆𝜆𝜆). The constant-returns-to-scale property of the production function then
implies 𝑌𝑌 = 𝐹𝐹(1, 𝜆𝜆𝜆𝜆)𝐾𝐾. As 𝐹𝐹(1, 𝜆𝜆𝜆𝜆) is a constant, this has the same form as an ‘AK’
production function 𝑌𝑌 = 𝑧𝑧𝑧𝑧, where the constant 𝑧𝑧 is 𝑧𝑧 = 𝐹𝐹(1, 𝜆𝜆𝜆𝜆).
Intuitively, the spill over effect from the aggregate capital stock to knowledge offsets the
usual diminishing returns to capital. If labour-augmenting technology 𝐴𝐴 is proportional to
capital 𝐾𝐾, the spill over cancels out diminishing returns and implies constant returns to
capital overall. This makes it equivalent to an ‘AK’ production function in the aggregate,
even though individual production functions at the level of firms are neoclassical. Therefore,
it is possible to generate endogenous growth without having directly to assume the unusual
features of the AK production function itself.
However, even if there is a spill over from capital accumulation to knowledge, the model’s
implications can change substantially if knowledge is not proportional to capital. If following
a 1 per cent increase in 𝐾𝐾, knowledge 𝐴𝐴 rises by less than 1 per cent then there are
diminishing returns to aggregate capital and it is not possible to generate endogenous long-
run growth. On the other hand, if 𝐴𝐴 rises more than 1 per cent following the 1 per cent rise
in 𝐾𝐾 then there are increasing returns to aggregate capital. This would imply endogenous
growth but, unrealistically, the economy’s growth rate would increase over time.
Unfortunately, the version of the model with stable endogenous growth resembles a special
case for which there is no clear support.
Even in the case where 𝐴𝐴 and 𝐾𝐾 are proportional, the learning-by-doing model implies what
are known as ‘scale effects’. This means that the economy’s growth rate of income per
person depends positively on the size of the population, a prediction that has a lack of
empirical support. This feature of the learning-by-doing model results from a larger
economy with more people producing more non-rival knowledge from which everyone
benefits. Mathematically, it can be seen from the aggregate production function displaying
increasing returns to capital 𝐾𝐾 and labour 𝑁𝑁 together.
73
EC2065 Macroeconomics | Chapter 2: Economic growth
How might an economy’s stock of human capital be measured? It is possible to look at data
on average years of schooling and the fraction of college graduates in the population.
Combined with research on wages that estimates the effect of, for example, an extra year of
education on workers’ pay, a measure of human capital based on the wage premiums
associated with education/training can be constructed.
Although we have seen models earlier where effective labour input depended on
technology, it is important to note that human capital is different from the notions of ideas
and technology. Human capital is embodied in a worker who has learned a particular skill –
it is not the abstract concept or discovery of the knowledge for the first time.
We now consider a model of human capital accumulation as a potential source of
endogenous long-run growth. The economy’s stock of human capital is denoted by 𝐻𝐻.
Assume the labour force 𝑁𝑁 has a constant size, so a change in 𝐻𝐻 reflects a change in average
human capital per person. Output of final goods and services 𝑌𝑌 is produced with physical
capital 𝐾𝐾 and human capital according to the neoclassical production function:
𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑢𝑢𝑢𝑢)
In the production function, 𝑢𝑢 denotes fraction of human capital 𝐻𝐻 used to produce goods.
As we will see, some existing human capital is dedicated to producing future human capital,
not goods and services in the present. We assume TFP 𝑧𝑧 is constant over time, and continue
to maintain all the other assumptions about capital and goods from the Solow model.
Human capital itself needs to be produced using existing human capital. Given 𝑢𝑢, the
amount of existing 𝐻𝐻 producing new human capital is (1 − 𝑢𝑢)𝐻𝐻. We assume a production
function 𝐺𝐺(∙) for producing new human capital. The only input is (1 − 𝑢𝑢)𝐻𝐻, although it is
possible to generalise this making production of human capital depend on both physical and
human capital.
The equation for next time period’s human capital 𝐻𝐻′ is:
𝐻𝐻′ = (1 − 𝑑𝑑𝐻𝐻 )𝐻𝐻 + 𝐺𝐺((1 − 𝑢𝑢)𝐻𝐻)
The first term represents undepreciated current human capital that remains usable in the
future, assuming the rate at which human capital depreciates is a constant 𝑑𝑑𝐻𝐻 . The
dynamics of human capital are therefore determined by:
The difference between the terms on the right-hand side is the difference between a ‘saving
line’ 𝐺𝐺�(1 − 𝑢𝑢)𝐻𝐻�, the shape of which depends on the function 𝐺𝐺(∙), and a depreciation line
𝑑𝑑𝐻𝐻 𝐻𝐻, which is a straight line. A diagram for human capital accumulation analogous to the
Solow model diagram is displayed in Figure 2.21.
74
EC2065 Macroeconomics | Chapter 2: Economic growth
The predictions of the human capital model depend on whether the returns to producing
human capital are diminishing or constant. The case of diminishing returns is shown in the
right panel of the figure. The human capital production function 𝐺𝐺(∙) is concave, which
implies it intersects the depreciation line and there exists a steady state 𝐻𝐻 ∗ for the stock of
human capital. As the 𝐻𝐻 entering the goods production function does not grow in the long
run, diminishing returns to physical capital in producing goods imply that there is no long-
run economic growth by the usual Solow model logic.
But what happens if there are constant returns to producing human capital? In this case, the
function 𝐺𝐺 (∙) is linear. There is no intersection between the saving and depreciation lines
for human capital (except at zero), so it is possible to have a constant positive growth rate of
𝐻𝐻 in the long run. The implications for the production of goods and services is then exactly
equivalent to the Solow model with labour-augmenting technological progress studied in
Section 2.3. Mathematically, constant growth in workers’ human capital 𝐻𝐻 plays exactly the
same role as constant growth of labour-augmenting technology 𝐴𝐴.
An increase in the amount of resources devoted to education and training can be
represented by a decline in 𝑢𝑢 and an increase in 1 − 𝑢𝑢. If the human capital model
generates positive long-run growth, then higher 1 − 𝑢𝑢 increases the economy’s long-run
growth rate. Although there is certain to be a benefit in the long run, this is not a free lunch
because there is a cost during a transitional period with lower 𝑢𝑢 implying lower output 𝑌𝑌
starting from the initial level of 𝐻𝐻.
If the model does not generate long-run growth, then reducing 𝑢𝑢 has two conflicting effects
on the long-run level of output. First, it leads to a higher steady-state stock of human
capital, which boosts output. Second, it reduces the share of human capital used to produce
goods, which lowers output of goods and services. There is an optimal choice of 𝑢𝑢 to
maximise long-run output level analogous to the Golden Rule for capital accumulation.
75
EC2065 Macroeconomics | Chapter 2: Economic growth
But we see that firms and governments devote considerable resources to research and
development (R&D). R&D activity directly aims to discover new ideas and technologies, with
countries at the frontiers of research often spending 2–3 per cent of GDP on R&D. The R&D
model introduced in this section studies growth through the deliberate accumulation of
ideas and technologies rather than of physical or human capital.
76
EC2065 Macroeconomics | Chapter 2: Economic growth
This means the growth rate (𝐴𝐴′ − 𝐴𝐴)/𝐴𝐴 of the stock of ideas 𝐴𝐴 is (1 − 𝑢𝑢)𝑁𝑁⁄𝜇𝜇. The linearity
of the production function for new ideas implies constant returns to R&D activity, i.e. the
same workforce (1 − 𝑢𝑢)𝑁𝑁 in the R&D sector can produce new ideas at a constant rate.
Given 𝑢𝑢, 𝜇𝜇, and 𝑁𝑁, the R&D model predicts a constant positive growth rate of 𝐴𝐴. The
implications for real income per worker can then be analysed using the framework from
Section 2.3 where there was exogenous labour-augmenting technological progress. With a
constant growth rate of 𝐴𝐴, there is positive long-run growth in output 𝑌𝑌 (and, hence, output
per worker here) at the same rate (1 − 𝑢𝑢)𝑁𝑁⁄𝜇𝜇, so the R&D model generates growth
endogenously. Moreover, shifting resources to the R&D sector, i.e. lowering 𝑢𝑢 and raising
1 − 𝑢𝑢, raises the economy’s long-run growth rate. This comes with the short-run cost of
lower output of goods when workers are diverted from the goods-producing sector to the
R&D sector but before they have produced any new knowledge.
We have not analysed the determinants of the fraction 1 − 𝑢𝑢 of labour allocated to R&D
using the model but we can think about this as resulting from a comparison of benefits and
costs to firms engaged in R&D. The cost to a firm of having more R&D workers is the wages
they need to be paid. The benefit of having them depends on the value to the firm itself of
the knowledge the R&D workers discover, for example, the value of new patents registered.
This value is affected by the strength of patent protection. Stronger protection of
intellectual property rights raises the private benefit of R&D and should lead to a lower
value of 𝑢𝑢.
77
EC2065 Macroeconomics | Chapter 2: Economic growth
78
EC2065 Macroeconomics | Chapter 2: Economic growth
Country 2 can copy ideas from country 1 at a cost. Some labour input is required to
understand and apply the discoveries of country 1 but less than is needed to ‘reinvent the
wheel’ through independent R&D. We write down a ‘production function’ for ideas newly
imitated in country 2 that is also of the form 𝐴𝐴′2 − 𝐴𝐴2 = (1 − 𝑢𝑢2 )𝑁𝑁/𝜇𝜇𝑐𝑐 , but with 𝜇𝜇𝑐𝑐 being
the cost of copying ideas rather than innovating. This cost 𝜇𝜇𝑐𝑐 is lower than the cost of
innovation 𝜇𝜇𝑖𝑖 .
While 𝜇𝜇𝑖𝑖 is a constant parameter, reflecting the assumption of constant returns in R&D for
the frontier country, it is reasonable that the cost of imitation 𝜇𝜇𝑐𝑐 should depend on the size
of the knowledge gap between countries 1 and 2. Hence, 𝜇𝜇𝑐𝑐 is assumed to be a function
𝜇𝜇𝑐𝑐 = 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . The cost function is decreasing in the size of
the knowledge gap 𝐴𝐴1 /𝐴𝐴2 because it is easier to copy more basic ideas discovered by
country 1 further in the past. The cost is close to zero when 𝐴𝐴1 /𝐴𝐴2 is extremely large
because when country 2 starts far behind, picking up some simple knowledge from country
1 should be almost free. Finally, the cost approaches the cost of innovation 𝜇𝜇𝑖𝑖 when the
knowledge gap becomes small (𝐴𝐴1 /𝐴𝐴2 is close to 1). This says that cost of quickly imitating
the most recent discoveries of country 1 is close to the cost of doing innovation itself. A cost
function with these features is depicted in Figure 2.22.
Figure 2.22: Imitation cost function
The growth rate of knowledge 𝐴𝐴1 and GDP in country 1 is (1 − 𝑢𝑢1 )𝑁𝑁/𝜇𝜇𝑖𝑖 , and the growth
rate of knowledge 𝐴𝐴2 and GDP in country 2 is (1 − 𝑢𝑢2 )𝑁𝑁/𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ). The two growth rates
are plotted in Figure 2.23 as a function of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . For country 1, its
growth rate is independent of how far country 2 is behind. For country 2, its growth rate is
faster when it is further behind because it becomes cheaper to imitate, i.e. 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) is
lower. Over time, the ratio 𝐴𝐴1 /𝐴𝐴2 declines to the extent that the growth rate of 𝐴𝐴2 exceeds
the growth rate of 𝐴𝐴1 . We can see from the diagram that there is convergence to a steady
state for the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 where the growth rate lines intersect. Mathematically,
this occurs where:
𝐴𝐴1 (1 − 𝑢𝑢2 )
𝜇𝜇𝑐𝑐 = 𝑐𝑐 � �= 𝜇𝜇 < 𝜇𝜇𝑖𝑖
𝐴𝐴2 (1 − 𝑢𝑢1 ) 𝑖𝑖
79
EC2065 Macroeconomics | Chapter 2: Economic growth
Since 𝐴𝐴1 /𝐴𝐴2 converges to a steady state in the long run, the follower country’s long-run
growth rate equals the growth rate of the frontier country. This means that even if 𝑢𝑢1 and
𝑢𝑢2 are permanently different, the gap between the countries eventually stabilises. The
international technology transfer model thus suggests it is difficult to explain very large
income differences across countries through knowledge gaps because those countries that
have not yet adopted much knowledge from the rest of the world can improve at a
relatively low cost. Note that this does not mean that knowledge gaps will shrink to zero in
the long run. With fewer resources allocated to obtaining knowledge, country 2 cannot
imitate the newest technologies of country 1.
Figure 2.23: The steady-state knowledge gap across countries
If country 2 were to lower 𝑢𝑢2 , allocating more workers to imitating country 1’s ideas, this
will shrink the knowledge gap between the two in the long run but it would not change
country 2’s long-run growth rate (assuming country 2 remains the follower). This reflects
the diminishing returns to imitation because the stock of existing knowledge to copy from
country 1 is necessarily finite.
80
EC2065 Macroeconomics | Chapter 2: Economic growth
The imitation cost 𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) rises for each level of the knowledge ratio 𝐴𝐴1 /𝐴𝐴2 . This
implies the growth rate (1 − 𝑢𝑢2 )𝑁𝑁/𝑐𝑐(𝐴𝐴1 ⁄𝐴𝐴2 ) of 𝐴𝐴2 is lower for each value of 𝐴𝐴1 ⁄𝐴𝐴2 ,
shifting down the growth rate function for country 2 as shown in the left panel of
Figure 2.24.
Firms in the innovating country (1) now gain more from discovering new technologies
and have a greater incentive to do R&D. This increases the fraction of workers
employed in the R&D sector, i.e. 𝑢𝑢1 is lower. The growth rate (1 − 𝑢𝑢1 )𝑁𝑁⁄𝜇𝜇𝑖𝑖 of 𝐴𝐴1 is
now higher, which shifts up the growth rate line for country 1 in the figure (it is a
horizontal line because the growth rate is independent of the levels of 𝐴𝐴1 and 𝐴𝐴2 ).
Figure 2.24: Effects of stronger intellectual property rights
The right panel of the figure depicts the time paths of knowledge (in logarithms) for the
two countries. The gradient of the path for country 1 becomes steeper as the growth
rate of knowledge 𝐴𝐴1 increases. From the perspective of the follower country 2,
stronger intellectual property rights slow down the adoption of new technologies from
the frontier country, which means 𝐴𝐴2 is below the path it would have followed for
some time after the strengthening of intellectual property rights. This implies its output
per worker grows more slowly for some time.
However, since there is a steady state for relative technology 𝐴𝐴1 /𝐴𝐴2 , albeit at a higher
level than before, eventually 𝐴𝐴2 will rise above the path it would have followed
because of the faster growth in 𝐴𝐴1 now country 1 has a greater incentive to innovate. It
follows that country 2 gains from greater protection of intellectual property rights in
the long run, although loses out during a transitional period (which could last for a
considerable amount of time).
81
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The focus on the demand side means we will give more emphasis to fluctuations in the
economy compared to the long-run trends that were the subject of the previous two
chapters. We will look at how macroeconomic data can be detrended to zoom in on the
business-cycle, i.e. the fluctuations of macroeconomic variables around their long-run
trends.
After having studied the determinants of consumption and investment, we will see how to
integrate our analysis of the demand side of the economy with the supply side from earlier
chapters. This entails setting up a dynamic macroeconomic model where demand and
supply factors both matter in understanding changes in GDP.
Essential reading
• Williamson, Chapters 9 and 11.
82
EC2065 Macroeconomics | Chapter 3: Aggregate demand
and persistent negative deviations from trend are ‘recessions’. The turning points of the
business cycle are the peaks and troughs in the deviations of real GDP from its trend. Note
that the definition of a recession here differs from the conventional two consecutive
quarters of decline in real GDP and that an economy can still be in recession if it is still
below its trend while growing.
Figure 3.1: Stylised representation of business cycles
Some terms are widely used to describe the patterns seen in diagrams like Figure 3.1. The
amplitude of the fluctuations in a variable refers to how far above or below trend a variable
goes. The persistence of the fluctuations in a variable refers to how long it typically takes for
the variable to return to its trend line. The frequency of fluctuations refers to how often the
variable switches from being above trend to below trend, or below to above.
3.1.2 Detrending
How is the detrending done to construct a version of the figure with real data? It is not
necessary to know the technical details in this course but it essentially involves putting a line
of best fit through the data points plotted as a time series. If the long-run percentage
growth rate is stable, the trend line for data plotted in logarithms can be estimated as the
best-fitting straight line. In practice, long-run growth rates are not entirely stable, so the
best trend line is not a completely straight line. Researchers often use the Hodrick-Prescott
(HP) filter, which is essentially a trend line that can change ‘smoothly’ over time.
Figure 3.2 shows the HP-filter trend line for the logarithm of US real GDP, plotted as a
quarterly time series from 1947 to 2021. The estimated HP-filter trend line is not completely
straight. For instance, the gradient is steeper in the 1960s and 1990s than the 1970s,
reflecting the ‘productivity slowdown’ experienced by the US economy in the 1970s. We see
that the trend dominates the data, so that even events such the recessions after the 2008
financial crisis and during the COVID pandemic appear as relatively small dips compared to
the secular progress in the US economy over many decades.
83
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.3 shows the percentage deviations from US real GDP from its HP-filter trend line,
and it is calculated as the difference between actual and trend log real GDP multiplied by
100. This is our measure of the business cycle in the USA. The business cycle is typically
ranges from +4% to −4% of trend real GDP, although the recession owing to COVID is
much larger. The period from the mid-1980s up to the 2008 financial crisis is known as the
‘great moderation’ owing to the small ±2% range of fluctuations in real GDP.
Figure 3.2: HP filter applied to US real GDP
84
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The relative volatility of a variable 𝑥𝑥 can be judged by plotting a time series of its percentage
deviations from trend against those of real GDP. If the amplitude of the fluctuations in 𝑥𝑥 is
typically larger than those of real GDP then 𝑥𝑥 has a higher relative volatility. If 𝑥𝑥 has smaller-
85
EC2065 Macroeconomics | Chapter 3: Aggregate demand
amplitude fluctuations than real GDP, as shown for example in Figure 3.6, then 𝑥𝑥 has a
lower relative volatility.
Finally, the dynamic relationship between 𝑥𝑥 and real GDP can be described in terms of leads
or lags. If the fluctuations in 𝑥𝑥 anticipate those of real GDP that follow then 𝑥𝑥 is said to be a
‘leading’ variable. On the other hand, if fluctuations in 𝑥𝑥 follow those of real GDP with some
delay than 𝑥𝑥 is said to be ‘lagging’. These two cases are illustrated in Figure 3.7. If 𝑥𝑥 neither
leads nor lags real GDP then it is said to be ‘coincident’.
Figure 3.6: Relative volatility
3.2 Consumption
Consumption refers to purchases of final goods and services by households. Consumption is
the ultimate purpose of all economic activity and a key determinant of households’ welfare.
We begin by documenting some simple facts about consumption. In advanced economies,
consumption expenditure is typically around two-thirds of GDP. This makes it the largest
component of aggregate expenditure. While it makes up a large fraction of GDP,
86
EC2065 Macroeconomics | Chapter 3: Aggregate demand
consumption is generally less volatile than GDP. This observation is important in motivating
the theories of consumption we will study.
Figure 3.8 shows detrended real consumption expenditure in the United States alongside
detrended real GDP for the period 1947–2021. The peaks and troughs of consumption
closely align with those of real GDP, so consumption is said to be a procyclical and
coincident variable because no leads or lags can be discerned relative to GDP. The most
notable observation is that the absolute value of the peaks and troughs of consumption are
almost always smaller than those of GDP. Consumption therefore has a lower relative
volatility compared to GDP.
Figure 3.8: Empirical evidence on consumption
However, as we will see, it may not always be possible to achieve this consumption
smoothing. In particular, borrowing against the expectation of receiving income in the
future might be very costly or difficult. This is a particular example of what we will refer to
as ‘credit-market imperfections’ and we will see how they place limits on the extent to
which households are able to smooth consumption even if they want to.
87
EC2065 Macroeconomics | Chapter 3: Aggregate demand
3.3.1 Preferences
In the model, households choose consumption plans (𝑐𝑐, 𝑐𝑐 ′ ), where 𝑐𝑐 is current consumption
and 𝑐𝑐′ is future consumption. Preferences over consumption plans can be represented in a
diagram with 𝑐𝑐 and 𝑐𝑐′ on the axes using indifference curves as shown in Figure 3.9.
Indifference curves are downward sloping because more consumption is preferred to less at
each date. Importantly, indifference curves are drawn as convex to origin. This indicates a
dislike of extreme consumption plans, for example, the household does not like to have high
𝑐𝑐 but 𝑐𝑐′ being very low and would prefer a greater balance between 𝑐𝑐 and 𝑐𝑐′. Similarly, the
household would prefer 𝑐𝑐 and 𝑐𝑐′ closer together than very low 𝑐𝑐 but high 𝑐𝑐′. This is how we
represent a preference for consumption smoothing.
The absolute value of the gradient of the indifference curve at a point is known as the
marginal rate of substitution between 𝑐𝑐 and 𝑐𝑐′, denoted by 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ . Since the indifference
curve gradient indicates how much extra future consumption 𝑐𝑐′ the household would
require to be as well off after losing a unit of current consumption 𝑐𝑐, the marginal rate of
substitution indicates the relative value households put on a unit of current consumption
compared to future consumption. The convex shape of the indifference curves is equivalent
to assuming the marginal rate of substitution is diminishing as current consumption 𝑐𝑐 rises.
Another assumption on preferences is that both 𝑐𝑐 and 𝑐𝑐′ are normal goods. A normal good
is a good the household wants more of when better off. Geometrically, this assumption
means that if points on different indifference curves with the same marginal rate of
substitution were joined up then they would trace out an upward sloping line. This
assumption captures the idea that if more consumption is affordable, the household would
like to spread out the extra expenditure over time.
88
EC2065 Macroeconomics | Chapter 3: Aggregate demand
89
EC2065 Macroeconomics | Chapter 3: Aggregate demand
period. In cases where the household is a borrower, interest has to be paid on debts.
Assume for now that the household can borrow at the same interest rate 𝑟𝑟 that savers
receive. Later financial-market imperfections will explain why borrowers face higher interest
rates than savers in practice.
In the simple two-period model, we suppose that the household begins from a blank slate in
terms of not having any initial assets or debts at the beginning of the current period. Given
saving 𝑠𝑠, the household has financial wealth (1 + 𝑟𝑟)𝑠𝑠 in the future period, which includes
the saving 𝑠𝑠 plus real interest income 𝑟𝑟𝑟𝑟. Note that interest income is counted separately
from future non-financial income 𝑦𝑦′.
In the future period, the household has disposable income 𝑦𝑦 ′ − 𝑡𝑡′ and net financial wealth
(1 + 𝑟𝑟)𝑠𝑠. Net wealth is negative for borrowers. Taking disposable income and financial
wealth together, the household can afford future consumption:
𝑐𝑐 ′ = 𝑦𝑦 ′ − 𝑡𝑡 ′ + (1 + 𝑟𝑟)𝑠𝑠
Given the nature of the two-period model, the future period comprises the whole future the
household is planning for, so there is no reason to save for anything after the future period.
Hence, 𝑐𝑐’ is given by the equation above conditional on the choice of current saving 𝑠𝑠. As
mentioned, saving 𝑠𝑠 can be negative and the only limit imposed on borrowing here is that
debts including interest do not exceed the ability to repay as measured by future disposable
income 𝑦𝑦 ′ − 𝑡𝑡 ′ . It may be reasonable to consider stricter limits on borrowing with financial-
market imperfections and examples of these are considered in Chapter 4.
Dividing both sides of the equation for 𝑐𝑐′ by 1 + 𝑟𝑟 implies
𝑐𝑐 ′ 𝑦𝑦 ′ − 𝑡𝑡′
= + 𝑠𝑠
1 + 𝑟𝑟 1 + 𝑟𝑟
and substituting the definition of saving 𝑠𝑠 = 𝑦𝑦 − 𝑡𝑡 − 𝑐𝑐 and grouping consumption and
disposable income terms together:
𝑐𝑐 ′ 𝑦𝑦 ′ − 𝑡𝑡′
𝑐𝑐 + = 𝑦𝑦 − 𝑡𝑡 +
1 + 𝑟𝑟 1 + 𝑟𝑟
Consumption plans (𝑐𝑐, 𝑐𝑐 ′ ) must satisfy this equation to be affordable to the household.
Geometrically, the budget constraint is a straight line with gradient −(1 + 𝑟𝑟) passing
through the ‘endowment’ point (𝑦𝑦 − 𝑡𝑡, 𝑦𝑦 ′ − 𝑡𝑡 ′ ) as illustrated in Figure 3.10.
90
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The household can always choose neither to be a saver nor a borrower, meaning that 𝑐𝑐 =
𝑦𝑦 − 𝑡𝑡 and 𝑐𝑐 ′ = 𝑦𝑦 ′ − 𝑡𝑡′, so disposable income is simply consumed in both time periods. This
means choosing a consumption plan at the endowment point in the diagram (labelled E).
Choosing a consumption plan to the left of the endowment point means choosing to be a
saver (𝑠𝑠 > 0), while choosing a plan to the right of the endowment point means choosing to
be a borrower (𝑠𝑠 < 0).
For each unit less of income consumed in the current period, an extra unit can be saved,
which means an extra amount 1 + 𝑟𝑟 of financial wealth in the future and thus an extra
amount 1 + 𝑟𝑟 of future consumption is affordable. This is why the budget constraint has
gradient −(1 + 𝑟𝑟). In the absence of financial-market imperfections, the budget line is a
straight line (it has the same gradient in the saving and borrowing regions) and extends all
the way down to the horizontal axis.
The budget constraint is described as a ‘lifetime budget constraint’ because it includes
consumption and income in both periods. However, 𝑐𝑐 and 𝑐𝑐′ and 𝑦𝑦 − 𝑡𝑡 and 𝑦𝑦 ′ − 𝑡𝑡′ cannot
simply be added together because they occur in different periods. Instead, present
discounted values (PDVs) of future consumption 𝑐𝑐 ′ /(1 + 𝑟𝑟) and future income (𝑦𝑦 ′ −
𝑡𝑡 ′ )/(1 + 𝑟𝑟) are added to current 𝑐𝑐 and 𝑦𝑦 − 𝑡𝑡. The present value of a future amount 1 is
equal to 1/(1 + 𝑟𝑟) because if 1/(1 + 𝑟𝑟) were saved in the current period, after including
interest, it would be worth (1 + 𝑟𝑟)/(1 + 𝑟𝑟) = 1 in the future period. The lifetime budget
constraint shows that the present discounted value of all consumption across the two
periods is limited by the present value of all current and future disposable income.
The present discounted value of all income after tax can be interpreted as ‘lifetime’ or
‘human’ wealth, which is denoted by ℎ:
𝑦𝑦 ′ − 𝑡𝑡 ′
ℎ = 𝑦𝑦 − 𝑡𝑡 +
1 + 𝑟𝑟
This is what an asset making payments equal to disposable income in the two periods would
be worth to the household. If the household had any initial financial assets in the current
period, the value of these would be added to ℎ to obtain total wealth in all forms.
91
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Intuitively, the budget constraint gradient 1 + 𝑟𝑟 (in absolute value) represents the market
price of current consumption in terms of future consumption, while the indifference curve
gradient represents how much extra 𝑐𝑐′ is needed to compensate the household for the loss
of one unit of 𝑐𝑐. The tangency point is thus where the subjective value the household puts
on current consumption relative to future consumption is equal to the relative cost of
current consumption given the market interest rate 𝑟𝑟.
92
EC2065 Macroeconomics | Chapter 3: Aggregate demand
A relationship between consumption and current income consistent with the cross-
sectional empirical evidence is sometimes directly assumed in macroeconomics. This is
the Keynesian consumption function relating aggregate consumption 𝐶𝐶 to aggregate
disposable income 𝑌𝑌 − 𝑇𝑇, where 𝑇𝑇 denotes taxes:
𝐶𝐶 = 𝑎𝑎 + 𝑏𝑏(𝑌𝑌 − 𝑇𝑇)
The terms a and b are parameters with a>0 and 0<b<1. The marginal propensity to
consume ∂C/∂Y is the constant b. The average propensity to consume C/Y declines as
income rises because a is positive, that is to say, a higher fraction of income is saved when
income is high. Although this equation is consistent with the cross-sectional evidence, it
actually posits that the same relationship holds between aggregate consumption and
income over time. As we will see, this is not the case. Moreover, the equation is not
derived from a model of rational behaviour by households.
93
EC2065 Macroeconomics | Chapter 3: Aggregate demand
• Stage of life cycle: the young and old have lower incomes on average compared to
the middle-aged
• Temporary shocks: some people experience periods of unemployment, others earn
large bonuses/overtime payments.
First, there are large predictable differences between current income and future income
because households are of different ages. However, the preference for consumption
smoothing means that the optimal consumption plan has 𝑐𝑐 and 𝑐𝑐′ closer together than are
𝑦𝑦 − 𝑡𝑡 and 𝑦𝑦 ′ − 𝑡𝑡′. This is consistent with the cross-sectional empirical evidence.
The left panel of Figure 3.13 illustrates the consumption choice diagram for a young person
with a low current income but expectations of higher income in the future, so the
endowment point 𝐸𝐸 is at a relatively extreme north-west position in the diagram. The right
panel shows a middle-aged person with a high current income but expectations of lower
income in the future owing to retirement, so the endowment point is at a relatively
extreme south-east position. Households with quite different levels of current income
could choose very similar levels of consumption.
Figure 3.13: Life-cycle differences in income
If the life cycle were the only reason for income differences across households then the
predicted consumption-income relationship would be even weaker than found in the data.
However, there are plenty of income differences that do not average out over the life
cycle, for example, people having different careers.
A second source of income variation comes from temporary shocks, such as receiving a
bonus, or a period of unemployment. We can show the predicted effects of such shocks in
the dynamic consumption model by shifting the endowment point 𝐸𝐸 with the change in
income. A temporary shock affects only current income 𝑦𝑦, leaving 𝑦𝑦 ′ unchanged, which
moves the endowment point horizontally. A movement of 𝐸𝐸 causes a parallel shift of the
lifetime budget constraint. Taking the case of a temporary rise in income 𝑦𝑦, Figure 3.14
shows the endowment point move to the right from 𝐸𝐸0 to 𝐸𝐸1 . The budget constraint shifts
to the right, and both 𝑐𝑐 and 𝑐𝑐′ rise in response because they are normal goods. The desire
for consumption smoothing reflected in the rise94of both 𝑐𝑐 and 𝑐𝑐′ means that 𝑐𝑐 rises by less
than 𝑦𝑦. This is also consistent with cross-sectional data on consumption and income.
EC2065 Macroeconomics | Chapter 3: Aggregate demand
95
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Households can also save by holding bonds. Bonds are a particular type of financial asset
that promise a sequence of fixed future payments. For now, assume these payments are
fixed in real terms, rather than in units of money. This would correspond to what is called an
inflation-indexed bond.
The yield or interest rate on a bond (these terms are interchangeable) is the discount rate
that makes the present value of all the bond’s future fixed payments equal to the actual
market price it currently trades at. Since the bond payments are fixed, high bond prices are
equivalent to low yields/interest rates and low bond prices are equivalent to high
yields/interest rates.
The simplest form of bond is a discount bond that makes only one payment (its face value)
at the maturity date of the bond. If a bond is held to maturity, the real return received is
equal to the bond’s real yield.
Consider a discount bond maturing in the next period and assume the face value is worth
one unit of goods. If the bond’s price currently is worth 𝑝𝑝 units of goods, then the real yield
or real interest rate 𝑟𝑟 on the bond is the discount rate that makes the present value of 1
equal to the bond price 𝑝𝑝:
1
𝑝𝑝 =
1 + 𝑟𝑟
If the bond has no risk of default, an investment of 𝑝𝑝 gives a payoff of 1 in the future. An
investor therefore makes a profit of 1 − 𝑝𝑝 on an investment of 𝑝𝑝, so the percentage real
return is (1 − 𝑝𝑝)/𝑝𝑝 = 𝑟𝑟.
96
EC2065 Macroeconomics | Chapter 3: Aggregate demand
gradient changes, the lifetime budget constraint pivots around the endowment point 𝐸𝐸,
which is always affordable.
To understand how the household’s optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) adjusts, we break
down the response into income and substitution effects. The substitution effect is found by
looking at the response to the change in the budget constraint gradient with a hypothetical
income transfer that shifts the position of the budget constraint so that the original
indifference curve can be reached. This leaves the household no better or worse off. The
substitution effect obtain in this way captures the pure incentive effect of changes in
interest rates, controlling for whether this makes the household better off or worse off.
The income effect is the response to removing the hypothetical income transfer used to
derive the substitution effect. This holds the gradient of the budget constraint constant, so
it is a response to a parallel shift of the budget constraint. The income effect thus captures
the household’s response to being made better off or worse off by the interest rate change.
Figure 3.15: The lifetime budget constraint with a higher interest rate
Figure 3.17 analyses the effects of higher interest rates on a saver. A saver is a household
that initially chooses a consumption plan to the left of the endowment point 𝐸𝐸. The increase
in 𝑟𝑟 pivots the budget constraint around 𝐸𝐸, making it steeper. The household can now reach
a higher indifference curve, so savers are made better off by higher interest rates because
they earn a higher return on their financial wealth. The substitution effect is found by
making a parallel downward shift of the true budget constraint to where it is tangent to the
original indifference curve. This leaves the household no better off. Even so, the household
still has an incentive to shift towards less consumption in the current period and more in the
future because the relative price of current consumption has risen. This substitution effect is
labelled 𝑆𝑆𝑆𝑆 in the diagram. It is movement north-west to where the indifference curve is
steeper.
97
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The income effect is found by removing the parallel downward shift of the true budget
constraint used to derive the substitution effect. Thus, the income effect is the response to
a parallel upward shift of the budget constraint. Since both current and future consumption
are normal goods, the income effect leads the saver to increase both 𝑐𝑐 and 𝑐𝑐′. Intuitively,
the household wants to spread out over time its gains from higher interest rates. The
income effect is labelled 𝐼𝐼𝐼𝐼 in the diagram and is a movement in a north-east direction.
Figure 3.18 performs the same exercise for a borrower. This is a household that initially
chooses a consumption plan to the right of the endowment point. When 𝑟𝑟 rises, the budget
constraint becomes steeper, pivoting around 𝐸𝐸. The initial consumption plan is no longer
affordable, so the borrower is made worse off when access to credit becomes more
expensive. While the hypothetical income transfer used for the substitution effect (𝑆𝑆𝑆𝑆) now
shifts the true budget constraint upwards, the analysis is qualitatively the same as for the
saver. The 𝑆𝑆𝑆𝑆 is a north-west movement in the diagram with lower 𝑐𝑐 and higher 𝑐𝑐′. On the
other hand, the income effect (𝐼𝐼𝐼𝐼) is the response to a parallel downward shift of the
budget constraint. The 𝐼𝐼𝐼𝐼 is therefore a south-west movement that reduces both 𝑐𝑐 and 𝑐𝑐′.
98
EC2065 Macroeconomics | Chapter 3: Aggregate demand
For savers, the income and substitution effects of 𝑟𝑟 on current consumption 𝑐𝑐 go in opposite
directions so the overall response is ambiguous. The diagram shows the special case where
𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 exactly cancel out. For future consumption 𝑐𝑐′, 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are reinforcing, so 𝑐𝑐′
rises unambiguously. For borrowers, the 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are reinforcing for 𝑐𝑐, which
unambiguously declines, while 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 are conflicting for 𝑐𝑐′ and the overall response is
ambiguous. The diagram shows the special case where 𝑆𝑆𝑆𝑆 and 𝐼𝐼𝐼𝐼 exactly cancel out for 𝑐𝑐′.
99
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Figure 3.19 shows the percentage deviations from trend of consumption of durables
alongside those of real GDP. An important aspect of the behaviour of aggregate
consumption is that it is less volatile than aggregate income, which motivated our
emphasis on consumption smoothing in the dynamic consumption choice model.
However, durables consumption is much more volatile than income, so we do not
observe smoothing of households’ expenditure on durable goods. This is not
surprising, though, because durable goods do not need to be purchased all the time to
enjoy the benefit of using them.
Figure 3.18: Consumption of durable goods
100
EC2065 Macroeconomics | Chapter 3: Aggregate demand
However, while aggregate consumption is less volatile than income, it is not much less
volatile. Although we cannot investigate this issue quantitatively in the simple two-period
model, for very short-term fluctuations in income relative to the whole future over which
101
EC2065 Macroeconomics | Chapter 3: Aggregate demand
To understand this point, consider the following example. The economy is comprised of 𝑁𝑁
representative households each with exogenous income 𝑦𝑦. Since income is treated as
exogenous, this implicitly assumes no adjustment of households’ supply of labour.
Furthermore, we ignore the use of capital goods in production, which also means
investment is zero (𝐼𝐼 = 0). Real GDP in the economy is 𝑌𝑌 = 𝑁𝑁𝑁𝑁, which is exogenous here.
The government’s fiscal policy is an exogenous level of expenditure is 𝐺𝐺.
Households choose a consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) as in the two-period model. Given 𝑐𝑐,
aggregate consumption is 𝐶𝐶 = 𝑁𝑁𝑁𝑁 because all 𝑁𝑁 households make the same choices given
that they have the same preferences and face the same income and interest rates.
Suppose the economy is closed, so market clearing in the goods market requires 𝑌𝑌 = 𝐶𝐶 + 𝐺𝐺.
The aggregate supply of goods, as given by the economy’s real GDP, must equal aggregate
expenditure 𝐶𝐶 + 𝐺𝐺 (there is no investment or net exports). With 𝑌𝑌 and 𝐺𝐺 being exogenous,
equilibrium is reached through adjustment of the real interest rate 𝑟𝑟. The equilibrium real
interest rate 𝑟𝑟 ∗ ensures that the consumption plan chosen by the representative household
is 𝑐𝑐 ∗ = (𝑌𝑌 − 𝐺𝐺)⁄𝑁𝑁 and 𝑐𝑐 ′∗ = (𝑌𝑌 ′ − 𝐺𝐺 ′ )⁄𝑁𝑁, which are equivalent to goods-market clearing
in the current and future time periods.
Figure 3.22 below shows how 𝑟𝑟 ∗ is determined graphically. At an arbitrary real interest rate
𝑟𝑟, the lifetime budget constraint of the representative household has gradient −(1 + 𝑟𝑟) and
passes through the point ((𝑌𝑌 − 𝐺𝐺)⁄𝑁𝑁 , (𝑌𝑌 ′ − 𝐺𝐺 ′ )⁄𝑁𝑁), labelled 𝐸𝐸. A detailed justification of
102
EC2065 Macroeconomics | Chapter 3: Aggregate demand
the second of these claims is found in Section 4.1 but, for now, just note that the
government could pay for its expenditure using lump-sum taxes 𝑡𝑡 = 𝐺𝐺/𝑁𝑁 and 𝑡𝑡 ′ = 𝐺𝐺 ′ /𝑁𝑁
levied on each household.
The optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) for a given 𝑟𝑟 will generally not be at
((𝑌𝑌 − 𝐺𝐺 )⁄𝑁𝑁 , (𝑌𝑌 ′ − 𝐺𝐺 ′ )⁄𝑁𝑁), which means that interest rate is not the equilibrium interest
rate. There is a particular interest rate 𝑟𝑟 ∗ at which each household willingly chooses
((𝑌𝑌 − 𝐺𝐺)⁄𝑁𝑁 , (𝑌𝑌 ′ − 𝐺𝐺 ′ )⁄𝑁𝑁). Since the optimal consumption plan is at a tangency point
between an indifference curve and the lifetime budget constraint, the value of 1 + 𝑟𝑟 ∗ is
pinned down by the gradient of the indifference curve passing through the point
((𝑌𝑌 − 𝐺𝐺 )⁄𝑁𝑁 , (𝑌𝑌 ′ − 𝐺𝐺 ′ )⁄𝑁𝑁).
Figure 3.21: Market-clearing real interest rate
103
EC2065 Macroeconomics | Chapter 3: Aggregate demand
This is not consistent with equilibrium. In order for the chosen consumption plan to move
horizontally to the left as much as the lifetime budget constraint shifts, the real interest
rate must rise so that the budget constraint is tangent to the indifference curve passing
through that point. The indifference curve is steeper here than at the initial consumption
plan because of the diminishing marginal rate of substitution. Therefore, the equilibrium
real interest rate will need to rise to achieve equilibrium in the goods market.
104
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Intuitively, after the negative supply shock, at an unchanged real interest rate,
households would smooth consumption by borrowing against unchanged future
income. However, all households are hit by the shock in this example, so all want to
increase borrowing and none wants to save more. The interest rate must rise to
dissuade households from borrowing. More generally, as seen later in economies
where investment or net exports can adjust, the shock causes investment or net
exports to fall in addition to, or in place of, the rise in the equilibrium real interest rate.
3.7 Investment
In the national accounts, investment comprises:
• Business fixed investment: firms purchasing new capital for use over time in
producing goods and services
• Residential investment: purchases of new housing units
• Inventory investment: changes in value of stock of finished goods.
In the USA, investment is typically 15–20 per cent of GDP. Of this total amount of
investment, business fixed investment is around two thirds, residential investment is around
one third, and inventory investment is small but volatile.
Figure 3.24 shows a time series of detrended investment in the USA alongside detrended
real GDP. We see that investment is procyclical, approximately coincident with GDP but is
much more volatile. The percentage deviations of investment from its trend are around 3–4
times larger than those of GDP. This is very different from the pattern of consumption
expenditure (except durables, which behaves more like investment) being less volatile than
GDP.
Figure 3.23: US investment over the business cycle
105
EC2065 Macroeconomics | Chapter 3: Aggregate demand
𝐾𝐾 ′ = 𝐼𝐼 + (1 − 𝑑𝑑 )𝐾𝐾
The future capital stock is the sum of undepreciated current capital (1 − 𝑑𝑑 )𝐾𝐾 and
investment 𝐼𝐼. Investment is purchases of new capital goods, and we treat capital goods as
equivalent to any other goods for simplicity. This means a new unit of capital costs one unit
of goods (its price is 1), and in principle, undepreciated capital goods can be sold off and
used in the same way as newly produced goods.
As in the Solow model, capital depreciates at a constant rate 𝑑𝑑 over time. Note also that the
current capital stock 𝐾𝐾 is determined by decisions made in the past and is not affected by
current investment. Since the model has only two time periods, it is assumed firms sell off
any undepreciated capital (1 − 𝑑𝑑 )𝐾𝐾 ′ that remains after production in the future time
period.
106
EC2065 Macroeconomics | Chapter 3: Aggregate demand
hence, current and future real costs are 𝑤𝑤𝑤𝑤 and 𝑤𝑤′𝑁𝑁′. Real revenues are the same as
production 𝑌𝑌 and 𝑌𝑌′ because all goods are homogeneous. Real profits are thus given by:
𝜋𝜋 = 𝑌𝑌 − 𝑤𝑤𝑤𝑤 and 𝜋𝜋 ′ = 𝑌𝑌 ′ − 𝑤𝑤 ′ 𝑁𝑁′
Note that these profits implicitly include the return on capital owned by the firm, so the
argument in Section 1.2 that perfect competition and a constant-returns-to-scale
production function push profits to zero does not apply here. Observe that the cost of any
investment 𝐼𝐼 is not deducted from profits because purchases of capital mean acquisition of
an asset that is not immediately used in current production. As profits are gross profits,
there is also no deduction of any cost of depreciation. However, both the cost of purchasing
capital and depreciation will be relevant when analysing the optimal level of investment.
107
EC2065 Macroeconomics | Chapter 3: Aggregate demand
First, suppose a firm finances investment using borrowing. The present discounted value of
the dividends it distributes to shareholders in this case is
𝑧𝑧 ′ 𝐹𝐹 (𝐾𝐾 ′ , 𝑁𝑁 ′ ) − 𝑤𝑤 ′ 𝑁𝑁 ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′ − (1 + 𝑟𝑟𝑙𝑙 )(𝐾𝐾 ′ − (1 − 𝑑𝑑 )𝐾𝐾)
𝑉𝑉 = 𝑧𝑧𝑧𝑧 (𝐾𝐾, 𝑁𝑁) − 𝑤𝑤𝑤𝑤 +
1 + 𝑟𝑟
The first-order condition to maximise 𝑉𝑉 with respect to 𝐾𝐾 ′ is:
𝑀𝑀𝑃𝑃𝐾𝐾′ + (1 − 𝑑𝑑 ) − (1 + 𝑟𝑟𝑙𝑙 )
=0
1 + 𝑟𝑟
This can be rearranged to obtain the following condition for the optimal level of investment:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟𝑙𝑙
Alternatively, suppose that retained earnings are available and used to finance investment
instead. The present discounted value of dividends is:
𝑧𝑧 ′ 𝐹𝐹 (𝐾𝐾 ′ , 𝑁𝑁 ′ ) − 𝑤𝑤 ′ 𝑁𝑁 ′ + (1 − 𝑑𝑑 )𝐾𝐾 ′
𝑉𝑉 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) − 𝑤𝑤𝑤𝑤 − (𝐾𝐾 ′ − (1 − 𝑑𝑑 )𝐾𝐾 ) +
1 + 𝑟𝑟
The first-order condition to maximise 𝑉𝑉 with respect to 𝐾𝐾′ is:
𝑀𝑀𝑃𝑃𝐾𝐾′ + (1 − 𝑑𝑑 )
−1 + =0
1 + 𝑟𝑟
This yields an equation for the optimal investment decision:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟
In all cases, the optimal investment decision equates the marginal benefit and marginal cost
of investment. More investment allows the firm to hold an extra unit of capital. The
marginal benefit of this is the extra production of future output it makes possible, which is
worth 𝑀𝑀𝑃𝑃𝐾𝐾′ , the marginal product of capital in the future.
108
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The marginal cost of holding an extra unit of capital includes the loss of a fraction 𝑑𝑑 of the
capital through depreciation. There is also the financing cost of holding more capital. If the
funds to purchase it are borrowed, the financing cost is the interest rate 𝑟𝑟𝑙𝑙 on the loan. If
the funds come from retained earnings, the financing cost is an opportunity cost to the
firm’s shareholders. Funds are returned to shareholders in the future period rather than in
the current period and these funds could have earned shareholders a return 𝑟𝑟 if received
and saved.
The Inada conditions for a neoclassical production function imply that 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 is negative
for very high levels of investment, so 𝐼𝐼 𝑑𝑑 cuts the horizontal axis and falls as low as 𝑟𝑟 = −𝑑𝑑.
Observe also that 𝑀𝑀𝑃𝑃𝐾𝐾′ is finite if investment were zero, even though the Inada conditions
hold. This is because there is existing capital, so 𝐾𝐾′ is not zero even if there were no
109
EC2065 Macroeconomics | Chapter 3: Aggregate demand
investment currently. In principle, investment 𝐼𝐼 could be negative, which would mean selling
off capital goods for other uses. However, much investment may be irreversible in practice,
hence, this possibility is ignored when drawing the investment demand curve.
110
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Observe that Tobin’s Q is greater than 1 when 𝑀𝑀𝑃𝑃𝐾𝐾′ + 1 − 𝑑𝑑 > 1 + 𝑟𝑟, which is
equivalent to 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 > 𝑟𝑟 and would mean investment is too low according to the
theory developed in Section 3.8. If firms were to invest up to the point where 𝑀𝑀𝑃𝑃𝐾𝐾′ −
𝑑𝑑 = 𝑟𝑟, this would mean the same as investing until Tobin’s Q has declined to 1. Thus, the
theory of investment developed earlier is equivalent to the prediction that investment is
higher the further Tobin’s Q is above 1.
In principle, Tobin’s Q is measurable and can be compared to data on investment. In
practice, for many modern firms, intangible capital such as patents and brand reputation
has become more important than physical capital. The replacement cost of intangible
capital is harder to quantify, and makes measurement of the denominator of Tobin’s Q
challenging.
112
EC2065 Macroeconomics | Chapter 3: Aggregate demand
Let 𝑟𝑟 denote the equilibrium after-tax return that investors receive, which we will take as
given here. In the absence of tax, investment would be determined by 𝑅𝑅 − 𝑑𝑑 = 𝑟𝑟. When
the tax on capital income is 𝜏𝜏, firms invest in capital up to the point where:
(1 − 𝜏𝜏)(𝑅𝑅 − 𝑑𝑑 ) = 𝑟𝑟
Here, the cost of depreciation reduces the tax liability, although we can also do the
analysis under the assumption that gross capital income is taxed. Given the required
after-tax real rate of return 𝑟𝑟 on capital, investment and the stock of capital are
determined by:
𝑟𝑟
𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑 =
1 − 𝜏𝜏
Assume the production function is 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁), which is 𝑦𝑦 = 𝑧𝑧𝑧𝑧(𝑘𝑘) in per worker
terms. The marginal product of capital is 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑧𝑧′(𝑘𝑘). In a competitive economy, the
real wage received by workers is 𝑤𝑤 = 𝑀𝑀𝑃𝑃𝑁𝑁 . A neoclassical production function has the
feature that 𝑌𝑌 = 𝑀𝑀𝑃𝑃𝐾𝐾 𝐾𝐾 + 𝑀𝑀𝑃𝑃𝑁𝑁 𝑁𝑁, which implies 𝑤𝑤 = 𝑦𝑦 − 𝑀𝑀𝑃𝑃𝐾𝐾 𝑘𝑘 = 𝑧𝑧�𝑓𝑓(𝑘𝑘) − 𝑘𝑘𝑓𝑓 ′ (𝑘𝑘)�.
Taking a given positive value of 𝑟𝑟, a higher tax rate 𝜏𝜏 raises 𝑟𝑟⁄(1 − 𝜏𝜏) and thus 𝑀𝑀𝑃𝑃𝐾𝐾 =
𝑧𝑧𝑧𝑧′(𝑘𝑘) must be higher in equilibrium. Since 𝑓𝑓′(𝑘𝑘) is decreasing in 𝑘𝑘, this means there will
be less investment and lower steady-state 𝑘𝑘. The tax thus reduces output per worker 𝑦𝑦 =
𝑧𝑧𝑧𝑧(𝑘𝑘) and also lowers wages 𝑤𝑤 = 𝑀𝑀𝑃𝑃𝑁𝑁 because 𝑀𝑀𝑃𝑃𝑁𝑁 is lower when there is less capital
per worker 𝑘𝑘. This can be shown by noting 𝑤𝑤 = 𝑧𝑧(𝑓𝑓(𝑘𝑘) − 𝑘𝑘𝑓𝑓 ′ (𝑘𝑘)) is increasing in 𝑘𝑘 since
𝑓𝑓 ′′ (𝑘𝑘) < 0.
Suppose capital income is taxed and used to subsidise the wage income of workers. More
generally, we could ask whether capital income or wages should be taxed to finance
public expenditure. The amount of tax raised per worker is 𝜏𝜏(𝑅𝑅 − 𝑑𝑑 )𝐾𝐾⁄𝑁𝑁 =
𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) − 𝑑𝑑 )𝑘𝑘, so the wage including the subsidy is 𝑤𝑤 + 𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) − 𝑑𝑑 )𝑘𝑘. Using 𝑤𝑤 =
𝑧𝑧(𝑓𝑓 (𝑘𝑘) − 𝑘𝑘𝑓𝑓 ′ (𝑘𝑘)) the wage plus the subsidy is:
𝑧𝑧𝑧𝑧(𝑘𝑘) − 𝑘𝑘𝑘𝑘𝑓𝑓 ′ (𝑘𝑘) + 𝜏𝜏(𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) − 𝑑𝑑 )𝑘𝑘 = 𝑧𝑧𝑧𝑧 (𝑘𝑘) − 𝑘𝑘 (1 − 𝜏𝜏)𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) − 𝜏𝜏𝜏𝜏𝜏𝜏
With 𝑘𝑘 determined by 𝑧𝑧𝑧𝑧′(𝑘𝑘) = 𝑑𝑑 + 𝑟𝑟⁄(1 − 𝜏𝜏), the subsidised wage is given by the
formula 𝑧𝑧𝑧𝑧(𝑘𝑘) − (1 − 𝜏𝜏)𝑑𝑑𝑑𝑑 − 𝑟𝑟𝑟𝑟 − 𝜏𝜏𝜏𝜏𝜏𝜏 = 𝑧𝑧𝑧𝑧(𝑘𝑘) − (𝑟𝑟 + 𝑑𝑑 )𝑘𝑘.
Taking account of the capital tax’s effect on investment, each worker receives 𝑧𝑧𝑧𝑧(𝑘𝑘) −
(𝑟𝑟 + 𝑑𝑑 )𝑘𝑘, where 𝑘𝑘 depends on 𝜏𝜏. This is maximised over 𝑘𝑘 where 𝑀𝑀𝑃𝑃𝐾𝐾 = 𝑧𝑧𝑓𝑓 ′ (𝑘𝑘) = 𝑟𝑟 +
𝑑𝑑. But since 𝑘𝑘 must satisfy 𝑀𝑀𝑃𝑃𝐾𝐾 − 𝑑𝑑 = 𝑟𝑟/(1 − 𝜏𝜏) given 𝜏𝜏, the value of 𝜏𝜏 that is in the
interests of workers is 𝜏𝜏 = 0. Even though the revenue from taxing capital is directly
redistributed to workers, workers prefer no capital tax. This is because the negative effect
of the capital tax on investment and wages is too large.
One important assumption in this analysis is that there is a perfectly elastic supply of
funds for investment at a constant required rate of return 𝑟𝑟, which makes the economy’s
supply of capital perfectly elastic. This makes capital the exact opposite of land, which is
perfectly inelastic in supply.
113
EC2065 Macroeconomics | Chapter 3: Aggregate demand
We have also studied the trade-off between 𝑐𝑐 and 𝑐𝑐′, holding 𝑙𝑙 and 𝑙𝑙′ fixed (which is
equivalent to fixing incomes 𝑦𝑦 and 𝑦𝑦′). The relative price is 1 + 𝑟𝑟 and the optimality
condition is 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ = 1 + 𝑟𝑟. So far, the optimality conditions have been derived and
114
EC2065 Macroeconomics | Chapter 3: Aggregate demand
explained earlier. The new condition we will look at comes from the trade-off between
current leisure 𝑙𝑙 and future leisure 𝑙𝑙′.
Sacrificing leisure in the current period (reducing 𝑙𝑙) increases labour supply (higher 𝑁𝑁 𝑠𝑠 ) and
raises current income by the wage 𝑤𝑤 for each hour of leisure given up. This income could be
spent now on consumption but it could also be saved. By saving the extra wage 𝑤𝑤, future
financial assets are increased by (1 + 𝑟𝑟)𝑤𝑤. This extra wealth could be spent on consumption
but it could also be used to reduce labour supply in future by replacing future wage income.
For example, earlier retirement with no loss of future consumption becomes possible if the
household has more financial assets.
Mathematically, holding 𝑐𝑐 and 𝑐𝑐′ constant, a reduction of 𝑙𝑙 by one unit means that 𝑙𝑙′ can
increase by (1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′. Hence, (1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′ is the relative price of the household’s
current time in terms of its future time and this relative price matters for how much it is
optimal to work now compared to working in the future period. The relative price increases
when the current wage 𝑤𝑤 rises relative to the future wage 𝑤𝑤′, or when the real interest rate
𝑟𝑟 increases. The household’s optimal plan must feature:
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′
Figure 3.27 draws indifference curves and a lifetime budget constraint in a diagram with
current leisure 𝑙𝑙 on the horizontal axis and future leisure 𝑙𝑙′ on the vertical axis. The range is
limited to leisure between 0 and the maximum available time ℎ where the household would
not participate in the labour market. All other variables 𝑐𝑐 and 𝑐𝑐′ are held fixed. The budget
constraint is downward sloping and has gradient −(1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′. The diagram shows the
case where it is feasible to have one of, but not both, 𝑙𝑙 and 𝑙𝑙′ equal to ℎ, so the household
could choose not to participate in the labour market in one of the periods. However, given
the indifference curves as shown, the household chooses to participate in both periods to
some extent.
Figure 3.26: Optimal timing of leisure/labour supply
115
EC2065 Macroeconomics | Chapter 3: Aggregate demand
The optimality condition 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′ implies the real interest rate 𝑟𝑟 changes
optimal labour supply in the same direction as the real wage 𝑤𝑤. A higher 𝑟𝑟 raises the
incentive for work because the extra income obtained can be saved to earn a higher return.
Or in other words, a higher 𝑟𝑟 increases the incentive to save and one way to do this is to
earn more and not to spend it. Since 𝑟𝑟 matters for the labour supply decision, it follows that
the labour supply curve 𝑁𝑁 𝑠𝑠 shifts when 𝑟𝑟 changes. Where labour supply increases with the
real wage 𝑤𝑤, the upward-sloping 𝑁𝑁 𝑠𝑠 curve shifts to the right when 𝑟𝑟 rises.
• Goods market
• Labour market
• Bond market.
For now, we will work with the case of a closed economy. Section 9.1 shows how to adapt
our dynamic macroeconomic model to an open economy.
The consequence of the representative household assumption is that all households will
ultimately make the same choices, even if they act individually. With 𝑁𝑁 households in the
economy, aggregate consumption is 𝐶𝐶 = 𝑁𝑁𝑁𝑁, where 𝑐𝑐 is an individual household’s
consumption choice. The same is true for other aggregates, which are simply scaled-up
versions of each individual’s choices and we will often ignore the distinction between
individual and aggregate variables. While we cannot use the representative-household
framework to study how inequality affects the economy, it is nonetheless a useful starting
point for many other issues.
One simplification that comes from there being a representative household in a closed
economy is that income effects from changes in the wage 𝑤𝑤 or interest rate 𝑟𝑟 can be
ignored. All else being equal, a higher wage makes households better off as workers but
worse off as owners of firms and these two effects cancel out. All else being equal, a higher
116
EC2065 Macroeconomics | Chapter 3: Aggregate demand
interest rate makes households better off as savers but worse off as owners of firms with
higher borrowing costs and worse off as taxpayers if the government has higher borrowing
costs.
With no income effects coming from 𝑤𝑤 or 𝑟𝑟, there are only substitution effects to consider.
A higher wage 𝑤𝑤 implies an incentive to supply more labour, so the labour supply curve 𝑁𝑁 𝑠𝑠
is upward sloping. A higher interest rate 𝑟𝑟 increases saving, which lowers the consumption
demand 𝐶𝐶 𝑑𝑑 . The left panel of Figure 3.28 depicts a downward-sloping consumption demand
curve with 𝑟𝑟 on the vertical axis and 𝐶𝐶 on the horizontal axis. Higher 𝑟𝑟 also increases labour
supply as discussed in the analysis of the dynamics of labour supply. This means the labour
supply curve 𝑁𝑁 𝑠𝑠 shifts to the right when 𝑟𝑟 increases as shown in the right panel of the
figure.
Figure 3.27: Consumption demand and labour supply
It is important to note this logic does not mean all income effects can be neglected. Often,
the exogenous cause of a change in 𝑤𝑤 or 𝑟𝑟 will make households better off or worse off
overall. For example, higher productivity (TFP 𝑧𝑧 or 𝑧𝑧′) makes households better off, raising
either wages, profits, or some combination of the two. The argument is that conditional on
the level of TFP, households are not made better off overall by higher wages. Another
example is when the tax burden falls, which makes households better off, all else equal. It is
not always possible to infer from the direction of change in GDP whether households are
better off or not, as we will see in later examples.
Since consumption and leisure are both normal goods, anything that makes households
better off overall leads to a ‘wealth effect’ that increases consumption demand and reduces
labour supply. These wealth effects shift the 𝐶𝐶 𝑑𝑑 curve to the right and the 𝑁𝑁 𝑠𝑠 curve to the
left. If households are made worse off, the wealth effects are lower consumption demand
𝐶𝐶 𝑑𝑑 and higher labour supply 𝑁𝑁 𝑠𝑠 .
3.11.2 Firms
We assume a representative firm. All firms share the same neoclassical production function
with the same total factor productivity. Firms own the capital they use and all have the
same capital stock, so there is no need to consider a rental market for capital. Investment is
117
EC2065 Macroeconomics | Chapter 3: Aggregate demand
financed by issuing bonds. Firms are owned by the representative household. There is no
need for trade in shares, so we do not consider the stock market in the dynamic
macroeconomic model.
3.11.3 Government
We will analyse the government’s fiscal policies further in Chapter 4. At this stage we simply
note that the government chooses current and future public expenditure 𝐺𝐺 and 𝐺𝐺′, which is
part of the economy’s demand for goods and services along with private expenditure 𝐶𝐶 𝑑𝑑
and 𝐼𝐼 𝑑𝑑 on consumption and investment. The government levies the same lump-sum taxes 𝑡𝑡
and 𝑡𝑡′ on all households, and finances any budget deficit by issuing bonds.
The equilibrium level of employment 𝑁𝑁 ∗ in the labour market has implications for the goods
market through the amount of goods and services firms produce. The production function is
𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁). Given exogenous TFP 𝑧𝑧 and a predetermined current capital stock 𝐾𝐾, firms’
output of goods 𝑌𝑌 rises or falls with employment. The right panel of the figure depicts the
production function and shows how the supply of goods 𝑌𝑌 ∗ is determined by equilibrium
employment 𝑁𝑁 ∗ .
In the goods market, firms’ supply curve for goods is derived from labour-market
equilibrium. The supply of output comes from the production function 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁 ∗ ) given
118
EC2065 Macroeconomics | Chapter 3: Aggregate demand
the equilibrium level of employment 𝑁𝑁 ∗ in the labour market. The derivation of the output
supply curve 𝑌𝑌 𝑠𝑠 is shown in Figure 3.30. The right panel is the goods market diagram where
the quantity of output (real GDP 𝑌𝑌) is on the horizontal axis and the real interest rate 𝑟𝑟 is on
the vertical axis. The interest rate is relevant because 1 + 𝑟𝑟 is relative price of current goods
in terms of future goods.
A higher real interest rate 𝑟𝑟 shifts 𝑁𝑁 𝑠𝑠 (𝑟𝑟) to the right as shown in the left panel. This
increases employment 𝑁𝑁 ∗ leading to a movement up the production function in the middle
panel, which raises the supply of output 𝑌𝑌 𝑠𝑠 . This explains the upward-sloping 𝑌𝑌 𝑠𝑠 curve seen
in the right panel.
Figure 3.29: Derivation of the output supply curve
The analysis of the goods market is completed by deriving the demand curve. This sums up:
• Consumption demand 𝐶𝐶 𝑑𝑑
• Investment demand 𝐼𝐼 𝑑𝑑
• Government expenditure 𝐺𝐺.
The equation describing the aggregate demand 𝑌𝑌 𝑑𝑑 for goods and services is:
𝑌𝑌 𝑑𝑑 = 𝐶𝐶 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺
Consumption demand and investment demand depend negatively on the real interest rate
𝑟𝑟. Summing up the components of aggregate demand implies a downward-sloping demand
curve for goods as shown in Figure 3.31.
119
EC2065 Macroeconomics | Chapter 3: Aggregate demand
With upward-sloping 𝑌𝑌 𝑠𝑠 and downward-sloping 𝑌𝑌 𝑑𝑑 curves, the goods market clears through
adjustment of the real interest rate 𝑟𝑟. A higher real interest rate discourages spending on
consumption and investment, which reduces aggregate demand for goods. A higher real
interest rate also encourages greater labour supply so households can save more, which
raises employment and production. Figure 3.32 shows the equilibrium real interest rate 𝑟𝑟 ∗
where the 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 curves intersect. In this model, notice that the real interest rate is
determined by competitive markets, not by the central bank’s monetary policy. This is
different in the sticky-price New Keynesian model we will study in Chapter 8.
Figure 3.31: Goods market equilibrium
The intersection of the 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 curves is not only a point of goods-market equilibrium. It
is also a labour-market equilibrium because every point on 𝑌𝑌 𝑠𝑠 represents equilibrium in the
labour market by construction. That covers two of the three markets in the dynamic
macroeconomic model. What about the remaining market, the bond market?
It might be expected the interest rate 𝑟𝑟 would adjust to achieve bond-market equilibrium
but it must be at 𝑟𝑟 = 𝑟𝑟 ∗ to clear the goods market. However, it turns out that the bond
market is in equilibrium once the labour and goods markets are both in equilibrium. This
120
EC2065 Macroeconomics | Chapter 3: Aggregate demand
follows from the logic of budget constraints. Spending (demand) in markets other than the
bond market is balanced by earnings (supply) in those markets, so budget constraints then
imply a balance between spending and earning in final market, which is equivalent to
demand equals supply in that market. This result is known as Walras’ law and consequently,
we do not need to consider the bond market separately.
The complete dynamic macroeconomic model can then be summarised by the labour-
market and goods-market diagrams shown in Figure 3.33.
With a leftward shift of 𝑌𝑌 𝑑𝑑 and a rightward shift of 𝑌𝑌 𝑠𝑠 , the equilibrium real interest rate
declines unambiguously. The impact on the other variables 𝑌𝑌, 𝑁𝑁, and 𝑤𝑤 is ambiguous but
this analysis helps to explain the observation of persistently low real interest rates.
121
EC2065 Macroeconomics | Chapter 3: Aggregate demand
122
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Essential reading
• Williamson, Chapter 10.
A key point in the analysis of taxation is that changes in taxes – if not matched by changes in
government expenditure – affect the government’s budget deficit and, in turn, the level of
government debt. The need to repay government debt in the future if a default is to be
avoided has implications for the level of future taxes. These considerations lead us to the
idea that the government itself has a budget constraint.
We will begin by analysing taxes and government debt using the two-period model from
Section 3.3 that was central to our study of households’ consumption decisions. That model
has a current period and a future period.
Government expenditure, or public expenditure, refers to purchases of goods and services
by the government, irrespective of whether these are produced by private firms or are the
result of the government’s own production. This does not include transfer payments, for
example, those associated with the welfare state. Government expenditure in the current
123
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
period is 𝐺𝐺 and 𝐺𝐺′ in the future period. These variables are in real terms, as are all variables
in this chapter.
Suppose the government collects tax revenue 𝑇𝑇 in the current period and expects to collect
𝑇𝑇 ′ in the future. Any transfer payments where the government makes payments to others
rather than receiving tax revenue are counted as negative values of tax revenue.
The government does not have to match current expenditure 𝐺𝐺 and taxes 𝑇𝑇: it can run a
budget deficit or surplus. The government’s budget deficit is defined as 𝐵𝐵 = 𝐺𝐺 − 𝑇𝑇, with a
negative value of 𝐵𝐵 denoting a surplus. If the government runs a deficit it borrows by issuing
bonds with real interest rate 𝑟𝑟. We assume the government begins from a blank slate at the
beginning of the current period with no initial debt outstanding. This means the total stock
of government debt is 𝐵𝐵 and an amount (1 + 𝑟𝑟)𝐵𝐵 including interest falls due for repayment
in the future.
In the future period, the government must repay (1 + 𝑟𝑟)𝐵𝐵 otherwise it defaults on its debt.
The analysis here assumes the government debt is in the form of inflation-indexed bonds
paying a known real interest rate 𝑟𝑟. Later in 0 where money is studied, we will see how
governments can print money and how a ‘soft’ default can occur through inflation. Here, we
can still make our equations consistent with the possibility of a ‘hard’ default by considering
a default as a special form of wealth tax applying to holdings of government bonds.
The budget constraint on the government in the future period is therefore that the budget
surplus 𝑇𝑇 ′ − 𝐺𝐺′ is sufficient to cover the repayment (1 + 𝑟𝑟)𝐵𝐵, because in a two-period
model there is no possibility of rolling over the debt any longer. In accounting for the
government’s interest payments 𝑟𝑟𝑟𝑟, it is possible to distinguish between the primary
budget surplus 𝑇𝑇 ′ − 𝐺𝐺′ that excludes the interest cost, and the budget surplus 𝑇𝑇 − 𝐺𝐺 ′ − 𝑟𝑟𝑟𝑟
that deducts the interest cost.
Dividing both sides of the budget constraint 𝑇𝑇′ − 𝐺𝐺′ = (1 + 𝑟𝑟)𝐵𝐵 by 1 + 𝑟𝑟 implies a
constraint on the amount of bonds the government can issue in the current period:
𝑇𝑇 ′ − 𝐺𝐺 ′
𝐵𝐵 =
1 + 𝑟𝑟
Substituting the budget deficit definition 𝐵𝐵 = 𝐺𝐺 − 𝑇𝑇 yields a present-value constraint:
𝑇𝑇′ 𝐺𝐺 ′
𝑇𝑇 + = 𝐺𝐺 +
1 + 𝑟𝑟 1 + 𝑟𝑟
This equation states that the government must raise tax revenue across the two periods of a
present value sufficient to cover the present value of all government expenditure across the
two periods.
We now want to consider how taxes and government debt affects private expenditure in
the economy. We start by supposing all taxes are lump-sum taxes levied on households. A
lump-sum tax is one where the amount of tax paid by a household does not depend on the
household’s behaviour. Most taxes do not have this form, for example, income taxes or
sales taxes. We consider later taxes where revenue raised depends on choices.
124
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
We also make an assumption that the government is not redistributing when levying taxes
on households. This means that everyone shares the tax burden equally. Assume there are
𝑁𝑁 households in the economy and these households remain unchanged over the two
periods (later we will consider what happens if new generations are born). If 𝑡𝑡 and 𝑡𝑡′ are the
lump-sum amounts of tax collected from a household in the current and future periods then
an equal distribution of taxes means 𝑡𝑡 = 𝑇𝑇/𝑁𝑁 and 𝑡𝑡 ′ = 𝑇𝑇 ′ /𝑁𝑁.
What are the implications of the government’s tax policy for household budget constraints
under the assumptions made so far? We have seen that a household’s present value of
consumption must be equal to its lifetime wealth ℎ or present value of income after tax:
𝑦𝑦 ′ − 𝑡𝑡 ′ 𝑦𝑦′ 𝑡𝑡 ′
ℎ = 𝑦𝑦 − 𝑡𝑡 + = �𝑦𝑦 + � − �𝑡𝑡 + �
1 + 𝑟𝑟 1 + 𝑟𝑟 1 + 𝑟𝑟
The second expression for ℎ above breaks down lifetime wealth into the present value of
pre-tax income and the present value of the taxes a household will face. Using government
budget constraint and the equal distribution of taxes 𝑡𝑡 = 𝑇𝑇/𝑁𝑁 and 𝑡𝑡 ′ = 𝑇𝑇 ′ /𝑁𝑁:
𝑡𝑡′ 1 𝑇𝑇 ′ 1 𝐺𝐺 ′
𝑡𝑡 + = �𝑇𝑇 + � = �𝐺𝐺 + �
1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟
It follows that lifetime wealth can be expressed as the present value of pre-tax income net
of a 1/𝑁𝑁th share of the present value of all government expenditure.
𝐺𝐺 ′
𝑦𝑦′ 1 𝐺𝐺 ′
𝐺𝐺 𝑦𝑦 ′ −
ℎ = �𝑦𝑦 + � − �𝐺𝐺 + � = 𝑦𝑦 − + 𝑁𝑁
1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟
Note that the exact values of taxes 𝑡𝑡 and 𝑡𝑡′ drop out of the equation. It is therefore
sufficient to know the government’s plans for public expenditure because these determine
the present value of taxes given the government’s budget constraint. The exact combination
of 𝑡𝑡 and 𝑡𝑡′ is not relevant in calculating ℎ.
Since the position of a household’s lifetime budget constraint can be determined by
knowing ℎ and 𝑟𝑟, a corollary is that the position of the budget constraint can known simply
with reference to the present value of pre-tax income and the government’s expenditure
plans. Another way to deduce this is to note that the household’s budget constraint must
always pass through the point (𝑦𝑦 − 𝐺𝐺 ⁄𝑁𝑁 , 𝑦𝑦 ′ − 𝐺𝐺 ′ ⁄𝑁𝑁). Figure 4.1 illustrates the combined
household and government budget constraint.
125
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In the consumption choice diagram in Figure 4.2, the endowment point on the household
budget constraint moves from 𝐸𝐸1 to 𝐸𝐸2 along an unchanged household budget constraint.
We have seen that the position of this budget constraint depends on pre-tax income and
the government’s spending plans but it does not depend on the exact timing of taxes. If
households can choose any point on the standard lifetime budget constraint then there is
no change to the optimal consumption plan. The tax cut does not increase consumption
demand but instead increases the desire to save. Intuitively, the tax cut now means higher
taxes in the future and households must save more now to sustain their optimal choice of
consumption in the future.
126
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
We have shown that the tax cut does not affect households’ lifetime wealth ℎ. But might it
affect the real interest rate 𝑟𝑟, which determines the gradient of the budget constraint? In
the basic consumption choice model, the real interest rate is exogenous. But we have seen
in Chapter 3 how equilibrium interest rates can be found given exogenous incomes and
optimal consumption choices. Would that analysis of interest rates predict that issuing more
government bonds causes the bond price to fall and the interest rate to rise?
First note that the extra supply of bonds is equal to the size of tax cut. Our analysis above
shows that at any arbitrary real interest rate 𝑟𝑟, the optimal consumption plan (𝑐𝑐 ∗ , 𝑐𝑐 ′∗ ) does
not change following the tax cut, so private saving 𝑠𝑠 = 𝑦𝑦 − 𝑡𝑡 − 𝑐𝑐 also increases by an
amount equal to the tax cut. All else being equal, greater private saving means a larger
demand for bonds. With the supply and demand for bonds increasing by the same amount,
no change in 𝑟𝑟 is required for the extra issuance of bonds to be absorbed by the market.
An equivalent argument points out that with no change in 𝑐𝑐 for an arbitrary 𝑟𝑟 following the
tax cut and no change in 𝑦𝑦 or 𝐺𝐺, goods-market equilibrium 𝑐𝑐𝑐𝑐 + 𝐺𝐺 = 𝑦𝑦𝑦𝑦 still prevails at the
original interest rate 𝑟𝑟. Therefore, neither the position nor the gradient of households’
budget constraint changes after the tax cut.
Although the Ricardian equivalence argument is a useful benchmark for the analysis of fiscal
policy, the argument requires strong assumptions. These include the lump-sum nature of
taxes. When taxes are not lump sum, they can have incentive effects, even if the timing of
taxes does not change the present value of all tax revenue that needs to be collected. Lump-
sum taxes matter only through an income effect but this channel is neutralised by the
government’s budget constraint. Other types of taxes also have substitution effects.
Another assumption is that the tax burden is shared equally among households, so taxes do
not redistribute between different households. We will see examples of how taxes with
distributional effects that affect households’ consumption choices. Different from the
analysis above, there are winners and losers whose consumption adjusts. Depending on the
exact behaviour of these groups, aggregate consumption can depend on tax policy. One
127
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
𝜏𝜏 ′ 𝑐𝑐 ′ 1 𝐺𝐺 ′
𝜏𝜏𝜏𝜏 + = �𝐺𝐺 + �
1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟
128
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
How does this type of tax system affect the Ricardian equivalence argument seen
earlier? Consider a deficit-financed cut of current tax rate 𝜏𝜏 and hold constant the
public expenditure 𝐺𝐺 and 𝐺𝐺 ′ planned in the current and future time periods. Just as
before, the government budget constraint implies the future tax rate 𝜏𝜏 ′ must
adjust.
Reducing the current tax rate 𝜏𝜏 implies the household budget constraint’s
horizontal intercept (𝑦𝑦 + 𝑦𝑦 ′ /(1 + 𝑟𝑟))/(1 + 𝜏𝜏) rises and its gradient −(1 + 𝑟𝑟)(1 +
𝜏𝜏)/(1 + 𝜏𝜏 ′ ) is lower. This is illustrated in Figure 4.2. The reduction in the gradient is
similar to what happens when the interest rate 𝑟𝑟 declines and, in both cases,
current consumption becomes relatively cheaper compared to future consumption.
Figure 4.3: Failure of Ricardian equivalence
𝑐𝑐 ′ 𝑦𝑦 ′ 𝜏𝜏 ′ 𝑐𝑐 ′
𝑐𝑐 + = 𝑦𝑦 + − �𝜏𝜏𝜏𝜏 + �
1 + 𝑟𝑟 1 + 𝑟𝑟 1 + 𝑟𝑟
129
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Notice that the representative household’s choice of consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) also
appears in the government budget constraint because it affects how much tax
revenue is collected from given tax rates. Since all households make the same
consumption choices here, the household and government budget constraints can
be combined:
′ 𝐺𝐺 ′
′
𝑐𝑐 𝐺𝐺 𝑦𝑦 −
𝑐𝑐 + = 𝑦𝑦 − + 𝑁𝑁
1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟
The tax rates 𝜏𝜏 and 𝜏𝜏 ′ have been eliminated and the equation has exactly the same
form as what was obtained with lump-sum taxes. Graphically, the equation is a
straight line with gradient −(1 + 𝑟𝑟) passing through the point (𝑦𝑦 − 𝐺𝐺/𝑁𝑁, 𝑦𝑦 ′ −
𝐺𝐺 ′ /𝑁𝑁). This is not the budget constraint faced by any individual household acting
independently of others but implies that after tax rates have been adjusted to
satisfy the government’s budget constraint, the consumption plan chosen by
households must lie on the same line as it must with a lump-sum tax system.
Considering the deficit-financed cut of current tax rate 𝜏𝜏, the budget constraint
faced by individual households becomes flatter. After taking account of the
adjustment of households’ choices and the future tax rate 𝜏𝜏 ′ needed to satisfy the
government’s budget constraint, the consumption plan must lie on combined
budget constraint and, without any changes to public expenditure 𝐺𝐺 and 𝐺𝐺 ′ , this
line does not shift. Hence, overall, the new consumption plan must be tangent to a
flatter individual budget constraint while also lying on the same downward-sloping
combined household-and-government budget constraint. Therefore, the new
consumption plan must feature higher 𝑐𝑐 and lower 𝑐𝑐 ′ as shown in Figure 4.4.
These unambiguous effects can be thought of as the substitution effect of lower
relative price of 𝑐𝑐 compared to 𝑐𝑐 ′ brought about by the tax rate changes, analogous
to the substitution effect of a lower interest rate. However, the income effects of
the tax changes are weakened by accounting for the government budget
constraint, as is the case with lump-sum taxes.
Although changing tax rates is seen to have an impact on consumption, the analysis
so far does not provide a reason for governments to vary tax rates over time. To
understand why, note that if 𝜏𝜏 = 𝜏𝜏 ′ , i.e. tax rates remain the same, the household
budget constraint gradient becomes −(1 + 𝑟𝑟). This allows households to reach the
highest indifference curve conditional on the government satisfying its budget
constraint, because the combined household-and-government budget constraint
has gradient −(1 + 𝑟𝑟). Any choice of 𝜏𝜏 ≠ 𝜏𝜏 ′ would mean the representative
household ends up on a lower indifference curve.
130
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
131
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The higher burden of taxes after the increase in 𝐺𝐺 has a negative wealth effect on
households because they are less able to afford a given amount of private
consumption and leisure. This leads them to reduce their demand for consumption
𝐶𝐶 𝑑𝑑 as it is a normal good, which has the effect of shifting the 𝑌𝑌 𝑑𝑑 curve to the left.
However, the desire for consumption smoothing means that 𝐶𝐶 𝑑𝑑 falls by less than
the tax burden and 𝐺𝐺 rise. Overall, the output demand curve 𝑌𝑌 𝑑𝑑 shifts to the right.
A way of avoiding some of the reduction in consumption is for households to work
more and earn more income. This is what the wealth effect of taxes on labour
supply represents – corresponding to a lower demand for leisure as a normal good.
Note that we are assuming taxes have a lump-sum form, so there are no
disincentive effects here. The labour supply curve 𝑁𝑁 𝑠𝑠 thus shifts to the right as
shown in Figure 4.5. Labour-market equilibrium now occurs at a higher level of
employment and the movement along the production function implies that firms
increase output of goods at the same real interest rate, shifting the output supply
curve 𝑌𝑌 𝑠𝑠 to the right. Since future consumption and leisure are normal goods,
households do not want to raise 𝑌𝑌 𝑠𝑠 by more than increase in tax burden (equal to
the increase in 𝐺𝐺) minus the reduction in consumption expenditure 𝐶𝐶 𝑑𝑑 .
Figure 4.5: Wealth effect on labour supply and output supply
With both the 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 curves shifting rightwards, GDP 𝑌𝑌 rises unambiguously as
shown in Figure 4.6. The overall shift of 𝑌𝑌 𝑑𝑑 is larger than the shift of 𝑌𝑌 𝑠𝑠 because
smoothing of consumption and leisure makes the combined wealth effect on 𝐶𝐶 𝑑𝑑
and 𝑁𝑁 𝑠𝑠 (and hence, 𝑌𝑌 𝑠𝑠 ) smaller than direct effect of 𝐺𝐺 on 𝑌𝑌 𝑑𝑑 . A larger shift of 𝑌𝑌 𝑑𝑑
than 𝑌𝑌 𝑠𝑠 implies the real interest rate 𝑟𝑟 rises unambiguously.
There are two shifts of 𝑁𝑁 𝑠𝑠 to right in the labour market diagram from Figure 4.6.
First, the wealth effect of the higher tax burden leading to a shift from 𝑁𝑁1𝑠𝑠 (𝑟𝑟1 ) to
𝑁𝑁2𝑠𝑠 (𝑟𝑟1 ). Second, the saving incentive effect of the higher real interest rate 𝑟𝑟
producing the shift from 𝑁𝑁2𝑠𝑠 (𝑟𝑟1 ) to 𝑁𝑁2𝑠𝑠 (𝑟𝑟2 ). Hence, we can conclude unambiguously
that 𝑁𝑁 rises and 𝑤𝑤 falls. Note that there is no effect on 𝑁𝑁 𝑑𝑑 – the increase in 𝐺𝐺 does
not boost productivity here, so our analysis might reach different conclusions if we
considered expenditure on infrastructure that increased total factor productivity
and hence, labour demand.
132
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Although we conclude that GDP 𝑌𝑌 rises overall, this analysis shows it increases by
less than 𝐺𝐺 does. This is due to 𝑌𝑌 = 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺 and both 𝐶𝐶 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 falling.
Consumption falls because of the negative wealth effect and the negative
substitution effect of a higher real interest rate 𝑟𝑟. Investment falls because the
higher real interest rate raises the borrowing cost or opportunity cost of
investment. These displacement effect on private expenditure of higher public
expenditure 𝐺𝐺 are known as ‘crowding out’ effects.
Figure 4.6: Effects of temporary fiscal stimulus
In summary, our model predicts a temporary fiscal stimulus would boost real GDP
but with no ‘multiplier’ effect whereby 𝑌𝑌 rises more than 𝐺𝐺. What is missing? First,
we could imagine that some types of public expenditure are complementary with
production or consumption. Infrastructure expenditure might raise the return to
capital and boost investment by firms. It might also cause households to spend
more on certain types of goods (for example, cars or air travel) that are
complementary to the public expenditure (although it is also possible to envisage
goods and services that are substitutes for public expenditure).
More importantly, our analysis is missing the idea that deficit-financed public
expenditure raises households’ current disposable income through higher GDP. The
argument is that this encourages some households to spend more on consumption
because previously they were unable to pay for this consumption by borrowing
against future income. This supposes that there are credit-market imperfections of
the type we will study in the remainder of the chapter. We return to the analysis of
a fiscal stimulus with credit-market imperfections when studying macroeconomic
policy in Chapter 9.
133
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
We will start by considering a simple borrowing constraint. Households cannot borrow more
than some amount 𝐿𝐿. Example of borrowing constraints could be a credit limit on a credit
card or an overdraft limit. To begin with, we treat the maximum loan size 𝐿𝐿 as exogenous.
Also for now, assume that borrowers face the same interest rate 𝑟𝑟 as savers receive.
We now add the borrowing constraint to the two-period consumption choice model. With
no initial assets, borrowing means a negative level of saving 𝑠𝑠 = 𝑦𝑦 − 𝑡𝑡 − 𝑐𝑐. The borrowing
constraint is thus:
−𝑠𝑠 ≤ 𝐿𝐿
Using the definition of saving 𝑠𝑠, this constraint is equivalent to 𝑐𝑐 ≤ 𝑦𝑦 − 𝑡𝑡 + 𝐿𝐿, which places
an upper limit on current consumption 𝑐𝑐. This additional constraint truncates the lifetime
budget constraint to the right of 𝑐𝑐 = 𝑦𝑦 − 𝑡𝑡 + 𝐿𝐿 as shown in Figure 4.7. Depending on
income and preferences, the constraint may or may not be binding.
Figure 4.7: The borrowing constraint may or may not be binding
The borrowing constraint is not binding when the tangency point between an indifference
curve and the full lifetime budget constraint without the borrowing constraint is to the left
134
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
of where the borrowing constraint is relevant, i.e. where the household wants to borrow
less than the borrowing constraint, or wants to save. The borrowing constraint is binding
when this tangency point lies to the right of where the budget constraint is truncated and,
hence, the choice of that level of borrowing is not feasible. In this case, the next best
consumption plan is the one at the ‘kink’ of the budget constraint where it is truncated by
the borrowing constraint. The household chooses to borrow the maximum amount allowed
by the constraint. In what follows, we shall assume the borrowing constraint is binding,
otherwise it is irrelevant and our analysis of consumption proceeds as it did in Chapter 3.
Binding borrowing constraints cause households’ consumption behaviour to behave in a
very different way from what we saw in Chapter 3 and Section 4.2. First, there is no
consumption smoothing following a temporary increase or decrease in disposable income.
This is illustrated in Figure 4.8. A decline in current income shifts the endowment point
horizontally to the left from 𝐸𝐸1 to 𝐸𝐸2 . Since the budget constraint is truncated at 𝑐𝑐 = 𝑦𝑦 −
𝑡𝑡 + 𝐿𝐿 because of the limit on borrowing, this truncation point also moves to the left by the
same amount.
Figure 4.8: No consumption smoothing
Starting from the borrowing constraint being binding, the household chooses the
consumption plan at the kink of the new budget constraint, which means that current
consumption falls by as much as current income. If current income were to increase, current
consumption would rise by the same amount if the borrowing constraint remains binding
(which it would as long as current income does not rise too much). Therefore, changes in
current disposable income have a one-for-one effect on current consumption if the
borrowing constraint is binding. In contrast, without a binding borrowing constraint,
households would choose to smooth consumption in response to temporary income shocks,
with 𝑐𝑐 moving less than one-for-one with a change in 𝑦𝑦.
A second difference compared to the earlier model of consumption is the failure of
Ricardian equivalence. A deficit-financed tax cut now increases current consumption
spending for those with a binding borrowing constraint as shown in Figure 4.9. The
135
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
endowment point 𝐸𝐸 moves along the downward-sloping budget constraint line because of
the government’s own present-value budget constraint for the same reason explained in
Section 4.2. However, the limit on borrowing truncates the budget constraint horizontally at
disposable income plus maximum debt, so a tax cut that directly affects disposable income
moves the truncation point. Lower taxes today raise current disposable income, moving the
truncation point to the right along the lifetime budget constraint.
A household with a binding borrowing constraint would like to spend more today by
borrowing. If such a household receives more disposable income then they will spend all or
some of it on more current consumption, so Ricardian equivalence fails. If the borrowing
constraint remains binding after the tax cut, the household will spend all the extra
disposable income immediately.
Figure 4.9: Failure of Ricardian equivalence
136
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
As in Section 4.2, the present-value budget constraint is not affected by the timing
of taxes after accounting for government’s own budget constraint. This means that
a deficit-financed tax cut (lower 𝑡𝑡, higher 𝑡𝑡′) does not affect the position of the
constraint. The current generation’s choice of consumption 𝑐𝑐 therefore remains
unchanged as shown in Figure 4.10. Since the endowment point moves to the right,
the implied bequest 𝑠𝑠 rises.
Figure 4.10: Ricardian equivalence with bequests
The present-value budget constraint does not rule out the current generation
choosing a level of its own consumption with 𝑐𝑐 > 𝑦𝑦 − 𝑡𝑡. But having 𝑐𝑐 > 𝑦𝑦 − 𝑡𝑡
means that 𝑠𝑠 < 0, a negative bequest. However, it is usually not possible
137
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The key point here is that a larger planned bequest 𝑠𝑠 gives parents greater control
over their children. As an example, suppose we start from sufficiently large 𝑠𝑠 where
parents fully achieve their goal of influencing children’s behaviour, albeit at the cost
of making a bequest that is too large from a purely altruistic perspective. This case
is depicted in Figure 4.12. The indifference curves take account only of 𝑐𝑐 and
altruism over 𝑐𝑐′, not parents’ desire to influence their children’s behaviour. Hence,
the initial point does not feature a point of tangency between the indifference
curves and the present-value budget constraint because parents are choosing 𝑐𝑐 too
low and 𝑠𝑠 too high from a purely altruistic perspective.
Figure 4.11: Failure of Ricardian equivalence when bequests are zero
138
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Following a deficit-financed tax cut, the endowment point moves along the budget
constraint as usual. If the current generation maintained the same 𝑐𝑐 and increased
bequests 𝑠𝑠 then they would have even greater control over their children but we
started from a point where parents were already achieving their goals in that
respect. Hence, parents would respond by raising 𝑐𝑐 and leaving 𝑠𝑠 unchanged,
resulting in a failure of Ricardian equivalence even though bequests are positive.
Figure 4.12: Strategic bequests and a failure of Ricardian equivalence
139
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
budget constraint increases (in absolute value) when a household switches from saving to
borrowing. This is depicted in Figure 4.13, where the lifetime budget constraint has a ‘kink’
at the endowment point.
Figure 4.13: Lifetime budget constraint with different interest rates for borrowers and savers
Compared to the case of no credit-market imperfections, someone who would have chosen
to borrow if 𝑟𝑟𝑙𝑙 = 𝑟𝑟 now chooses to borrow less or not to borrow at all if 𝑟𝑟𝑙𝑙 > 𝑟𝑟. The case
where a household decides not to borrow as a result of 𝑟𝑟𝑙𝑙 > 𝑟𝑟 is shown in Figure 4.14. If the
interest-rate spread is large enough to dissuade households from borrowing then they
choose a consumption plan at the kink of the lifetime budget constraint. Households in this
position will not smooth consumption after a temporary shock to income. The logic is similar
to the case with a quantitative limit on borrowing. This is easiest to see in the special case of
an extremely high interest-rate spread. The borrowing segment of the budget constraint
becomes so steep it is equivalent to a borrowing limit 𝐿𝐿 of zero.
Figure 4.14: An interest-rate spread causes a household to stop borrowing
140
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The presence of an interest-rate spread for borrower households also leads to a failure of
Ricardian equivalence. Although households who borrow face an interest rate 𝑟𝑟𝑙𝑙 > 𝑟𝑟, the
government is able to borrow at savers’ interest rate 𝑟𝑟 (savers may indeed be receiving 𝑟𝑟 as
a result of holding government bonds). The government present-value budget constraint is:
𝑡𝑡 ′ 1 𝐺𝐺 ′
𝑡𝑡 + = �𝐺𝐺 + �
1 + 𝑟𝑟 𝑁𝑁 1 + 𝑟𝑟
By the logic of Section 4.1, the lifetime wealth ℎ of savers who receive interest rate 𝑟𝑟 is
unaffected by the timing of taxes 𝑡𝑡 and 𝑡𝑡′ conditional on the levels of public expenditure 𝐺𝐺
and 𝐺𝐺′. On the other hand, the timing of taxes affects the lifetime wealth ℎ𝑙𝑙 of borrowers,
defined as the present value of all disposable income discounted at rate 𝑟𝑟𝑙𝑙 :
𝑦𝑦 ′ 1 𝐺𝐺 ′ (𝑟𝑟𝑙𝑙 − 𝑟𝑟)𝑡𝑡 ′
ℎ𝑙𝑙 = 𝑦𝑦 + − �𝐺𝐺 + �+
1 + 𝑟𝑟𝑙𝑙 𝑁𝑁 1 + 𝑟𝑟 (1 + 𝑟𝑟)(1 + 𝑟𝑟𝑙𝑙 )
A deficit-financed tax cut lowers 𝑡𝑡 but raises 𝑡𝑡′, which increases ℎ𝑙𝑙 when 𝑟𝑟𝑙𝑙 > 𝑟𝑟. Intuitively,
private borrowers effectively discount the future at a higher rate 𝑟𝑟𝑙𝑙 than the interest rate 𝑟𝑟
at which the government is able to borrow. Hence, shifting taxes away from the present to
the future actually increases the present value of lifetime income for borrower households.
For unchanged 𝐺𝐺 and 𝐺𝐺′, the tax cut shifts the kink point on households’ lifetime budget
constraint to the right as shown in Figure 4.15. This has no effect on the set of feasible
consumption plans for savers but expands the feasible set for borrowers. Consequently,
credit-constrained households are better off and choose to consume more. If a household is
dissuaded from borrowing by 𝑟𝑟𝑙𝑙 > 𝑟𝑟 then the household spends all of increase in disposable
income on higher current consumption 𝑐𝑐. If the household was still borrowing initially, then
the tax cut leads to an increase in both 𝑐𝑐 and 𝑐𝑐′. Either way, there is a failure of Ricardian
equivalence.
Figure 4.15: Failure of Ricardian equivalence with an interest-rate spread
141
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In reality, lenders go to considerable lengths in screening and credit scoring applicants for
loans to resolve this informational asymmetry. However, these efforts are imperfect in that
some bad types will still slip through, in which case, the asymmetric information is the
residual uncertainty that remains about an individual’s type after screening. Moreover,
screening and credit scoring entail costs, and accounting for those costs has similar
implications for interest-rate spreads as the original informational asymmetry itself. For this
reason, we will not account for screening and credit scoring explicitly in our analysis.
Good types make up a fraction 1 − 𝑡𝑡 of all borrowers and bad types are a fraction 𝑡𝑡. The
value of 𝑡𝑡 (0 < 𝑡𝑡 < 1) is common knowledge to everyone including lenders.
While lenders cannot directly observe an individual’s type, bad types do not want to reveal
their type indirectly by asking for a different loan size compared to good types. Suppose
good types want to borrow 𝐿𝐿 when the interest rate on loans is 𝑟𝑟𝑙𝑙 . Bad types mimic them, so
all borrowers end up requesting the same loan size 𝐿𝐿.
Loans are provided by financial intermediaries such as banks who fund them exclusively by
taking safe deposits. There is no bank capital or reserves in this analysis – such
considerations are deferred until Chapter 7. A bank needs to take deposits 𝐿𝐿 per loan made
and depositors must be paid an interest rate 𝑟𝑟, the same as the interest rate on government
bonds. We suppose there is a competitive market for loans with free entry of lenders.
For each loan 𝐿𝐿, lenders must repay (1 + 𝑟𝑟)𝐿𝐿 to depositors. Since both good and bad types
obtain loans, with many borrowers, lenders can predict with confidence that a fraction 𝑡𝑡 will
default, repaying nothing. The repayment from the fraction 1 − 𝑡𝑡 of good types is (1 + 𝑟𝑟𝑙𝑙 )𝐿𝐿,
which implies that lender profits per loan are:
𝜋𝜋 = (1 − 𝑡𝑡)(1 + 𝑟𝑟𝑙𝑙 )𝐿𝐿 − (1 + 𝑟𝑟)𝐿𝐿
142
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Competition between lenders with free entry pushes down lending rates 𝑟𝑟𝑙𝑙 until profits are
zero. The equation for profits can be written as 𝜋𝜋 = �(1 − 𝑡𝑡)(1 + 𝑟𝑟𝑙𝑙 ) − (1 + 𝑟𝑟)�𝐿𝐿, so 𝜋𝜋 = 0
implies that (1 − 𝑡𝑡)(1 + 𝑟𝑟𝑙𝑙 ) = 1 + 𝑟𝑟. Solving this equation for the equilibrium loan interest
rate 𝑟𝑟𝑙𝑙 implies for any 𝐿𝐿:
𝑡𝑡(1 + 𝑟𝑟)
𝑟𝑟𝑙𝑙 = 𝑟𝑟 +
1 − 𝑡𝑡
This equation implies 𝑟𝑟𝑙𝑙 > 𝑟𝑟 because 𝑡𝑡 > 0. Asymmetric information therefore provides a
rationale for a positive interest-rate spread between borrowers’ and savers’ interest rates.
Intuitively, to pay savers an interest rate 𝑟𝑟, all borrowers, including the good types can only
receive loans with a higher interest rate than 𝑟𝑟 to compensate for the bad types defaulting.
The interest-rate spread 𝑟𝑟𝑙𝑙 − 𝑟𝑟 = 𝑡𝑡(1 + 𝑟𝑟)⁄(1 − 𝑡𝑡) increases with the fraction 𝑡𝑡 of bad
types.
If information were symmetric, good and bad types would face different offers from
lenders. We can think of there being two separate markets with different interest rates 𝑟𝑟𝑙𝑙 in
which 𝑡𝑡 is known to be either 0 (only good types) or 1 (only bad types). With 𝑡𝑡 = 0, good
types would be offered 𝑟𝑟𝑙𝑙 = 𝑟𝑟, the same interest rate as savers. This corresponds to the
implicit assumption of perfect financial markets in Chapter 3. With 𝑡𝑡 = 1, bad types would
face 𝑟𝑟𝑙𝑙 = ∞, meaning that financial intermediaries refuse to lend to them.
Suppose that because of asymmetric information, firms can only borrow at interest
rate 𝑟𝑟𝑙𝑙 > 𝑟𝑟 to fund investment. Think of the ‘good types’ in the asymmetric
information model as firms with profitable investment opportunities who
undertake investment and will repay loans. The ‘bad types’ are unprofitable firms
that are borrowing to keep operating, in that these firms make no investments and
loans are simply used to pay managers and workers until the firms ultimately
default.
For good types, if they have sufficient current profits to fund investment from
retained earnings, the opportunity cost of investment is 𝑟𝑟, the cost of
143
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
internal funds. But if they do not have sufficient profits, the cost of borrowing for
investment is 𝑟𝑟𝑙𝑙 > 𝑟𝑟, the cost of external funds. For firms with sufficient internal
funds, they prefer to finance investment at the lower cost 𝑟𝑟 < 𝑟𝑟𝑙𝑙 using internal
funds, in which case their level of investment is determined by the equation
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟 as in the case of perfect financial markets.
If a firm must use external funds then investment is determined by 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟𝑙𝑙
instead. Let 𝑥𝑥 = 𝑟𝑟𝑙𝑙 − 𝑟𝑟 denote the spread between cost of external funds
(borrowing) and internal funds (retained earnings). The equation that determines
investment is equivalent to:
𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 − 𝑥𝑥 = 𝑟𝑟
For a firm that was initially sufficiently profitable to finance investment from
internal funds, a decline in profits to the point where the firm must borrow to
invest has the effect of pushing up the funding cost of investment from 𝑟𝑟 to 𝑟𝑟𝑙𝑙 . This
is equivalent to going from 𝑥𝑥 = 0 to 𝑥𝑥 > 0 in the equation above (𝑥𝑥 = 0
corresponds to 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟), implying a downward shift of the investment
demand curve. Hence, investment demand is also increasing in current profits for
some firms, all else equal.
144
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Some empirical support for the prediction of a negative relationship between the
interest-rate spread 𝑥𝑥 and investment 𝐼𝐼 is presented in Figure 4.17. The graph
shows time series of US investment and the spread between the interest rates on
BAA-rated and AAA-rated corporate bonds. The logic here is that lending to firms
with a AAA rating is less subject to problems of asymmetric information (‘bad types’
being included in the pool of borrowers) than lending to firms with a lower BAA
rating. The negative relationship between the spread and investment is particularly
striking during the 2008 financial crisis.
Figure 4.17: Corporate bond spreads and investment (USA)
To the extent that corporate profits decline in a financial crisis, there is also an
additional amplification effect. With lower profits, fewer firms can rely on internal
funds to finance investment, which means more firms have investment determined
by 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 − 𝑥𝑥 = 𝑟𝑟 rather than 𝑀𝑀𝑃𝑃𝐾𝐾′ − 𝑑𝑑 = 𝑟𝑟, further shifting the aggregate
investment demand curve to the left.
145
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
As we will see, financial crises also have important consequences through the effect
of falling asset prices. This reduces the value of collateral and makes it harder to
access credit.
The standard lifetime budget constraint assumes a household can borrow up to a maximum
amount (𝑦𝑦 ′ − 𝑡𝑡 ′ )/(1 + 𝑟𝑟). At this point, all future income would be needed to repay debt.
While it is feasible to repay, a household cannot make a binding and credible commitment
always to do so. Moreover, a household has a strong incentive to default if most future
income is simply used for debt service. Lenders cannot easily seize this labour income, or
force people to work to their full potential to repay debts. But if borrowers know lenders
cannot take their labour income, what stops them from defaulting?
We will see this problem is often resolved by the use of collateral, an asset pledged by a
borrower as security for a loan. For example, house is collateral for a mortgage loan and a
car for a car loan. Collateral assets can be more easily seized by lenders than income. Thus,
146
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
the threat of losing collateral on default can provide discipline for borrowers to encourage
them to repay loans. However, the size of the threat depends on what the collateral is
worth. This value will set a limit on the maximum loan sizes lenders are willing to agree to.
We now add a housing asset to the two-period consumption model. Assume a household
initially owns a house. Housing is illiquid, though, and it is not possible to sell a fraction of
the house in the current period (and selling the whole house is inconvenient). We allow for
the house to be sold in future at an anticipated price = 𝑝𝑝′. Suppose the household has some
amount of existing debt 𝐷𝐷 (including principal and interest) owed at the beginning of the
current period, for example, an existing mortgage. The lifetime budget constraint is now:
𝑐𝑐 ′ 𝑦𝑦 ′ − 𝑡𝑡 ′ + 𝑝𝑝′
𝑐𝑐 + = 𝑦𝑦 − 𝑡𝑡 − 𝐷𝐷 +
1 + 𝑟𝑟 1 + 𝑟𝑟
This adds the future sale value of the house 𝑝𝑝′ to future income, which is equivalent to
adding the present value of the housing asset 𝑝𝑝′ /(1 + 𝑟𝑟) to lifetime wealth. Existing debt 𝐷𝐷
is subtracted from current income. This assumes that all of this debt is due for repayment or
refinancing in the current period. Long-term debt with interest rate 𝑟𝑟̅ fixed in the past would
enter the budget constraint with (1 + 𝑟𝑟̅ )𝐷𝐷 subtracted from future income 𝑦𝑦 ′ − 𝑡𝑡 ′ instead.
Suppose the household borrows a total amount 𝐿𝐿 (including rolling over or refinancing
existing debt 𝐷𝐷 due for repayment) between the current and future time periods. By
definition, 𝐿𝐿 = 𝐷𝐷 − 𝑠𝑠 = 𝑐𝑐 − (𝑦𝑦 − 𝑡𝑡 − 𝐷𝐷). If the interest rate is 𝑟𝑟 then the future repayment
due is (1 + 𝑟𝑟)𝐿𝐿. The housing asset is collateral for this loan. If the household defaults in the
future, this avoids the debt repayment (1 + 𝑟𝑟)𝐿𝐿, but leads to the loss of the house of value
𝑝𝑝′. Therefore, the household gains from default if 𝑝𝑝′ < (1 + 𝑟𝑟)𝐿𝐿, so to avoid borrowers
defaulting, lenders must ensure that the following collateral constraint holds:
𝑝𝑝′
𝐿𝐿 ≤
1 + 𝑟𝑟
The maximum loan size is thus limited by value of the collateral asset 𝑝𝑝′ /(1 + 𝑟𝑟). This
provides a rationale for a borrowing constraint of the form assumed in Section 4.3, with
current consumption limited by 𝑐𝑐 ≤ 𝑦𝑦 − 𝑡𝑡 − 𝐷𝐷 + 𝑝𝑝′ /(1 + 𝑟𝑟), which truncates the lifetime
budget constraint. Note that as long as the collateral constraint holds, borrowers are not
expected to default, so lenders anticipate being repaid in full. Therefore, there is no
interest-rate spread as seen in Section 4.5 and loans are available savers’ interest rate 𝑟𝑟.
In the absence of any collateral and faced with an inability to seize borrowers’ incomes on
default, the limited commitment model implies there would be no lending in equilibrium, so
the household would face the tightest credit constraint 𝑐𝑐 ≤ 𝑦𝑦 − 𝑡𝑡. The presence of suitable
collateral benefits households by giving access to low-cost borrowing up to some limit. This
helps to smooth consumption as shown in Figure 4.19.
147
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The major drawback of lending depending on collateral is that fluctuations in the value of
the collateral cause fluctuations in the availability of credit. As Figure 4.20 shows, a fall in
house prices leads to a tightening of the borrowing constraint and a drop in current
consumption if that constraint is binding (there is also a wealth effect on existing
homeowners). This is problematic because of the volatility of asset prices. Figure 4.21 shows
the sizeable fluctuations in US house prices over time, which can contribute to fluctuations
in consumption demand by tightening or loosening borrowing constraints.
Figure 4.20: Effects of a fall in house prices
148
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
149
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The right panel of the figure shows the case without a collateral constraint (or
equivalently, where it is not binding). We know from Section 3.5 that a higher
interest rate also unambiguously reduces current 𝑐𝑐 because income and
substitution effects are reinforcing. For future consumption 𝑐𝑐′, the two effects are
opposing, and 𝑐𝑐′ remains unchanged in the special case where income and
substitution effects are of exactly the same size. A dominant income effect and
weak substitution effect imply that 𝑐𝑐′ falls, with current 𝑐𝑐 declining by less than
when the two effects were balanced. A dominant income effect means that the
concern for consumption smoothing is more important than responding to
incentives for intertemporal substitution provided by the interest rate.
Therefore, comparing the left and right panels of the figure, in the case where
income effects are dominant and substitution effects weak, a binding collateral
constraint is likely to magnify the effect of 𝑟𝑟 on 𝑐𝑐. This is because the binding
collateral constraint blocks the attempt to smooth consumption that the household
prefers when income effects are strong.
150
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
151
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
equilibrium, the real interest rate 𝑟𝑟 on bonds adjusts so that saving of young 𝑠𝑠 = 𝑦𝑦 − 𝑡𝑡 − 𝑐𝑐
is equal to 𝑏𝑏.
To see how this works, start from the even simpler case of no government intervention.
There are no taxes and transfers (𝑡𝑡 = 0, 𝑡𝑡 ′ = 0) and no debt is issued (𝑏𝑏 = 0). In
equilibrium, saving must be zero, 𝑠𝑠 = 0, which implies the young must consume their
income, 𝑐𝑐 = 𝑦𝑦. As they do not acquire any assets because there are no government bonds,
the same is true when old, 𝑐𝑐 ′ = 𝑦𝑦′. In this no-intervention economy, the real interest rate 𝑟𝑟
adjusts so that 𝑐𝑐 = 𝑦𝑦 and 𝑐𝑐 ′ = 𝑦𝑦′. Diagrammatically, the gradient of the lifetime budget
constraint 𝑐𝑐 + 𝑐𝑐 ′ /(1 + 𝑟𝑟) = 𝑦𝑦 + 𝑦𝑦 ′ /(1 + 𝑟𝑟) adjusts until 𝑐𝑐 = 𝑦𝑦 and 𝑐𝑐 ′ = 𝑦𝑦′ is chosen by the
young generation of households as shown in Figure 4.23. The resulting real interest rate is
given by 1 + 𝑟𝑟 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑐𝑐,𝑐𝑐 ′ at the endowment point.
Figure 4.23: No trade between the generations
Is the market equilibrium of this economy Pareto efficient? In other words, would
government intervention making one person better off require making someone else worse
off? There is none of the usual externalities or market failures that would suggest the
market equilibrium is suboptimal. However, as we will see, it may be possible for
government intervention to make all generations better off.
Suppose there is a social planner who can directly choose consumption for each generation
subject to the total supply of goods available. Can this social planner do better than the
market? Assume the social planner picks the same consumption plan (𝑐𝑐, 𝑐𝑐 ′ ) for youth and
old age of each generation. Given the numbers of young and old people, this plan entails
aggregate consumption 𝐶𝐶 = 𝑁𝑁 ′ 𝑐𝑐 + 𝑁𝑁𝑁𝑁′. The plan is feasible if 𝐶𝐶 = 𝑌𝑌:
𝑁𝑁 ′ 𝑐𝑐 + 𝑁𝑁𝑐𝑐 ′ = 𝑁𝑁 ′ 𝑦𝑦 + 𝑁𝑁𝑁𝑁′
Dividing both sides by 𝑁𝑁′ and using the demographic equation 𝑁𝑁⁄𝑁𝑁′ = 1⁄(1 + 𝑛𝑛):
𝑐𝑐′ 𝑦𝑦′
𝑐𝑐 + = 𝑦𝑦 +
1 + 𝑛𝑛 1 + 𝑛𝑛
152
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In the diagram with consumption plans (𝑐𝑐, 𝑐𝑐 ′ ), this is a straight line passing through the
endowment point (𝑦𝑦, 𝑦𝑦 ′ ) with gradient −(1 + 𝑛𝑛) as plotted in Figure 4.24. This line
resembles a budget constraint, except that the population growth rate, which is also the
economy’s GDP growth rate, determines the gradient of the constraint instead of an
interest rate.
Figure 4.24: Feasible consumption plans for social planner
When the no-government-intervention market equilibrium has 𝑟𝑟 < 𝑛𝑛, the social planner can
choose a combination of 𝑐𝑐 and 𝑐𝑐′ where 𝑐𝑐′ is higher than the market equilibrium and a
higher indifference curve is reached as illustrated in the figure. As 𝑐𝑐′ is higher, moving to this
plan would benefit the generation of old alive when it is implemented because they only
care about 𝑐𝑐′. But as a higher indifference curve is reached, all current and future
generations of young are also better off. This constitutes a Pareto improvement on the
market equilibrium.
In light of this analysis, we say that the economy is dynamically inefficient when 𝑟𝑟 < 𝑛𝑛, that
is, the real interest rate is below the population growth rate (which is also the GDP growth
rate here). However, when 𝑟𝑟 > 𝑛𝑛, this Pareto improvement is not available. Moving to a
feasible consumption plan on a higher indifference curve would mean reducing 𝑐𝑐′, which
would hurt the initial generation of old. The economy is said to be dynamically efficient in
the case 𝑟𝑟 > 𝑛𝑛.
153
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
and whether it is a defect that the system is not fully funded. Unlike a typical
private pension, the system is not backed by any assets.
Suppose a government sets up a pay-as-you-go pension system. The young are
obliged to make contributions 𝑡𝑡 (a tax). The old are paid a pension 𝑝𝑝 (a transfer,
𝑡𝑡 ′ = −𝑝𝑝). The system has no assets. The payments to the old come from the
contributions collected at the same time from the young. Assuming the system
balances contributions from the 𝑁𝑁′ young and payments to the 𝑁𝑁 old, the system’s
budget constraint is:
𝑁𝑁𝑁𝑁 = 𝑁𝑁 ′ 𝑡𝑡
Dividing both sides by 𝑁𝑁′ and using the demographic equation 𝑁𝑁⁄𝑁𝑁′ = 1⁄(1 + 𝑛𝑛),
the contributions 𝑡𝑡 and pensions 𝑝𝑝 must be related as follows:
𝑝𝑝
𝑡𝑡 =
1 + 𝑛𝑛
The first generation of the old at the time the system is established necessarily gain
from it because they receive the benefit 𝑝𝑝 without having to pay the contributions
𝑡𝑡. Their consumption increases as they spend their additional pension income.
Given a real interest rate 𝑟𝑟, the lifetime budget constraint of current and
subsequent generations of the young is:
𝑐𝑐′ 𝑦𝑦 ′ + 𝑝𝑝
𝑐𝑐 + = 𝑦𝑦 − 𝑡𝑡 +
1 + 𝑟𝑟 1 + 𝑟𝑟
By substituting 𝑡𝑡 = 𝑝𝑝⁄(1 + 𝑛𝑛) from the pension-system budget constraint:
𝑐𝑐′ 𝑝𝑝 𝑦𝑦 ′ + 𝑝𝑝 𝑦𝑦 ′ (𝑛𝑛 − 𝑟𝑟)𝑝𝑝
𝑐𝑐 + = 𝑦𝑦 − + = 𝑦𝑦 + +
1 + 𝑟𝑟 1 + 𝑛𝑛 1 + 𝑟𝑟 1 + 𝑟𝑟 (1 + 𝑟𝑟)(1 + 𝑛𝑛)
The final term is positive and the budget constraint shifts out if 𝑛𝑛 > 𝑟𝑟 holds. This
case is depicted in Figure 4.25.
Figure 4.25: Effect of introducing a pay-as-you-go pension system
154
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
In equilibrium, the real interest rate 𝑟𝑟 rises as consumption falls when young and
rises when old, which means the gradient of the indifference curve is steeper. This
is why the gradient of the budget constraint increases as well as its position shifting
upwards in the diagram.
When 𝑛𝑛 > 𝑟𝑟, current and subsequent generations of young are better off as a
result of the pay-as-you-go pension system being established, in addition to the
current old. This means everyone gains from the introduction of the system. We
know this is possible because the economy’s equilibrium is dynamically inefficient if
𝑟𝑟 < 𝑛𝑛. Resources can be reallocated to make everyone better off, and the pay-as-
you-go pension system is just one way of achieving this.
Intuitively, the market failure and case for government intervention is due to the
difficulty of trade between the generations. There is no pair of generations that can
make a mutually beneficial deal. Observe that the young lose by paying a tax that
goes to the old – they only gain because they in turn get a pension paid for by the
next generation. And that generation only gains because its pension is paid for by
the generation after, and so on. This pattern of trade between the generations is
difficult to arrange privately without government intervention.
If the economy is dynamically efficient (𝑟𝑟 > 𝑛𝑛) then introducing the pay-as-you-go
pension system benefits the currently old generation but makes the young and all
subsequent generations worse off. In the model, 𝑛𝑛 is the GDP growth rate, which
comes only from population growth and 𝑟𝑟 is the real interest rate on bonds. For
everyone to gain from pay-as-you-go, we need real interest rates that are lower
than the economy’s real growth rate. Importantly, this must be true not just
currently but at all future times as well. Pay-as-you-go pension systems were often
established during ‘baby booms’ when GDP growth was high but such favourable
demographics have not been maintained in many countries.
We study fully funded pensions using the two-period model of consumption where
the first period is ‘youth’ and the second period is ‘old age’. Suppose the
155
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
government requires the young to pay an amount 𝑡𝑡 into their pension. These funds
are invested in assets with real return 𝑟𝑟. These could be managed privately or by
the government but we assume that the return 𝑟𝑟 is the same irrespective of who
invests the funds. Assets are worth (1 + 𝑟𝑟)𝑡𝑡 by old age and are used to pay out a
pension 𝑝𝑝 given by the constraint:
𝑝𝑝 = (1 + 𝑟𝑟)𝑡𝑡
This is the budget constraint of the pension system, assuming there are no
administrative costs or intermediary profits taken out. A household’s lifetime
budget constraint is:
𝑐𝑐 ′ 𝑦𝑦 ′ + 𝑝𝑝
𝑐𝑐 + = 𝑦𝑦 − 𝑡𝑡 +
1 + 𝑟𝑟 1 + 𝑟𝑟
By substituting the pension budget constraint 𝑝𝑝 = (1 + 𝑟𝑟)𝑡𝑡:
𝑐𝑐 ′ 𝑦𝑦 ′ 𝑦𝑦 ′
𝑐𝑐 + = 𝑦𝑦 − 𝑡𝑡 + + 𝑡𝑡 = 𝑦𝑦 +
1 + 𝑟𝑟 1 + 𝑟𝑟 1 + 𝑟𝑟
Notice that the pension 𝑝𝑝 and the contributions 𝑡𝑡 cancel out because they are of
the same present value. The argument here is analogous to Ricardian equivalence.
Absent credit-market imperfections, setting up a fully funded pension system has
no effect on households. This is because without credit-market imperfections, the
lifetime budget constraint is the only constraint faced by households, and we have
seen that a fully funded pension system has no impact on this constraint. As shown
in Figure 4.26, the endowment point would move along an unchanged lifetime
budget constraint and there would be no change in households’ consumption plans
or welfare. For the consumption plan to remain the same, households reduce their
own private saving (or borrow) when forced to make pension contributions. Total
savings including the pension contributions are unaffected.
Figure 4.26: Establishing a fully-funded pension system
156
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
However, households may not find it possible to borrow against future pension
income, which means they may not be able to cut private saving to compensate for
being obliged to contribute to the new pension system. Suppose we impose a
borrowing constraint that limits current consumption to 𝑐𝑐 ≤ 𝑦𝑦 − 𝑡𝑡. Without credit-
market imperfections, a fully funded pension system has no consequences for
welfare. With a binding borrowing constraint, a fully funded pension system
reduces current consumption if individuals are forced to save more than they
would have chosen to. In that case, individuals are made worse off by introducing
the pension system, as shown in Figure 4.26.
The analysis so far has failed to demonstrate a case for government intervention in
the pension system. However, if households were myopic and failed to make
adequate provision for the future then paternalistic government intervention might
improve welfare. It is also possible that when there are transaction costs in making
investments, the large scale of the government lowers costs relative to everyone
investing privately.
A further possible justification for imposing a fully funded pension system is that if
the government did not force people to save, political pressure might compel it to
provide public pensions to old people who have not saved enough. If such public
pensions are financed by taxes levied on the young, this would create a de facto
pay-as-you-go pension system. If 𝑟𝑟 > 𝑛𝑛 then the economy is dynamically efficient
and we know pay-as-you-go pensions make the young and subsequent generations
worse off. There is a need to impose a fully funded pension system in this case to
avoid political pressure to establish an inefficient pay-as-you-go system.
A pay-as-you-go pension system is one way that a government can affect the
allocation of consumption between young (𝑐𝑐) and old (𝑐𝑐′) generations. Absent
157
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
The lower population growth rate 𝑛𝑛2 < 𝑛𝑛1 pivots the government’s constraint to
the left around the point (𝑦𝑦, 𝑦𝑦 ′ ). The original pension system 𝑡𝑡1 and 𝑝𝑝1 no longer
satisfies the constraint 𝑡𝑡 = 𝑝𝑝⁄(1 + 𝑛𝑛2 ). This means that either the pension 𝑝𝑝 is cut,
or the contributions 𝑡𝑡 are raised, or some combination of both.
Reducing pensions 𝑝𝑝 makes the current old generation worse off. The current old
clearly prefer contributions 𝑡𝑡 are raised to preserve their pension. However, if 𝑝𝑝 is
not cut then it is now the case that 𝑟𝑟 > 𝑛𝑛2 (the economy started from 𝑟𝑟 = 𝑛𝑛1 ). The
young are worse off if the pension 𝑝𝑝 is maintained at its former level. As the
economy has become dynamically efficient, current and future generations of
young prefer to cut pensions 𝑝𝑝 to some extent. We conclude that, unfortunately,
there is no reform that does not make some generation worse off.
158
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
Note, however, that the young would generally not favour total abolition of the
pay-as-you-go pension system. Reducing contributions 𝑡𝑡 all the way to zero may
imply that 𝑟𝑟 falls below 𝑛𝑛2 , making the economy dynamically inefficient, from
which point a higher 𝑡𝑡 and 𝑝𝑝 would make both young and old better off. Therefore,
the young prefer pensions 𝑝𝑝 are cut until 𝑟𝑟 = 𝑛𝑛2 .
159
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
saving 𝑠𝑠 = 𝑦𝑦 − 𝑐𝑐 of the young must equal 𝑝𝑝. The level of 𝑠𝑠 must also maximise
lifetime utility given the expected asset return 𝑟𝑟. Conditional on 𝑝𝑝 and 𝑟𝑟, since 𝑟𝑟 is
solely a capital gain, the amount 𝑝𝑝′ paid by the next generation is:
1 + 𝑟𝑟
𝑝𝑝′ = � � 𝑝𝑝
1 + 𝑛𝑛
These conditions must continue to hold for all subsequent generations as well.
Figure 4.28 shows that in dynamically inefficient economies with 𝑟𝑟 < 𝑛𝑛, a bubble is
possible with the young paying 𝑝𝑝 each for bubble asset. As long as 𝑝𝑝 is not too
large, the increase in 𝑟𝑟 needed to give the young an incentive to hold the asset
leaves 𝑟𝑟 no more than 𝑛𝑛. The bubble asset is sold on to next generation of young,
with each paying 𝑝𝑝′ but 𝑝𝑝′ is no more than 𝑝𝑝 because 𝑟𝑟 ≤ 𝑛𝑛. The bubble asset
remains affordable to future generations of young because the amount paid per
person does not grow. The total bubble asset value is 𝑉𝑉 = (1 + 𝑛𝑛)𝑝𝑝𝑝𝑝, thus 𝑉𝑉′⁄𝑉𝑉 =
(1 + 𝑛𝑛)(𝑝𝑝′ ⁄𝑝𝑝) ≤ 1 + 𝑛𝑛, so the total value grows no faster than the size of the
economy. The largest possible bubble size pushes 𝑟𝑟 up to 𝑛𝑛 exactly, in which case 𝑝𝑝
is constant and 𝑉𝑉 grows at same rate 𝑛𝑛 as the economy.
In the dynamically efficient economy (𝑟𝑟 ≥ 𝑛𝑛) depicted in Figure 4.29, a bubble
would require 𝑟𝑟 to rise above 𝑛𝑛, or even further above 𝑛𝑛. Since 𝑟𝑟 > 𝑛𝑛, the amount
𝑝𝑝′ that must be paid per person for the bubble asset by the next generation of
young is greater than the 𝑝𝑝 that the current generation of young pays. This per-
person amount paid needs to keep rising, which is eventually impossible given the
incomes of the young. Such a bubble would ‘pop’ and the anticipation of this
prevents it forming if investors behave rationally. Therefore, no bubbles form in a
dynamically efficient economy.
Figure 4.28: A bubble in a dynamically inefficient economy
160
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
161
EC2065 Macroeconomics | Chapter 4: Fiscal policy and credit-market imperfections
payments planned. With 𝑁𝑁 ′′ = (1 + 𝑛𝑛)𝑁𝑁 ′ young people in the future period, if new
bonds are issued to repay the existing debt then the following bond issuance 𝑏𝑏′ per
future young person is required:
(1 + 𝑟𝑟)𝑁𝑁 ′ 𝑏𝑏 1 + 𝑟𝑟
𝑏𝑏′ = ′′
=� � 𝑏𝑏
𝑁𝑁 1 + 𝑛𝑛
Given bond issuance 𝑏𝑏, an equilibrium requires there is an interest rate 𝑟𝑟 at which
the saving 𝑠𝑠 = 𝑦𝑦 − 𝑐𝑐 of the young is equal to 𝑏𝑏. Given this 𝑟𝑟, next period’s bond
issuance is given by the equation above for 𝑏𝑏′ . There must then be a 𝑟𝑟′ such that
𝑠𝑠 ′ = 𝑏𝑏′, and so on for all future generations.
This analysis is mathematically identical to that for ‘bubbles’ in Box 4.10 by simply
replacing the per young person purchases 𝑝𝑝 of the bubble asset with per young
person government bond issuance 𝑏𝑏.
162
EC2065 Macroeconomics | Chapter 5: Unemployment
Chapter 5: Unemployment
The basic function of an economy is to allow people to turn their time as work into
production and incomes that allow them to consume goods and services. The occurrence of
unemployment, meaning those who do not have a job but who are actively seeking work,
suggests economies might be failing in this central task.
Our analysis of the labour market back in Chapter 1 had no place for unemployment. There,
the labour market cleared with the real wage adjusting to bring the demand for labour into
line with the supply of labour. Note that unemployment is defined as people who want to
work at the prevailing wage but who have not yet found a job. This is different from the
earlier notion of non-participation in the labour market, which refers to those who do not
want to work at the prevailing wage.
In this chapter, we will explore why unemployment occurs. Furthermore, we would like to
understand why the extent of unemployment varies over time and why unemployment can
differ significantly between countries. By understanding unemployment better, we will try
to answer the question of whether unemployment is a failure of the economy that can be
corrected with an appropriate policy response.
Essential reading
163
EC2065 Macroeconomics | Chapter 5: Unemployment
164
EC2065 Macroeconomics | Chapter 5: Unemployment
Figure 5.3: The unemployment rate over the business cycle in the USA
How can we understand the occurrence of unemployment? In our earlier supply and
demand analysis of the labour market, there was no unemployment because the real wage
𝑤𝑤 adjusts to a point where labour demand is equal to labour supply. This is depicted in
Figure 5.4, with market clearing at 𝑤𝑤 ∗∗ . At this point, not everyone need be participating the
labour market and working the maximum amount but there is no unemployment in the
sense of someone wanting to work more at wage 𝑤𝑤 ∗∗ but not being able to find extra
employment.
One approach to thinking about unemployment is to suppose there is some impediment to
wage adjustment. Assume that wages are ‘sticky’ downwards, so the labour market cannot
clear. With the real wage stuck at some level 𝑤𝑤 ∗ above 𝑤𝑤 ∗∗ , desired labour supply exceeds
desired labour demand, and unemployment 𝑈𝑈 ∗ is the gap between the two.
Figure 5.4: Sticky wages and unemployment
If wages are sticky, there can be a shortage of demand for labour and, hence, some who
want to work cannot find jobs. But in this case, why do wages not fall to clear the labour
165
EC2065 Macroeconomics | Chapter 5: Unemployment
market in a way that a price would in any other market if there were excess supply? We
could identify wage contracts as creating some rigidity in wage adjustment, along the lines
of the nominal rigidities we will consider in Chapter 8. However, contracts can eventually be
renegotiated, so this in itself does not seem a promising explanation for why there is
positive unemployment even in the long run. We could also think of regulations such as
minimum wages as creating wage rigidity. However, unemployment exists even in countries
with no minimum wages, so this cannot be an important part of the story.
166
EC2065 Macroeconomics | Chapter 5: Unemployment
Another argument for having effective labour input as a function of wages is based on an
asymmetric information problem. Job applicants know more about their skills and abilities,
hence, potential earnings, than firms looking to recruit them. This means that good workers
will not want to apply for low-paying jobs. Even though firms cannot perfectly identify in
advance who will be a good worker, they know the wage they offer will affect the
composition of the group of applicants. Firms then need to offer high wages to attract
applications from good workers. A related argument makes the case that paying high wages
is important for retaining good workers.
As well as effective labour input 𝑒𝑒(𝑤𝑤) being increasing in wages 𝑤𝑤, it also makes sense to
assume 𝑒𝑒(𝑤𝑤) is a convex function for low 𝑤𝑤, which becomes concave for high 𝑤𝑤. Convexity
for low wages justified by there being some minimum level of wages required for people to
apply for jobs and exert effort on the job, so starting from very low 𝑤𝑤, the gradient of 𝑒𝑒(𝑤𝑤)
is initially increasing. Concavity for high wages justified by there being some maximum level
of effective labour input workers can physically put in, so the gradient of 𝑒𝑒(𝑤𝑤) ultimately
flattens out. An 𝑒𝑒(𝑤𝑤) function with these properties is shown in Figure 5.5.
Figure 5.5: Wages and effective labour input
There are other factors that affect workers’ effective labour input beyond just the wage paid
by their employers. This should also depend on market conditions. Higher wages 𝑤𝑤 ∗ in other
jobs weaken the incentives provided by the firm’s own wage 𝑤𝑤, all else being equal.
Workers would be less inclined to apply for a job paying 𝑤𝑤 or try to keep their current job if
wages 𝑤𝑤 ∗ at other firms are high. On the other hand, a higher unemployment rate 𝑢𝑢
strengthens incentives, making workers more inclined to apply for jobs or to try to keep
their current job. In general, we could write down a function 𝑒𝑒(𝑤𝑤, 𝑤𝑤 ∗ , 𝑢𝑢) that is decreasing
in 𝑤𝑤 ∗ and increasing in 𝑢𝑢, although we will often simplify matters by assuming that the
function depends on 𝑤𝑤 only.
How are wages determined when there is not a competitive market for workers who
provide a fixed and known amount of effective labour input? Firms are able to choose
wages 𝑤𝑤 to maximise profits, taking into account the relationship between effective labour
input and wages. A firm maximises profits 𝜋𝜋 = 𝐹𝐹 (𝐸𝐸 ) − 𝑤𝑤𝑤𝑤 with respect to both
167
EC2065 Macroeconomics | Chapter 5: Unemployment
The profit-maximising efficiency wage 𝑤𝑤 ∗ is where this tangency occurs, noting that the
initial convexity and subsequent concavity of 𝑒𝑒(𝑤𝑤) imply such a point exists. The inverse of
the gradient of the tangent and ray from the origin gives the effective cost 𝑤𝑤
� of a unit of
effective labour input.
168
EC2065 Macroeconomics | Chapter 5: Unemployment
Since firms demand labour input 𝐸𝐸 up to the point where 𝐹𝐹 ′ (𝐸𝐸 ) = 𝑤𝑤 �, the implied demand
for workers is 𝑁𝑁 = 𝐸𝐸/𝑒𝑒(𝑤𝑤 ). This is equivalent to finding 𝑁𝑁 where 𝑒𝑒(𝑤𝑤 ∗ )𝐹𝐹 ′ (𝑒𝑒(𝑤𝑤 ∗ )𝑁𝑁) = 𝑤𝑤 ∗ ,
∗
which can be plotted as a downward-sloping labour demand curve with 𝑁𝑁 on the horizontal
axis. With wages set at 𝑤𝑤 ∗ , labour demand is 𝑁𝑁 ∗ , as seen in Figure 5.7. The supply of labour
𝑁𝑁 𝑠𝑠 depends on wages in usual way, but firms have no incentive to adjust the wage 𝑤𝑤 ∗ even
if there are unemployed workers when labour demand is 𝑁𝑁 ∗ . It is not in the interests of a
firm to hire at lower wages because the cost 𝑤𝑤 � of effective labour input would actually be
higher.
Figure 5.7: Efficiency wage and unemployment
Is there any reason to expect the efficiency wage 𝑤𝑤 ∗ to be above the market-clearing real
wage 𝑤𝑤 ∗∗? If not, there would be no unemployment and, since all firms would pay the same
wage 𝑤𝑤 ∗ , all else being equal, no worker would have an incentive to exert effort. This is
because workers could obtain the sae pay in another job with no risk of unemployment risk.
Taking the moral-hazard argument for efficiency wages, there needs to be unemployment in
equilibrium to provide incentives. Note that we argued earlier that 𝑒𝑒(𝑤𝑤, 𝑤𝑤 ∗ , 𝑢𝑢) should be
increasing in unemployment 𝑢𝑢.
With an efficiency wage 𝑤𝑤 ∗ above the market-clearing wage 𝑤𝑤 ∗∗ , the theory explains the
persistence of some amount of unemployment. Conditional on the level of the efficiency
wage, unemployment fluctuates with shifts of labour demand. Figure 5.8 shows the effect of
a decline in labour demand. This leads to a drop in employment but does not change wages
and, consequently, unemployment is higher. A change in one of the determinants of firms’
optimal efficiency wage would also have implications for unemployment, with a higher
efficiency wage causing an increase in unemployment, all else being equal.
169
EC2065 Macroeconomics | Chapter 5: Unemployment
170
EC2065 Macroeconomics | Chapter 5: Unemployment
A decline in 𝑤𝑤 ∗ implies a downward shift in the efficiency wage line in the labour-market
diagram in Figure 5.10. Since 𝑒𝑒(𝑤𝑤 ∗ ) = 1 for the particular functional form considered here,
the change in the monitoring technology does not shift the labour demand curve 𝑁𝑁 𝑑𝑑 . Since
the only change is the decline in 𝑤𝑤 ∗ , unemployment declines from 𝑈𝑈1∗ to 𝑈𝑈2∗ .
Figure 5.10: Labour-market implications of better monitoring technology
171
EC2065 Macroeconomics | Chapter 5: Unemployment
there are individual workers and individual firms who have to search to form mutually
agreeable matches.
This view of the labour market emphasises heterogeneity among potential workers and
among jobs. Workers have different skills and jobs have different requirements. This means
it is not straightforward to match up jobs with suitable workers.
Search theories of the labour market also draw a distinction between stocks and flows.
Stocks refer to the number of people with particular labour-market statuses, such as those
who currently have jobs, or those who are currently unemployed. Flows refer to transitions
between different labour-market statuses, such as those who find a new job, or those who
lose or leave a job and become unemployed. The analysis of stocks and flows is absent from
the usual supply-and-demand approach to the labour market where it is implicitly assumed
flows are so rapid that only stocks (employment and unemployment) need to be
considered.
However, flows are interesting in their own right. Even with no change in overall
employment or unemployment, there is continual ‘churn’ in the labour market with
individuals making transitions between employment and unemployment, and back again.
Owing to the differences among potential workers and jobs, it takes time for firms to find an
‘acceptable’ worker to fill a particular job and it takes time for someone searching for work
to find an ‘acceptable’ job. In other words, the search process takes time because of limited
information. Furthermore, it is often not in interests of firms to accept the first potential
employee they see, or workers the first job they are offered.
Search theory studies this process of search and matching, the rates at which the flows
occur and the unemployment that results from them. The theory explains why both
unemployment and job vacancies coexist, i.e. unfilled jobs alongside people who want to
find jobs.
172
EC2065 Macroeconomics | Chapter 5: Unemployment
labour market. The job separation rate 𝑠𝑠 represents dismissals, redundancies and workers
quitting jobs.
The value of currently having a job with wage 𝑤𝑤 is specified by the value function of
employment 𝑉𝑉𝑒𝑒 (𝑤𝑤). Value refers to the present value of all future payoffs starting from
having a job with wage 𝑤𝑤. The future payments in this present discounted sum are
discounted at some rate 𝑟𝑟. The value function 𝑉𝑉𝑒𝑒 (𝑤𝑤) is increasing in the wage 𝑤𝑤 because it
is better to start from a job with a higher wage, all else equal. An example value function is
sketched in Figure 5.11.
Figure 5.11: The value function of employment
173
EC2065 Macroeconomics | Chapter 5: Unemployment
The decision to accept or reject a job offer can be stated in terms of a reservation wage 𝑤𝑤 ∗
where 𝑉𝑉𝑒𝑒 (𝑤𝑤 ∗ ) = 𝑉𝑉𝑢𝑢 . In the diagram, the reservation wage is found where the value function
of employment 𝑉𝑉𝑒𝑒 (𝑤𝑤) intersects the value of unemployment 𝑉𝑉𝑢𝑢 . A job offer with wage 𝑤𝑤
should be accepted if 𝑤𝑤 ≥ 𝑤𝑤 ∗ and rejected if 𝑤𝑤 < 𝑤𝑤 ∗ because these ranges of 𝑤𝑤
correspond to where 𝑉𝑉𝑒𝑒 (𝑤𝑤) is respectively above and below 𝑉𝑉𝑢𝑢 .
Intuitively, a reservation wage captures the idea that people do not want to accept just any
job when better ones might be found by continuing to search. On the other hand, people do
not want to be so picky that they spend forever searching for a perfect job that they might
never find, forgoing the opportunity to earn a wage in an acceptable but not perfect job.
It is important to understand that the value of unemployment 𝑉𝑉𝑢𝑢 includes not only receiving
unemployment benefits 𝑏𝑏 but also the chance of receiving job offers while searching for
jobs, i.e. the value of search. This means 𝑉𝑉𝑢𝑢 > 𝑉𝑉𝑒𝑒 (𝑏𝑏), so the value of unemployment is more
than simply having a job that pays a wage equal to unemployment benefits 𝑏𝑏. It follows that
𝑤𝑤 ∗ > 𝑏𝑏, so the reservation wage lies above the level of unemployment benefits.
We now consider the implications of rational search behaviour for how quickly on average
people make the transition from unemployment to employment. Recall that job offers are
obtained at rate 𝑝𝑝 over time. The probability of a wage offer 𝑤𝑤 being less than the
reservation wage 𝑤𝑤 ∗ is 𝐹𝐹 (𝑤𝑤 ∗ ) because the cumulative distribution function 𝐹𝐹(𝑤𝑤 ∗ ) gives the
probability of 𝑤𝑤 ≤ 𝑤𝑤 ∗ . Therefore, the probability that an offer is accepted is 1 − 𝐹𝐹(𝑤𝑤 ∗ ). If 𝑓𝑓
denotes the average rate at people find jobs per unit of time then:
𝑓𝑓 = 𝑝𝑝(1 − 𝐹𝐹(𝑤𝑤 ∗ ))
The relationship between the job-finding rate 𝑓𝑓 and the reservation wage 𝑤𝑤 ∗ is illustrated in
Figure 5.13. The job-finding rate is a decreasing function of the reservation wage because a
higher reservation wage means people are more picky when searching for jobs and thus are
less likely to receive an acceptable offer. Mathematically, 𝐹𝐹(𝑤𝑤 ∗ ) increases with 𝑤𝑤 ∗ , so 1 −
𝐹𝐹(𝑤𝑤 ∗ ) declines with 𝑤𝑤 ∗ .
174
EC2065 Macroeconomics | Chapter 5: Unemployment
The expected time taken to find and accept a job is 1/𝑓𝑓. This expected time is longer when
offers are harder to get (low 𝑝𝑝), or when the reservation wage 𝑤𝑤 ∗ is high, meaning that
people are more picky (𝐹𝐹(𝑤𝑤 ∗ ) is low).
Given the job-separation rate 𝑠𝑠, over time there are inflows from employment to
unemployment of 𝑠𝑠(𝐿𝐿 − 𝑈𝑈). This is because inflows from the group of size 𝐿𝐿 − 𝑈𝑈 occur at
rate 𝑠𝑠 per unit of time. Similarly, given the job-finding rate 𝑓𝑓, there are outflows from
175
EC2065 Macroeconomics | Chapter 5: Unemployment
unemployment to employment of 𝑓𝑓𝑓𝑓. Outflows from the group of size 𝑈𝑈 occur at rate 𝑓𝑓
over time.
Between two periods, stock-flow accounting says that the change in the number of people
unemployed is equal to inflows minus outflows. Letting 𝑈𝑈′ denote the number of people
unemployed in the next period, the stock-flow accounting identity is:
𝑈𝑈 ′ − 𝑈𝑈 = 𝑠𝑠 (𝐿𝐿 − 𝑈𝑈) − 𝑓𝑓𝑓𝑓
Generally, we are more interested in the unemployment rate 𝑢𝑢 = 𝑈𝑈/𝐿𝐿 rather than the
number of people unemployed. The stock-flow accounting identity can be transformed in
terms of unemployment rates by dividing both sides by 𝐿𝐿:
𝑈𝑈 ′ − 𝑈𝑈 𝑠𝑠 (𝐿𝐿 − 𝑈𝑈) 𝑓𝑓𝑈𝑈
= −
𝐿𝐿 𝐿𝐿 𝐿𝐿
Since 𝐿𝐿′ = 𝐿𝐿, the unemployment rate is 𝑢𝑢′ = 𝑈𝑈 ′ /𝐿𝐿 next period, and thus:
𝑢𝑢′ − 𝑢𝑢 = 𝑠𝑠(1 − 𝑢𝑢) − 𝑓𝑓𝑓𝑓
This equation states that the inflow to unemployment increases the unemployment rate by
𝑠𝑠(1 − 𝑢𝑢) and the outflow decreases the unemployment rate by 𝑓𝑓𝑓𝑓. The relationship
between the two flows (relative to the size of the labour force) and the unemployment rate
is depicted in Figure 5.14. The outflow 𝑓𝑓𝑓𝑓 is increasing in the unemployment rate because a
greater number of people will leave unemployment for the same 𝑓𝑓 when there are initially
more people unemployed. The inflow 𝑠𝑠(1 − 𝑢𝑢) is decreasing in 𝑢𝑢 because higher 𝑢𝑢 means
fewer people have jobs, so fewer will leave for a given job-separation rate 𝑠𝑠.
Figure 5.14: Inflows and outflows to and from unemployment
176
EC2065 Macroeconomics | Chapter 5: Unemployment
177
EC2065 Macroeconomics | Chapter 5: Unemployment
Since more generous unemployment insurance raises the equilibrium unemployment rate,
why do governments set up welfare states that provide unemployment insurance? While
the diagram above shows that 𝑉𝑉𝑢𝑢 and the average value of 𝑉𝑉𝑒𝑒 (𝑤𝑤) above 𝑤𝑤 ∗ are higher, this
analysis does not account for cost of the insurance, i.e. the higher taxes on wages needed to
pay the unemployment benefits, which must be set against the gains.
Aside from distributional considerations, the main reason for governments to provide
unemployment insurance is because there is a missing market for people to obtain private
insurance against unemployment risk. Such insurance is usually not available because of the
problem of moral hazard. While this analysis lies outside the domain of our search model,
publicly provided unemployment insurance can be good because it substitutes for the
missing market.
Does the search model offer any guidance on the design of unemployment insurance
beyond its role in substituting for a missing market? At first glance, the model points to a
cost of unemployment insurance in making178 people excessively picky when searching for
EC2065 Macroeconomics | Chapter 5: Unemployment
jobs. The argument is that people would already optimally trade off higher wages against
longer search in the absence of insurance, so the insurance is distorting search decisions by
shifting the cost of delaying taking a job on to the taxpayers who fund the unemployment
benefits.
However, this argument may not be valid when the unemployed have little financial wealth
and there are credit-market imperfections that limit private insurance and the ability to
borrow against future income. In this case, people obtain extremely low utility while
unemployed compared to receiving even a very low wage. This leads them to accept lower
paid jobs that are worse matches compared to what they would with perfect credit markets.
Unemployment insurance might then help produce better quality matches between jobs
and workers by giving people the ability to search more thoroughly.
In the job-search model, those searching for jobs receive an offer with probability 𝑝𝑝. An
offer is a wage 𝑤𝑤 drawn from a probability distribution with cumulative distribution function
𝐹𝐹(𝑤𝑤). It is this probability distribution that reflects the dispersion of wages across jobs. With
𝑤𝑤 ∗ denoting the reservation wage, job offers are accepted if 𝑤𝑤 ≥ 𝑤𝑤 ∗ , which has probability
1 − 𝐹𝐹(𝑤𝑤 ∗ ), and the job-finding rate that results is 𝑓𝑓 = 𝑝𝑝(1 − 𝐹𝐹 (𝑤𝑤 ∗ )). The value of being
unemployed 𝑉𝑉𝑢𝑢 includes the expected present value of 𝑉𝑉𝑒𝑒 (𝑤𝑤) conditional on 𝑤𝑤 ≥ 𝑤𝑤 ∗ with
probability 𝑓𝑓 in the next period and the present value of 𝑉𝑉𝑢𝑢 with probability 1 − 𝑓𝑓.
We now consider a more dispersed distribution of wages. Suppose the average wage that is
offered remains the same but low-wage offers are now further below the average and high-
wage offers are further above the average. As a simple example, suppose that job-offers
have one of three possible wages:
179
EC2065 Macroeconomics | Chapter 5: Unemployment
Suppose the reservation wage is initially 𝑤𝑤𝑀𝑀 (𝑤𝑤 ∗ = 𝑤𝑤𝑀𝑀 ), so low-wage offers are rejected
and medium- and high-wage offers are accepted. Since job offers with 𝑤𝑤 = 𝑤𝑤𝐿𝐿 are not
accepted anyway, a lower value of 𝑤𝑤𝐿𝐿 does not worsen the expected payoff of the
unemployed. But as job offers with 𝑤𝑤 = 𝑤𝑤𝐻𝐻 are accepted, an increase in 𝑤𝑤𝐻𝐻 means 𝑉𝑉𝑒𝑒 (𝑤𝑤𝐻𝐻 )
is higher, so the expected payoff of those searching for jobs increases. Figure 5.16 illustrates
how the expected payoff from job search increases when wages are more dispersed. Note
that by drawing the value function 𝑉𝑉𝑒𝑒 (𝑤𝑤) as a straight line, we are assuming that people do
not inherently dislike uncertainty about what job offers they will receive.
Figure 5.16: More wage dispersion increases the value of search
The increase in the value of searching for jobs implies that 𝑉𝑉𝑢𝑢 rises. As seen earlier, a higher
value of 𝑉𝑉𝑢𝑢 leads to a higher reservation wage 𝑤𝑤 ∗ and that reduces the job-finding rate 𝑓𝑓.
Consequently, greater wage dispersion leads to more unemployment. This example thus
illustrates the important role of differences across jobs in explaining unemployment using
the search-and-matching approach.
180
EC2065 Macroeconomics | Chapter 5: Unemployment
USA from 2000 to mid-2021. The vacancy rate is usually lower than the unemployment rate,
being typically around 2–3 per cent. Nonetheless, significant numbers of vacancies co-exist
with significant numbers of people searching for jobs. Although vacancies and
unemployment co-exist, it is also apparent there is almost always a clear negative
relationship between the two.
Figure 5.17: The vacancy rate and the unemployment rate
The negative relationship between the vacancy and unemployment rates is shown as a
scatterplot in Figure 5.18. Most of the time, the points trace out a downward-sloping curve
known as a ‘Beveridge curve’. However, this relationship displays occasional shifts, for
example, in 2009. An even bigger shift occurs with the COVID crisis in 2020 and, as of
writing, it remains to be seen whether subsequent data points will return to the original
Beveridge curve, or a Beveridge curve in a new position.
Figure 5.18: The Beveridge curve
181
EC2065 Macroeconomics | Chapter 5: Unemployment
create and the level of wages, rather than taking these as given as we did earlier in the job-
search model.
The equilibrium search model has three elements:
1. A ‘matching function’: a representation of the frictions in the search and matching
problem of workers and firms.
2. A wage-bargaining problem: there is no competitive labour market to determine
wages, so we must think of wages as coming from bargaining between workers and
firms.
3. A job-creation decision of firms: given the frictions in hiring and the wages that arise
from bargaining, how many jobs do firms want to create?
182
EC2065 Macroeconomics | Chapter 5: Unemployment
both a person receiving a job offer, so the worker is acceptable to the firm, and the offer
being acceptable to the worker. Therefore, the average job-finding rate 𝑓𝑓 is the number of
matches 𝑚𝑚 divided by number of unemployed 𝑢𝑢:
𝑚𝑚 𝜇𝜇𝜇𝜇(𝑢𝑢, 𝑣𝑣)
𝑓𝑓 = =
𝑢𝑢 𝑢𝑢
5.6.3 Explaining the Beveridge curve
To explain the empirical regularity of a negative relationship between vacancies and
unemployment, we combine our earlier stock-flow accounting exercise and its implications
for the steady-state unemployment rate with the newly introduced matching function.
Taking as given the job-separation rate 𝑠𝑠 and the job-finding rate 𝑓𝑓, the steady-state
unemployment rate is:
𝑠𝑠
𝑢𝑢 =
𝑠𝑠 + 𝑓𝑓
We do not distinguish between the actual and steady-state unemployment rates 𝑢𝑢 and 𝑢𝑢∗
because we suppose convergence to the steady state is sufficiently rapid that the difference
can be ignored. We assume the job-separation rate 𝑠𝑠 is exogenous here, as we did earlier in
the job-search model. The job-finding rate is 𝑓𝑓 = 𝑚𝑚/𝑢𝑢, with new matches given by the
matching function 𝑚𝑚 = 𝜇𝜇𝜇𝜇(𝑢𝑢, 𝑣𝑣).
Multiplying both sides of the equation by 𝑠𝑠 + 𝑓𝑓 and noting 𝑓𝑓𝑓𝑓 = 𝑚𝑚, it follows that vacancies
𝑣𝑣 and unemployment 𝑢𝑢 satisfy the equation 𝑠𝑠𝑠𝑠 + 𝜇𝜇𝜇𝜇(𝑢𝑢, 𝑣𝑣) = 𝑠𝑠. The left-hand side is
increasing in both 𝑢𝑢 and 𝑣𝑣 and the right-hand side is a fixed parameter, hence, we deduce
that high vacancies 𝑣𝑣 must be associated with low unemployment 𝑢𝑢. A matching function
𝑚𝑚 = 𝜇𝜇𝜇𝜇(𝑢𝑢, 𝑣𝑣) that is increasing in 𝑢𝑢 and 𝑣𝑣 and the labour-market stock-flow accounting are
sufficient to explain the Beveridge curve. This theoretical Beveridge curve relationship is
depicted in Figure 5.19. A change in the matching function 𝑚𝑚 = 𝜇𝜇𝜇𝜇(𝑢𝑢, 𝑣𝑣) or the parameter
𝑠𝑠 will modify this relationship and cause the Beveridge curve to shift.
Figure 5.19: The model-implied Beveridge curve
The equilibrium search model is consistent with the Beveridge curve (BC). It is able to
explain why vacancies and unemployment co-exist, and why there is usually a negative
183
EC2065 Macroeconomics | Chapter 5: Unemployment
relationship between the two. We now go further and analyse which point on the BC the
economy will reach. This is done by introducing the notion of ‘market tightness’, the ratio of
vacancies to unemployment.
184
EC2065 Macroeconomics | Chapter 5: Unemployment
Suppose a wage 𝑤𝑤 is agreed. A person who would remain or become unemployed if there
were no deal gains 𝑤𝑤 − 𝑏𝑏 per period from a deal, where 𝑏𝑏 is the level of unemployment
benefits. A worker thus has a surplus of 𝑤𝑤 − 𝑏𝑏 from employment. Assume each worker
produces goods of value 𝑦𝑦 per period once employed, hence, a firm gains 𝑦𝑦 − 𝑤𝑤 per period
by employing an extra person. Assume that each period of time a job vacancy is unfilled, a
firm must incur a recruitment cost 𝑐𝑐 to try to fill the position. Firms thus stand to gain an
amount 𝑦𝑦 − 𝑤𝑤 + 𝑐𝑐 𝑣𝑣 ⁄𝑢𝑢 per extra worker they hire, so a firm’s surplus is 𝑦𝑦 − 𝑤𝑤 + 𝑐𝑐𝑐𝑐.
Both surpluses must be positive for a deal to be in the interests of both parties. The total
surplus, the sum of the gains to both parties, is:
(𝑤𝑤 − 𝑏𝑏) + (𝑦𝑦 − 𝑤𝑤 + 𝑐𝑐𝑐𝑐) = 𝑦𝑦 − 𝑏𝑏 + 𝑐𝑐𝑐𝑐
Observe that the wage 𝑤𝑤 cancels out from this expression because it is a transfer between
the two parties. If the total surplus is positive then there are wages 𝑤𝑤 where both parties
would gain from a deal. The total surplus is always positive under the weak requirement
that 𝑦𝑦 > 𝑏𝑏, meaning a worker’s output exceeds the value of unemployment benefits. We
assume this in what follows.
When the total surplus is positive, there are many possible wages that give both parties a
positive surplus. The split of the surplus is resolved by assuming a specific form of bargaining
known as Nash bargaining. This has the bargaining powers 𝛾𝛾 and 1 − 𝛾𝛾 of workers and firms
given by an exogenous parameter 𝛾𝛾 between 0 and 1. The bargaining powers are the shares
of the total surplus received by each party.
The wage 𝑤𝑤 that achieves a split of the total surplus giving workers a share 𝛾𝛾 is given by:
𝑤𝑤 − 𝑏𝑏 = 𝛾𝛾 (𝑦𝑦 − 𝑏𝑏 + 𝑐𝑐𝑐𝑐)
The wage resulting from Nash bargaining is thus:
𝑤𝑤 = (1 − 𝛾𝛾)𝑏𝑏 + 𝛾𝛾𝛾𝛾 + 𝛾𝛾𝛾𝛾𝛾𝛾
This equation is referred to as the wage curve (WC). It implies a positive relationship
between market tightness 𝜃𝜃 and the bargained wage 𝑤𝑤. In Figure 5.20, the upward-sloping
wage curve is drawn in a diagram with 𝜃𝜃 on the horizontal axis and 𝑤𝑤 on the vertical axis.
Intuitively, a tighter labour market makes the cost to a firm of finding a substitute or
replacement for a worker more expensive because it will take longer to fill an open vacancy.
This puts workers in a stronger bargaining position.
It can be seen from the wage curve equation that WC shifts upwards if 𝑏𝑏 increases.
Intuitively, higher unemployment benefits raises the outside option of workers when
bargaining. An increase in productivity 𝑦𝑦 shifts WC upwards because this increases the gains
to a firm from striking a deal.
185
EC2065 Macroeconomics | Chapter 5: Unemployment
186
EC2065 Macroeconomics | Chapter 5: Unemployment
This equation implies a negative relationship between wages 𝑤𝑤 and tightness 𝜃𝜃.
Geometrically, it is represented in Figure 5.21 by a downward-sloping job-creation (JC) curve
drawn on the same axes as the wage curve. Intuitively, lower wages lead to more job
creation and a tighter labour market, all else equal. The JC curve shifts upwards if
productivity 𝑦𝑦 increases, and downwards if recruiting costs 𝑐𝑐 rise.
Figure 5.21: The job-creation curve
The equilibrium search model is completed by putting together the upward-sloping wage
curve (WC) and the downward-sloping job-creation curve (JC) in a diagram with market
tightness 𝜃𝜃 on the horizontal axis and wages 𝑤𝑤 on the vertical axis. This is shown in the left
panel of Figure 5.22. The intersection between WC and JC determines equilibrium wages 𝑤𝑤 ∗
and labour market tightness 𝜃𝜃 ∗ . Once market tightness is known, this determines the
gradient of the ray from the origin to the Beveridge curve and, hence, the equilibrium levels
of vacancies and unemployment in the right panel of the figure.
Figure 5.22: Equilibrium wages and market tightness
187
EC2065 Macroeconomics | Chapter 5: Unemployment
188
EC2065 Macroeconomics | Chapter 5: Unemployment
The shift down of the wage curve reduces wages 𝑤𝑤 but equilibrium market tightness 𝜃𝜃 rises
moving along the job-creation curve (JC). Higher 𝜃𝜃 increases the gradient of the ray from the
origin to the Beveridge curve (BC), so this results in a movement up the Beveridge curve
with firms creating more vacancies 𝑣𝑣 and the unemployment rate 𝑢𝑢 falling.
As well as changing the average values of 𝑤𝑤, 𝜃𝜃, 𝑣𝑣, and 𝑢𝑢, the flatter wage curve also makes
market tightness 𝜃𝜃 more responsive to shifts of the job-creation curve. Movements along
the Beveridge curve would become larger, so unemployment 𝑢𝑢 becomes more volatile
whenever the job-creation curve shifts. This is because wages are tied more closely to
workers’ fixed outside option 𝑏𝑏, so are effectively ‘stickier’ and less responsive to shocks.
189
EC2065 Macroeconomics | Chapter 5: Unemployment
Having a job-support scheme would avoid some or all of these losses, so we can
assess the gains from the scheme by adding up all the losses from its absence. Note
that because 𝑣𝑣 = 𝜃𝜃 ∗ 𝑢𝑢 at all times during the recovery, the loss can be simplified to
(𝑦𝑦 − 𝑏𝑏 + 𝑐𝑐𝜃𝜃 ∗ )(𝑢𝑢 − 𝑢𝑢∗ ) at a point when the unemployment rate is 𝑢𝑢.
The dynamics of unemployment rate are given by 𝑢𝑢′ − 𝑢𝑢 = 𝑠𝑠(1 − 𝑢𝑢) − 𝑓𝑓𝑓𝑓, and the
steady state is the solution of the equation 𝑠𝑠(1 − 𝑢𝑢∗ ) = 𝑓𝑓𝑢𝑢∗ . Hence, the dynamics of
the gap between unemployment 𝑢𝑢 and its steady-state value 𝑢𝑢∗ are:
190
EC2065 Macroeconomics | Chapter 6: Money
Chapter 6: Money
Up to this point, our study of macroeconomics has focused only on real variables. In ignoring
any reference to money, we have implicitly assumed individuals do not suffer ‘money
illusion’ and that they are able to make decisions solely with reference to real values and
relative prices. But are there reasons why money matters that are missing from our earlier
analysis? In other words, why is money important for the functioning of markets?
Essential reading
• Williamson, Chapters 12 and 18.
191
EC2065 Macroeconomics | Chapter 6: Money
person B have a good that the other person wants. In a specialised economy producing a
vast range of goods and services, a double coincidence of wants is rare.
For trade to take place in markets, it must be in interests of both parties. Since double
coincidences of wants are hard to find, barter exchange is difficult. Exchange is made easier
if one party is willing to accept money because then trade requires only a single coincidence
of wants.
192
EC2065 Macroeconomics | Chapter 6: Money
However, unless there is a double coincidence of wants for all trade, this implicitly assumed
a very high degree of coordination, or that short-term credit is freely available and works
without any friction. The reason is that a budget constraint allows for purchases in a period
even though people may not have yet received payment for what they plan to sell. An
alternative approach is known as ‘search’ theory. In a search model, all trade is
decentralised and occurs in meetings between pairs of individuals rather than in centralised
markets.
193
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.2 below modifies the earlier example so that one person can produce a physical
good and one person wants to consume that good. However, there is still no double
coincidence of wants for direct barter exchange. But all trade is possible if everyone accepts
the physical good in exchange for what they produce even if they do not want to consume
it. This enables the physical good to serve as a commodity money.
In the past, precious metals were a common form of commodity money, which satisfy the
first two of these requirements well. The use of coinage and convertible notes or tokens
added extra convenience, helping to satisfy the third and fourth requirements.
The advantage of a system of commodity money is that the limited supply and intrinsic
value of the commodity should give confidence that money will be a good store of value, or
at least not too bad. The disadvantage is that the system ties up valuable goods as money,
which either cannot be used directly, or there must be extra production of the commodity,
which has a cost.
194
EC2065 Macroeconomics | Chapter 6: Money
But IOUs could in principle become money if the initial holder uses it to make a payment to
someone else and then that person might pass it on to someone else as well. Consequently,
the IOU is held by a third party at redemption.
Figure 6.3 returns to the example with three individuals who produce only services. Since
services must usually be consumed at the point they are produced, this rules out the use of
commodity money. However, if everyone is willing to accept someone’s IOU, that IOU can
circulate as money. Through the use of this credit money, all three individuals are able to
purchase the services they desire.
Figure 6.3: Trade with private IOUs
195
EC2065 Macroeconomics | Chapter 6: Money
If banks have created IOUs that are accepted for payments then it is easy to see how these
can be used to facilitate exchange among the three individuals in the earlier example. Thus,
trade can take place using credit money issued by banks even if the individuals in the
economy cannot persuade others to use their own IOUs as money.
A system of credit money has some important advantages. It is efficient system with a low
resource cost because it does not tie up goods with intrinsic value to be used as money.
Even if bank IOUs are claims to commodity money, banks would not need to hold 100 per
cent of deposits as vault cash with intrinsic value. Furthermore, as we will see, banks can
loan deposits to support long-term investment. Some of the return on these investments
can be paid to depositors as interest, making bank deposits a better store of value.
However, credit money also has disadvantages. The biggest of these is that default by banks
on their IOUs may cause a collapse of confidence in the monetary system. Bank runs and
bank failure disrupt trade and trigger financial crises. Defaults by banks may be due to losses
made on their loans, or even caused by a bank run itself. These problems also lead to
pressure for bailouts from the government, creating a problem of moral hazard (‘too big to
fail’).
As we have discussed, individuals’ own IOUs cannot circulate as money. However, it is
possible that some forms of credit can be substitutes for money in making payments, for
example, credit cards. An individual paying with a credit card does not need to hold money
at the time of making a purchase, hence, this payment method acts as a substitute for
money. Essentially, the financial intermediaries that issue credit cards endorse individuals’
IOUs so others can be assured these debts will be repaid. The ability to use credit in this way
reduces frictions in payment because there is less need to hold money. However, such a
system of credit-based payments has costs for financial intermediaries coming from the
need to track credit histories and collect debt repayments.
196
EC2065 Macroeconomics | Chapter 6: Money
Figure 6.4 below shows how trade can take place using fiat money in the earlier example
with three individuals. Initially, someone holds a unit of fiat money and everyone accepts it
for payments. The fiat money circulates among the individuals and everyone can consume
the service they desire. Note that the fiat money remains in circulation after all the
exchanges have taken place. This implicitly assumes the fiat money will go on being used for
future trade.
Fiat money shares the advantage of credit money in being an efficient, low-cost system
because the intrinsically worthless money that is used has a negligible resource cost (there
are still some costs of production for the notes and coins, and costs of handling cash for the
private sector). It is important that individuals can easily recognise units of fiat money as
genuine but this can be achieved to a sufficient degree of accuracy with appropriate anti-
forgery devices.
Figure 6.4: Trade with fiat money
The potential disadvantages of fiat monetary systems also stem from fiat money lacking any
intrinsic value. As currency has a much lower cost for the government to produce it than its
market value and, as there is no obligation to redeem it, there is a temptation to issue more
fiat currency to raise revenue. The abuse of this money-issuing power by governments
results in money being a poor store of value and, at worst, hyperinflation.
Furthermore, because fiat money is not redeemable for anything other than itself, its value
depends on the belief that others will continue to accept it for payments. Note that in Figure
6.4, the fiat money is never withdrawn from circulation. This means that those choosing to
accept money must always believe that others will continue to accept money in the future.
In principle, such beliefs could be subject to self-fulfilling shifts because the belief that
others will not accept fiat money justifies individuals choosing not to accept it.
However, in practice, the concern about self-fulfilling losses of confidence in fiat currency
may be mitigated by the government’s power of taxation. Governments can insist on
payment of taxes in their own currencies, which ensures there is always some demand for
197
EC2065 Macroeconomics | Chapter 6: Money
money. The payment of taxes in fiat money to the government could also be seen as a
mechanism through which fiat money is withdrawn from circulation.
198
EC2065 Macroeconomics | Chapter 6: Money
6.3.1 Inflation
Inflation is defined as a general rise in prices quoted in terms of money. Inflation affects
how good or bad money is as a real store of value.
In previous chapters, we have measured real variables in terms of a homogeneous good or
basket of goods. Let 𝑃𝑃 denote the price of this good, or basket of goods, in terms of units of
money. If the price level in the current period is 𝑃𝑃, the notation for the price level in the
next period is 𝑃𝑃′. The rate of inflation between these time periods is denoted by 𝜋𝜋:
𝑃𝑃′ − 𝑃𝑃
𝜋𝜋 =
𝑃𝑃
Note that this definition of inflation refers to the percentage change in prices between the
current level and the level that will prevail in the future. It is also possible to measure
inflation between the past and current periods and where that inflation rate is relevant, the
notation will be adjusted to accommodate it. Note also that the future price level 𝑃𝑃′ and the
resulting inflation rate are not known in current period. Where the distinction between and
expected inflation is important, the notation 𝜋𝜋 𝑒𝑒 will be used to denote expected inflation.
Figure 6.5 shows data on inflation for the USA in the post-war period. Inflation is volatile at
the end of the 1940s but becomes very low and stable in the 1950s. The 1960s see an
increase in the inflation rate, which reaches double digits in the 1970s. Inflation is brought
under control in the 1980s and remains stable throughout the 1990s. This stability continues
into the 2000s except for the years around the 2007–8 financial crisis and its aftermath.
To calculate the real change in spending power when holding money, the value of money,
plus any interest 𝑖𝑖𝑚𝑚 that accrues, is adjusted for changes in the money prices of goods and
services. Take the amount of money 𝑃𝑃 that currently buys one unit of goods. If held simply
as money then this becomes (1 + 𝑖𝑖𝑚𝑚 )𝑃𝑃 units of money in the future period and with a price
level 𝑃𝑃′ , it would be possible to buy (1 + 𝑖𝑖𝑚𝑚 )𝑃𝑃/𝑃𝑃′ units of goods in the future. The
199
EC2065 Macroeconomics | Chapter 6: Money
definition of the real return 𝑟𝑟𝑚𝑚 on money is that holding an amount of money sufficient to
purchase a unit of goods now yields purchasing power over 1 + 𝑟𝑟𝑚𝑚 units of future goods.
The percentage real return on money 𝑟𝑟𝑚𝑚 is therefore calculated from the equation:
(1 + 𝑖𝑖𝑚𝑚 )𝑃𝑃 1 + 𝑖𝑖𝑚𝑚 1 + 𝑖𝑖𝑚𝑚
1 + 𝑟𝑟𝑚𝑚 = = =
𝑃𝑃′ 𝑃𝑃′⁄𝑃𝑃 1 + 𝜋𝜋
Observe that (1 + 𝑟𝑟𝑚𝑚 )(1 + 𝜋𝜋) = 1 + 𝑖𝑖𝑚𝑚 implies 1 + 𝑟𝑟𝑚𝑚 + 𝜋𝜋 + 𝑟𝑟𝑚𝑚 𝜋𝜋 = 1 + 𝑖𝑖𝑚𝑚 . If 𝑟𝑟𝑚𝑚 𝜋𝜋 is small
compared to 𝑟𝑟𝑚𝑚 and 𝜋𝜋, the real return on money is approximately given by 𝑟𝑟𝑚𝑚 ≈ 𝑖𝑖𝑚𝑚 − 𝜋𝜋. In
the case where no interest is paid on money (𝑖𝑖𝑚𝑚 = 0), for example, when money is
interpreted as cash, then the real return is approximately 𝑟𝑟𝑚𝑚 ≈ −𝜋𝜋. This says that the
inflation rate is approximately the percentage loss of purchasing power of money over time.
Figure 6.5: US inflation
200
EC2065 Macroeconomics | Chapter 6: Money
amount of money returned when the bond matures in the next period is (1 + 𝑖𝑖 )𝑃𝑃. Dividing
this by the future price level 𝑃𝑃′ gives the amount of future goods that can be purchased. The
equation then follows by noting that 𝑟𝑟 is defined so that buying nominal bonds worth a unit
of goods today gives the ability to buy 1 + 𝑟𝑟 units of goods in the future.
Rearranging the equation gives 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 + 𝑟𝑟𝑟𝑟, so if 𝑟𝑟 and 𝜋𝜋 are small, the term 𝑟𝑟𝑟𝑟 is
negligible compared to the other terms, and it follows that 𝑟𝑟 ≈ 𝑖𝑖 − 𝜋𝜋. This equation, which
can also be written as 𝑖𝑖 ≈ 𝑟𝑟 + 𝜋𝜋, is the approximate version of the Fisher equation. It states
that the real return on bonds is the difference between the nominal interest rate and the
inflation rate. This is an approximation, and in contexts where the inflation rate can be very
high, the exact version 1 + 𝑖𝑖 = (1 + 𝑟𝑟)(1 + 𝜋𝜋) will be used.
201
EC2065 Macroeconomics | Chapter 6: Money
If no interest is paid on money (𝑖𝑖𝑚𝑚 = 0), for example where money is physical cash, then the
opportunity cost is simply 𝑖𝑖. In this case, the level of nominal interest rates on bonds is a
measure of the opportunity cost of holding money.
A time series of US real interest rates is shown in Figure 6.6. This shows that real interest
rates are positive but low on average (around 2 per cent). The 1980s featured much higher
real interest rates, which peaked at close to 10 per cent. Real interest rates were positive
but lower in the 1960s and the 1990s. There are also times of negative real interest rates in
the late 1940s, 1970s and, more recently, from the aftermath of the 2008 financial crisis
through to 2021.
Figure 6.6: US real and nominal interest rates
As we will see in this chapter, the nominal interest rate also matters independently of the
level of real interest rates. This is because it affects the relative returns on money and bonds
in a world where at least some forms of money pay no interest, which influences how
households allocate wealth between different assets.
Empirically, US nominal interest rates have almost always been positive, though there have
been long spells where they have been close to zero, most notably after the 2008 financial
crisis. On average, nominal interest rates are higher than real interest rates, reflecting the
positive average rate of inflation. The broad pattern for US nominal interest rates is that
they were very low in the 1940s, increased over the subsequent decades to peak close to 15
per cent in the early 1980s and then declined in through to the time of writing (2021).
202
EC2065 Macroeconomics | Chapter 6: Money
Nominal interest rates are clearly positive on average, indicating a positive opportunity cost
of holding money, particularly cash. In some countries, nominal interest rates have
occasionally turned negative. We will return to that unusual case later in this chapter.
Taking as given 𝑃𝑃 and 𝑌𝑌, the minimum amount of money holdings consistent with the
transaction constraint can only be reduced and thus forgone interest saved, by increasing 𝑋𝑋.
As we now discuss, increasing 𝑋𝑋 has costs that we can compare to forgone interest to derive
households’ and firms’ demand for money.
203
EC2065 Macroeconomics | Chapter 6: Money
The benefit of higher 𝑋𝑋 is a reduction in forgone interest but this uses up time or incurs
transaction costs. These costs in real terms are specified by the function 𝑍𝑍(𝑋𝑋), which is
increasing in 𝑋𝑋. We assume 𝑍𝑍(𝑋𝑋) has the properties 𝑍𝑍(0) = 0, 𝑍𝑍 ′ (𝑋𝑋) > 0, and 𝑍𝑍 ′′ (𝑋𝑋) > 0,
the third of these implying that the marginal cost 𝑍𝑍′(𝑋𝑋) is increasing in 𝑋𝑋.
Conditional on 𝑋𝑋, the lowest money holdings can be is 𝑀𝑀 = 𝑃𝑃(𝑌𝑌 − 𝑋𝑋). An increase of 𝑋𝑋 by 1
reduces the need to hold real money balances 𝑀𝑀/𝑃𝑃 by 1, which reduces the real value of
forgone interest 𝑖𝑖𝑖𝑖/𝑃𝑃 on money holdings by 𝑖𝑖. The marginal benefit of higher 𝑋𝑋 is thus
equal to 𝑖𝑖. The marginal cost of higher 𝑋𝑋 is 𝑍𝑍 ′ (𝑋𝑋), which we will denote by 𝑞𝑞 in what
follows. The optimal choice of 𝑋𝑋 ∗ is where the marginal benefit equals the marginal cost:
𝑖𝑖 = 𝑍𝑍′(𝑋𝑋 ∗ )
The marginal cost function 𝑍𝑍′(𝑋𝑋) is shown as an upward-sloping line in Figure 6.7 with 𝑋𝑋 on
the horizontal axis and the marginal cost 𝑞𝑞 on the vertical axis. The optimal value of 𝑋𝑋 ∗ for a
particular nominal interest rate 𝑖𝑖 is derived by drawing a horizontal line at 𝑞𝑞 = 𝑖𝑖 and finding
where it intersects the marginal cost function. The figure shows that a higher nominal
interest rate 𝑖𝑖 leads to an increase in the optimal 𝑋𝑋 ∗ . Intuitively, if money is a worse store of
value, meaning that the opportunity cost 𝑖𝑖 is higher, it is rational to make more efforts to
avoid holding it.
Figure 6.7: Optimal reduction in money holdings
Having derived 𝑋𝑋 ∗ , agents’ demand for money 𝑀𝑀𝑑𝑑 is the minimum amount allowed by the
transaction constraint (assuming 𝑖𝑖 > 0, so there is a positive opportunity cost):
This a money demand function of the form 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑖𝑖), where real money demand
𝐿𝐿(𝑌𝑌, 𝑖𝑖 ) = 𝑌𝑌 − 𝑋𝑋 ∗ (𝑖𝑖) is increasing in 𝑌𝑌 and decreasing in 𝑖𝑖.
The choice of how much money to hold is related to the following measure of the velocity of
money 𝑉𝑉, which is defined by 𝑀𝑀𝑉𝑉 = 𝑃𝑃𝑃𝑃. This is a measure of how fast a unit of money
circulates in a given period as it is used for multiple transactions. Since 𝑉𝑉 = 𝑃𝑃𝑃𝑃⁄𝑀𝑀𝑑𝑑 =
𝑌𝑌/(𝑌𝑌 − 𝑋𝑋 ∗ (𝑖𝑖 )), velocity is inversely related to 𝑀𝑀𝑑𝑑 and increases with the opportunity cost 𝑖𝑖
204
EC2065 Macroeconomics | Chapter 6: Money
of holding money. Intuitively, money circulates faster with people holding it for shorter
periods when money is a poor store of value.
The demand function 𝑋𝑋 𝑑𝑑 (𝑞𝑞) is plotted alongside the supply function 𝑋𝑋 𝑠𝑠 (𝑞𝑞) in Figure 6.8.
The demand curve shifts vertically if the nominal interest rate 𝑖𝑖 changes, moving upwards if
𝑖𝑖 rises. The equilibrium of the market for credit facilities is at the intersection of the demand
and supply curves. Assuming 𝑋𝑋 ∗ < 𝑌𝑌, so some amount of money is held to make payments,
the equilibrium features 𝑞𝑞 ∗ = 𝑖𝑖, so the credit fee is equal to the nominal interest rate on
bonds. An increase in 𝑖𝑖 shifts the demand function upwards, so the equilibrium 𝑋𝑋 ∗ (𝑖𝑖) rises
with 𝑖𝑖. As it does not make sense to hold more money than required to satisfy the
transaction constraint when 𝑖𝑖 > 0, the money demand function is 𝑀𝑀𝑑𝑑 = 𝑃𝑃�𝑌𝑌 − 𝑋𝑋 ∗ (𝑖𝑖 )�,
which has the same form as seen earlier with the first interpretation of 𝑋𝑋.
205
EC2065 Macroeconomics | Chapter 6: Money
The function 𝐿𝐿(𝑌𝑌, 𝑖𝑖 ) = 𝑌𝑌 − 𝑋𝑋 ∗ (𝑖𝑖 ) for holdings of real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 increases in 𝑌𝑌
and decreases in 𝑖𝑖. Nominal money demand 𝑀𝑀𝑑𝑑 is proportional to the price level 𝑃𝑃 for given
real transactions and interest rates because higher prices scale up the need for units of
money to make payments. Money demand 𝑀𝑀𝑑𝑑 increases with 𝑌𝑌 as higher GDP means more
transactions. Money demand 𝑀𝑀𝑑𝑑 decreases with 𝑖𝑖 because a higher opportunity cost
increases incentives to reduce money holdings through various means.
Figure 6.8: The market for credit facilities
The money demand function is plotted against the price level 𝑃𝑃 in the left panel of Figure
6.9 for given values of real GDP 𝑌𝑌 and the nominal interest rate 𝑖𝑖. It is an upward-sloping
straight line because nominal money demand is proportional to the price level. The demand
function pivots to the right if 𝑌𝑌 increases or 𝑖𝑖 falls.
Figure 6.9: The demand for money
The relationship between the nominal interest rate 𝑖𝑖 and real money demand 𝑀𝑀𝑑𝑑 /𝑃𝑃 is
depicted in the right panel of Figure 6.9. The negative relationship reflects the incentive to
reduce money holdings when the opportunity cost 𝑖𝑖 is high. Mathematically, the demand
curve represents the optimality condition 𝑖𝑖 = 𝑍𝑍′(𝑋𝑋), where 𝑋𝑋 = 𝑌𝑌 − (𝑀𝑀⁄𝑃𝑃) using the
206
EC2065 Macroeconomics | Chapter 6: Money
binding transaction constraint. The demand curve shifts to the right if 𝑌𝑌 increases. In the
special case 𝑖𝑖 = 0, there is no forgone interest when holding money and the optimal value
of 𝑋𝑋 is 0. Moreover, there is no incentive to reduce money holdings until the transaction
constraint just holds. Hence, with a zero nominal interest rate, money demand is 𝑀𝑀𝑑𝑑 ≥
𝑃𝑃𝑃𝑃(𝑌𝑌, 0) = 𝑃𝑃𝑃𝑃, which corresponds to a horizontal line at 𝑖𝑖 = 0. Money demand thus
becomes perfectly interest elastic at 𝑖𝑖 = 0.
Finally, we note that when money itself pays interest at rate 𝑖𝑖𝑚𝑚 , all references to 𝑖𝑖 above in
the money demand function should be replaced by the correct opportunity cost 𝑖𝑖 − 𝑖𝑖𝑚𝑚 .
We derive the amount of goods 𝑤𝑤𝑚𝑚 that can be purchased in the same month when a
household supplies an additional hour of labour paid money wage 𝑊𝑊, holding constant the
household’s future plans for consumption and labour supply. The effective real purchasing
power of a household’s wages during the month is 𝑤𝑤𝑚𝑚 , which will generally differ from the
real cost 𝑤𝑤 = 𝑊𝑊/𝑃𝑃 to firms when wages are paid at the end of the month.
Real purchases 𝑤𝑤𝑚𝑚 cost 𝑃𝑃𝑤𝑤𝑚𝑚 units of money. If the household holds extra money 𝑃𝑃𝑤𝑤𝑚𝑚
instead of bonds during the month to make the purchases then this reduces nominal wealth
by (1 + 𝑖𝑖 )𝑃𝑃𝑤𝑤𝑚𝑚 at beginning of next month. To leave future spending plans unchanged, this
needs to be replenished with the extra wages 𝑊𝑊 received at the end of the month, hence,
(1 + 𝑖𝑖 )𝑃𝑃𝑤𝑤𝑚𝑚 = 𝑊𝑊. Dividing both sides by 𝑃𝑃 implies that 𝑤𝑤𝑚𝑚 is related to 𝑤𝑤 as follows:
207
EC2065 Macroeconomics | Chapter 6: Money
𝑤𝑤
𝑤𝑤𝑚𝑚 =
1 + 𝑖𝑖
This equation says that the effective purchasing power of the wages households receive is
reduced by 𝑖𝑖 because spending more requires holding more money, which forgoes interest.
Alternatively, the household could maintain the same average money holdings during the
month and avoid forgoing interest. However, this requires swapping between money and
other assets more frequently (higher 𝑋𝑋) during the month to cover the additional spending.
But this entails transaction costs 𝑞𝑞 = 𝑍𝑍′(𝑋𝑋) per unit of extra spending. Deducting these
from the wage received implies 𝑃𝑃𝑤𝑤𝑚𝑚 = 𝑊𝑊 − 𝑞𝑞𝑞𝑞𝑤𝑤𝑚𝑚 . It follows that 𝑤𝑤𝑚𝑚 = 𝑤𝑤⁄(1 + 𝑞𝑞).
Finally, credit could be used for the extra purchases made during the month. This requires
paying a fee 𝑞𝑞𝑞𝑞𝑤𝑤𝑚𝑚 at the end of month. Deducting that from the wage implies 𝑃𝑃𝑤𝑤𝑚𝑚 = 𝑊𝑊 −
𝑞𝑞𝑞𝑞𝑤𝑤𝑚𝑚 and hence, 𝑤𝑤𝑚𝑚 = 𝑤𝑤⁄(1 + 𝑞𝑞). In Section 6.4, we saw that households’ optimal choice
of money holdings implies 𝑞𝑞 = 𝑍𝑍 ′ (𝑋𝑋) = 𝑖𝑖, so this means that 𝑤𝑤𝑚𝑚 = 𝑤𝑤/(1 + 𝑖𝑖) whichever
way of paying for current consumption that households choose.
Households’ labour supply decision in a monetary economy with the timing restriction on
receiving and spending wages equates the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 between
leisure and current consumption to the effective current purchasing power of the wage 𝑤𝑤𝑚𝑚 :
𝑤𝑤
𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 =
1 + 𝑖𝑖
The right-hand side of the equation is lower when the opportunity cost 𝑖𝑖 rises, indicating
that money is worse as a store of value. Since working and consuming more depends on
holding money for some time, a positive opportunity cost 𝑖𝑖 works in a way similar to a
proportional tax 𝜏𝜏 on wages. We know from Section 1.5 that a proportional income tax on
wages means the households’ labour supply is determined by 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 = (1 − 𝜏𝜏)𝑤𝑤.
This logic points to one way that money matters for real GDP. If money is worse as a store of
value (a high opportunity cost 𝑖𝑖) then the implicit tax on economic activity rises. This leads
to a lower labour supply, shifting the 𝑁𝑁 𝑠𝑠 curve to the left. All else equal, there is less
employment and lower production, which causes a shift of the 𝑌𝑌 𝑠𝑠 curve to the left.
Households are worse off, which reduces consumption demand and shifts the 𝑌𝑌 𝑑𝑑 curve to
the left as well. Consequently, real GDP 𝑌𝑌 is lower. If 𝑌𝑌 𝑠𝑠 and 𝑌𝑌 𝑑𝑑 shift by the same amount,
then the real interest rate remains unchanged.
208
EC2065 Macroeconomics | Chapter 6: Money
How does the central bank change the money supply? In other words, how does new money
enter circulation or existing money is removed from circulation? We will see that there are
two basic ways this can happen:
• Open-market operations
• Transfers.
Since fiat money is intrinsically worthless, the resource costs of creating new money are
negligible and we ignore them in our analysis.
6.6.1 Open-market operations
An open-market operation is where the central bank buys or sells assets. When the central
bank buys assets, it pays with newly created money, which increases the quantity of money
in circulation. When the central bank sells assets, it receives existing money as payment,
which is effectively removed from circulation.
The central bank can in principle buy any asset in an open market operation or sell any asset
it already holds. It usually transacts with the private sector through its dealings with
commercial banks (rather than buying bonds directly from the government). Traditionally,
open-market operations were in markets for short-term government bonds, or repos
(repurchase/resale agreements) of long-term government bonds.
These assets were chosen because they have low credit risk and a short maturity and thus
protect the central bank from capital losses that would make it harder to reverse an
expansionary open-market operation in the future. But since the 2008 financial crisis, many
central banks have also made outright purchases of long-term bonds or risky assets, for
example, quantitative easing (QE) purchases of mortgage-backed securities in the US.
6.6.2 Transfers
A transfer payment is where the central bank distributes money without acquiring any asset
in return, for example, the payment of central-bank profits to a country’s finance ministry.
These profits often arise as a normal outcome of the central bank’s operations and are
distributed to the finance ministry as the owner of the central bank.
However, in principle – putting aside legal rules – a central bank can create new money and
simply distribute it to the finance ministry or others. This could mean directly paying for
government expenditure, or giving the government money to compensate for lower tax
revenues. The case of a direct payment of new money to households is known as a
‘helicopter drop’ of money, though the same economic effect could be achieved by a
transfer to the government to fund a tax cut for households.
6.6.3 Monetary policy
The decisions the central bank makes that affect the supply of money are described as its
monetary policy. For now, we assume monetary policy is an exogenous supply of money
𝑀𝑀 𝑠𝑠 . This can be represented as a perfectly inelastic money supply curve. This supply curve
shifts if monetary policy changes. We will consider later what monetary policy should be
chosen to meet the objectives of a country’s government.
209
EC2065 Macroeconomics | Chapter 6: Money
210
EC2065 Macroeconomics | Chapter 6: Money
future real GDP growth rate), or the current real interest rate 𝑟𝑟 relative to its future level 𝑟𝑟′,
or 𝜋𝜋 relative to 𝜋𝜋′. All else being equal, this means 𝑌𝑌 = 𝑌𝑌′, 𝑟𝑟 = 𝑟𝑟′, and 𝜋𝜋 = 𝜋𝜋′. Note we have
not ruled out that monetary policy affects the levels of 𝑌𝑌, 𝑟𝑟, or 𝜋𝜋. The equation for money-
market equilibrium then reduces to 𝑀𝑀′⁄𝑀𝑀 = 𝑃𝑃′⁄𝑃𝑃. With definitions 𝑀𝑀′⁄𝑀𝑀 = 1 + 𝜇𝜇 and
𝑃𝑃′ /𝑃𝑃 = 1 + 𝜋𝜋, money-market equilibrium therefore implies:
𝜋𝜋 = 𝜇𝜇
The rate of inflation 𝜋𝜋 is equal to the money-supply growth rate 𝜇𝜇, which means that
inflation is determined by monetary policy through the choice of 𝜇𝜇. Intuitively, increases in
the money supply shift the 𝑀𝑀 𝑠𝑠 curve to the right, which imply that the intersection with 𝑀𝑀𝑑𝑑
occurs at a higher price level 𝑃𝑃 to leave holdings of real money balances unchanged.
211
EC2065 Macroeconomics | Chapter 6: Money
How serious an issue is the instability of money demand? The left panel of Figure 6.12
reports a time series of the quantity of money 𝑀𝑀 𝑠𝑠 in the USA relative to nominal GDP 𝑃𝑃𝑃𝑃.
Note that this uses the M1 measure of the money supply, which is broader than the
monetary base and that we study further in Chapter 7. The measure 𝑀𝑀 𝑠𝑠 /(𝑃𝑃𝑃𝑃) is the
inverse of the velocity of money 𝑉𝑉 = (𝑃𝑃𝑃𝑃)/𝑀𝑀. In equilibrium, it is also equal to
(𝑀𝑀𝑑𝑑 ⁄𝑃𝑃)/𝑌𝑌, which is real money demand relative to real GDP 𝑌𝑌. The scaling by GDP is done
to control for changes in the demand for money owing to transactions rising with GDP.
We see that M1 as a fraction of GDP followed a stable trend prior to the 1980s but has
experienced various shifts in the 1980s, 1990s, and 2000s. The right panel of the figure is a
scatterplot of 𝑀𝑀 𝑠𝑠 /(𝑃𝑃𝑃𝑃) against the nominal interest rate 𝑖𝑖, which should show the
downward-sloping real money demand curve scaled by GDP. However, the plot indicates
this relationship has been unstable.
Figure 6.12: Demand for money according to the M1 measure in the USA
The exercise is repeated for the broader M2 measure of the US money supply in Figure
6.13. The time series in the left panel suggests the demand for M2 (relative to GDP) has
212
EC2065 Macroeconomics | Chapter 6: Money
been more stable than M1. The scatterplot in the right panel comes closer to tracing out
something that resembles a negative relationship between real money demand (scaled by
GDP) and the nominal interest rate 𝑖𝑖, though this relationship still appears to shift at some
points in time.
Overall, the evidence presented here suggests we cannot be confident that regulating the
money-supply growth rate will give tight control over inflation.
213
EC2065 Macroeconomics | Chapter 6: Money
With 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋, and 𝜋𝜋 = 𝜇𝜇 in equilibrium with a constant money-supply growth rate 𝜇𝜇, the
central-bank profits definition of seigniorage can be broken down into:
𝑖𝑖𝑖𝑖 𝑟𝑟𝑟𝑟 𝜋𝜋𝜋𝜋 𝑟𝑟𝑟𝑟 𝜇𝜇𝜇𝜇
= + = +
𝑃𝑃 𝑃𝑃 𝑃𝑃 𝑃𝑃 𝑃𝑃
This is the saving of real interest payments on bonds otherwise issued plus the erosion of
existing money’s real value due to new money being created.
6.8.4 Limits on real seigniorage revenues
As seigniorage arises from money being less good a store of value than other assets, it is an
implicit tax on money. Seigniorage is closely related to the notion of forgone interest we
saw in the analysis of money demand in Section 6.4 and is essentially identical to the total
amount of forgone interest on money.
If money becomes a worse store of value because the nominal interest rate 𝑖𝑖 is higher then
real money demand 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖) falls. Real seigniorage revenues are 𝑖𝑖𝑖𝑖⁄𝑃𝑃 = 𝑖𝑖𝑖𝑖(𝑌𝑌, 𝑖𝑖),
so there are two conflicting effects of higher 𝑖𝑖. First, the direct effect of money being worse
as a store of value. Second, the indirect effect of falling real money demand reducing the
real value of seigniorage. This means the relationship between real seigniorage revenues
and 𝑖𝑖 is not unambiguously positive. Observe that seigniorage is zero if 𝑖𝑖 = 0 and becomes
zero again for high 𝑖𝑖 if real money demand 𝐿𝐿(𝑌𝑌, 𝑖𝑖) falls towards zero sufficiently fast as 𝑖𝑖
increases. This gives rise to a Laffer curve for real seigniorage revenues as shown in Figure
6.14, indicating there are limits on the amount of real seigniorage government can obtain.
6.8.5 The inflation tax
If there is an unpredictable increase in the inflation rate 𝜋𝜋, the nominal interest rate 𝑖𝑖 on
bonds cannot rise to leave the real return 𝑟𝑟 unchanged. An inflation surprise thus reduces
214
EC2065 Macroeconomics | Chapter 6: Money
both the real value of nominal government bonds as well as existing money. This is reflected
in the ex-post real interest rate 𝑟𝑟 being less than the ex-ante real interest rate 𝑟𝑟 𝑒𝑒 .
The fiscal advantage derived from such surprise inflation is referred to here as an ‘inflation
tax’. A different term is used because the mechanism through which the inflation tax works
is distinct from the source of seigniorage revenue discussed earlier. With the definitions
adopted here, seigniorage revenues derive only from money not bonds and do not depend
on inflation being a surprise. In contrast, the inflation tax depends on inflation that was
unexpected when nominal bonds were first issued. This means that, ex post, the inflation
tax does not have any incentive effects on behaviour – like a lump-sum tax – because it is
completely unexpected.
There is no inflation tax on the real value of nominal bonds when the inflation is anticipated.
In this case, the real return is protected from expected inflation when the nominal interest
rate 𝑖𝑖 adjusts in advance. Inflation-indexed bonds are also protected against surprise
inflation and offer a guaranteed real return.
Figure 6.14: Seigniorage Laffer curve
215
EC2065 Macroeconomics | Chapter 6: Money
In what follows, we take the case of an increase in the money supply 𝑀𝑀 for illustration. We
will see that the model predicts this increase in 𝑀𝑀 has no real effects at all. This result is
shown in the supply-and-demand diagrams for the goods, labour and money markets
depicted in Figure 6.15. But what is the logic for this striking claim?
Figure 6.15: Permanent increase in money supply
216
EC2065 Macroeconomics | Chapter 6: Money
First, and most importantly, prices and wages expressed in units of money are fully flexible
here. With no impediments to price adjustment, the same real wage 𝑤𝑤 and real interest rate
𝑟𝑟 can continue to ensure supply and demand are brought into equilibrium in the labour and
goods markets. Moreover, there is no money illusion – everyone’s decisions depend on
relative prices and real variables.
Second, the policy change does not affect perceptions of how good money is as a store of
value going forwards between the current and future time periods. Since 𝑖𝑖 = 𝑟𝑟, there is no
change in the nominal interest rate 𝑖𝑖 unless 𝑟𝑟 changes. The nominal interest rate 𝑖𝑖 is a
measure of how bad money is as a store of value relative to other assets. This means no
greater tax on economic activity that depends on holding money is expected and, hence,
there is no reason for the labour and output supply curves to shift through the effect of 𝑖𝑖 on
the labour-supply condition 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 = 𝑤𝑤/(1 + 𝑖𝑖).
Third, while existing holdings of money and nominal government bonds are caught by a
surprise inflation tax that reduces the real value of households’ financial assets, the inflation
tax also allows the government to reduce other taxes and still pay for the same level of
public expenditure. These tax cuts offset the reduction in the value of financial assets and
there is no wealth effect overall on households – Ricardian equivalence holds after
accounting for the government budget constraint.
Thus, we conclude there are no reasons for any shifts of the 𝑁𝑁 𝑑𝑑 , 𝑁𝑁 𝑠𝑠 , 𝑌𝑌 𝑑𝑑 , or 𝑌𝑌 𝑠𝑠 curves.
Therefore, the equilibrium values of 𝑤𝑤 ∗ , 𝑟𝑟 ∗ , 𝑁𝑁 ∗ , and 𝑌𝑌 ∗ are unaffected. A permanent change
in the money supply has no real effects. No change in 𝑌𝑌 or 𝑖𝑖 means there is no shift of the
money demand curve 𝑀𝑀𝑑𝑑 . The rightward shift of the money supply curve 𝑀𝑀 𝑠𝑠 thus leads 𝑃𝑃 to
rise in proportion to 𝑀𝑀. Given these predictions, money is said to be ‘neutral’.
6.9.2 A permanent change in the growth rate of the money supply
Alternatively, suppose there is a permanent adjustment of the growth rate of the money
supply 𝜇𝜇. This money-supply growth rate is defined by 𝜇𝜇 = (𝑀𝑀′ − 𝑀𝑀)/𝑀𝑀, hence, the future
money supply is given by 𝑀𝑀′ = (1 + 𝜇𝜇)𝑀𝑀. The change in 𝜇𝜇 is exogenous, unexpected and no
further adjustments of 𝜇𝜇 are expected. Note that there is no change in the initial money
supply 𝑀𝑀 here.
Since the policy change affects expectations of the future money supply, inflation
expectations 𝜋𝜋 = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃 adjust. As we have seen in Section 6.7, the effect on the
equilibrium current and future price levels 𝑃𝑃 and 𝑃𝑃′ is such that 𝜋𝜋 = 𝜇𝜇, so any changes in
money-supply growth are reflected one-for-one in changes in expected inflation.
Let us take the case of faster money growth for illustration. We will see that increasing the
money growth rate does have real effects. These are depicted in the supply-and-demand
diagrams in Figure 6.16.
217
EC2065 Macroeconomics | Chapter 6: Money
The logic for the real effects is that higher money growth 𝜇𝜇 raises expectations of future
inflation 𝜋𝜋. The Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 then implies the nominal interest rate 𝑖𝑖 is higher
for each value of the real interest rate 𝑟𝑟. From the labour-supply equation 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 =
𝑤𝑤/(1 + 𝑖𝑖), higher 𝑖𝑖 has a negative effect on labour supply. Intuitively, because money is a
worse store of value, the implicit tax on economic activity rises, which causes the supply of
labour to decline. Consequently, the output supply curve 𝑌𝑌 𝑠𝑠 shifts to the left and, as this
change is permanent, 𝐶𝐶 𝑑𝑑 falls in line with income, leading to a leftward shift of 𝑌𝑌 𝑑𝑑 of same
size as the shift of 𝑌𝑌 𝑠𝑠 .
If a permanent change in the money supply growth were to have no effects on any real
variables then we would say that money is ‘superneutral’. The term ‘neutrality’ used earlier
refers to there being no real effects of a permanent change to the level of 𝑀𝑀. We see in
Figure 6.16 that the model predicts money is not superneutral. Permanently faster growth
of the money supply reduces real GDP and employment because money is less good as a
store of value. This inflationary policy has a negative real effect on the economy’s supply
side.
In the money market, there is no initial change in 𝑀𝑀, so no shift of the money supply curve
𝑀𝑀 𝑠𝑠 to begin with. The money demand curve 𝑀𝑀𝑑𝑑 pivots to the left as there are fewer
transactions due to lower GDP 𝑌𝑌 and more efforts to economise on holding money or make
use of money substitutes (higher 𝑋𝑋) because of higher 𝑖𝑖. This leads real money balances
𝑀𝑀/𝑃𝑃 to fall as 𝑀𝑀𝑑𝑑 /𝑃𝑃 is lower, which causes an immediate jump up in the level of prices 𝑃𝑃.
Box 6.3: Money supply increases that the central bank announces are
temporary
We have looked at the consequences for prices, inflation and real economic variables of
permanent changes to the quantity of money or the growth rate of the money supply. But
central banks might change the money supply temporarily in some circumstances.
For example, quantitative easing (QE) might increase the money supply but it is the central
bank’s stated intention to unwind the policy in the future. QE expansions of money
218
EC2065 Macroeconomics | Chapter 6: Money
supply have turned out to be persistent in most countries, although this may not have
been expected when they were first begun. There are cases where QE has been
temporary, such as the Bank of Japan QE policy from 2001, which was largely reversed in
2006. Another example of a temporary change is the ‘de-monetization’ experiment in
India in 2016, where there was a temporary decline of the money supply.
To see what difference it makes when a money-supply change is expected to be
temporary, suppose 𝑀𝑀 𝑠𝑠 = 𝑀𝑀 is expected to change for only one time period. Throughout,
we hold the expected future money supply 𝑀𝑀′ constant. Consequently, the equilibrium
future price level 𝑃𝑃′ does not change in this example. The Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 and
the definition of expected inflation 𝜋𝜋 = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃 imply that the nominal interest rate 𝑖𝑖
is:
𝑃𝑃′ − 𝑃𝑃 𝑃𝑃′
𝑖𝑖 = 𝑟𝑟 + = 𝑟𝑟 + − 1
𝑃𝑃 𝑃𝑃
Money-market equilibrium is the equation 𝑀𝑀 = 𝑀𝑀 𝑠𝑠 = 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑖𝑖), and, hence:
𝑃𝑃′
𝑀𝑀 = 𝑃𝑃𝑃𝑃 �𝑌𝑌, 𝑟𝑟 + − 1�
𝑃𝑃
The key point to note is that a higher price level 𝑃𝑃 lowers the nominal interest rate 𝑖𝑖 here,
so the effect of 𝑃𝑃 on money demand is magnified. We ignore here any effect of 𝑖𝑖 on 𝑌𝑌 (but
accounting for that would further boost the impact of 𝑃𝑃 on 𝑀𝑀𝑑𝑑 ). In what follows, we
assume that nominal interest rate 𝑖𝑖 remains positive throughout. Figure 6.17 shows the
relationship between 𝑀𝑀𝑑𝑑 and 𝑃𝑃 in this case for given 𝑌𝑌 and 𝑟𝑟 and the relationship in the
case where changes in the money supply are permanent, in which case money demand
𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑟𝑟) is proportional to 𝑃𝑃 and is thus represented by a straight line in the
diagram.
Figure 6.17: Temporary increase in money supply
219
EC2065 Macroeconomics | Chapter 6: Money
Following temporary increase in 𝑀𝑀, the money supply curve 𝑀𝑀 𝑠𝑠 shifts to the right as usual.
If the policy is expected to be reversed in future, any rise in the price level is also expected
to be reversed. Therefore, a higher price level 𝑃𝑃 would create expectations of future
deflation, reducing the nominal interest rate 𝑖𝑖 and boosting money demand. In the
diagram, 𝑀𝑀𝑑𝑑 is thus less steep than the usual 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑟𝑟) money-demand function. It follows
that the price level rises by proportionately less than 𝑀𝑀 does, in contrast to the case of a
permanent change where 𝑃𝑃 rises in proportion to 𝑀𝑀.
This different prediction compared to the case of a permanent change in 𝑀𝑀 is likely to be
quantitatively significant. If there were a 25 per cent higher money supply temporarily and
𝑃𝑃 went up by 25 per cent initially then this would require 25 per cent expected deflation
subsequently. But that cannot be an equilibrium because 𝑖𝑖 ≥ 0 implies deflation cannot
exceed the much lower equilibrium value of the real interest rate 𝑟𝑟. The price level 𝑃𝑃 must
therefore rise by far less than 25 per cent.
In respect of transaction costs, we noted that there are costs of using substitutes for money,
or using time and effort to economise on holding money. Those resource costs are
represented by the function 𝑍𝑍(𝑋𝑋) and they constitute a social cost of carrying out
transactions. On the other hand, when holding money, the cost of forgone interest is not a
social cost because the government gains an equal amount of seigniorage revenues.
Forgone interest is simply a transfer from holders of money to issuers of money.
6.10.2 Monetary policy, efficiency, and the Friedman rule
We now consider how the choice of monetary policy affects the efficiency of the economy’s
equilibrium. The demand for labour 𝑁𝑁 𝑑𝑑 is given by 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑤𝑤 and labour supply 𝑁𝑁 𝑠𝑠 is
determined by 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 = 𝑤𝑤⁄(1 + 𝑖𝑖). This equation for 𝑁𝑁 𝑠𝑠 comes from wages being paid as
money that must be held for some period before it can be spent, as explained in Section 6.5.
When the labour market is in equilibrium (𝑁𝑁 𝑑𝑑 = 𝑁𝑁 𝑠𝑠 ), it follows that 𝑀𝑀𝑀𝑀𝑁𝑁 = 𝑤𝑤 =
(1 + 𝑖𝑖 )𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 . A positive value of the nominal interest rate 𝑖𝑖 implies 𝑀𝑀𝑀𝑀𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑀𝑀𝑙𝑙,𝐶𝐶 , which
corresponds to employment and output being inefficiently low relative to the optimal
outcome with 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 .
220
EC2065 Macroeconomics | Chapter 6: Money
Since 𝜋𝜋 = 𝜇𝜇 = −𝑟𝑟 < 0, the Friedman rule requires deflation. In other words, the inflation
rate needed for economic efficiency is negative. Higher rates of inflation 𝜋𝜋, including zero or
positive rates, imply that 𝑖𝑖 is higher, which means the economy’s equilibrium is further away
from what is efficient. The Friedman rule thus provides a way to understand the costs of
inflation but also suggests that deflation is a good thing.
6.10.4 The fiscal implications of following the Friedman rule
Implementing the Friedman rule has fiscal implications because no seigniorage revenue is
received when 𝑖𝑖 = 0. Governments therefore need to find alternative sources of tax
revenue to continue to pursue their plans for public expenditure.
Another way to think of this is that the negative inflation rate required for the Friedman rule
implies there is a positive real return on non-interest-bearing money coming from its
purchasing power growing over time. This makes it as good a store of value as bonds with
real return 𝑟𝑟. Since money offers the same return as bonds, money is effectively being
treated by the government as a debt liability that must be repaid. The deflation that
supports the Friedman rule is achieved by buying back money to reduce its supply, which
works like repaying a debt. Money can only be repurchased by the central bank selling its
assets or using tax revenue transferred from the government. Hence, other taxes must rise
if government spending is to remain unchanged.
221
EC2065 Macroeconomics | Chapter 6: Money
Note that if other taxes create distortions, for example, income tax, then it may not be
optimal to follow the Friedman rule because this would replace one distortion (positive 𝑖𝑖)
with another distortion (a higher income tax rate). We have implicitly been assuming that
the lost seigniorage revenue could be replaced by lump-sum taxes.
While such effects are theoretically present even at single-digit rates of inflation, they are
likely to be very small in that case. The implicit tax rate on economic activity coming from
inflation is approximately equal to the nominal interest rate 𝑖𝑖 and, to be precise, this is the
nominal interest rate over the period of time people cannot avoid holding on to cash they
receive, not the annual nominal interest rate. Taking that period to be no more than a
month, single-digit annual inflation rates would not generate an implicit tax rate of more
than 1 per cent, resulting in a generally small social cost. However, in a hyperinflation with
monthly inflation rates above 50 per cent, it is easy to see that the implicit tax rate can be
very high, even if people try to shorten the period over which they hold on to cash they
receive. Hence, the social cost of inflation through this mechanism is far larger in a
hyperinflation.
222
EC2065 Macroeconomics | Chapter 6: Money
of the government’s needs. Thus, a hyperinflation can easily spiral out of control with
severe consequences for the economy unless the government can reduce its expenditure
or find alternative sources of tax revenue.
Our analysis of the demand for money in Section 6.4 was based on a comparison of
benefits and costs. The marginal cost of holding higher real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 is 𝑖𝑖, the
forgone interest on bonds. The marginal benefit of higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 was 𝑞𝑞 = 𝑍𝑍 ′ (𝑋𝑋)
previously. Now, a marginal value 𝑒𝑒 of the tax-evasion advantage of money is added to this
and the overall marginal benefit of higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 is 𝑞𝑞 + 𝑒𝑒.
The demand for cash is found where the marginal benefit equals the marginal cost, i.e.
where 𝑖𝑖 = 𝑞𝑞 + 𝑒𝑒 = 𝑍𝑍 ′ (𝑋𝑋) + 𝑒𝑒. This is equivalent to the equation 𝑍𝑍 ′ (𝑋𝑋) = 𝑖𝑖 − 𝑒𝑒, so the
nominal interest rate 𝑖𝑖 in the usual money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖) is replaced by
𝑖𝑖 − 𝑒𝑒 when cash has a tax evasion advantage. The new money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 =
𝐿𝐿(𝑌𝑌, 𝑖𝑖 − 𝑒𝑒) is higher at each interest rate 𝑖𝑖, so there is a rightward shift of 𝑀𝑀𝑑𝑑 plotted
against 𝑖𝑖 as shown in Figure 6.18.
While individuals might gain from the use of cash for tax evasion, the marginal private
benefit 𝑒𝑒 is not a social benefit of using money. When tax evasion occurs, other taxes need
to be higher to pay for public expenditure. Furthermore, cash also facilitates criminal
activity, imposing negative externalities on others. These considerations mean it is not
desirable for monetary policy to maximise individuals’ use of cash in the way that it would
if the Friedman rule were followed.
223
EC2065 Macroeconomics | Chapter 6: Money
Since the only social benefit of money is in reducing transaction costs 𝑞𝑞 = 𝑍𝑍′(𝑋𝑋), optimal
monetary policy should aim to push 𝑞𝑞 to zero, ignoring both the private cost of money in
terms of forgone interest 𝑖𝑖 (not a social cost) and the private benefit 𝑒𝑒 when money is
used for tax evasion (not a social benefit). With 𝑖𝑖 = 𝑞𝑞 + 𝑒𝑒 in equilibrium, this suggests
monetary policy should aim for 𝑖𝑖 = 𝑒𝑒 > 0, a positive nominal interest rate 𝑖𝑖 and a higher
rate of inflation 𝜋𝜋 (or less deflation) than what is implied by the Friedman rule. By
following this policy, the implicit tax on money through 𝑖𝑖 > 0 cancels out the tax evasion
advantage. Money being a worse store of value makes tax evasion harder and provides a
way to tax illegal activities.
Technological advances have made transactions by debit card and bank transfers much
cheaper and easier, even for small payments. As the payments system now enables bank
deposits to be used for almost all transactions, it has been suggested that physical cash
can and should be abolished. Since bank deposits lack the anonymity of cash, eliminating
cash would help to reduce tax evasion and criminal activity. But some argue that cash
might occasionally offer greater convenience and, more importantly, the anonymity
offered by cash could be a desirable feature in preserving individuals’ privacy and civil
liberties.
224
EC2065 Macroeconomics | Chapter 6: Money
if money demand is unstable, so a monetary target might not be desirable with these other
objectives in mind. This section explores how monetary policy works if the central bank uses
an interest rate as its operating target.
Rather than setting the money supply 𝑀𝑀 at some target level, the central bank now has a
target for the nominal interest rate 𝑖𝑖. However, market interest rates are not directly
controlled by the central bank, so it needs to vary some policy instrument under its control
to achieve its interest rate target. We assume this means the use of open-market
operations. The central bank must be willing to increase or decrease 𝑀𝑀 so that the money
market is in equilibrium at its target for the nominal interest rate 𝑖𝑖. This makes the supply of
money 𝑀𝑀 become an endogenous variable determined by the equation:
𝑀𝑀
= 𝐿𝐿(𝑌𝑌, 𝑖𝑖 )
𝑃𝑃
What are the implications of conducting monetary policy in this way? We will focus here on
the determination of prices and inflation, assuming that monetary policy does not affect
real GDP 𝑌𝑌 or the real interest rate 𝑟𝑟.
The Fisher equation is 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 ′𝑒𝑒 , where 𝜋𝜋 ′𝑒𝑒 = (𝑃𝑃′𝑒𝑒 − 𝑃𝑃)/𝑃𝑃 explicitly denotes expected
inflation between the current and future time periods, which might not be the same as the
realised inflation rate. In equilibrium, expected inflation must be
𝜋𝜋 ′𝑒𝑒 = 𝑖𝑖 − 𝑟𝑟
Given an exogenous target for 𝑖𝑖 and an equilibrium for 𝑟𝑟 that is independent of monetary
policy, this equation determines a unique equilibrium for expected inflation 𝜋𝜋 ′𝑒𝑒 . A low
nominal rate 𝑖𝑖 is associated with low expected inflation in equilibrium. But what about the
absolute level of prices 𝑃𝑃 and the inflation rate between the past and current period?
With an endogenous money supply 𝑀𝑀, there are many possible price levels 𝑃𝑃 consistent
with 𝑀𝑀⁄𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖). This is because 𝑀𝑀 adjusts to ensure this equation holds at the central
bank’s target level of 𝑖𝑖. It follows that setting a target for interest rates does not by itself
give the economy a ‘nominal anchor’ – the level of prices 𝑃𝑃 in terms of money is
indeterminate. In practice, what this means is that a fixed 𝑖𝑖 policy does not rule out
unexpected fluctuations in inflation, even though expected inflation is determinate.
A traditional monetary target does provide a nominal anchor in the sense of there being a
unique equilibrium for the levels of prices and inflation. Given a demand for real money
balances, an exogenous nominal quantity of money means there is only one possible level of
prices in equilibrium. But we have seen that a fixed money supply not desirable if real
money demand fluctuates. Moving away from targeting the money supply is desirable but
simply setting a fixed target for the nominal interest rate has pitfalls. The way forward is to
consider an alternative approach to monetary policy, the use of an interest-rate feedback
rule, the most famous example of which being the Taylor rule.
225
EC2065 Macroeconomics | Chapter 6: Money
𝑖𝑖 = 𝑟𝑟̂ + 𝜋𝜋 ∗ + 𝜙𝜙(𝜋𝜋 − 𝜋𝜋 ∗ )
This describes how the central bank sets the nominal interest rate 𝑖𝑖. According to the Taylor
rule, the level of 𝑖𝑖 should depend on the actual rate of inflation 𝜋𝜋 that occurs between the
past and current period. The coefficient 𝜙𝜙 indicates how strongly the central bank reacts to
inflation 𝜋𝜋 missing its target 𝜋𝜋 ∗ . If 𝜋𝜋 is one percentage point higher then 𝑖𝑖 is raised by 𝜙𝜙
percentage points. The special case of 𝜙𝜙 = 0 represents an exogenous interest rate target
that is not adjusted to the actual inflation rate. Finally, the term 𝑟𝑟̂ denotes the central
bank’s estimate of the equilibrium real interest rate 𝑟𝑟.
It is argued that a sufficiently strong response to inflation, as measured by the parameter 𝜙𝜙,
ensures the equilibrium inflation rate 𝜋𝜋 is determinate (and will be on target 𝜋𝜋 ∗ if the
estimate of 𝑟𝑟 is correct). To be precise, a sufficiently strong response means that 𝜙𝜙 > 1, so
the nominal interest rate reacts more than one-for-one to inflation. This is known as the
‘Taylor principle’.
To see the argument, we combine the Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 ′𝑒𝑒 written explicitly in terms
of expected inflation 𝜋𝜋 ′𝑒𝑒 = (𝑃𝑃′𝑒𝑒 − 𝑃𝑃)/𝑃𝑃 between the current and future periods and the
Taylor rule 𝑖𝑖 = 𝑟𝑟̂ + 𝜋𝜋 ∗ + 𝜙𝜙(𝜋𝜋 − 𝜋𝜋 ∗ ). By eliminating 𝑖𝑖 from these equations:
𝑟𝑟 + 𝜋𝜋 ′𝑒𝑒 = 𝑟𝑟̂ + 𝜋𝜋 ∗ + 𝜙𝜙(𝜋𝜋 − 𝜋𝜋 ∗ ) = 𝑟𝑟̂ + 𝜋𝜋 + (𝜙𝜙 − 1)(𝜋𝜋 − 𝜋𝜋 ∗ )
This can be rearranged to write an equation for the expected change in the inflation rate
over time:
𝜋𝜋 ′𝑒𝑒 − 𝜋𝜋 = (𝜙𝜙 − 1)(𝜋𝜋 − 𝜋𝜋 ∗ ) − (𝑟𝑟 − 𝑟𝑟̂ )
The Taylor principle 𝜙𝜙 > 1 implies the coefficient on 𝜋𝜋 − 𝜋𝜋 ∗, the extent to which inflation 𝜋𝜋
misses its target 𝜋𝜋 ∗ , is positive. Consequently, higher inflation now would mean that
inflation is expected to rise faster in the future and lower inflation now would mean that
subsequent inflation is expected to fall faster. It follows that there is only one value of
inflation 𝜋𝜋 where subsequent inflation is not expected to keep rising or keep falling
(although this argument ignores the lower bound on 𝑖𝑖 as explained later in Box 6.6).
If 𝜋𝜋 ′𝑒𝑒 = 𝜋𝜋, so inflation is neither expected to rise or fall further in future, then:
𝑟𝑟 − 𝑟𝑟̂
𝜋𝜋 = 𝜋𝜋 ∗ +
𝜙𝜙 − 1
This is the unique stable equilibrium for inflation when 𝜙𝜙 > 1. If the Taylor principle 𝜙𝜙 > 1
is not satisfied then there are many stable paths of inflation over time that are equally
226
EC2065 Macroeconomics | Chapter 6: Money
consistent with the interest-rate rule and equilibrium in the economy. We see from the
equation that inflation is on target if the central bank’s estimate 𝑟𝑟̂ of the equilibrium real
interest rate 𝑟𝑟 is correct. Underestimating this (𝑟𝑟̂ < 𝑟𝑟) leads to inflation above the target
(𝜋𝜋 > 𝜋𝜋 ∗ ).
Our analysis of the Taylor rule suggests that central banks wanting to meet an inflation
target have two key tasks. First, to make a strong reaction to any deviation of inflation from
its target so that the Taylor principle (𝜙𝜙 > 1) is satisfied. Second, to obtain an accurate
estimate of market-clearing real interest rate 𝑟𝑟.
The marginal benefit 𝑍𝑍′(𝑋𝑋) cannot be negative but can be zero if 𝑋𝑋 has already reached 0. If
the nominal interest rate 𝑖𝑖 on bonds were negative, this would mean that cash is a better
store of value than bonds, while also having a non-negative benefit relative to bonds in
saving transaction costs. Hence, 𝑖𝑖 < 0 would imply money is always preferred to bonds,
which is not possible in equilibrium because then there would be no demand for bonds.
6.13.2 The liquidity trap
The liquidity trap is the idea that money and bonds become perfect substitutes at the
margin once the lower bound on nominal interest rates is reached. It implies increases in
the quantity of money might be passively absorbed with no impact on the economy.
At the zero lower bound 𝑖𝑖 = 0, no interest is forgone by holding more money. Furthermore,
once holdings of real money balances 𝑀𝑀𝑑𝑑 /𝑃𝑃 reach 𝑌𝑌 (the amount needed to make all
payments using money without incurring any transaction costs), the marginal benefit of
lowering transaction costs 𝑍𝑍(𝑋𝑋) by holding more money is zero. With a zero marginal cost
and a zero marginal benefit, households and firms are indifferent about whether they hold
higher 𝑀𝑀𝑑𝑑 /𝑃𝑃 or not. But 𝑀𝑀𝑑𝑑 /𝑃𝑃 can be larger than 𝑌𝑌 at 𝑖𝑖 = 0 because money has become as
good a store of value as bonds. Consequently, money demand 𝑀𝑀𝑑𝑑 is perfectly interest
elastic at the interest-rate lower bound 𝑖𝑖 = 0.
Using the same framework as in Box 6.3, we see that a temporary expansion of 𝑀𝑀 𝑠𝑠 has no
effect on either the nominal interest rate 𝑖𝑖 or the price level 𝑃𝑃 once the zero lower bound is
reached. In the left panel of Figure 6.19, the demand for real money balances is perfectly
interest elastic (horizonal) at 𝑖𝑖 = 0. The money market is still in equilibrium at 𝑖𝑖 = 0 after a
227
EC2065 Macroeconomics | Chapter 6: Money
shift of 𝑀𝑀 𝑠𝑠 /𝑃𝑃 to the right with higher 𝑀𝑀 𝑠𝑠 (assuming no change in 𝑃𝑃 for now). This additional
money is willingly held with no change in the interest rate. To confirm that the price level 𝑃𝑃
does not change either, the right panel of the figure shows the money demand function
𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑟𝑟 + (𝑃𝑃′ ⁄𝑃𝑃) − 1), which combines the Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 = 𝑟𝑟 + (𝑃𝑃′ −
𝑃𝑃)/𝑃𝑃 with 𝑀𝑀𝑑𝑑 ⁄𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖). Taking the future price level 𝑃𝑃′ as given (because the change in
𝑀𝑀 𝑠𝑠 is only temporary), money demand is perfectly elastic with respect to the price level 𝑃𝑃
once 𝑖𝑖 = 0 is reached. The horizontal demand curve implies that the rightward of the supply
curve 𝑀𝑀 𝑠𝑠 does not change the equilibrium price level.
Figure 6.19: Money demand at zero nominal interest rate
It is important to note this argument does not apply to permanent expansions of the money
supply 𝑀𝑀 𝑠𝑠 where the central bank can convince people that it will never reverse the policy
change. This should affect expectations of future prices 𝑃𝑃′, which would shift 𝑀𝑀𝑑𝑑 =
𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑟𝑟 + (𝑃𝑃′ ⁄𝑃𝑃) − 1), resulting in a change of 𝑃𝑃 or 𝑖𝑖 or both.
228
EC2065 Macroeconomics | Chapter 6: Money
With 𝜙𝜙 > 1, the Taylor rule equation implies 𝑖𝑖 = 0 is reached for some inflation rate 𝜋𝜋
lying between −𝑟𝑟 and 𝜋𝜋 ∗ , which is where the zero lower bound becomes binding. We can
study what happens to inflation in equilibrium by combing the Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 ′𝑒𝑒
in terms of expected future inflation 𝜋𝜋 ′𝑒𝑒 with the Taylor rule as was done in Section 6.12.
This leads to 𝜋𝜋 ′𝑒𝑒 − 𝜋𝜋 = (𝜙𝜙 − 1)(𝜋𝜋 − 𝜋𝜋 ∗ ) if 𝑖𝑖 ≥ 0 as seen in the earlier analysis of Taylor
rules. But it leads to 𝜋𝜋 ′𝑒𝑒 − 𝜋𝜋 = −𝑟𝑟 − 𝜋𝜋 if 𝑖𝑖 = 0 because the Fisher equation implies 𝜋𝜋 ′𝑒𝑒 =
−𝑟𝑟 at the interest-rate lower bound.
The relationship between the expected change in inflation 𝜋𝜋 ′𝑒𝑒 − 𝜋𝜋 and the current
inflation rate 𝜋𝜋 is plotted in Figure 6.20. The upward-sloping segment at the right of the
diagram where the Taylor principle is satisfied has already been discussed in Section 6.12
and the inflation target 𝜋𝜋 ∗ is a steady state. The downward-sloping segment on the left is
where the lower bound binds and the Taylor principle cannot be satisfied.
Figure 6.20: Multiple equilibria with a deflation trap
The diagram shows that there are paths of inflation falling below the target 𝜋𝜋 ∗ where 𝜋𝜋
eventually becomes negative and converges to another steady state at 𝜋𝜋 = −𝑟𝑟. This
second steady state is the ‘deflation trap’ that the Taylor rule fails to avoid.
229
EC2065 Macroeconomics | Chapter 6: Money
accounts, such as reserves held at the central bank, could in principle have 𝑖𝑖𝑚𝑚 < 0.
However, as long as physical cash exists as a form of money, its zero nominal return is
always available to investors.
We will argue the key point missing here is that switching large amounts of wealth into
physical cash would entail security costs of keeping it safe. Unlike bonds and most assets,
there no register of ownership of physical cash – this is why cash gives anonymity – and so
holders of cash must consider the cost of keeping this physical object secure. Assume that
holding cash entails a security or storage cost that is a proportion ℎ of the amount of cash
held. This is a resource cost of holding cash, in addition to forgone interest 𝑖𝑖 (if any).
Consider again the money demand trade-off from Section 6.4. Raising 𝑋𝑋 to reduce the
amount of cash held and increase bond holdings leads to a gain of 𝑖𝑖 + ℎ per unit increase in
𝑋𝑋, saving security costs plus any forgone interest. Raising 𝑋𝑋 by one unit has a marginal cost
𝑞𝑞 = 𝑍𝑍 ′ (𝑋𝑋), so the optimal 𝑋𝑋 is found where 𝑖𝑖 + ℎ = 𝑞𝑞 = 𝑍𝑍′(𝑋𝑋), not where 𝑖𝑖 = 𝑍𝑍′(𝑋𝑋) as
before. Since 𝑖𝑖 + ℎ replaces 𝑖𝑖 but the equation is otherwise the same, it follows that the
usual money demand function 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖) becomes 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖 + ℎ). As money
demand is decreasing in 𝑖𝑖, adding the positive value of ℎ shifts 𝑀𝑀𝑑𝑑 downwards.
Geometrically, this is a parallel downward shift by an amount ℎ of the real money demand
curve plotted against 𝑖𝑖, as shown in Figure 6.21.
Figure 6.21: Demand for cash with security cost
The diagram shows it is now possible to have an equilibrium with a negative nominal
interest rate 𝑖𝑖 < 0. Bonds with a negative nominal return are willingly held because the
security costs of switching to physical cash holdings are too large. Note that bonds
themselves are not subject to the same security problems of cash because there is a register
of ownership.
Although we can now understand why nominal interest rates can be negative, it turns out
that there is still a lower bound on 𝑖𝑖, only now a negative one. As can be seen from the
diagram, it is not possible to have 𝑖𝑖 be less than −ℎ, that is, 𝑖𝑖 cannot be below the negative
of the security cost as percentage of the value of cash stored. The lower bound is therefore
230
EC2065 Macroeconomics | Chapter 6: Money
𝑖𝑖 ≥ −ℎ. In practice, ℎ should not be much more than 1 per cent for large amounts of
physical cash, which would suggest a lower bound on 𝑖𝑖 of approximately −1 per cent.
6.14.2 Costs of negative interest rates
Although our analysis has explained why a negative nominal interest rate 𝑖𝑖 is possible, it
does not explain why one would be desirable – and indeed the logic suggests that negative
nominal interest rates have social costs.
Consider what is optimal monetary policy here. There are resource costs of both holding
more cash (ℎ) and using substitutes for cash or economising on cash holdings (𝑍𝑍′(𝑋𝑋)). All
means of payment therefore have a social cost. The marginal net social cost of lower 𝑋𝑋 (or
equivalently, higher 𝑀𝑀𝑑𝑑 /𝑃𝑃) is ℎ − 𝑍𝑍 ′ (𝑋𝑋), so economic efficiency requires ℎ = 𝑍𝑍′(𝑋𝑋). Since
𝑖𝑖 + ℎ = 𝑍𝑍′(𝑋𝑋) in equilibrium, the Friedman rule 𝑖𝑖 = 0 therefore achieves efficiency. Note
that it is now efficient to have 𝑋𝑋 > 0, so cash is not used for all transactions. Having a
negative nominal interest rate 𝑖𝑖 < 0 has social costs because it leads to over-use of cash,
which wastes resources on security and storage costs, just as the earlier argument for the
Friedman rule pointed to the waste of resources in finding substitutes for cash when 𝑖𝑖 > 0.
6.14.3 Lowering the lower bound
The analysis suggests there is a negative lower bound on the nominal interest rate but one
not too far below zero in practice. What if governments want to lower interest rates further
below zero?
It is possible to envisage changes to the monetary system that would permit this. For
example, cash might have an ‘expiry date’, with an amount deducted if people need to
convert old, expired cash into new cash (what is sometimes known as a ‘Gesell tax’). If 𝜏𝜏 is
an explicit tax on holding cash, money demand 𝑀𝑀𝑑𝑑 is determined by the equation 𝑖𝑖 + 𝜏𝜏 =
𝑞𝑞 = 𝑍𝑍′(𝑋𝑋). Analogous to having a security cost ℎ, this implies the lower bound on 𝑖𝑖 is now
−𝜏𝜏, which can be lowered by raising 𝜏𝜏. Similar outcomes can be achieved by limits on
convertibility between physical cash and other forms of money such as reserves and bank
deposits. Finally, abolishing physical cash and moving to a system of purely electronic
money would remove the lower bound on nominal interest rates entirely.
None of this analysis explains why governments should do such things and points to
inefficiencies in making it harder or more costly to make use of money for transactions. In
Chapter 9, we see that there might be circumstances where macroeconomic stabilisation
policies work more effectively if the lower bound on nominal interest rates can be
circumvented.
231
EC2065 Macroeconomics | Chapter 7: Banking and finance
Essential reading
• Williamson, Chapter 18.
232
EC2065 Macroeconomics | Chapter 7: Banking and finance
Although commercial banks hold fiat money as reserves (or vault cash), the amount is
normally smaller than the deposits households and firms hold with the commercial banks.
This is what is known as a fractional-reserve banking system, in that bank reserves are less
than 100 per cent of deposits. Not holding reserves equal to deposits is in the interests of
commercial banks because reserves typically offer an inferior return to other assets, for
example, loans made by the commercial banks or other financial assets.
Nonetheless, it is prudent for commercial banks to hold some reserves to cover transfers or
withdrawals of deposits. There may also be regulatory reserve requirements that impose
legal minimum levels of reserve holdings relative to deposits. Even where such reserve
requirements exist, banks usually hold some amount of excess reserves above the
minimum.
With the presence of deposits, cash and reserves, there are now several distinct measures
of the supply of money. The first is quantity of money usable by households or firms for
payments, referred to as the broad money supply.
Broad money supply = Bank deposits + Cash
This corresponds to M1 measure of money supply in the USA. A narrower notion of money
considers only the supply of fiat money, the narrow money supply.
Narrow money supply = Reserves + Cash
This measure is also known as the ‘monetary base’, and it corresponds to what is called M0
in some countries such as the UK. You may also encounter the terms ‘inside money’ and
‘outside money’. Inside money is deposits, which are created by private sector and ‘outside
money’ is fiat money, which is created by governments.
If transfers of funds deposited at commercial banks are treated as equivalent to fiat money
then households and firms can use either deposits or cash to make payments. One
difference between cash and deposits is that no interest is received by holding cash, while
deposits may pay interest at rate 𝑖𝑖𝑚𝑚 , where 𝑖𝑖𝑚𝑚 refers to the interest rate paid on ‘money’ as
distinct from bonds. Assuming no difference in convenience between cash and deposits
when making payments, there would be no demand for cash if 𝑖𝑖𝑚𝑚 > 0. While cash remains
in use for payments to varying degrees around the world, making this assumption allows us
to focus on the demand for bank deposits, which is what is new in this chapter. The demand
for cash has already been studied in Section 6.4.
If the interest rate on bonds (or savings deposits that cannot be transferred to make
payments) is 𝑖𝑖 then interest 𝑖𝑖 − 𝑖𝑖𝑚𝑚 is forgone when wealth is held as money in the form of
demand deposits. A theory of the demand for money can be developed along the same lines
as Section 6.4, with the interest-rate spread 𝑖𝑖 − 𝑖𝑖𝑚𝑚 replacing the opportunity cost 𝑖𝑖 of
holding cash. The demand for deposits by households and firms is given by the equation:
𝑀𝑀𝑑𝑑
= 𝐿𝐿(𝑌𝑌, 𝑖𝑖 − 𝑖𝑖𝑚𝑚 )
𝑃𝑃
233
EC2065 Macroeconomics | Chapter 7: Banking and finance
This equation specifies a demand for real money balances 𝑀𝑀/𝑃𝑃 that depends positively on
real income 𝑌𝑌 and negatively on 𝑖𝑖 − 𝑖𝑖𝑚𝑚 . The downward-sloping demand curve is depicted in
Figure 7.1 for given 𝑌𝑌, where the opportunity cost 𝑖𝑖 − 𝑖𝑖𝑚𝑚 is the variable on the vertical axis.
Figure 7.1: The demand for deposits
1. Open-market operations
2. Standing facilities
3. Reserve requirements.
Central-bank asset purchases raise 𝑅𝑅 as payment is added to the reserve accounts of the
commercial banks from which the financial assets are purchased. Central-bank asset sales
lower 𝑅𝑅 as commercial banks’ reserve balances are reduced when they make payments to
the central bank.
In principle, open-market operations can be conducted in a wide range of financial markets.
These could be outright purchases of long-term government bonds paid for with newly
created reserves. If the central bank continues to hold these bonds then the effect on the
234
EC2065 Macroeconomics | Chapter 7: Banking and finance
supply of reserves persists until the bonds mature. However, the central bank could sell the
bonds before maturity to reverse the increase in 𝑅𝑅. An alternative to outright purchases or
sales of assets is repo or reverse-repo operations. A ‘repo’ is a sale-and-repurchase
agreement where an asset is sold but also agreed to be bought back at a prearranged price
(the percentage increase in the repurchase price relative to the sale price being the interest
rate on the repo). Open-market operations with repos change the supply of reserves 𝑅𝑅 only
temporarily.
It is important to note that, while individual commercial banks can change the quantity of
reserves they hold by buying or selling financial assets, or lending or borrowing, the central
bank is the monopoly supplier of reserves 𝑅𝑅. Transfers of reserves or loans of reserves
among commercial banks do not change the aggregate supply of reserves.
If a commercial bank fails to meet its reserve requirement then it is charged interest at a
penalty rate on the size of any shortfall, or it is obliged to borrow from the central bank’s
235
EC2065 Macroeconomics | Chapter 7: Banking and finance
discount window to make good the shortfall. Here, we suppose the penalty rate is the same
as the discount rate 𝑖𝑖𝑏𝑏 . In the presence of reserve requirements, the central bank can
choose to pay interest at a different rate 𝑖𝑖𝑟𝑟𝑟𝑟 on required reserves, with 𝑖𝑖𝑟𝑟 being the interest
rate on reserves held in excess of reserve requirements.
As well as mandating reserve requirements, central banks could vary the required reserve
ratio 𝑞𝑞 but this is not often done in advanced economies.
If a bank is left with more reserves relative to reserve requirements at a point when it
cannot go back to the interbank market to manage its reserve holdings then it keeps excess
reserves in its account at the central bank and receives interest 𝑖𝑖𝑟𝑟 . If it has insufficient
reserves relative to reserve requirements and borrowing from other banks is not feasible
then it must borrow the shortfall from the central bank at penalty rate 𝑖𝑖𝑏𝑏 . By holding more
reserves by borrowing from other banks when the interbank markets are open, the first of
these outcomes becomes more likely and the second less likely.
Suppose a bank borrows an additional unit of reserves. The cost of this borrowing is
interbank interest rate 𝑖𝑖. The benefit of holding more reserves depends on whether the
bank ends up with excess reserves or a shortfall of reserves. With excess reserves, the
benefit is the extra interest 𝑖𝑖𝑟𝑟 paid on its reserve balance. With a shortfall of reserves, the
benefit of an extra unit of reserves is saving the borrowing cost 𝑖𝑖𝑏𝑏 because the shortfall is
smaller. The overall expected benefit of is a weighted average of 𝑖𝑖𝑟𝑟 and 𝑖𝑖𝑏𝑏 depending on the
probabilities of the two events. When reserves are initially low, the probability of a shortfall
is high, which means more weight is put on the saving 𝑖𝑖𝑏𝑏 when calculating the expected
benefit of borrowing more in the interbank market.
236
EC2065 Macroeconomics | Chapter 7: Banking and finance
It is optimal for commercial banks to borrow reserves up to the point where the cost 𝑖𝑖
equals expected benefit. This implies the demand to borrow reserves decreases with the
cost 𝑖𝑖. With the interbank interest rate 𝑖𝑖 on the vertical axis and the quantity of reserve
holdings on the horizontal axis, there is a downward-sloping reserve demand curve as
shown in Figure 7.2.
Figure 7.2: The demand for reserves
The demand curve is bounded in a range of interbank interest rates between 𝑖𝑖𝑟𝑟 and 𝑖𝑖𝑏𝑏 . This
is because there would be an unlimited demand to borrow reserves if the cost were below
𝑖𝑖𝑟𝑟 , with banks able to profit from arbitraging the difference between 𝑖𝑖 and 𝑖𝑖𝑟𝑟 . There would
be no demand to borrow reserves if the cost were above 𝑖𝑖𝑏𝑏 because commercial banks can
always borrow from central bank instead at interest rate 𝑖𝑖𝑏𝑏 . Since the height of the demand
curve represents the expected benefit of borrowing a unit of reserves, and the probabilities
of having an excess or shortfall of reserves depend on reserve holdings relative to deposits,
the appropriate horizontal-axis variable is the quantity of reserves 𝑅𝑅 relative to amount of
deposits 𝑀𝑀, the ratio 𝑅𝑅⁄𝑀𝑀.
The reserve demand curve shifts vertically if there are changes to the central bank’s
standing-facility interest rates 𝑖𝑖𝑟𝑟 or 𝑖𝑖𝑏𝑏 . Higher reserve requirements 𝑞𝑞 shift the demand
curve to the right. Note that the interest rate on required reserves 𝑖𝑖𝑟𝑟𝑟𝑟 does not affect the
reserve demand curve here because the amount of interest paid on required reserves does
not depend on banks’ decisions that affect the extent of their holdings of excess reserves.
237
EC2065 Macroeconomics | Chapter 7: Banking and finance
shift the reserve supply curve to the right and asset sales shift the supply curve to the left.
Taking deposits 𝑀𝑀 as given for now, the supply of 𝑅𝑅⁄𝑀𝑀 is also inelastic.
Figure 7.3 represents the interbank market equilibrium by putting together the demand and
supply curves for reserves. The equilibrium interbank interest rate 𝑖𝑖 ∗ is at the intersection of
the demand and supply curves 𝑅𝑅𝑑𝑑 and 𝑅𝑅 𝑠𝑠 .
Figure 7.3: Equilibrium in the interbank market
238
EC2065 Macroeconomics | Chapter 7: Banking and finance
We have seen that varying the terms of a pair of standing facilities in parallel allows very
precise adjustments of interest rates in the channel system. It is not necessary to know
anything about the exact position or shape of the reserve demand curve 𝑅𝑅𝑑𝑑 to achieve
this. Furthermore, while there is no guarantee that the equilibrium interest rate is
necessarily in the middle of the channel, a narrow corridor between 𝑖𝑖𝑟𝑟 and 𝑖𝑖𝑏𝑏 means 𝑖𝑖 ∗
cannot stray too far from its target. If the central bank wants to bring 𝑖𝑖 ∗ as close as
possible to the middle of the channel then it can use ‘fine-tuning’ open-market
operations to shift 𝑅𝑅 𝑠𝑠 so that it intersects 𝑅𝑅𝑑𝑑 in the middle of the channel.
Figure 7.4: Raising interest rates using the channel system
239
EC2065 Macroeconomics | Chapter 7: Banking and finance
In principle, the channel used in the channel system could be very narrow but that would
tend to reduce trade in the interbank market and result in frequent use by commercial
banks of central bank’s borrowing facility instead. To keep the interbank market active
and avoid potential credit risk exposure of the central bank, the channel needs to be
wide enough to encourage commercial banks to borrow or lend reserves among each
other in the interbank market.
Like the channel system, we see that the floor system allows interest rates to be
controlled very precisely. Moreover, it does so without the need for any ‘fine-tuning’
open-market operations to steer the market interest rate to the centre of a channel.
240
EC2065 Macroeconomics | Chapter 7: Banking and finance
As we will see in Box 7.3, the floor system has an efficiency advantage because it does
not depend on maintaining an artificial scarcity of reserves. Central banks face no
resource cost in increasing the supply of reserves but reserves have a social benefit
through their implications for commercial banks’ supply of deposits. In the floor system,
reserves are supplied up to the point where commercial banks are ‘satiated’, while in the
channel system, 𝑅𝑅 𝑠𝑠 needs to be limited to keep 𝑖𝑖 ∗ at the centre of channel. Although
decisions about the supply of reserves have no resource costs, they do have implications
for the profitability of central banks and, hence, for seigniorage revenues received by
governments.
241
EC2065 Macroeconomics | Chapter 7: Banking and finance
The real quantity of deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 supplied by commercial banks can be broken down into:
𝑀𝑀 𝑠𝑠 𝑀𝑀 𝑠𝑠 𝑅𝑅
= ×
𝑃𝑃 𝑅𝑅 𝑃𝑃
The term 𝑅𝑅/𝑃𝑃 is the real supply of reserves, with the aggregate quantity of reserves 𝑅𝑅 held
by commercial banks being determined by the reserve supply 𝑅𝑅 𝑠𝑠 chosen by the central
bank. We have seen that the costs to commercial banks of maintaining adequate reserves
imply that the ratio 𝑀𝑀 𝑠𝑠 /𝑅𝑅 increases with 𝑖𝑖 − 𝑖𝑖𝑚𝑚 . Hence, given 𝑅𝑅/𝑃𝑃, the real supply of
deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 increases with the interest margin 𝑖𝑖 − 𝑖𝑖𝑚𝑚 .
Reserve requirements imposed on commercial banks also limit deposit creation and these
impose additional costs when interest 𝑖𝑖𝑟𝑟𝑟𝑟 paid on required reserves is below the market
interest rate 𝑖𝑖. Having 𝑖𝑖𝑟𝑟𝑟𝑟 < 𝑖𝑖 means that a larger interest margin 𝑖𝑖 − 𝑖𝑖𝑚𝑚 is required by
commercial banks when creating deposits, all else equal.
242
EC2065 Macroeconomics | Chapter 7: Banking and finance
243
EC2065 Macroeconomics | Chapter 7: Banking and finance
The banking market can be analysed in a similar way to the money market of Chapter 6, the
only difference being that the quantity of money (deposits here) is endogenous owing to the
upward-sloping supply curve. The demand curve behaves in the same way as in Chapter 6.
The supply curve shifts to the right if there is an increase in the real supply of reserves 𝑅𝑅/𝑃𝑃.
A higher interest-rate penalty 𝑖𝑖𝑏𝑏 − 𝑖𝑖𝑟𝑟 faced by banks that have a shortfall of reserves would
increase the costs of reserve management and lower 𝑀𝑀 𝑠𝑠 /𝑅𝑅, shifting the deposit supply
curve to the left. Similarly, increasing reserve requirements 𝑞𝑞 when the interest rate paid on
required reserves is low (such as if 𝑖𝑖𝑟𝑟𝑟𝑟 = 𝑖𝑖𝑟𝑟 < 𝑖𝑖𝑏𝑏 ) shifts the deposit supply curve to the left.
An increase in binding bank capital requirements would also reduce the supply of deposits.
244
EC2065 Macroeconomics | Chapter 7: Banking and finance
If the central bank chooses 𝑖𝑖𝑟𝑟 = 0 then this leaves it with the ‘traditional’ approach to
implementing monetary policy. In that traditional system, open-market operations are used
to shift the reserve supply curve 𝑅𝑅 𝑠𝑠 . This changes the point of intersection with the reserve
demand curve 𝑅𝑅𝑑𝑑 in the interbank market and hence, affects the equilibrium interest rate
𝑖𝑖 ∗ as seen in Figure 7.9.
Figure 7.9: The traditional system of monetary policy implementation
However, this traditional system is subject to greater practical difficulties to the extent that
the reserve demand curve 𝑅𝑅𝑑𝑑 has an uncertain shape and position, with shocks to banks’
demand for reserves shifting 𝑅𝑅𝑑𝑑 . This means it is not certain what size of open-market
operation is needed to achieve a given change in 𝑖𝑖 ∗ . Moreover, unlike the channel system
that bounds 𝑖𝑖 ∗ in a tight range between 𝑖𝑖𝑟𝑟 and 𝑖𝑖𝑏𝑏 , only the discount-window standing
facility acts to cap 𝑖𝑖 ∗ at the interest rate 𝑖𝑖𝑏𝑏 on loans of reserves from the central bank. And
often even that was not effective because of a ‘stigma’ attached to commercial banks that
were seen to borrow from the central bank through its discount window. The traditional
system can be seen as essentially a very wide channel between 𝑖𝑖𝑟𝑟 = 0 and 𝑖𝑖𝑏𝑏 + stigma.
The cost of banking and the efficiency of money as a medium of exchange
Where money is physical cash, we saw in Section 6.10 that the Friedman rule 𝑖𝑖 = 0 yields
efficiency in the use of money as medium of exchange. Note that real money demand
𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖) is maximised at 𝑖𝑖 = 0 because this means the opportunity cost of holding
cash is zero, which avoids taxing economic activity that depends on using money.
Where money is deposits at commercial banks that pay interest rate 𝑖𝑖𝑚𝑚 , the real demand
for money is 𝑀𝑀𝑑𝑑 /𝑃𝑃 = 𝐿𝐿(𝑌𝑌, 𝑖𝑖 − 𝑖𝑖𝑚𝑚 ), with 𝑖𝑖 − 𝑖𝑖𝑚𝑚 now being the opportunity cost of holding
money. Just as there are no resource costs of central banks creating fiat money, similarly,
there are no resource costs of commercial banks creating deposits. Hence, when money is
bank deposits, the equivalent of the Friedman rule that maximises the use of money and
avoids imposing costs on economic activity that depends on money is to have 𝑖𝑖𝑚𝑚 = 𝑖𝑖.
To achieve 𝑖𝑖𝑚𝑚 = 𝑖𝑖, the real supply of deposits 𝑀𝑀 𝑠𝑠 /𝑃𝑃 must be sufficiently large. As discussed
in Section 7.1, commercial banks need to hold reserves for the payments system to
operate, even when there are no regulatory reserve requirements. Banks’ deposit creation
245
EC2065 Macroeconomics | Chapter 7: Banking and finance
decision studied in Section 7.4 then explains why deposit supply 𝑀𝑀 𝑠𝑠 is discouraged by a low
interest rate 𝑖𝑖𝑟𝑟 paid on reserves (and required reserves 𝑖𝑖𝑟𝑟𝑟𝑟 ) relative to 𝑖𝑖 and also by a high
interest rate 𝑖𝑖𝑏𝑏 for borrowing reserves if there is a shortfall.
If interest 𝑖𝑖𝑟𝑟 is paid on reserves (and 𝑖𝑖𝑟𝑟𝑟𝑟 on required reserves where relevant) then
commercial banks increase their supply of deposits for a given interest margin 𝑖𝑖 − 𝑖𝑖𝑚𝑚 . This
shifts the 𝑀𝑀 𝑠𝑠 curve to right and lowers the spread 𝑖𝑖 − 𝑖𝑖𝑚𝑚 using the banking-market
equilibrium from Section 7.5. Therefore, by paying interest on reserves, the gap between
𝑖𝑖𝑚𝑚 and 𝑖𝑖 becomes smaller and the economy moves closer to efficiency. An expansion of the
real supply of reserves 𝑅𝑅 𝑠𝑠 /𝑃𝑃 also reduces the spread 𝑖𝑖 − 𝑖𝑖𝑚𝑚 because banks paying the
borrowing cost 𝑖𝑖𝑏𝑏 becomes less likely. Paying interest on reserves – and using a floor
system – thus supports efficiency in a similar way to the Friedman rule.
It would still be possible to achieve efficiency by directly following the original Friedman
rule 𝑖𝑖 = 0. However, achieving 𝑖𝑖𝑚𝑚 = 𝑖𝑖 has the advantage that the absolute level of interest
rates 𝑖𝑖 is not restricted, and hence, the Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 does not imply that
monetary policy must generate deflation (𝜋𝜋 < 0) for money to work best as a medium of
exchange.
246
EC2065 Macroeconomics | Chapter 7: Banking and finance
zero, while as maturity increases beyond two years, interest rates are higher, reaching
almost 2 per cent for 30-year maturity bonds.
Figure 7.10: A yield curve for US government bonds
What information is conveyed by the yield curve and the term structure of interest rates?
What difference does it make to the economy whether the yield curve is upward sloping,
downward sloping or flat?
The basics of the term structure of interest rates can be illustrated by considering just two
bonds. First, a bond that pays one unit of money next period. Second, a bond that pays one
unit of money two time periods in future. These simple bonds with one payment at maturity
(no coupon payments before maturity) are known as ‘discount’ bonds. The bonds have
maturities of one and two periods respectively. Another feature of these bonds is that they
are nominal bonds, i.e. ones that specify payments in units of money that are not indexed to
inflation.
The yield to maturity (or just yield) on a bond is defined as the discount rate that makes the
present value of the payments promised by the bond equal to the market price at which the
bond currently trades. Suppose the prices in current units of money of the one- and two-
period bonds are 𝑉𝑉1 and 𝑉𝑉2 respectively, their respective yields 𝑖𝑖 and 𝐼𝐼 are defined by the
price-yield relationships:
1 1
𝑉𝑉1 = and 𝑉𝑉2 =
1 + 𝑖𝑖 (1 + 𝐼𝐼 )2
By definition, there is necessarily an inverse relationship between yields and bond prices. A
high bond yield 𝑖𝑖 means a one-period bond price 𝑉𝑉1 and similarly for the two-period bond.
Suppose a saver is considering which bond to hold. What returns are obtained between the
current and future periods? Here, we look at nominal returns. For real returns, we would
also need to adjust for inflation as discussed in Chapter 6. If a saver uses a unit of money to
buy one-period bonds at price 𝑉𝑉1 then an amount 1/𝑉𝑉1 is purchased. Each bond pays off a
unit of money in the next period, so the return is (1⁄𝑉𝑉1 ) − 1. Using 𝑉𝑉1 = 1/(1 + 𝑖𝑖), the
247
EC2065 Macroeconomics | Chapter 7: Banking and finance
return is equal to the yield 𝑖𝑖. In general, returns and yields are not the same but the return
on a discount bond that is held to maturity is equal to the yield when purchased, as seen in
this example.
If instead the saver buys two-period bonds, a unit of money can purchase a quantity 1/𝑉𝑉2 of
such bonds. By the next period, the bond has not yet matured, so the only return that can
be realised immediately comes from selling it. With one period remaining until maturity,
what was originally a two-period bond is now equivalent to a one-period bond, so its price is
now 𝑉𝑉1 ′, which denotes the price of a one-period bond in the future period. Using 𝑉𝑉1 ′ =
1/(1 + 𝑖𝑖′) and 𝑉𝑉2 = 1/(1 + 𝐼𝐼 )2 , the return on holding the two-period bond for just one
period is:
𝑉𝑉1′ − 𝑉𝑉2 (1 + 𝐼𝐼 )2 1 + 𝐼𝐼
Return = = ′
− 1 = 𝐼𝐼 + (𝐼𝐼 − 𝑖𝑖 ′ ) � �
𝑉𝑉2 1 + 𝑖𝑖 1 + 𝑖𝑖 ′
Observe that this return is not equal to the yield 𝐼𝐼, except in the special case 𝑖𝑖 ′ = 𝐼𝐼 because
this bond has not reached maturity over the period the return is calculated.
In choosing what combination of bonds to hold, savers compare the returns they offer.
These returns depend on the yields 𝑖𝑖 and 𝐼𝐼 and the return on the two-period bond also
depends on the future yield 𝑖𝑖′, which is not known with certainty now. An important point
here is that the two-period bond’s return, if it is held for only one period, is in general riskier
than that of the one-period bond. All else equal, savers are assumed to prefer higher
expected returns and lower risk.
248
EC2065 Macroeconomics | Chapter 7: Banking and finance
𝑖𝑖 = 2𝐼𝐼 − 𝑖𝑖 ′𝑒𝑒
Solving this equation for the long-term interest rate 𝐼𝐼 shows that
𝑖𝑖 + 𝑖𝑖 ′𝑒𝑒
𝐼𝐼 =
2
The theory therefore predicts that the long-term interest rate 𝐼𝐼 is equal to an average of the
current and expected future short-term interest rates 𝑖𝑖 and 𝑖𝑖 ′𝑒𝑒 until the maturity of the
long-term bond. This idea applies to bonds with longer than a two-period maturity, where
the long-term yield is an average on expected short-term interest rates over a longer
horizon.
249
EC2065 Macroeconomics | Chapter 7: Banking and finance
Individuals dislike uncertainty about whether they will get to consume 𝑐𝑐1 or 𝑐𝑐2 . We say they
are ‘risk averse’. More precisely, this means they prefer to get expected consumption 𝑐𝑐 𝑒𝑒 =
𝑡𝑡𝑐𝑐1 + (1 − 𝑡𝑡)𝑐𝑐2 for certain (𝑐𝑐 𝑒𝑒 in both the ‘good’ and ‘bad’ scenarios for the risky asset)
than receive 𝑐𝑐1 with probability 𝑡𝑡 and 𝑐𝑐2 with probability 1 − 𝑡𝑡. We can represent these
250
EC2065 Macroeconomics | Chapter 7: Banking and finance
We would like to add to this diagram the equivalent of a ‘budget constraint’, allowing us to
use the familiar framework of constrained utility maximisation to solve the portfolio choice
problem. This can be done by eliminating 𝑥𝑥 from the equations 𝑐𝑐1 = 1 + 𝑟𝑟1 + 𝑥𝑥(𝑟𝑟𝑓𝑓 − 𝑟𝑟1 )
and 𝑐𝑐2 = 1 + 𝑟𝑟2 − 𝑥𝑥(𝑟𝑟2 − 𝑟𝑟𝑓𝑓 ), showing that different portfolios lead to different
combinations of 𝑐𝑐1 and 𝑐𝑐2 with:
𝑟𝑟2 − 𝑟𝑟𝑓𝑓
𝑐𝑐2 = 1 + 𝑟𝑟2 − � � (𝑐𝑐1 − 1 − 𝑟𝑟1 )
𝑟𝑟𝑓𝑓 − 𝑟𝑟1
This relationship between 𝑐𝑐1 and 𝑐𝑐2 is plotted in Figure 7.13. Since 𝑟𝑟1 < 𝑟𝑟𝑓𝑓 < 𝑟𝑟2 , the
equation is a downward-sloping straight line. It can be seen the line always passes through
the point (1 + 𝑟𝑟𝑓𝑓 , 1 + 𝑟𝑟𝑓𝑓 ), which corresponds to 𝑥𝑥 = 1, a portfolio comprising only safe
assets. The gradient of the line is − (𝑟𝑟2 − 𝑟𝑟𝑓𝑓 )⁄(𝑟𝑟𝑓𝑓 − 𝑟𝑟1 ) and it also passes through the point
(1 + 𝑟𝑟1 , 1 + 𝑟𝑟2 ), which corresponds to 𝑥𝑥 = 0, that is, holding none of the safe asset in the
portfolio. Unlike budget constraints with no credit-market imperfections, we do not extend
this straight line all the way to the horizontal and vertical axes as the points to the right of
(1 + 𝑟𝑟𝑓𝑓 , 1 + 𝑟𝑟𝑓𝑓 ) correspond to 𝑥𝑥 greater than 1 and the points to the left of (1 + 𝑟𝑟1 , 1 + 𝑟𝑟2 )
251
EC2065 Macroeconomics | Chapter 7: Banking and finance
correspond to negative 𝑥𝑥. Such portfolios would mean ‘negative’ or short positions in one of
the two assets, which we rule out.
The gradient of the constraint can be expressed in terms of the expected risky asset return
𝑟𝑟 𝑒𝑒 = 𝑡𝑡𝑟𝑟1 + (1 − 𝑡𝑡)𝑟𝑟2 as follows:
𝑟𝑟2 − 𝑟𝑟𝑓𝑓 𝑡𝑡 1 𝑟𝑟 𝑒𝑒 − 𝑟𝑟𝑓𝑓
= + � �
𝑟𝑟𝑓𝑓 − 𝑟𝑟1 1 − 𝑡𝑡 1 − 𝑡𝑡 𝑟𝑟𝑓𝑓 − 𝑟𝑟1
The term 𝑟𝑟 𝑒𝑒 − 𝑟𝑟𝑓𝑓 is the risk premium, the expected risky-asset return 𝑟𝑟 𝑒𝑒 minus the risk-free
asset’s certain return 𝑟𝑟𝑓𝑓 . We see that the gradient of the constraint depends on the risk
premium 𝑟𝑟 𝑒𝑒 − 𝑟𝑟𝑓𝑓 relative to the amount of risk taken in holding the risky asset. The term in
the denominator is 𝑟𝑟𝑓𝑓 − 𝑟𝑟1 , the size of the loss from holding the risky asset relative to the
safe return on the risk-free asset.
The optimal portfolio choice is found where an indifference curve is tangent to the
constraint, the straight line joining the ‘all risky asset’ portfolio to the ‘all risk-free asset’
portfolio. This is depicted in Figure 7.13. The fractions 𝑥𝑥 and 1 − 𝑥𝑥 allocated to the risk-free
and risky assets can be deduced from how far the ray to the origin from the optimal
portfolio lies between the rays to the origin from the extreme portfolios.
Figure 7.13: Optimal portfolio choice
We can see from the formula for the gradient of the constraint that a positive amount of
risky asset can be held (𝑥𝑥 < 1) only if 𝑟𝑟 𝑒𝑒 > 𝑟𝑟𝑓𝑓 . The risky asset must offer a higher expected
return 𝑟𝑟 𝑒𝑒 than the certain return 𝑟𝑟𝑓𝑓 on the safe asset to be attractive to savers who dislike
risk, creating a trade-off between risk and return. A higher expected return compensates for
taking greater risk.
What happens to the optimal portfolio if the asset returns were to change? If the risk-free
return 𝑟𝑟𝑓𝑓 rises then the gradient of the constraint declines, all else being equal. If the price
of the risky asset rises, which would lower the returns 1 + 𝑟𝑟1 , 1 + 𝑟𝑟2 , and 1 + 𝑟𝑟 𝑒𝑒
proportionately, then the gradient of the constraint also declines. A lower gradient of the
252
EC2065 Macroeconomics | Chapter 7: Banking and finance
constraint in these two cases gives rise to a substitution effect would unambiguously raise
the safe-asset portfolio share 𝑥𝑥. However, the income effect on 𝑥𝑥 is ambiguous. We usually
assume preferences where the substitution effect dominates, in which case portfolio shares
adjust in the same direction as assets’ expected returns, all else equal. An example where
the expected return on the risky asset declines is illustrated in Figure 7.14.
Figure 7.14: Lower expected return on risky asset
253
EC2065 Macroeconomics | Chapter 7: Banking and finance
Observe that there is an important difference in the risk characteristics of the bond returns
over the same period. The nominal return on the one-period bond is known with certainty
when purchased at yield 𝑖𝑖, while the nominal return on the two-period bond (if held for
only one period) is uncertain because 𝑖𝑖′ is not known in advance. This makes the long-term
bond riskier than the short-term bond.
If investors were risk neutral and cared only about expected returns, the average returns
must be same on both bonds. Using the formulas for the returns above, the average return
on the short-term bond is 𝚤𝚤̅ and the average return on the long-term bond is approximately
2𝐼𝐼 ̅ − 𝚤𝚤̅ since �
𝚤𝚤′ = 𝚤𝚤̅. Risk neutrality implies 2𝐼𝐼 ̅ − 𝚤𝚤̅ ≈ 𝚤𝚤̅, and hence, 𝚤𝚤̅ ≈ 𝐼𝐼,̅ which means the
yield curve would be flat on average, contrary to what is observed.
Now suppose investors are risk averse regarding the return in the first period. Since the
long-term bond has a risky return if it is sold before maturity, the portfolio choice model of
Section 7.8 explains why the long-term bond needs to offer a higher average return. This
risk premium for the long-term bond requires 2𝐼𝐼 ̅ − 𝚤𝚤̅ > 𝚤𝚤̅, hence, implying 𝐼𝐼 ̅ > 𝚤𝚤̅. Investors’
dislike of risk can thus explain why the yield curve is upward sloping on average.
One important aspect of this intermediation is that individuals lack knowledge about
creditworthiness of borrowers and would face large costs of monitoring them closely. Banks
as intermediaries between many individual savers and borrowers with whom they have an
established relationship and thus more knowledge about are able to reduce costs arising
from asymmetries of information. A further advantage of banks coming from their scale is
that they can diversify lending much more easily than lending done by individuals, which is
more concentrated when an individual has limited funds to deploy.
Another important aspect of the intermediation done by banks is known as maturity
transformation. This is where a bank takes deposits from savers, offering to pay interest and
give access to funds on demand but uses depositors’ funds to make loans to firms for
longer-term investment projects. In other words, the bank borrows short from depositors
and lends long. This is a valuable service because borrowers typically want long maturity
loans, while savers want quick access to their funds on demand and for those funds to be
safe.
In what follows, we will focus on the role of banks in maturity transformation. We will see
how bank deposits are a more liquid asset for savers to hold than direct equity investment
in firms, so the maturity transformation done by banks has a social value. However, this
maturity transformation also exposes banks to the risk of bank runs.
254
EC2065 Macroeconomics | Chapter 7: Banking and finance
We turn now to what can be done with the initial real wealth individuals have in period 0 to
allow them to consume in the future time periods 1 or 2. We assume wealth can be stored
at no cost between periods 0 and 1, and between periods 1 and 2 but no return is earned
from this storage.
Wealth can be put into a long-term investment starting in period 0. One unit invested in
period 0 yields real wealth of 1 + 𝑅𝑅 in period 2 when the investment reaches maturity,
where 𝑅𝑅 denotes the total return between periods 0 and 2. However, if the investment is
abandoned before maturity in period 1 then no return is earned, although the initial
amount invested can be recovered. This means the long-term investment is fundamentally
illiquid – only long-term investors who can tie up funds between periods 0 and 2 can earn a
return on the investment.
255
EC2065 Macroeconomics | Chapter 7: Banking and finance
Banks will engage in maturity transformation, issuing short-term liabilities (deposits) to fund
long-term assets (lending). Banks help by pooling the risk of individuals needing early access
to funds because they are able to take deposits from a large number of savers.
256
EC2065 Macroeconomics | Chapter 7: Banking and finance
Implicitly, 𝑑𝑑1 and 𝑑𝑑2 define the interest rates 𝑟𝑟 and 𝑟𝑟′ paid on deposits between periods 0
and 1 (with 𝑑𝑑1 = 1 + 𝑟𝑟), and periods 1 and 2 (with 𝑑𝑑2 ⁄𝑑𝑑1 = 1 + 𝑟𝑟′).
An important point is that the bank does not know either ex ante or ex post who is an early
or a late type – think of the true need for funds as being private information. Everyone who
deposits one unit of funds in period 0 has the right to withdraw an amount 𝑑𝑑1 in period 1.
Suppose a large number of individuals 𝑁𝑁 each deposits one unit of funds. Deposit taking is
the only source of funds here for banks – we ignore bank capital because there is no risk
when investments are held to maturity (𝑅𝑅 is known with certainty). A bank chooses the
fraction 𝑥𝑥 of funds that are stored (a liquid asset) and the fraction 1 − 𝑥𝑥 put into the long-
term investment (an illiquid asset). The bank needs to have enough liquid assets to be able
to meet the demand for withdrawals in period 1.
Suppose for now that only early types withdraw 𝑑𝑑1 in period 1, which requires among other
things that 𝑑𝑑2 ≥ 𝑑𝑑1 (i.e. the bank never offers a negative interest rate 𝑟𝑟 ′ < 0), otherwise
everyone wants to withdraw in period 1. As each of the large number 𝑁𝑁 of depositors has
an independent chance of being an early type with probability 𝑡𝑡, the law of large numbers
implies there will be 𝑡𝑡𝑡𝑡 early types in total. Given 𝑥𝑥, the value of liquid assets in period 1 is
𝑥𝑥𝑥𝑥, so 𝑑𝑑1 and 𝑥𝑥 must satisfy:
𝑡𝑡𝑡𝑡𝑑𝑑1 ≤ 𝑥𝑥𝑥𝑥
By period 2, the (1 − 𝑥𝑥 )𝑁𝑁 funds put into long-term investments have earned a return and
now have value (1 + 𝑅𝑅)(1 − 𝑥𝑥 )𝑁𝑁. All early types have withdrawn already and only the
deposits of (1 − 𝑡𝑡)𝑁𝑁 late types remain. The amount 𝑑𝑑2 promised to those withdrawing in
period 2 must satisfy:
(1 − 𝑡𝑡)𝑁𝑁𝑑𝑑2 ≤ (1 + 𝑅𝑅)(1 − 𝑥𝑥 )𝑁𝑁
257
EC2065 Macroeconomics | Chapter 7: Banking and finance
which is true if 𝑑𝑑2 ≥ 𝑑𝑑1 , assuming other late types wait as well. We therefore ignore the
points below the 45∘ line in the figure.
What interest rates 𝑟𝑟 and 𝑟𝑟′ can banks offer? By definition, 𝑑𝑑1 = 1 + 𝑟𝑟 and 𝑑𝑑2 =
(1 + 𝑟𝑟 ′ )𝑑𝑑1 . One possibility is 𝑟𝑟 = 0 and 𝑟𝑟′ = 𝑅𝑅, which is equivalent to 𝑑𝑑1 = 1 and 𝑑𝑑2 = 1 +
𝑅𝑅. However, this deposit contract is not particularly interesting because it simply replicates
the outcome if there were no banks in the economy. This corresponds to the point labelled
N in Figure 7.15. Another possibility is 𝑟𝑟 = (1 − 𝑡𝑡)𝑅𝑅/(1 + 𝑡𝑡𝑡𝑡) and 𝑟𝑟′ = 0, which is
equivalent to 𝑑𝑑1 = 𝑑𝑑2 = (1 + 𝑅𝑅)/(1 + 𝑡𝑡𝑡𝑡). This deposit contract is special because it
eliminates all liquidity risk for depositors – they receive the same amount whether they
need to withdraw in period 1 or period 2. Every depositor earns interest 𝑟𝑟 < 𝑅𝑅 between
periods 0 and 1. It is the point labelled L on the 45∘ line in the figure.
Figure 7.15: Feasible deposit contracts
258
EC2065 Macroeconomics | Chapter 7: Banking and finance
If individuals are sufficiently risk-averse, meaning that their indifference curves have
enough curvature, then banks offer a deposit contract with 0 < 𝑟𝑟 < 𝑅𝑅 and 0 < 𝑟𝑟′ < 𝑅𝑅,
which corresponds to a point in the diagram with 𝑑𝑑1 > 1 and 𝑑𝑑2 < 1 + 𝑅𝑅, that is, one to the
right of N but to the left of L. If depositors are not very risk averse then banks might offer a
point at N or even to the left, while only if depositors have the maximum risk aversion (‘L’-
shaped indifference curves) will banks offer L.
With the equilibrium deposit contract for sufficiently risk-averse depositors, the withdrawals
and consumption levels of the early and late types are 𝑐𝑐1 = 𝑑𝑑1 > 1 and 𝑐𝑐2 = 𝑑𝑑2 < 1 + 𝑅𝑅.
Individuals are able to get to a higher indifference curve than without banks, which
corresponds to the point N where 𝑐𝑐1 = 1 and 𝑐𝑐2 = 1 + 𝑅𝑅.
The optimal deposit contract offered by banks helps with risk sharing. It provides insurance
to those who need access to their funds early by paying some portion of the illiquid
investment return, even though investments have not yet yielded any return. The trade-off
is that late types do not get the full investment return but risk-averse individuals value this
insurance ex ante.
The reason why financial intermediation is valued here is similar to why risk-averse people
value insurance. For example, home insurance and car insurance cover the risk of
unexpectedly needing to find funds to make repairs. Banks provides liquidity insurance –
access on demand to funds without sacrificing all investment returns. A crucial difference is
that insurance policies pay out when an objectively verifiable event occurs, for example, a
fire. However, problems of asymmetric information and moral hazard mean that some risks
are not directly insurable, for example, the risk of unemployment, which is what creates a
need for liquidity insurance instead. While banks help to provide this, we will see that the
unverifiable nature of being an ‘early type’ creates a danger of bank runs.
259
EC2065 Macroeconomics | Chapter 7: Banking and finance
260
EC2065 Macroeconomics | Chapter 7: Banking and finance
7.11.4 The ‘shadow’ banking system and the 2008 financial crisis
The Diamond-Dybvig model focuses on runs on the retail banking system but financial
markets can sometimes face situations similar to bank runs. A version of this was seen in the
so-called ‘shadow’ banking system during the 2008 financial crisis.
Prior to the financial crisis, there was a large expansion in ‘securitisation’, which is where
mortgages and other long-term loans were originated and then sold in packages to other
financial institutions such as investment banks. This ‘shadow’ banking system financed
purchases of securitised assets through short-term borrowing in the money markets, which
required frequent rollovers. In 2008, investment banks such as Bear Stearns and Lehman
Brothers struggled to roll over short-term financing. The belief that these banks and other
institutions could not repay short-term debts through the sale of illiquid assets caused new
financing to dry up, creating a self-fulfilling freeze in the money markets.
261
EC2065 Macroeconomics | Chapter 7: Banking and finance
262
EC2065 Macroeconomics | Chapter 7: Banking and finance
bank equity to total bank assets (sometimes weighted by assets’ riskiness). Funds provided
by a bank’s shareholders and retained profits not paid out to shareholders can absorb some
losses on a bank’s investments without jeopardising depositors’ funds.
263
EC2065 Macroeconomics | Chapter 7: Banking and finance
We know from Section 7.10 that banks have an incentive to offer 𝑑𝑑1 > 1. If 𝑞𝑞 is low, so 1/𝑞𝑞
is high, the reserve requirement is not binding. Reserve requirements ultimately become
binding for higher values of 𝑞𝑞 closer to 1. Once this point is reached, the deposit contract is
limited to 𝑑𝑑1 = 1⁄𝑞𝑞, which is less than 𝑑𝑑1 would otherwise be.
Reserve requirements with 𝑞𝑞 < 1 still result in 𝑑𝑑1 > 1. This means a large enough bank run
in period 1 still causes a bank to fail because 𝑑𝑑1 exceeds the liquidation value of all assets,
although a greater number of requests for withdrawals could be satisfied during a run.
Therefore, less than 100 per cent reserve requirements do not rule out self-fulfilling bank
runs in the Diamond-Dybvig model.
The 100 per cent reserve requirements (𝑞𝑞 = 1) imply that banks must offer 𝑑𝑑1 = 1. In this
case, they can always meet all requests for withdrawals, so self-fulfilling runs cannot occur,
eliminating the bad outcome of the Diamond-Dybvig model. But as shown in Figure 7.17,
the 100% reserve requirements prevent maturity transformation, so the good outcome is
worse.
Figure 7.17: The 100 per cent reserve requirements in the Diamond-Dybvig model
More generally, beyond the Diamond-Dybvig model, reserve requirements reduce the
supply of bank deposits when interest is not paid on required reserves, as explained in
Section 7.4. This reduction in 𝑀𝑀 𝑠𝑠 raises the gap between 𝑖𝑖 and 𝑖𝑖𝑚𝑚 . Having 100 per cent
reserve requirements mean that banks supply deposits up to the point where 𝑖𝑖𝑚𝑚 = 𝑖𝑖𝑟𝑟𝑟𝑟 ,
where 𝑖𝑖𝑟𝑟𝑟𝑟 is interest paid on required reserves. To avoid a reduction in the supply of
deposits 𝑀𝑀 𝑠𝑠 , the central bank must pay interest on required reserves 𝑖𝑖𝑟𝑟𝑟𝑟 and increase the
supply of reserves to the banking system. Maintaining an efficient supply of money to the
economy after imposing 100 per cent reserve requirements thus entails a fiscal cost.
264
EC2065 Macroeconomics | Chapter 7: Banking and finance
Unlike deposits at a commercial bank, there cannot be a ‘run’ on central banks offering
CBDC accounts. This is because fiat money not redeemable for anything but itself. Thus,
CBDC would offer the convenience of electronic payments without the risks associated with
commercial-bank deposits. Moreover, it would offer greater security than the physical cash
that is currently the only way non-banks can hold fiat money, removing the anonymity that
is a feature of cash.
Central banks can pay interest on CBDCs in the same way they can pay interest on reserves.
If they were to do so at a sufficiently high rate, this might lead people to switch from
commercial-bank deposits to CBDCs. That would present a similar challenge to commercial
banks – how to fund bank lending – as is present with 100 per cent reserve requirements.
In some ways, a CBDC has similarities to the proposal for 100 per cent reserve
requirements, which is also designed to make the monetary system free of ‘runs’. The
difference is that with CBDC, there would be a choice of whether to hold the CBDC or a
commercial-bank deposit backed by less than 100 per cent reserves, rather than forcing
everyone to hold fully reserve-backed deposits or cash.
CBDC may ultimately be intended as a replacement for physical cash. As discussed in
Section 6.14, this would open the door to significantly negative nominal interest rates.
265
EC2065 Macroeconomics | Chapter 8: Business cycles
Essential reading
• Williamson, Chapters 13 and 14.
266
EC2065 Macroeconomics | Chapter 8: Business cycles
Leaving a price unchanged saves costs because no new price needs to be determined. Firms’
relationships with their customers might also create barriers to price adjustment. For
example, the coordination problem of no firm wanting to go first with a price increase and
antagonise its customers.
Data on observations of individual prices also provides some support for the relevance of
nominal rigidity. The price data shown in Figure 8.1 suggests individual goods prices quoted
in units of money change infrequently, even during times when the economy experiences
shocks.
Figure 8.1: Evidence on sticky prices
Models of the economy where there is a failure of market clearing because of nominal
rigidity are often labelled ‘Keynesian’ models. We will study extensively what is called a
‘new Keynesian’ model. This model is ‘new’ in the sense that it has many of the features of
modern macroeconomic models seen in earlier chapters but in combination with older
Keynesian ideas about the failure of markets to clear.
The only nominal rigidity in the new Keynesian model is stickiness of goods prices. Having
price stickiness in the model implicitly assumes prices are set by firms, not markets. For this
reason, the model also features an imperfectly competitive goods market.
It is important to note that the reasons for nominal rigidity are unlikely to prevent eventual
price adjustment in the long run. Prices will ultimately adjust to shocks, so markets still clear
in the long run. Consequently, the model will make a distinction between the short run
where prices are sticky and the long run where prices are flexible. The transition from the
short run to the long run is studied in Chapter 9.
There is also an important difference compared to our earlier analysis of the failure of
labour-market clearing owing to efficiency wages in Section 5.2. There, firms have incentives
to pay efficiency wages at all times, so the rigidity of wages is not temporary and does not
disappear in the long run. Moreover, efficiency wages impart rigidity to real wages, which is
conceptually distinct from nominal rigidity.
267
EC2065 Macroeconomics | Chapter 8: Business cycles
With nominal rigidity, if an imperfectly competitive firm does not change the fixed price 𝑃𝑃� of
the good it is selling then it cannot choose how much it sells. If demand falls, shifting the
demand curve for the firm’s product leftwards, it cannot sell as much as before at price 𝑃𝑃�,
and as a result hires less labour. Even if the marginal product of labour exceeds the real
wage, the firm does not hire more labour because it cannot sell the extra output that could
be produced with additional employment. If demand rises, a firm can now sell more at the
same price 𝑃𝑃�, so it hires more labour to meet the additional demand. Strictly speaking, this
is true only as long as 𝑤𝑤 ≤ 𝑀𝑀𝑃𝑃𝑁𝑁 , otherwise the firm would not want to sell more, although
as we will see, cases where this condition fails to hold are not likely to be relevant in
practice.
This logic tells us that the labour demand curve is no longer given by the marginal product of
labour but is instead perfectly wage inelastic as depicted in the right panel of Figure 8.2. The
vertical labour demand curve shifts with the aggregate demand for goods, which we
suppose affects the demand for each individual good. We truncate the vertical labour
demand curve where it goes above 𝑀𝑀𝑃𝑃𝑁𝑁 .
268
EC2065 Macroeconomics | Chapter 8: Business cycles
The wage-inelastic labour demand curve 𝑁𝑁 𝑑𝑑 (𝑌𝑌) is determined using the production function
𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) to find what level of employment 𝑁𝑁 is needed to produce output 𝑌𝑌 sufficient
to meet demand, taking as given the stock of capital 𝐾𝐾 and TFP 𝑧𝑧. This is shown in the left
panel of Figure 8.2. An increase in 𝑌𝑌 shifts labour demand to the right.
How do we analyse outcomes in the labour market when goods prices are sticky? Since
labour demand is wage inelastic, the outcome for employment 𝑁𝑁 is directly determined by
the position of 𝑁𝑁 𝑑𝑑 (𝑌𝑌). For wages 𝑤𝑤, we need to be more specific about how the supply side
of the labour market works.
Let us first suppose that wages are fully flexible. In that case, wages are determined by a
standard upward-sloping labour supply curve 𝑁𝑁 𝑠𝑠 (𝑟𝑟) derived from households’ optimality
conditions 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 = 𝑤𝑤 and 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝑙𝑙′ = (1 + 𝑟𝑟)𝑤𝑤/𝑤𝑤′ as explained in Chapters 1 and 3. With
a representative household, Chapter 3 explains why 𝑁𝑁 𝑠𝑠 (𝑟𝑟) is unambiguously upward sloping
owing to the substitution effect of wages and shifts to the right when the real interest rate 𝑟𝑟
increases (intertemporal substitution). Although we consider a monetary economy, we
ignore here the effect of money’s medium of exchange function on 𝑁𝑁 𝑠𝑠 , that is to say, we
neglect any effects coming from money being less good as a store of value and acting as a
tax on economic activity (see Section 6.5).
With flexible wages, the real wage 𝑤𝑤 adjusts so that the labour market clears with 𝑁𝑁 𝑑𝑑 (𝑌𝑌) =
𝑁𝑁 𝑠𝑠 (𝑟𝑟) as shown in the left panel of Figure 8.3. We will see that employment fluctuates with
aggregate demand, although if there are no impediments to wage adjustment then there is
no unemployment or fluctuations in unemployment. If we want to study unemployment
over the business cycle, we can combine our earlier analysis of efficiency wages from
Section 5.2 with the new Keynesian model. In that case, the real wage is determined by
firms’ desire to pay an efficiency wage and remains constant. Employment is found on the
inelastic labour demand curve but desired labour supply can be higher, so unemployment
exists. Given desired labour supply, unemployment changes in the opposite direction to
changes in employment. This case is depicted in the right panel of Figure 8.3.
269
EC2065 Macroeconomics | Chapter 8: Business cycles
In an economy with flexible prices, the output supply curve in the goods market represent
the production of goods and services by firms given employment at the labour-market
equilibrium. However, with sticky prices, the demand for labour depends on the aggregate
demand for goods. This means there is no independent decision made by firms about how
much to sell. The supply of goods passively accommodates changes in demand and thus
there is no output supply curve relevant for determining outcomes in the goods market as
long as goods prices remain sticky. One caveat to this logic is that demand must not be so
large that firms do not want to meet it because wages are too high. We require that 𝑤𝑤 ≤
𝑀𝑀𝑃𝑃𝑁𝑁 , which in the goods market diagram is equivalent to remaining on the left of the
hypothetical output supply curve with perfect competition and flexible prices (𝑌𝑌� 𝑠𝑠 ), where
the real wage would be exactly equal to the marginal product of labour. The hypothetical
supply curve is depicted as a dashed line in Figure 8.4. This condition is not likely to be of
concern in most applications of the model.
Figure 8.4: The goods market with sticky prices
The demand curve in the goods market simply represents the same aggregate demand for
goods and services that was found in the earlier dynamic macroeconomic model from
Section 3.12. The 𝑌𝑌 𝑑𝑑 curve represents the equation 𝑌𝑌 𝑑𝑑 = 𝐶𝐶 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺 as before and is
270
EC2065 Macroeconomics | Chapter 8: Business cycles
shown in Figure 8.4. But since prices are sticky, the economy does not have to be at the
intersection of the 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 curves. Many points on the 𝑌𝑌 𝑑𝑑 curve can be a goods-market
equilibrium in the short run. Note that in Keynesian models, the 𝑌𝑌 𝑑𝑑 curve was traditionally
known as the 𝐼𝐼𝐼𝐼 (investment = saving) curve. The equation for investment 𝐼𝐼 being equal to
national saving 𝑌𝑌 − 𝐶𝐶 − 𝐺𝐺 is equivalent to 𝑌𝑌 = 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺.
The point on the output demand curve the economy reaches and the outcome for real GDP
𝑌𝑌 are determined by the level of interest rates. With complete stickiness of prices there is
zero inflation (𝜋𝜋 = 0) and, hence, the Fisher equation 𝑖𝑖 = 𝑟𝑟 + 𝜋𝜋 𝑒𝑒 implies that 𝑟𝑟 = 𝑖𝑖, so the
real interest rate 𝑟𝑟 is the same as the nominal interest rate 𝑖𝑖. As discussed in Chapters 6 and
7, there are various instruments of monetary policy that can control the nominal interest
rate 𝑖𝑖. With sticky prices, monetary policy also effectively sets the real interest rate 𝑟𝑟, which
selects a point on the 𝑌𝑌 𝑑𝑑 curve.
In the goods market diagram, we represent the central bank’s choice of nominal interest
rate 𝑖𝑖 (and, hence, 𝑟𝑟) with a line that we will label 𝑀𝑀𝑀𝑀 (money and monetary policy). The
𝑀𝑀𝑀𝑀 line is usually assumed to be flat or upward-sloping, which is to say that the central
bank either sets some particular interest rate, in which case 𝑀𝑀𝑀𝑀 is horizontal, or we think of
the central bank as systematically adjusting interest rates up and down with GDP. The
intersection between the 𝑌𝑌 𝑑𝑑 (𝐼𝐼𝐼𝐼) curve and the 𝑀𝑀𝑀𝑀 line determines the real interest rate 𝑟𝑟
and output 𝑌𝑌 as seen in Figure 8.4.
Suppose the central bank cuts the nominal interest rate 𝑖𝑖. If prices are completely sticky,
inflation remains zero and, consequently, the real interest rate falls. Supposing the stance of
monetary policy is represented by a horizontal 𝑀𝑀𝑀𝑀 line, the interest rate cut shifts the 𝑀𝑀𝑀𝑀
line downwards. The economy moves along the output demand curve as shown in Figure
8.5, with the lower real interest rate 𝑟𝑟 stimulating consumption and investment demand.
This works through a lower cost of borrowing for firms and a substitution effect on
households’ consumption expenditure plans. GDP 𝑌𝑌 rises with the resulting increase in
aggregate demand.
271
EC2065 Macroeconomics | Chapter 8: Business cycles
In the labour market, the increase in aggregate demand for goods shifts the labour demand
𝑁𝑁 𝑑𝑑 (𝑌𝑌) curve to the right. This is because firms selling goods at a fixed price want to hire
more workers when they are able sell more output. Employment rises as a result.
When wages are flexible, the real wage 𝑤𝑤 increases because of a movement along the
upward-sloping labour supply curve 𝑁𝑁 𝑠𝑠 (𝑟𝑟), which also shifts to the left when 𝑟𝑟 declines. If
real wages are rigid owing to efficiency wages then higher employment and lower desired
labour supply imply a decline in unemployment. These two cases are depicted in Figure 8.6.
272
EC2065 Macroeconomics | Chapter 8: Business cycles
As seen in Figure 8.7, the shift in the Federal Reserve’s monetary policy stance
brought about by Volcker saw interest rates rise from below 10 per cent to peak at
15 per cent in 1981. Inflation started to fall and was below 5 per cent by 1983.
While we cannot yet analyse inflation using our basic new Keynesian model with
completely sticky prices, we will study inflation with partial price adjustment in
Chapter 9. For now, we focus on the effects of real interest rates on aggregate
demand and GDP. Volcker’s tightening of monetary policy led the real interest to
rise from negative levels to more than 5 per cent by 1981. This was followed by a
sharp fall in real GDP during the 1981–82 recession.
Figure 8.7: The Volcker disinflation
273
EC2065 Macroeconomics | Chapter 8: Business cycles
Consider a decline in the expected value of future TFP 𝑧𝑧 ′ as a specific example of a demand
shock. This leads to lower 𝐶𝐶 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 and shifts the 𝑌𝑌 𝑑𝑑 curve to the left. Assuming no
automatic or discretionary monetary policy response, that is to say, a horizontal 𝑀𝑀𝑀𝑀 line in
same position, the real interest rate 𝑟𝑟 remains the same and GDP 𝑌𝑌 declines as shown in
Figure 8.8.
Figure 8.8: Effects of a negative demand shock
In the labour market, lower aggregate demand 𝑌𝑌 reduces labour demand, with 𝑁𝑁 𝑑𝑑 (𝑌𝑌)
shifting to the left and resulting in lower employment 𝑁𝑁. Owing to the negative wealth
effect of lower 𝑧𝑧′, the labour supply curve 𝑁𝑁 𝑠𝑠 (𝑟𝑟) shifts rightwards. With flexible wages, the
real wage 𝑤𝑤 would decline, while with a rigid efficiency wage, higher unemployment would
result. It can also be seen from the production function that the decline in output and
employment would raise average labour productivity 𝑌𝑌/𝑁𝑁, which is the gradient of the ray
from the origin to the production function.
Box 8.2: Can the new Keynesian model match the business-cycle
stylised facts?
It is desirable that any theory of the business cycle is consistent with empirical
evidence on the behaviour of fluctuations of various macroeconomic variables. We
can document a set of business-cycle ‘stylised facts’ using methods described in
Section 3.1. Variables are detrended and their percentage deviations from trend
can be compared to those of real GDP. We have already looked at the behaviour of
fluctuations in consumption and investment in Chapter 3 and unemployment in
Chapter 5. Both consumption and investment are procyclical, meaning that their
deviations from trend are positively correlated with deviations of real GDP from its
trend, while unemployment is countercyclical, i.e. negatively correlated with real
GDP. Consumption is less volatile than GDP – its percentage fluctuations are smaller
than those of real GDP – while investment is more volatile than GDP.
We can also look at a broader range of macroeconomic variables. Figure 8.9 shows
detrended employment, which is procyclical, generally less volatile than
274
EC2065 Macroeconomics | Chapter 8: Business cycles
We can also look at a broader range of macroeconomic variables. Figure 8.9 shows
detrended employment, which is procyclical, generally less volatile than GDP and
slightly lagging. Figure 8.10 shows real wages, for which it is harder to discern a
clear pattern but which is overall weakly procyclical and less volatile than GDP.
Figure 8.11 displays the data for average labour productivity, which is procyclical
and less volatile than GDP. The procyclicality of average labour productivity reflects
the fact that employment typically moves by less in percentage terms than GDP.
Figure 8.9: Fluctuations in employment over the business cycle (USA)
Figure 8.10: Fluctuations in real wages over the business cycle (USA)
Fluctuations of the real interest rate are shown alongside fluctuations of GDP in
Figure 8.12. The cyclicality of the real interest rate appears to change over time,
generally being countercyclical prior to the 1990s and procyclical afterwards. Taking
an overview of the whole period covered by the data, the real interest rate is
weakly countercyclical. Finally, fluctuations of inflation are shown in Figure 8.13.
Here again the relationship with GDP fluctuations appears to have changed over
time. There is a strong countercyclical relationship in the 1970s but at other times
inflation appears procyclical. Overall, we conclude that inflation is weakly
procyclical.
275
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.11: Fluctuations of average labour productivity over the business cycle (USA)
Figure 8.12: Fluctuations of real interest rates over the business cycle (USA)
How do the predictions of the new Keynesian model compare to this evidence on
business-cycle fluctuations? We will consider two different types of shock studied
earlier. First, we will consider a demand shock coming from news that changes
confidence about the future, or a change in the extent of credit-market imperfections.
Second, we will look at a shift in the stance of monetary policy.
Figure 8.13: Fluctuations of inflation over the business cycle (USA)
276
EC2065 Macroeconomics | Chapter 8: Business cycles
Consider the negative demand shock considered earlier that was caused by lower
confidence about the future. A lower expected value of 𝑧𝑧′ shifts the 𝑌𝑌 𝑑𝑑 curve to the
left due to declines in 𝐶𝐶 𝑑𝑑 and 𝐼𝐼 𝑑𝑑 . We assume a horizontal 𝑀𝑀𝑀𝑀 line in an unchanged
position, indicating a passive stance of monetary policy throughout.
Referring to Figure 8.8, the model predicts that GDP 𝑌𝑌 falls and consumption 𝐶𝐶 and
investment 𝐼𝐼 are lower. Prices and inflation do not change because all prices fixed. The
real and nominal interest rates 𝑟𝑟 and 𝑖𝑖 are unchanged because of the passive
monetary policy and the absence of any change in inflation. The production function
implies employment 𝑁𝑁 declines and the leftward shift of 𝑁𝑁 𝑑𝑑 and rightward shift of 𝑁𝑁 𝑠𝑠
in the labour market result in a lower real wage 𝑤𝑤. If we assumed firms are paying
efficiency wages instead then the real wage would remain constant and
unemployment would rise owing to the direction of the shifts of 𝑁𝑁 𝑑𝑑 and 𝑁𝑁 𝑠𝑠 .
Using the production function diagram, we can also see the model’s prediction for the
response of average labour productivity 𝐴𝐴𝐴𝐴𝐴𝐴 = 𝑌𝑌/𝑁𝑁. Average labour productivity is
given the gradient of the ray from the origin to the relevant point on the production
function. Figure 8.14 shows that since the production function has a concave shape
and does not shift with a demand shock, a decline in employment and output raises
the gradient of this ray. Intuitively, the shape of the production function comes from
diminishing returns to labour, so a decline in employment raises labour productivity.
In summary, the new Keynesian model with demand shocks predicts that consumption
and investment are procyclical (they both move in the same direction as real GDP),
employment is procyclical and average labour productivity is countercyclical. Apart
from productivity, these predictions are in line with the business-cycle stylised facts.
With a competitive labour market, the model predicts a strongly procyclical real wage
contrary to the empirical evidence but this cyclicality of wages would be weakened by
integrating the model of efficiency wages into the analysis of the labour market.
Adding efficiency wages also allows the new Keynesian model to match the
countercyclicality of unemployment. These predictions of the model and the
corresponding stylised facts from the data are summarised in Table 8.1. Note that the
model makes similar predictions for other forms of demand shock such as a worsening
of credit-market imperfections that raises interest-rate spreads.
Figure 8.14: Prediction of countercyclical average labour productivity
277
EC2065 Macroeconomics | Chapter 8: Business cycles
As the stance of monetary policy remains completely passive following the shock by
assumption, the model predicts an acyclical real interest rate. Having an upward-
sloping 𝑀𝑀𝑀𝑀 line, as discussed in Box 8.4, would result in the model predicting a
procyclical real interest rate. While the overall pattern in the data is weak
countercyclicality, there are periods where real interest rates appear procyclical, so
the model’s predictions are not too far from the empirical evidence. With completely
sticky prices, the model predicts that inflation is acyclical. We will see in Chapter 9 that
adding partial price adjustment means that inflation would be procyclical in an
economy with demand shocks, which helps to match the data.
Table 8.1: Predictions of new Keynesian model with demand shock to confidence about
the future
Procyclical
Real wage Procyclical (weakly)
(Acyclical with efficiency wage)
Acyclical
Unemployment (Countercyclical with efficiency Countercyclical
wage)
Average labour
Countercyclical Procyclical
productivity
It is also possible to consider business cycles triggered by shifts in monetary policy, for
example, the Volcker disinflation discussed in Box 8.1. Suppose the central bank shifts
the stance of monetary policy towards higher interest rates, shifting the 𝑀𝑀𝑀𝑀 line
upwards. This is the opposite of the case depicted in Figures 8.5 and 8.6 from Section
8.3.
The model predicts that consumption 𝐶𝐶 and investment 𝐼𝐼 fall, moving up the 𝑌𝑌 𝑑𝑑 curve
in the goods-market diagram. Real GDP 𝑌𝑌 falls and the real interest rate 𝑟𝑟 rises. It can
be seen from the production function and labour-market diagrams that employment
𝑁𝑁 and real wages 𝑤𝑤 are lower, with 𝑁𝑁 𝑑𝑑 shifting to the left and 𝑁𝑁 𝑠𝑠 shifting to the right.
With efficiency wages, unemployment 𝑈𝑈 rises and the real wage 𝑤𝑤 is constant. Prices
and inflation do not change because all prices are fixed.
278
EC2065 Macroeconomics | Chapter 8: Business cycles
Table 8.2 summarises the predictions of the model and compares them to the
business-cycle stylised facts. Consumption and investment are procyclical, matching
the data. The real interest rate is countercyclical, which fits the overall pattern weakly
present in the data. Employment is procyclical, matching the data. Real wages are
strongly procyclical but that prediction can be tempered by efficiency wages, which
also allows the model to generate the countercyclical unemployment seen in the data.
Inflation is acyclical but partial price adjustment would change that prediction to
procyclicality. As with demand shocks, the model predicts countercyclical average
labour productivity, contrary to the empirical pattern.
In summary, the new Keynesian model is broadly consistent with most of the business-
cycle stylised facts when the business cycle is caused by demand shocks, including
shifts in monetary policy. The only major failing is in accounting for the procyclicality
of productivity. One potential reconciliation with the productivity data is discussed in
Box 8.3.
Table 8.2: Predictions of the new Keynesian model with a monetary policy shock
Real interest
Countercyclical Countercyclical (weakly)
rate
Procyclical
Real wage Procyclical (weakly)
(Acyclical with efficiency wage)
Acyclical
Unemployment (Countercyclical with efficiency Countercyclical
wage)
Average labour
Countercyclical Procyclical
productivity
279
EC2065 Macroeconomics | Chapter 8: Business cycles
280
EC2065 Macroeconomics | Chapter 8: Business cycles
An issue similar to labour hoarding arises when measuring total factor productivity
(TFP). This is done by calculating a ‘Solow residual’, that is, the change in GDP not
explained by changes in inputs of labour and capital. However, estimate of the
capital stock do not fully capture changes in the usage of capital by firms because
utilisation of capital might vary over time. This is analogous to the change in labour
utilisation by firms when there is labour hoarding and suggests there is bias
towards detecting procyclicality in TFP.
281
EC2065 Macroeconomics | Chapter 8: Business cycles
Given a production function 𝑌𝑌 = 𝑧𝑧𝑧𝑧(𝐾𝐾, 𝑁𝑁) with a particular capital stock 𝐾𝐾 and level of TFP
𝑧𝑧, the decision to supply goods 𝑌𝑌 is equivalent to a decision to hire labour 𝑁𝑁. For an
imperfectly competitive firm, the effect on real revenue of hiring an extra worker is less
than the physical marginal product of labour because the price of its product relative to
other goods needs to be lowered to sell the extra output. The marginal gain in real revenue
from hiring an extra unit of labour is measured by the marginal revenue product of labour
𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 , which is below the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁 . The relationship between the
two is 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = (1 − 𝜖𝜖 −1 )𝑀𝑀𝑃𝑃𝑁𝑁 , where 𝜖𝜖 is the price elasticity of the demand curve for a
firm’s product.
In choosing how much labour to hire and how much output to produce, each firm compares
the real cost of hiring a worker, the real wage 𝑤𝑤, to the marginal benefit 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 . Hence,
firms’ demand for labour is given by 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑤𝑤 instead of 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑤𝑤 with perfect
competition. Workers being paid their marginal revenue product is equivalent to firms
pricing (𝜖𝜖 − 1)−1 per cent above their marginal cost 𝑤𝑤/𝑀𝑀𝑃𝑃𝑁𝑁 of producing a unit of output.
Assuming each individual firm faces a demand curve for its product with a constant price
elasticity 𝜖𝜖 > 1, the marginal revenue product curve 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 is simply a scaling down of the
marginal product curve 𝑀𝑀𝑃𝑃𝑁𝑁 as shown in Figure 8.16. Labour demand thus behaves in the
same way as with perfect competition, just at a lower level, all else being equal. An output
supply curve 𝑌𝑌 𝑠𝑠 can then be derived exactly as earlier in the dynamic macroeconomic model
from Section 3.12. The only difference is that 𝑌𝑌 𝑠𝑠 is lower for any given 𝑟𝑟 because
imperfectly competitive firms restrict supply to raise profits. For comparison, what the
output supply curve would look like with perfect competition is shown labelled as 𝑌𝑌� 𝑠𝑠 in the
figure.
282
EC2065 Macroeconomics | Chapter 8: Business cycles
283
EC2065 Macroeconomics | Chapter 8: Business cycles
between leisure and consumption 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 . The 𝑀𝑀𝑃𝑃𝑁𝑁 is what can be produced if people were
able to work more and the 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 is what value (in terms of goods) people put on their
time. The economy has inefficiently low employment and production if 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶
because the value of people’s time is less than the value they put on the extra goods that
can be produced and consumed if there were more economic activity.
How efficiently the economy is operating can be judged from the goods market diagram by
comparing the outcome for 𝑌𝑌 and 𝑟𝑟 to the hypothetical perfectly competitive output supply
curve 𝑌𝑌� 𝑠𝑠 , on which 𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 . All points to the left of 𝑌𝑌� 𝑠𝑠 have 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 ,
meaning that output 𝑌𝑌 is too low. Inefficiency is thus measured by how far the economy is
to the left of the 𝑌𝑌� 𝑠𝑠 curve.
Why is the market equilibrium in the new Keynesian model generally not efficient? To
simplify the analysis, we ignore some other reasons for inefficiency we have studied
elsewhere that are not central to the new Keynesian model. First, wages are flexible, so the
labour-market equilibrium is always on the labour supply curve. This ignores the persistent
unemployment that results from firms’ incentives to pay ‘efficiency wages’ as seen in
Section 5.2. Second, we ignore the implications for labour supply of needing to use money
as a medium of exchange that were studied in Section 6.5. This neglects any inefficiencies
resulting from a failure of monetary policy to follow the ‘Friedman rule’ as discussed in
Section 6.10. The consequence of these simplifications is that real wages 𝑤𝑤 are always equal
to the marginal rate of substitution 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 between leisure and consumption.
There are two distinct reasons why output is inefficiently low in the New Keynesian model.
First, the natural level of output 𝑌𝑌 ∗ is already too low because imperfect competition gives
firms an incentive to reduce production. Even without nominal rigidity, imperfect
competition would result in 𝑤𝑤 = 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 < 𝑀𝑀𝑃𝑃𝑁𝑁 at 𝑌𝑌 = 𝑌𝑌 ∗ . Second, when prices are sticky, a
negative demand shock pushes GDP 𝑌𝑌 below 𝑌𝑌 ∗ and, as diminishing returns to labour then
implies 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 and 𝑀𝑀𝑃𝑃𝑁𝑁 rise while 𝑤𝑤 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 falls, we have 𝑤𝑤 < 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 and the economy
is even further away from what is efficient.
How can economic policy achieve a better outcome? Let us consider the example studied in
Section 8.4 of a negative demand shock due to a decline in expected future TFP 𝑧𝑧′. As shown
in Figure 8.17, the 𝑌𝑌 𝑑𝑑 curve shifts to the left and output 𝑌𝑌 falls below 𝑌𝑌 ∗ if the 𝑀𝑀𝑀𝑀 line
remains in its original position. Now, instead of leaving monetary policy unchanged, the
central bank lowers the nominal and real interest rates 𝑖𝑖 and 𝑟𝑟. Reducing 𝑟𝑟 moves the
economy along the 𝑌𝑌 𝑑𝑑 curve, raising GDP 𝑌𝑌. Since 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 , the representative
household gains from this policy intervention. Fiscal policy could also be used to provide a
stimulus to demand. An increase in public expenditure 𝐺𝐺 would raise 𝑌𝑌, although its effects
are not exactly equivalent because the composition of aggregate demand 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺 would
be different compared to the case where monetary policy is used.
What is the best monetary policy to follow? To close the output gap between 𝑌𝑌 and 𝑌𝑌 ∗
exactly, the central bank should set the nominal interest rate 𝑖𝑖 (and, hence, 𝑟𝑟) equal to the
natural rate of interest 𝑟𝑟 ∗ . This moves the 𝑀𝑀𝑀𝑀 line to where it intersects the 𝑌𝑌 𝑑𝑑 curve in the
same place as the output supply curve 𝑌𝑌 𝑠𝑠 as shown in Figure 8.17. Such a monetary policy
284
EC2065 Macroeconomics | Chapter 8: Business cycles
achieves the same economic outcome as if prices were flexible. The policy intervention thus
neutralises the negative consequences of slow price adjustment following a shock. The
central bank adjusting interest rates compensates for the slow pace of price changes.
Figure 8.17: Optimal stabilization policy
To implement this optimal monetary policy, the central bank needs to know the natural rate
of interest 𝑟𝑟 ∗ . This is a practical problem because 𝑟𝑟 ∗ is not directly observable and needs to
be estimated. A more fundamental challenge is that it must be feasible to reduce the
nominal interest rate 𝑖𝑖 if 𝑟𝑟 ∗ falls. As explained in Section 6.13, nominal interest rates are
subject to a lower bound, so the required interest rate cut might not be possible if the lower
bound on 𝑖𝑖 is reached. Chapter 9 discusses alternative policies that could be used if the
lower bound is binding.
The stabilisation policy described here aims to close the output gap between actual real
GDP 𝑌𝑌 and its natural level 𝑌𝑌 ∗ . But that does not mean the policy should aim for GDP to be
stable if 𝑌𝑌 ∗ itself varies over time. Furthermore, it might be wondered why the policy
intervention should stop when 𝑌𝑌 reaches 𝑌𝑌 ∗ . That addresses only one of the two sources of
inefficiency in the new Keynesian model – remember output at 𝑌𝑌 ∗ is still inefficiently low.
Should the central bank continue to push output above 𝑌𝑌 ∗ to where 𝑌𝑌 𝑑𝑑 intersects 𝑌𝑌� 𝑠𝑠 and
𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 hold? While at first glance this appears desirable, we will argue in Chapter 9
that such a policy would be unsustainable and give rise to negative side effects.
285
EC2065 Macroeconomics | Chapter 8: Business cycles
More generally, we consider the 𝑀𝑀𝑀𝑀 (‘money and monetary policy’) line to be a
representation of how the money market and the monetary policy regime imply a
link between interest rates and GDP. We can draw 𝑀𝑀𝑀𝑀 lines for monetary policies
that target the money supply, or an interest rate feedback rule such as a Taylor
rule.
8.6.1 Money supply targets, the LM curve and the IS-LM model
Suppose the central bank’s monetary policy is a target for the money supply 𝑀𝑀 𝑠𝑠 . Assume
the money-supply target 𝑀𝑀 𝑠𝑠 = 𝑀𝑀∗ is exogenous. For this monetary policy regime, the
interest rate 𝑖𝑖 is endogenous. It is still the case that real and nominal interest rates are same
because prices are sticky, hence, 𝑟𝑟 = 𝑖𝑖. With a target for the money supply, the interest
rates 𝑖𝑖 and 𝑟𝑟 and GDP 𝑌𝑌 are determined jointly in the goods and money markets. The
equivalent of the 𝑀𝑀𝑀𝑀 line in this case represents money-market equilibrium and it is often
labelled as the ‘𝐿𝐿𝐿𝐿’ curve for this particular monetary policy.
The 𝑀𝑀𝑀𝑀 line/𝐿𝐿𝐿𝐿 curve for a money-supply target is upward sloping, as shown in the goods
market diagram in Figure 8.18. The 𝐿𝐿𝐿𝐿 curve is upward sloping because higher output
increases the real demand for money for transactions, as shown in the left panel of the
figure representing the money market. With a fixed nominal money supply 𝑀𝑀∗ and a sticky
price level 𝑃𝑃�, there is an inelastic supply of real money balances 𝑀𝑀∗ /𝑃𝑃� and the nominal
interest rate 𝑖𝑖 rises to restore equilibrium in the money market. Money-market equilibrium
thus requires higher real interest rates 𝑟𝑟 when real GDP 𝑌𝑌 is higher, explaining the upward-
sloping 𝐿𝐿𝐿𝐿 curve.
Figure 8.18: The LM curve with a money supply target
Combining the 𝐿𝐿𝐿𝐿 curve (𝑀𝑀𝑀𝑀 line) with the output demand curve 𝑌𝑌 𝑑𝑑 in the goods market
leads to the IS-LM model, which is a special case of our new Keynesian model. What is called
the 𝐼𝐼𝐼𝐼 curve in that model is simply another label for what we call output demand 𝑌𝑌 𝑑𝑑 .
286
EC2065 Macroeconomics | Chapter 8: Business cycles
A change in the money-supply target causes the 𝐿𝐿𝐿𝐿 curve to shift, which has real effects on
the economy. Increasing the money supply 𝑀𝑀∗ implies the supply of real money balances
𝑀𝑀∗ /𝑃𝑃� is larger (the price level remaining constant at 𝑃𝑃�). Given the real demand for money
at a particular level of real GDP 𝑌𝑌, the intersection of money supply and demand occurs at a
lower nominal interest rate 𝑖𝑖 and, hence, also 𝑟𝑟. Since the 𝐿𝐿𝐿𝐿 curve represents
combinations of 𝑌𝑌 and 𝑟𝑟 where the money market is in equilibrium, the 𝐿𝐿𝐿𝐿 curve must shift
downwards and real GDP increases.
Why should the central bank want to choose interest rates that are positively related to 𝑌𝑌?
One argument is that this helps to stabilise an economy that is hit by demand shocks,
avoiding large output gaps between actual GDP 𝑌𝑌 and the natural level of output 𝑌𝑌 ∗ .
Furthermore, if 𝑌𝑌 ∗ is known or estimated, the interest-rate rule can be refined to react to
the gap 𝑌𝑌 − 𝑌𝑌 ∗ , or an estimate of this output gap.
We know from Section 8.6 that the optimal monetary policy is for the central bank to set
𝑖𝑖 = 𝑟𝑟 ∗ , where 𝑟𝑟 ∗ is the natural rate of interest. However, the central bank may not have
perfect information about 𝑟𝑟 ∗ . In that case, Figure 8.20 shows having a positive response of 𝑖𝑖
to 𝑌𝑌 can yield a better outcome for the economy than having monetary policy keep 𝑖𝑖
constant.
287
EC2065 Macroeconomics | Chapter 8: Business cycles
The RBC approach to understanding business cycles looks at how the equilibrium of the
economy in the goods, labour and money markets shown in Figure 8.21 is affected by supply
shocks.
288
EC2065 Macroeconomics | Chapter 8: Business cycles
289
EC2065 Macroeconomics | Chapter 8: Business cycles
Assuming the evidence from the Solow residual correctly indicates that transitory supply
shocks are hitting the economy, what might such shocks represent? One possibility is an
uneven pace of technological progress, where TFP might rise by more than usual in some
years, or by less in others. However, the Solow residual often falls sufficiently far below its
trend that the implied level of TFP actually declines in absolute terms. That is hard to
understand if TFP is representing technology because we would not expect that to go
backwards.
However, there are a number of other possible sources of transitory changes in TFP:
In the goods market, the real interest rate 𝑟𝑟 rises if 𝑌𝑌 𝑠𝑠 shifts more than 𝑌𝑌 𝑑𝑑 . The 𝑌𝑌 𝑑𝑑 shift
becomes larger if the drop in TFP lasts longer because that leads to a greater impact on
consumption and investment demand (less consumption smoothing and a greater impact on
the expected future marginal product of capital). The effect on employment 𝑁𝑁 in the labour
290
EC2065 Macroeconomics | Chapter 8: Business cycles
market is ambiguous because 𝑁𝑁 𝑑𝑑 falls but 𝑁𝑁 𝑠𝑠 can rise. The wealth effect on 𝑁𝑁 𝑠𝑠 is smaller
when TFP is expected to recover quickly. If 𝑁𝑁 𝑑𝑑 shifts to the left and 𝑁𝑁 𝑠𝑠 shifts to the right,
the real wage 𝑤𝑤 increases. A smaller wealth effect on 𝑁𝑁 𝑠𝑠 means that 𝑤𝑤 rises by less. Finally,
in the money market, 𝑀𝑀𝑑𝑑 falls with lower 𝑌𝑌, which leads to a higher price level 𝑃𝑃.
291
EC2065 Macroeconomics | Chapter 8: Business cycles
Real interest
Countercyclical Countercyclical (weakly)
rate
Average labour
Procyclical Procyclical (weakly)
productivity
In spite of this success, the source of transitory supply shocks is not obvious in
many business-cycle episodes. We could consider instead supply shocks with
permanent effects that are more easily interpreted as being due to the uneven
pace of technological progress. However, it is much harder to make the RBC model
consistent with the business-cycle stylised facts when supply shocks are highly
persistent.
A permanent productivity shock changes current and future TFPs 𝑧𝑧 and 𝑧𝑧′ by a
similar amount. Considering the case of permanent positive shock, the RBC model
predicts the following in comparison to a transitory shock:
• Wealth effects are larger, hence, 𝑁𝑁 𝑠𝑠 shifts further to the left, and 𝐶𝐶 𝑑𝑑 rises
by more, shifting 𝑌𝑌 𝑑𝑑 further to the right
• There is a greater incentive for firms to increase investment, and the larger
increase in 𝐼𝐼 𝑑𝑑 means 𝑌𝑌 𝑑𝑑 shifts further to the right
• The effect on 𝑁𝑁 𝑑𝑑 is same for a permanent and transitory shock to 𝑧𝑧.
Since 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 shift in the same direction, the RBC model can still generate
fluctuations in GDP 𝑌𝑌 with permanent shocks to TFP. However, as Figure 8.25
shows, the model will struggle to generate predictions consistent with the stylised
facts. The main problem is that the stronger wealth effect on labour supply 𝑁𝑁 𝑠𝑠
means that employment 𝑁𝑁 might fall when 𝑌𝑌 rises. Employment then becomes
countercyclical, which is clearly contrary to the data. Furthermore, the stronger
shift of 𝑁𝑁 𝑠𝑠 in the opposite direction to 𝑁𝑁 𝑑𝑑 implies much larger fluctuations in the
292
EC2065 Macroeconomics | Chapter 8: Business cycles
real wage, which now becomes very procyclical, contrary to the data. These
predictions are consistent with very long-run trends where hours worked have
fallen even though productivity and wages have risen permanently.
293
EC2065 Macroeconomics | Chapter 8: Business cycles
Why would such positive labour productivity spillover occur? Advocates of coordination-
failure models point to several possibilities. First, there could be complementarities in
employment across firms, for example, if more people are employed writing computer
software, this makes computer hardware more useful, so workers producing the hardware
are now making something of greater value. Another example is that delivery and logistics
are more efficient when such companies have more orders to fulfil.
A second possibility is what are known as ‘thick market’ effects. This is where an increased
amount of activity in a market provides some direct benefit to market participants. For
example, it is harder to attract customers to a shopping mall when not many other shops in
the mall are open. The coordination failure model explores the implications of such thick-
market effects and complementarities. Of course, a priori, it is also possible to envisage
negative spillovers across firms, so the validity of the assumption is debatable.
The spillover effect has important consequences for firms’ demand for labour. If an
individual firm increases its employment 𝑁𝑁𝑖𝑖 , then its marginal product of labour declines, so
the firm-level labour demand curve is downward-sloping as usual. However, if all firms are
increasing employment 𝑁𝑁𝑖𝑖 together, then aggregate employment 𝑁𝑁 rises and 𝑧𝑧(𝑁𝑁)
increases. The rise in 𝑧𝑧(𝑁𝑁) boosts the marginal product of labour in each firm, offsetting the
decline due to higher 𝑁𝑁𝑖𝑖 . This spillover effect might outweigh the declining firm-level
marginal product of labour, so marginal product of labour increases with aggregate
employment 𝑁𝑁. In that case, the aggregate-level labour demand curve becomes upward-
sloping. The firm-level and aggregate-level labour demand curves are plotted in Figure 8.26.
Figure 8.26: Aggregate labour demand in the coordination failure model
In what follows, we assume the positive spillover effect from firms’ employment is
sufficiently strong to make the aggregate labour demand curve upward sloping. Weaker
spillovers would mean 𝑁𝑁 𝑑𝑑 remains downward sloping, only becoming flatter. A strong
enough spillover to make 𝑁𝑁 𝑑𝑑 upward sloping implies increasing returns to labour at the
aggregate level, and results in an aggregate production function with a convex shape as
shown in Figure 8.27. Note that we assume a spillover effect – an externality – rather than
assume firms directly have an increasing-returns production function to maintain the
294
EC2065 Macroeconomics | Chapter 8: Business cycles
framework of perfect competition. Increasing returns at the firm level requires a model with
imperfectly competitive firms, similar to that used earlier in the new Keynesian model with
sticky prices.
The relationship between 𝑟𝑟 and the supply of goods can be reversed with a strong enough
spillover effect in the coordination failure model. High output and high employment
generate a strong productivity-boosting spillover, which means firms are willing to choose
high employment even when a low interest rate 𝑟𝑟 reduces workers’ desire to save by
earning more. As shown in Figure 8.29, the coordination failure model features a
downward-sloping output supply curve 𝑌𝑌 𝑠𝑠 . This profoundly changes the predictions of the
model compared to the standard dynamic macroeconomic model.
295
EC2065 Macroeconomics | Chapter 8: Business cycles
Figure 8.29: The output supply curve in the coordination failure model
296
EC2065 Macroeconomics | Chapter 8: Business cycles
markets because every point on the output supply curve represents a point of labour-
market equilibrium given the way the 𝑌𝑌 𝑠𝑠 curve is constructed). Without more assumptions,
it is not possible to say which equilibrium will be the outcome. Usually, we derive the
predictions of a model from looking at what happens in equilibrium but that is not sufficient
when there are multiple equilibria.
Figure 8.30: Two equilibria in the coordination failure model
What explains why the coordination failure model has multiple equilibria? This feature is
due to the presence of what is known as ‘strategic complementarity’. Strategic
complementarity refers to a situation where one person’s desire to perform an action
increases when others are also performing the same action. In the model, an individual
firm’s marginal product of labour rises when other firms are expanding employment and,
hence, it becomes profitable for firms to expand employment when other firms are doing
so. This creates strategic complementarity in firms’ employment decisions and sufficiently
strong strategic complementarity leads to multiple equilibria because different choices
made by individuals are mutually reinforcing.
While both equilibria in Figure 8.30 are fully consistent with rational optimisation by
individual households and firms, nonetheless, households are generally not indifferent
between them. The high-output equilibrium is good. It features high consumption, and
wages and productivity are high. Leisure is low but that choice makes sense because
productivity is high. The low-output equilibrium is bad. Consumption is low, and wages and
productivity are low, only mitigated by high leisure but that is chosen because productivity
is low. The inefficiency of the low-output equilibrium is due to the productivity spillover
effect that individuals fail to internalise.
Although everyone prefers the equilibrium with high output, there can be a coordination
problem in reaching it. At a low level of output, it makes sense for all firms in the economy
collectively to switch to the high-output equilibrium. But at the low-output equilibrium, it is
individually rational for each firm to choose low output. Therefore, the economy could get
stuck at the equilibrium with low GDP for some time, even though everyone would gain by
297
EC2065 Macroeconomics | Chapter 8: Business cycles
coordinating on the high-GDP equilibrium. A recession can thus result from a coordination
failure.
What ultimately determines which of the two equilibria prevails? If everyone believes the
high-output equilibrium will prevail, then it will, while if everyone believes the low-output
equilibrium will prevail, then it will instead. Thus, the outcome depends on whether people
are optimistic or pessimistic about the economy’s prospects. However, both of these are
consistent with rational expectations. There is an independent role for beliefs, which are not
uniquely determined by requirement of rationality.
298
EC2065 Macroeconomics | Chapter 8: Business cycles
Even if the aggregate 𝑁𝑁 𝑑𝑑 curve is upward sloping, this is not enough. If labour
supply is insufficiently wage elastic, i.e. 𝑁𝑁 𝑠𝑠 (𝑟𝑟) is too steep, then there is a unique
equilibrium even if there are increasing returns to labour in aggregate. This case is
depicted in Figure 8.33.
Figure 8.33: Labour supply insufficiently wage elastic
299
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Essential reading
We now consider a model with partial price adjustment to bridge the two extremes above.
This model also implies a close link between inflation and the state of the real economy, the
Phillips curve.
300
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
trade-off is why firms will not change their prices all the time, nor leave their prices constant
forever.
Moreover, a larger gap between 𝑌𝑌 and 𝑌𝑌 ∗ means that firms’ desired price change is larger
and more firms will prefer to adjust prices after taking account of the costs of doing so.
Intuitively, as output and employment increase, the marginal product of labour 𝑀𝑀𝑃𝑃𝑁𝑁
declines (as does 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ), which increases firms’ marginal cost of production. The wage 𝑤𝑤
must also increase to raise the supply of labour, which additionally adds to the cost of
production. It follows that the direction and size of price changes depends on the ‘output
gap’ between actual GDP 𝑌𝑌 and its natural level 𝑌𝑌 ∗ .
Figure 9.1: Firms' incentives to change prices
Inflation is defined as the rate of change of the price level 𝑃𝑃 over time. The notation we will
use in this chapter is that 𝜋𝜋 is the inflation rate between the past and current time periods
(note this is different from our earlier notation in Chapter 6) and 𝜋𝜋′ is the inflation rate
between the current and future time periods, i.e. 𝜋𝜋 = (𝑃𝑃 − 𝑃𝑃�)/𝑃𝑃� and 𝜋𝜋 ′ = (𝑃𝑃′ − 𝑃𝑃)/𝑃𝑃,
where 𝑃𝑃� is the past level of prices. Mechanically, inflation 𝜋𝜋 is positive when firms are
increasing prices on average, or equivalently, when newly set prices are higher than the
average of past prices 𝑃𝑃�. By the same logic, future inflation 𝜋𝜋′ is expected to be positive
when firms will set higher prices than the current average 𝑃𝑃 in the future.
301
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
9.1.3 Expectations
When there are costs of making price changes and firms do not expect to be adjusting prices
continually, they also need to consider future conditions when setting current prices. If it is
desirable to set higher prices in future time periods then expected future inflation 𝜋𝜋 ′𝑒𝑒 will
be positive. Hence, any firm adjusting its prices in current period will choose a larger price
increase when 𝜋𝜋 ′𝑒𝑒 is higher, so higher expected future inflation 𝜋𝜋 ′𝑒𝑒 leads to more inflation 𝜋𝜋
in the current time period. The effect of 𝜋𝜋 ′𝑒𝑒 on 𝜋𝜋 is less than one-for-one because less
weight is given to future conditions than current conditions when setting prices now.
We refer to the relationship between inflation 𝜋𝜋 and the output gap 𝑌𝑌 − 𝑌𝑌 ∗ as the ‘Phillips
curve’. This is the upward-sloping line or curve 𝑃𝑃𝑃𝑃 in Figure 9.2 with 𝜋𝜋 on the vertical axis
and real GDP 𝑌𝑌 on the horizontal axis. If no future inflation is expected (𝜋𝜋 ′𝑒𝑒 = 0), the Phillips
curve passes through the point with 𝜋𝜋 = 0 when 𝑌𝑌 = 𝑌𝑌 ∗ because there is no inflationary
pressure when output is at its natural level and no future inflation is expected. The Phillips
curve shifts to the right if the natural level of output 𝑌𝑌 ∗ rises and shifts upwards if expected
future inflation 𝜋𝜋 ′𝑒𝑒 is higher.
Figure 9.2: The Phillips curve
The gradient of the Phillips curve indicates how much inflation 𝜋𝜋 rises when output 𝑌𝑌
increases. The Phillips curve would be flatter if there is less inflationary pressure because of
more nominal rigidity, for example, fewer firms being willing to change price because the
costs of price adjustment are larger.
302
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
The Phillips curve is also flatter if there is greater ‘real rigidity’. Real rigidity refers to firms’
desired prices being less sensitive to what happens to real GDP 𝑌𝑌. For example, if the
marginal product of labour curve 𝑀𝑀𝑃𝑃𝑁𝑁 is flatter (and hence, also 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ) because returns to
labour diminish less rapidly, then firms’ cost of production rises by less with output,
reducing inflationary pressure. This also occurs if the labour supply curve 𝑁𝑁 𝑠𝑠 is flatter, so
wages need to rise by less to induce an increase in labour supply. Another source of real
rigidity is the presence of efficiency wage concerns that mean real wages are less sensitive
to economic conditions, for example, when a particular constant real wage level maximises
the amount of effective labour input per unit of wages paid.
303
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
𝑟𝑟 𝑒𝑒 = 𝑖𝑖 − 𝜋𝜋 ′𝑒𝑒
Given a nominal interest rate 𝑖𝑖, higher expected future inflation 𝜋𝜋 ′𝑒𝑒 reduces the expected
real interest rate 𝑟𝑟 𝑒𝑒 . This means there is less incentive to save and a greater incentive to
borrow. With these substitution effects, and ignoring income effects with a representative
household as explained in Section 3.11, higher 𝜋𝜋 ′𝑒𝑒 leads to higher 𝐶𝐶 𝑑𝑑 . It also raises
investment demand 𝐼𝐼 𝑑𝑑 according to the model from Section 3.8. Output demand 𝑌𝑌 𝑑𝑑
therefore depends positively on expected future inflation 𝜋𝜋 ′𝑒𝑒 .
However, this multiplier did not feature in our analysis of the output demand curve 𝑌𝑌 𝑑𝑑 from
the dynamic macroeconomic model of Chapter 3. There we emphasised that higher
304
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
government expenditure 𝐺𝐺 increases the tax burden, reducing the amount of private
consumption 𝐶𝐶 that is affordable. Income 𝑌𝑌 did increase but, as seen in Box 4.2, that was
the result of the decision of households to supply more labour when faced with a higher tax
burden. Higher GDP 𝑌𝑌 caused by higher 𝐺𝐺 did not in itself make households better off.
𝜕𝜕𝐶𝐶 𝑑𝑑
𝑀𝑀𝑀𝑀𝑀𝑀 =
𝜕𝜕𝜕𝜕
The aggregate demand for output is 𝑌𝑌 𝑑𝑑 = 𝐶𝐶 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺. Consumption demand 𝐶𝐶 𝑑𝑑 is now a
function of 𝑌𝑌, and in equilibrium, income is equal to aggregate demand (𝑌𝑌 = 𝑌𝑌 𝑑𝑑 ). Figure 9.3
plots the expenditure function 𝑌𝑌 𝑑𝑑 against aggregate output and income 𝑌𝑌.
Figure 9.3: Output demand with multiplier
305
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
If aggregate expenditure 𝑌𝑌 𝑑𝑑 were to increase, for example, owing to higher 𝐺𝐺, the
expenditure function shifts vertically upwards by the same amount. By finding the new
point of intersection with the 45∘ line, we see that the overall effect on 𝑌𝑌 is larger than the
size of the shift of the 𝑌𝑌 𝑑𝑑 function. A marginal propensity to consume between 0 and 1
(making the expenditure function upward sloping but less steep than the 45∘ line) amplifies
the effects of changes in output demand 𝑌𝑌 𝑑𝑑 because of a positive feedback loop working
through income and consumption.
The consequences of this for the output demand curve 𝑌𝑌 𝑑𝑑 (a relationship between real GDP
𝑌𝑌 and the real interest rate 𝑟𝑟) derived in Section 3.12 is that the 𝑌𝑌 𝑑𝑑 curve becomes flatter,
i.e. more sensitive to 𝑟𝑟 and shifts become larger than they would be in the standard
dynamic macroeconomic model of Chapter 3.
306
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
307
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
The constant 𝑟𝑟 reduces crowding out of private consumption and investment expenditure
when 𝐺𝐺 rises. The increase in real GDP 𝑌𝑌 is equal to the size of the horizontal shift of 𝑌𝑌 𝑑𝑑 and
we have seen that this can be larger than the increase in 𝐺𝐺 when there are sufficiently many
credit-constrained households. It is therefore possible that ‘multiplier’ effects dominate
‘crowding-out’ effects and GDP rises by more than a deficit-financed increase in government
expenditure.
308
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Finally, the Federal Reserve was unable fully to offset the large leftward shift of 𝑌𝑌 𝑑𝑑
by reducing interest rates and shifting down the 𝑀𝑀𝑀𝑀 line. This is because of the
lower bound on the nominal interest rate, an issue we will discuss further in Section
9.7. Assuming the nominal interest rate 𝑖𝑖 cannot fall below zero, the lowest
possible real interest rate was −𝜋𝜋 ′𝑒𝑒 , the negative of expected future inflation, and
inflation expectations were relatively low during the 2000s.
309
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Higher inflation 𝜋𝜋 thus causes an upward shift of the 𝑀𝑀𝑀𝑀 curve. The higher real interest rate
induced by monetary policy causes a reduction in aggregate demand, a movement up the
𝑌𝑌 𝑑𝑑 curve and 𝑌𝑌 is lower. Taking as given expectations of 𝜋𝜋 ′ and 𝑌𝑌 ′ , this implies a negative
demand-side relationship between inflation 𝜋𝜋 and real GDP 𝑌𝑌. We label this the 𝑌𝑌 𝑑𝑑 − 𝑀𝑀𝑀𝑀
line, coming from the combination of the output demand curve and the stance of monetary
policy. Putting this together with the upward-sloping Phillips curve that represents the
supply side of the economy, the intersection between 𝑌𝑌 𝑑𝑑 − 𝑀𝑀𝑀𝑀 and 𝑃𝑃𝑃𝑃 determines
inflation 𝜋𝜋 and real GDP 𝑌𝑌. This is illustrated in Figure 9.6.
Figure 9.6: The Phillips curve, output demand, and monetary policy
The 𝑌𝑌 𝑑𝑑 − 𝑀𝑀𝑀𝑀 curve shifts to the right for the same reasons that cause the output demand
curve 𝑌𝑌 𝑑𝑑 to shift to the right. It shifts to the left if there is an exogenous increase in the
tightness of monetary policy. Using the links between expectations of the future and
310
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
aggregate demand studied in Section 9.2, it also shifts to the right with an increase in
expected future inflation 𝜋𝜋 ′ or expected future GDP 𝑌𝑌 ′ .
311
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
312
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
In the standard new Keynesian model, we have seen that aiming for 𝜋𝜋 = 0 strictly
means accepting real GDP 𝑌𝑌 = 𝑌𝑌 ∗ , which closes the output gap. This suggests a
‘strict’ inflation target is not necessarily bad. One concern might be that real GDP
𝑌𝑌 ∗ is inefficiently low because 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 = 𝑤𝑤 ∗ = 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 implies 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶
(recalling that 𝑀𝑀𝑃𝑃𝑁𝑁 > 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 ). But as we will see in Box 9.5, aiming for 𝑌𝑌
systematically above 𝑌𝑌 ∗ risks an inflation bias.
Putting aside concerns that output 𝑌𝑌 ∗ is too low on average, what about
fluctuations in 𝑌𝑌 ∗ and, hence, in actual real GDP 𝑌𝑌 = 𝑌𝑌 ∗ when 𝜋𝜋 = 0? Will a strict
inflation target cause real GDP to fluctuate too much? Let us consider this point
after when efficiency wages are added to the new Keynesian model. This means the
level of employment 𝑁𝑁 ∗ associated with the natural level of output 𝑌𝑌 ∗ is found
where the efficiency wage 𝑤𝑤 ∗ equals 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 , rather than where 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 intersects
the labour supply curve 𝑁𝑁 𝑠𝑠 . The efficient level of employment 𝑁𝑁� is where 𝑀𝑀𝑃𝑃𝑁𝑁 =
𝑀𝑀𝑀𝑀𝑆𝑆𝑙𝑙,𝐶𝐶 , which lies on the 𝑁𝑁 𝑠𝑠 curve as shown in Figure 9.7.
Figure 9.7: Excessive real GDP fluctuations with a strict inflation target
Now consider a negative supply shock that shifts 𝑀𝑀𝑃𝑃𝑁𝑁 and 𝑀𝑀𝑀𝑀𝑃𝑃𝑁𝑁 to the left. As
seen in the figure, this shock causes employment 𝑁𝑁 ∗ and output 𝑌𝑌 ∗ to fall too much
compared to what is efficient (𝑁𝑁 � ). Rigidities such as efficiency wages can therefore
make the levels of employment and output too volatile if the central bank follows a
strict inflation target resulting in 𝑁𝑁 = 𝑁𝑁 ∗ and 𝑌𝑌 = 𝑌𝑌 ∗ .
In these circumstances, a flexible inflation target generally performs better than a
strict inflation target. Following a temporary negative supply shock, allowing
inflation 𝜋𝜋 to rise means 𝑁𝑁 will be above 𝑁𝑁 ∗ (this is a movement along the Phillips
curve). This is a better outcome because employment drops too much if 𝜋𝜋 = 0,
although there is a trade-off between the cost of the positive inflation rate and the
better outcome for real GDP.
313
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
314
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
distortions is the imperfect competition in the goods market that leads firms to produce
too little as a way of boosting profits. But other distortions such as ‘efficiency wages’ or
taxes that reduce the incentive to work also worsen the inefficiency of the natural level of
output.
In light of GDP being inefficiently low, should monetary policy aim to provide additional
stimulus even when 𝑌𝑌 = 𝑌𝑌 ∗ ? Suppose monetary policy reduces the real interest rate
below the natural interest rate 𝑟𝑟 ∗ to push real GDP above its natural level 𝑌𝑌 ∗ . Taking as
given inflation expectations 𝜋𝜋 ′𝑒𝑒 , this policy moves the economy along a Phillips curve
with output 𝑌𝑌 and inflation 𝜋𝜋 both rising. This seems to improve the outcome for real
GDP, albeit at the cost of some inflation.
However, this policy is not a response to a shock but to economic activity being judged
systematically too low. Hence, the higher inflation that results from it can be anticipated.
This causes inflation expectations to rise, which shifts the Phillips curve upwards. The
upward shift of the Phillips curve worsens the combinations of inflation and real GDP that
can be attained by the central bank. Now, an even higher rate of inflation is required to
achieve a given target for the level of real GDP, and when that is anticipated, this leads to
even higher expected inflation. Eventually, inflation expectations stop rising when further
inflation would be too costly for the central bank to tolerate.
These adverse shifts of the Phillips curve mean that there is higher inflation with only a
limited (or no) gain in terms of real GDP. Monetary policy therefore suffers from an
‘inflation bias’. The freedom to use monetary policy to aim for the efficient level of real
GDP leads to higher inflation, but fails to achieve its original goal. It results in higher costs
of inflation, without gaining much or anything by way of a better outcome for real GDP.
If the central bank were to announce it would target a lower rate of inflation and inflation
expectations fell, the Phillips curve would be in a more favourable position. But then
there would be a temptation to pursue an expansionary monetary policy to raise 𝑌𝑌 above
𝑌𝑌 ∗ . This points to the time inconsistency of announcing a goal of lower inflation.
Owing to the time inconsistency problem, an announcement that the central bank will
pursue low inflation is not credible. But there would be gains from being able to reduce
inflation expectations and obtain a Phillips curve in a more favourable position if the
central bank were able to commit itself to a low-inflation monetary policy.
What institutional mechanisms might enable the central bank to do this? First, giving the
central bank independence might insulate it from political pressure to aim for real GDP 𝑌𝑌
above 𝑌𝑌 ∗. Second, establishing a framework for monetary policy where the central bank
is given the primary task of controlling inflation and is judged by the public on how well it
performs in this might shift the focus of monetary policy away from the level of real GDP.
An independent central bank given the job of meeting an inflation target is the current
consensus on how best to achieve this. The central bank is usually allowed to interpret its
inflation target ‘flexibly’ and consider business-cycle fluctuations in real GDP in its policy
deliberations but to focus only on inflation in the long run.
315
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
The ex-ante Fisher equation implies the real interest rate is 𝑟𝑟 = 𝑖𝑖 − 𝜋𝜋 ′𝑒𝑒 . Taking as given
inflation expectations 𝜋𝜋 ′𝑒𝑒 , if there is a lower bound 𝑖𝑖 ≥ 0 on the nominal interest rate
(taken to be zero here) then the real interest rate is subject to the lower bound 𝑟𝑟 ≥ −𝜋𝜋 ′𝑒𝑒 .
The real interest rate cannot fall below the negative of the rate of inflation that is expected.
The problem is that a large shock to the economy may reduce the natural rate of interest 𝑟𝑟 ∗
below −𝜋𝜋 ′𝑒𝑒 , meaning that it is not feasible to get to 𝑟𝑟 = 𝑟𝑟 ∗ by lowering 𝑖𝑖. This challenge for
monetary policy is illustrated in Figure 9.8.
Figure 9.8: Interest-rate lower bound and the limits to stabilisation policy
The inability to use conventional monetary policy to stabilise the economy once the
interest-rate lower bound is reached has led central banks to use or consider using
unconventional monetary policies instead, such as:
• Quantitative easing
• Forward guidance
• Negative interest rates.
316
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Suppose the central bank creates more money and purchases short-term nominal
government bonds. Since the interest-rate lower bound has been reached, these short-term
nominal government bonds have yield 𝑖𝑖 = 0. But as Section 6.13 has explained, at the
interest-rate lower bound, the demand for money becomes perfectly interest elastic
because money and risk-free nominal bonds are perfect substitutes at the margin – the
‘liquidity trap’. As Figure 9.9 shows, this additional money would be passively absorbed into
money demand but there would be no change in interest rates, prices, or real GDP.
Can quantitative easing be adapted so that expansions of the money supply do have an
effect on the economy? What needs to be done differently?
Figure 9.9: The liquidity trap and the limits to stabilisation policy
It is implicit in the example from Figure 9.9 that the expansion of the money supply is only
temporary and would be reversed once the interest-rate lower bound ceases to bind and QE
is no longer needed. As explained in Box 6.3, the effects of a permanent expansion of the
money supply would be quite different, causing an increase in prices and inflation, and
leading here to a movement along the Phillips curve with higher economic activity. Hence, a
commitment to maintain a monetary expansion even in normal times in the future might in
principle be a more effective form of QE, although it may suffer from a time-inconsistency
problem and lack credibility if people think the central bank will change course in the future.
Another proposal is a ‘helicopter drop’ of money, where a monetary expansion is
transferred to the government and given away in the form of lower taxes or increased
transfer payments for households. In principle, this is very similar to the permanent
expansion of the money supply described above – which would deliver a fiscal gain to the
government from reducing the real value of existing money and nominal debt, and
ultimately show up as lower taxes. But one important difference is that it would be more
difficult to reverse because the central bank has given the money away and does not hold
any additional assets that could be used to buy back the money. While this policy might be
more effective, it is widely seen as setting a dangerous precedent and blurring the lines
between monetary and fiscal policy.
317
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
Rather than claim monetary expansions will be permanent, real-world central banks’ QE has
instead attempted to get around the liquidity trap problem by making purchases of long-
term government bonds with a positive yield, or private-sector assets such as corporate
bonds or mortgage-backed securities. These assets share the feature that they are risky,
unlike short-term government bonds. Private-sector assets are subject to default risk and
even long-term government bonds are risky for those not holding them until maturity (see
Box 7.5). These risky assets are not perfect substitutes for money, unlike short-term
government bonds at interest-rate lower bound.
We can use the model of portfolio choice from Section 7.8 to analyse the effects of
purchases of risky assets by the central bank. Central-bank purchases of risky assets require
in equilibrium that private investors hold a smaller fraction of risky assets and a larger
fraction of risk-free assets (the money the central bank creates) in their portfolios. As shown
in Figure 9.10, a lower risk premium is needed for this portfolio to be chosen, so central-
bank purchases result in a decline in risk premiums. This is known as the ‘portfolio balance
effect’. This then helps an economy at the lower bound by reducing the cost of credit for
risky borrowers, or those borrowing over long periods.
Figure 9.10: The portfolio balance effect
If interest rates can be lowered in future periods then this would be expected to raise future
output 𝑌𝑌 ′ and future inflation 𝜋𝜋 ′ . Higher expectations of 𝜋𝜋 ′ and 𝑌𝑌 ′ both increase output
demand in the current period through the channels explained in Section 9.2, even though
318
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
there is no immediate change in interest rates. Thus, forward guidance might work through
manipulating expectations of the future in a way that improves current economic outcomes.
However, the policy is not a ‘free lunch’ because future outcomes are worse if future
interest rates are set at an inappropriate level for the conditions then prevailing.
This also points to a potential problem with the credibility of forward guidance. Once the
future period is reached, the central bank may not want to keep interest rates too low. The
benefit of announcing this was better economic outcomes in the past but that is now a
bygone, while the cost is worse current economic outcomes. This time-inconsistency
problem can undermine the credibility of a forward guidance announcement. For people to
believe the announcement, the central bank needs to make a binding commitment but lacks
any straightforward way to tie its hands.
Box 9.6: Inflation targeting and the interest-rate lower bound problem
If the lower bound on nominal interest rates jeopardises macroeconomic stability and
unconventional monetary policies are seen as ineffective or costly are there any reforms
to economic policy that would mitigate the lower bound problem? For example, should
the existing framework of inflation targeting that is used in many countries be changed
or abandoned? Below we explore a number of alternatives.
Raising the inflation target
Suppose a higher target 𝜋𝜋 ∗ for inflation is chosen. The Fisher equation with the real
interest rate at its natural rate 𝑟𝑟 ∗ in the long run implies 𝑖𝑖 = 𝑟𝑟 ∗ + 𝜋𝜋 ∗ . A higher inflation
target 𝜋𝜋 ∗ thus means a higher nominal interest rate 𝑖𝑖 on average, which gives a larger
cushion to adjust 𝑖𝑖 downwards. With 𝜋𝜋 ′𝑒𝑒 ≈ 𝜋𝜋 ∗ and a lower bound of zero on the nominal
interest rate 𝑖𝑖, the real interest rate can be reduced to −𝜋𝜋 ∗ , which is lower when 𝜋𝜋 ∗ is
higher.
Therefore, if inflation targets were raised, monetary policy can shift the 𝑀𝑀𝑀𝑀 line further
downwards. This gives the central bank a greater ability to offset larger negative demand
shocks. However, there are costs of having higher 𝜋𝜋 on average, as discussed in Section
9.5.
Average inflation targeting
It would also be possible to replace the standard form of inflation targeting with what is
called ‘average inflation targeting’. This has the central bank aim for a target based on
the inflation rate averaged over a number of years. Average inflation targeting was
adopted by the US Federal Reserve in August 2020.
Consider an example of this policy where the target is for inflation averaged over two
periods. If this is credible, people expect that an average of current inflation 𝜋𝜋 and future
inflation 𝜋𝜋 ′ will remain stable even if 𝜋𝜋 does not. Suppose 𝜋𝜋 falls because of a negative
demand shock that the central bank cannot offset because of the interest-rate lower
bound. Under this policy, there is then an increase in expectations of 𝜋𝜋 ′ – assuming
monetary policy is not also constrained in the future period. As explained in Section 9.2,
higher 𝜋𝜋 ′𝑒𝑒 reduces the real interest rate and boosts demand in the current period.
319
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
320
EC2065 Macroeconomics | Chapter 9: Inflation, expectations and macroeconomic policy
By setting negative nominal interest rates, the central bank can shift the 𝑀𝑀𝑀𝑀 line
further downwards along the 𝑌𝑌 𝑑𝑑 curve, which allows it to stabilise the economy for
a larger range of negative demand shocks.
However, there may be costs of this negative interest rate policy. First, there can be
an inefficiently large use of cash to avoid negative rates, which wastes resources on
security and storage costs (see Section 6.14). Negative interest rates also mean
banks cannot break even without negative interest rates charged on deposits. But if
households and firms were to switch from using bank deposits to using cash as
money then this would eliminate the benefits of financial intermediation provided
by banks (see Section 7.10).
321
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Essential reading
• Williamson, Chapters 16 and 17.
322
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
The terminology related to the financial account (𝐹𝐹𝐹𝐹) has changed over time. You may see it
referred to as the capital account (𝐾𝐾𝐾𝐾) in some textbooks. The financial account comprises
foreign net purchases of domestic assets minus domestic net purchases of foreign assets.
Any purchases or sales of official foreign-exchange reserves are included in 𝐹𝐹𝐹𝐹 in this
course, so the financial account captures all trade in assets, both by the private sector and
the government.
The balance of payments identity is 𝐵𝐵𝐵𝐵 = 0, so the financial and current accounts must in
equilibrium be related according to:
𝐹𝐹𝐹𝐹 = −𝐶𝐶𝐶𝐶
Next, let us see how international trade affects the goods market and the relationship
between saving and investment. Net exports 𝑁𝑁𝑁𝑁 are a component of aggregate
expenditure, so the breakdown of GDP by expenditure is now:
𝑌𝑌 = 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁
An open economy also has a separate notion of gross national product (GNP) as well as GDP.
GNP is equal to GDP plus net foreign income:
𝐺𝐺𝐺𝐺𝐺𝐺 = 𝑌𝑌 + 𝑁𝑁𝑁𝑁𝑁𝑁
National saving 𝑆𝑆 is the sum of private and public saving, 𝑆𝑆 = 𝑆𝑆 𝑝𝑝 + 𝑆𝑆 𝑔𝑔 , where private saving
is 𝑆𝑆 𝑝𝑝 = (𝑌𝑌 + 𝐼𝐼𝐼𝐼𝐼𝐼 + 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝑇𝑇) − 𝐶𝐶. This is disposable income minus consumption 𝐶𝐶, where
the private sector receives domestic income 𝑌𝑌, interest on government debt 𝐼𝐼𝐼𝐼𝐼𝐼, net
foreign income 𝑁𝑁𝑁𝑁𝑁𝑁, and pays taxes net of transfers 𝑇𝑇. Public saving is 𝑆𝑆 𝑔𝑔 = 𝑇𝑇 − 𝐼𝐼𝐼𝐼𝐼𝐼 − 𝐺𝐺,
with the budget deficit being −𝑆𝑆 𝑔𝑔 . National saving is therefore 𝑆𝑆 = 𝑌𝑌 + 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝐶𝐶 − 𝐺𝐺 =
𝐺𝐺𝐺𝐺𝐺𝐺 − 𝐶𝐶 − 𝐺𝐺, and the national accounts and balance-of-payments identities imply:
𝑆𝑆 = 𝐼𝐼 + 𝐶𝐶𝐶𝐶
This means that a country can save either through investment in the domestic capital stock,
or through running a current account surplus (𝐶𝐶𝐶𝐶 > 0). Equivalently, a financial account
deficit (𝐹𝐹𝐹𝐹 < 0) implies lending to the rest of the world. A country can borrow by running a
current account deficit (𝐶𝐶𝐶𝐶 < 0), noting that a financial account surplus (𝐹𝐹𝐹𝐹 > 0) implies
borrowing from the rest of the world.
Saving, investment, and the current account are flow variables. Net foreign assets 𝑁𝑁𝑁𝑁𝑁𝑁 is
net stock of international savings, defined as the value of foreign assets owned by domestic
residents minus value of domestic assets owned by foreign residents. The accounting
equation for the link between current and future net foreign assets denoted by 𝑁𝑁𝑁𝑁𝑁𝑁′ is:
𝑁𝑁𝑁𝑁𝐴𝐴′ = 𝑁𝑁𝑁𝑁𝑁𝑁 − 𝐹𝐹𝐹𝐹 + Net capital gains
For the purposes of this course, we will ignore changes in asset valuations. Hence, the
current account gives the change in net foreign assets:
323
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
We will focus on international trade done by a small open economy (SOE). An SOE is not
large enough to affect what happens in other countries or in world markets. This makes the
country a price taker in competitive world markets. The country can be affected by what
happens in the rest of the world but there is no feedback from it to the world. This
assumption is appropriate for some economies, such as Singapore, although not for others.
However, many of the lessons for SOEs carry over to the analysis of large open economies.
324
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Now consider a country open to international trade. We suppose that capital mobility is
perfect. Perfect capital mobility means there are no restrictions on capital flows. Domestic
residents can freely buy or sell foreign assets and foreigners can freely buy or sell domestic
assets. As domestic and foreign bonds are equivalent, the domestic real interest rate 𝑟𝑟 must
equal the foreign real interest rate 𝑟𝑟 ∗ and a small open economy is not big enough to
influence 𝑟𝑟 ∗ . This means the economy is effectively a participant in a perfectly competitive
world market for bonds with 𝑟𝑟 = 𝑟𝑟 ∗ .
325
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Since the international budget constraint always passes through the point (𝑌𝑌 − 𝐺𝐺, 𝑌𝑌 ′ − 𝐺𝐺 ′ )
that is the autarky equilibrium irrespective of the level of 𝑟𝑟 ∗ , it follows that whenever 𝑟𝑟 ∗ ≠
𝑟𝑟𝑎𝑎 , households can get on to a higher indifference curve through trade than they can reach
in autarky. This is because the indifference curve at the autarky point must cut the budget
constraint because the indifference curve has gradient −(1 + 𝑟𝑟𝑎𝑎 ) there, while the budget
constraint has gradient −(1 + 𝑟𝑟 ∗ ) everywhere.
Figure 10.2: Gains from trade
This account of the gains from international trade also points towards some potential
determinants of the current account. By making an analogy with an individual’s life cycle
and consumption choice, we can see that the stage of development of a country can be one
long-run factor that might affect its current account.
For example, a country with high GDP due to extraction of an exhaustible resource is in a
similar position to a middle-aged person planning for retirement. It makes sense to save to
smooth consumption, which for the country as a whole means running a 𝐶𝐶𝐶𝐶 surplus. This
helps to explain the large current-account surpluses of oil exporters such as Saudi Arabia.
Figure 10.3: Long-run explanations for the current account
326
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
On the other hand, a developing country is in a similar position to a young person who
expects a much higher income in the future. It makes sense to borrow to smooth
consumption, which for the country as a whole means running a 𝐶𝐶𝐶𝐶 deficit. However, there
are many examples where developing countries are seen not to behave in this way, running
large current-account surpluses instead. This is related to the puzzle of why capital does not
flow from rich countries to poorer countries.
Although we have seen that there are gains from international trade, changes in the world
interest rate can increase or decrease the size of the gains from trade, although these
always remain positive relative to autarky. The argument is equivalent to how individual
savers and borrowers are affected by changes in the real interest rate. Recalling the
argument from Section 3.5, a higher interest rate implies a substitution effect reducing
current consumption and increasing 𝐶𝐶𝐶𝐶 for both. The higher interest rate makes a country
with a current-account surplus (a saver) better off but a country with a current-account
deficit (a borrower) worse off. The overall effect on the current account is ambiguous when
it is initially in surplus but, if it is initially in deficit, then the deficit should fall.
Figure 10.4: Increase in world interest rate
327
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Intuitively,
Box 10.1:a negative income risk
International shocksharing
leads a country to lower its international
lending or increase its international borrowing. Adjustment of the current account
Suppose a country faces a temporary negative shock to real GDP 𝑌𝑌. In the two-
thus helps households faced with income shocks to smooth consumption. The
period model, current income falls but expected future income remains the same.
model predicts that the current account falls with a negative income shock, so it is
Figure 10.5 shows the leftward shift of the budget constraint passing through (𝑌𝑌 −
predicted to be a procyclical variable. Figure 10.6 shows data on the cyclicality of
𝐺𝐺, 𝑌𝑌 ′ − 𝐺𝐺 ′ ) as 𝑌𝑌 falls. The optimal choice of current consumption 𝐶𝐶 ∗ declines but by
the current account in the USA. Empirically, the US current account is actually
less than 𝑌𝑌 does. This reflects the desire of households to smooth consumption by
countercyclical, although the USA is far from a small open economy and many
choosing a decline in 𝐶𝐶 ′∗ as well to permit a smaller fall in 𝐶𝐶. It follows that the
countries’ own business-cycle fluctuations correlate with what happens in the USA.
current account 𝐶𝐶𝐶𝐶 = (𝑌𝑌 − 𝐺𝐺) − 𝐶𝐶 declines after the income shock leaving the
If global business cycles are correlated across countries but more volatile outside
economy with a smaller 𝐶𝐶𝐶𝐶 surplus or a larger 𝐶𝐶𝐶𝐶 deficit.
the USA, it is less surprising that the USA borrows more from the rest of the world
in booms and lends to the rest of the world (or borrows less) in recessions.
Figure 10.5: Negative shock to current income
328
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
329
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
However, this prediction for the current account is not tightly linked to the budget
deficit. We obtain the same prediction for 𝐶𝐶𝐶𝐶 irrespective of whether current taxes
𝑇𝑇 rise to pay for higher 𝐺𝐺, in which case there is no change in the budget deficit, or
future taxes 𝑇𝑇′ increase and the budget deficit rises. National saving 𝑆𝑆 = 𝑌𝑌 − 𝐶𝐶 − 𝐺𝐺
falls by the same amount in both these cases.
This is because Ricardian equivalence implies that timing of taxes does not affect
households’ consumption choices taking as given the government’s spending plans
(𝐺𝐺, 𝐺𝐺 ′ ). Households’ budget constraint passes through the point (𝑌𝑌 − 𝐺𝐺, 𝑌𝑌 ′ − 𝐺𝐺 ′ ),
which determines its position. This means private saving 𝑆𝑆 𝑝𝑝 will adjust to offset any
changes in public saving 𝑆𝑆 𝑔𝑔 due to taxes and, consequently, national saving 𝑆𝑆 =
𝑆𝑆 𝑝𝑝 + 𝑆𝑆 𝑔𝑔 is unaffected by the timing of taxes. The same is therefore true of the
330
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
331
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
shape. The present-value budget constraint passes through the point (𝑌𝑌 − 𝐵𝐵, 𝑌𝑌 ′ ), where
initial debts 𝐵𝐵 are subtracted from current income 𝑌𝑌.
Now suppose that a nation has the option of (full) default in either the current or future
periods. If a default occurs in the future period then the country wipes out debt 𝐵𝐵′ but
incurs a penalty 𝑣𝑣, the effect of which is measured as equivalent to a reduction in output 𝑌𝑌′.
Total spending is then 𝐶𝐶 ′ + 𝐺𝐺 ′ = 𝑌𝑌 ′ − 𝑣𝑣. The penalty 𝑣𝑣 can either be direct losses or
disruption from international creditors trying to enforce their claims, or a proxy in a two-
period model for the loss from being excluded from world credit markets further in future.
The nation can also default in the current period. In this case, the country wipes out initial
debts 𝐵𝐵 but suffers exclusion from world credit markets in the future period as well as the
same future-period penalty 𝑣𝑣 described above. The default has the effect of returning the
country to autarky, with total expenditure limited by income net of penalties in each time
period, that is, 𝐶𝐶 + 𝐺𝐺 = 𝑌𝑌 and 𝐶𝐶 ′ + 𝐺𝐺 ′ = 𝑌𝑌 ′ − 𝑣𝑣.
Default is a rational choice for the nation if it allows the representative household to reach a
higher indifference curve. In making the default decision, suppose that the government acts
in the interests of the representative household.
Since a nation can choose to default, lenders will only make or roll over current loans if no
default is expected to occur in the future. If there is no default in the second period, the
representative household benefits from expenditure 𝐶𝐶 ′ + 𝐺𝐺 ′ = 𝑌𝑌 ′ − 𝐵𝐵′, while a default
allows it to benefit from 𝐶𝐶 ′ + 𝐺𝐺 ′ = 𝑌𝑌 ′ − 𝑣𝑣. Comparing the two, it follows that a future-
period default is rational if 𝐵𝐵′ > 𝑣𝑣. Consequently, rational lenders make loans in the current
period only if 𝐵𝐵′ ≤ 𝑣𝑣. With this restriction, the current-period budget equation implies:
𝑣𝑣
𝐶𝐶 + 𝐺𝐺 ≤ 𝑌𝑌 − 𝐵𝐵 +
1 + 𝑟𝑟
This is referred to as a limited-commitment constraint on borrowing. It takes the form of a
borrowing constraint analogous to those studied in Chapter 4. The present-value budget
constraint is truncated at 𝐶𝐶 + 𝐺𝐺 = 𝑌𝑌 − 𝐵𝐵 + 𝑣𝑣/(1 + 𝑟𝑟), where the amount that expenditure
can exceed income is 𝑣𝑣 ⁄(1 + 𝑟𝑟) − 𝐵𝐵. Subject to the limited-commitment constraint, if
lending occurs in the current period, debts will be repaid in the future if nothing
unexpectedly changes. When the limited-commitment constraint binds, 𝐶𝐶 ′ + 𝐺𝐺 ′ = 𝑌𝑌 ′ − 𝑣𝑣
because 𝐵𝐵′ = 𝑣𝑣 makes the nation indifferent between defaulting or not.
Since the possibility of default explains why a country faces a borrowing constraint, this
analysis can be used to understand why countries might fail to use international financial
markets to smooth consumption by as much as our earlier analysis would suggest.
Now consider the default decision in the current period, taking as given past lending
decisions 𝐵𝐵 (which might turn out ex post to have been unwise if default is rational). If a
default occurs, the country has no access to international financial markets and only one
expenditure point (𝑌𝑌, 𝑌𝑌′ − 𝑣𝑣) is feasible in autarky. Supposing 𝐵𝐵 ≤ 𝑣𝑣/(1 + 𝑟𝑟), the default
outcome lies to the left of the point where the limited-commitment constraint binds. It
follows that default is not chosen in this case.
332
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
What happens if 𝐵𝐵 > 𝑣𝑣/(1 + 𝑟𝑟)? In this case, the autarky point A lies to the right of the
point B where the limited-commitment constraint binds, so it might be in the nation’s
interest to default. If limited commitment constraint would bind in the absence of default, it
can be seen that default is definitely preferable when 𝐵𝐵 > 𝑣𝑣 ⁄(1 + 𝑟𝑟). If the limited-
commitment constraint is not binding with the country not wanting to borrow the
maximum amount, then there is the value of being able to access world capital markets to
consider as well as the present value of the penalty 𝑣𝑣.
Figure 10.9: Case where default is chosen
This analysis points to factors that make default more likely. First, it is immediate that a
greater debt burden 𝐵𝐵 increases the likelihood of default, all else being equal. Default is also
more likely when the losses or penalties 𝑣𝑣 from default are small. For example, bailouts or
debt forgiveness might allow a nation to return to credit markets after a short spell of
exclusion.
333
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
For a country not already borrowing the maximum amount consistent with the limited-
commitment constraint, a negative shock to income 𝑌𝑌 makes default more likely. However,
𝑌𝑌 becomes irrelevant to the default decision once the maximum borrowing amount is
reached because there is no option value of being able to borrow more to smooth
consumption.
A final crucial factor in the default decision is the interest rate 𝑟𝑟 faced by a country. If this is
high, default is more likely because a greater sacrifice of expenditure is necessary to roll
over existing debts at a higher interest rate. Countries with large debt burdens might not
have an incentive to default as long as interest rates remain low.
The link between interest rates and default decisions points to the possibility of self-fulfilling
expectations of default. Suppose international lenders believe there is a positive probability
of a country defaulting. If 𝑟𝑟 ∗ is the interest rate on a risk-free asset (a bond issued by a
country that will not default), lenders are only willing to lend to the country with default risk
if they receive a risk premium. This risk premium is a spread between 𝑟𝑟 and 𝑟𝑟 ∗ that depends
on the perceived probability of default.
Self-fulfilling defaults could occur because believing the probability of default is high drives
up 𝑟𝑟, making it more likely the condition for default will be satisfied. On the other hand, if
there is a low default probability, 𝑟𝑟 is low and default is not chosen, assuming other
fundamentals do not point to default. This logic shows that lack of commitment can lead to
self-fulfilling defaults in some circumstances.
334
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
𝑌𝑌 𝑑𝑑 = 𝐶𝐶 𝑑𝑑 + 𝐼𝐼 𝑑𝑑 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁
Output demand is 𝑌𝑌 𝑑𝑑 = 𝑌𝑌𝑎𝑎𝑑𝑑 + 𝑁𝑁𝑁𝑁, the sum of domestic demand 𝑌𝑌𝑎𝑎𝑑𝑑 , referred to as
absorption and net exports 𝑁𝑁𝑁𝑁. All else being equal, changes in net exports cause shifts of
the 𝑌𝑌 𝑑𝑑 curve. The second important difference is that perfect mobility of capital in an open
economy aligns the domestic real interest rate 𝑟𝑟 with the foreign interest rate 𝑟𝑟 ∗ . This is a
necessary condition for balance-of-payments equilibrium 𝐵𝐵𝐵𝐵 = 𝐶𝐶𝐶𝐶 + 𝐹𝐹𝐹𝐹 = 0. However, if
𝑟𝑟 = 𝑟𝑟 ∗ for a small open economy that takes 𝑟𝑟 ∗ as given, what ensures the goods market is in
equilibrium if the domestic real interest rate 𝑟𝑟 cannot adjust to where the 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 curves
intersect? The mechanism is now that capital flows occur until 𝑌𝑌 𝑑𝑑 and 𝑌𝑌 𝑠𝑠 cross at 𝑟𝑟 = 𝑟𝑟 ∗.
Adjustment of 𝐹𝐹𝐹𝐹 implies changes in the current account 𝐶𝐶𝐶𝐶 = −𝐹𝐹𝐹𝐹 and, hence, net
exports, which shift the 𝑌𝑌 𝑑𝑑 curve.
Consider the example in Figure 10.10. The hypothetical closed-economy output demand
curve 𝑌𝑌𝑎𝑎𝑑𝑑 intersects 𝑌𝑌 𝑠𝑠 at 𝑟𝑟 = 𝑟𝑟𝑎𝑎 , which is above the foreign real interest rate 𝑟𝑟 ∗ . In an open
economy with perfect capital mobility, 𝑟𝑟 would fall to 𝑟𝑟 ∗ on the horizontal 𝐵𝐵𝐵𝐵 line
representing balance-of-payments equilibrium. At 𝑟𝑟 = 𝑟𝑟 ∗ , the output demand curve 𝑌𝑌𝑎𝑎𝑑𝑑 is to
the right of the output supply curve 𝑌𝑌 𝑠𝑠 . With domestic demand exceeding domestic
production, imports rise, reducing net exports 𝑁𝑁𝑁𝑁, which shifts the open-economy output
demand curve 𝑌𝑌 𝑑𝑑 to the left until it intersects 𝑌𝑌 𝑠𝑠 at 𝑟𝑟 = 𝑟𝑟 ∗ . The fall in the current account
𝐶𝐶𝐶𝐶 is matched by capital inflows that raise 𝐹𝐹𝐹𝐹.
Figure 10.10: Goods market in an open economy
In what follows, we simplify matters by assuming 𝑁𝑁𝑁𝑁𝑁𝑁 = 0, which means that the current
account 𝐶𝐶𝐶𝐶 is the same as net exports 𝑁𝑁𝑁𝑁. We will also consider an economy starting from
a current account close to balance (𝐶𝐶𝐶𝐶 is close to zero). This means that households hold
assets equal to all domestic assets and the economy is neither a net saver or borrower. This
allows us to continue to ignore income effects arising from changes in real wages 𝑤𝑤 and the
real interest rate 𝑟𝑟 (see the discussion in Section 3.11). The consequence of these
simplifications is that the labour demand 𝑁𝑁 𝑑𝑑 and labour supply 𝑁𝑁 𝑠𝑠 curves, and the output
supply curve 𝑌𝑌 𝑠𝑠 , continue to behave as they do in a closed economy. The only change in an
335
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
open economy is the presence of net exports as a component of 𝑌𝑌 𝑑𝑑 . Hence, the adjustment
to balance-of-payments equilibrium (𝑟𝑟 = 𝑟𝑟 ∗ ) in an open economy occurs solely through
shifts of the output demand curve as seen in Figure 10.10.
10.4.2 Examples
Let us consider two types of shock to illustrate how the open-economy real dynamic model
works. We proceed by finding the effects of a shock in a closed economy and then ask what
adjustment of net exports (and, hence, what shift of 𝑌𝑌 𝑑𝑑 ) is required to achieve balance-of-
payments equilibrium in an open economy.
First, consider a temporary negative supply shock (a decline in current TFP 𝑧𝑧) of the kind we
used to explain business cycles using the RBC approach from Section 8.8. In a closed
economy, this shock shifts both the output supply 𝑌𝑌 𝑠𝑠 and output demand 𝑌𝑌 𝑑𝑑 curves to the
left but the leftward shift of 𝑌𝑌 𝑑𝑑 is smaller than that of 𝑌𝑌 𝑠𝑠 owing to the desire for
consumption smoothing. This led the equilibrium real interest rate to rise in a closed
economy. In an open economy, the real interest rate cannot rise above 𝑟𝑟 ∗ and balance-of-
payments equilibrium (𝑟𝑟 = 𝑟𝑟 ∗ ) is restored by net exports 𝑁𝑁𝑁𝑁 declining instead, shifting 𝑌𝑌 𝑑𝑑
further to the left to match the shift of 𝑌𝑌 𝑠𝑠 . This analysis is illustrated in Figure 10.11. The
negative TFP shock results in a larger decline in output in an open economy as households
smooth consumption by importing more and borrowing from the rest of the world.
Figure 10.11: Temporary TFP shock in an open economy
336
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
337
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
because 𝑟𝑟𝑙𝑙 is higher for each real interest rate 𝑟𝑟 on the vertical axis. In a closed
economy, this would lead to a lower domestic real interest rate 𝑟𝑟 in equilibrium.
Figure 10.13: Country with a high return on capital
338
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Taking the example in Section 10.4 of a negative supply shock, Figure 10.15 shows
that GDP 𝑌𝑌 falls by less with full capital controls and the real interest rate 𝑟𝑟 rises.
Although GDP falls by less, households are better off with perfect capital mobility
because they can smooth consumption with cheaper international borrowing at
interest rate 𝑟𝑟 ∗ . The high interest rate 𝑟𝑟 is a reflection of the difficulties of
producing domestically after the negative supply shock.
339
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
340
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Accounting for domestic and foreign output being different types of goods does not
fundamentally affect the determination of net exports 𝑁𝑁𝑁𝑁 in the two-period model of the
current account from Section 10.2. Given the equilibrium value of 𝑁𝑁𝑁𝑁, the terms of trade 𝑞𝑞
adjusts to the level that is consistent with it according to Figure 10.16. For example, the
temporary increase in government spending 𝐺𝐺 from Box 10.2 to leads to a fall in net exports
𝑁𝑁𝑁𝑁 and now to a lower terms of trade 𝑞𝑞, which represents a real appreciation of the terms
of trade, a loss of competitiveness of domestic goods.
In the full real dynamic model, the output demand curve 𝑌𝑌 𝑑𝑑 continues to shift with 𝑁𝑁𝑁𝑁
following shocks to the economy. However, a higher value of 𝑁𝑁𝑁𝑁, associated with a
rightward shift of 𝑌𝑌 𝑑𝑑 requires higher 𝑞𝑞. This real depreciation of the terms of trade reduces
the purchasing power of the real wage 𝑤𝑤 (which is in terms of domestic goods but
households also want to buy imports), implying a leftward shift of labour supply 𝑁𝑁 𝑠𝑠 and
causing a leftward shift of the output supply curve 𝑌𝑌 𝑠𝑠 . Qualitatively, this does not change
the direction of our earlier results but implies smaller adjustments of 𝑁𝑁𝑁𝑁 and GDP 𝑌𝑌 are
needed to restore balance-of-payments equilibrium 𝑟𝑟 = 𝑟𝑟 ∗ . Hence, we shall usually ignore
this effect in practice.
Suppose there are two currencies in the world, one for the domestic economy and one for
the foreign economy. The price of domestic goods in terms of domestic currency is 𝑃𝑃 and
the price of foreign goods in terms of foreign currency is 𝑃𝑃∗ . The exchange rate 𝑒𝑒 between
the domestic and foreign currencies, or to be precise, the nominal exchange rate is defined
as the domestic-currency price of a unit of foreign currency. Note that a higher value of 𝑒𝑒 is
a depreciation of the domestic currency with this definition.
Given the nominal exchange rate 𝑒𝑒 and the prices 𝑃𝑃 and 𝑃𝑃∗ we can calculate the implied real
exchange rate, denoted by 𝑞𝑞. This is defined as the price of a unit of foreign goods in terms
of domestic goods. The real exchange rate 𝑞𝑞 is calculated by comparing the domestic price
𝑃𝑃 of domestic goods to the price of foreign goods 𝑃𝑃∗ converted into domestic currency 𝑒𝑒𝑃𝑃 ∗:
𝑒𝑒𝑃𝑃∗
𝑞𝑞 =
𝑃𝑃
341
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
342
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
money demand function in terms of the exchange rate is 𝑀𝑀𝑑𝑑 = 𝑒𝑒𝑃𝑃∗ 𝐿𝐿(𝑌𝑌, 𝑟𝑟 ∗ + 𝜋𝜋). Given 𝑌𝑌,
𝑟𝑟 ∗ and 𝜋𝜋, this rises proportionally with 𝑒𝑒. Higher 𝑒𝑒 means a depreciation of the domestic
currency that raises domestic prices according to PPP, which increases the need for money
to carry out a given real quantity of transactions.
Figure 10.17 plots the money demand curve with the nominal exchange rate 𝑒𝑒 on the
vertical axis. The demand curve is upward sloping, analogous to the upward-sloping money
demand function in terms of the price level 𝑃𝑃 that were seen in Section 6.4. The money
supply curve is vertical because the quantity of money 𝑀𝑀 𝑠𝑠 is exogenous. The equilibrium
nominal exchange rate 𝑒𝑒 is found at the point of intersection between 𝑀𝑀𝑑𝑑 and 𝑀𝑀 𝑠𝑠 .
Figure 10.17: Equilibrium exchange rate
Under a flexible-exchange rate regime, the market determines the exchange rate 𝑒𝑒, which
will depend on the stance of monetary policy among other things. With a fixed-exchange
rate regime, there is a target value 𝑒𝑒̅ of the exchange rate and the government or central
bank intervenes in the foreign-exchange market to ensure 𝑒𝑒 = 𝑒𝑒̅.
A fixed exchange-rate regime depends on the government or central bank maintaining
sufficient foreign-exchange reserves (foreign-currency assets) to support the value of the
domestic currency if needed and being willing to accumulate more foreign-exchange
reserves if necessary. Foreign-exchange market interventions lead to the domestic money
supply 𝑀𝑀 𝑠𝑠 being endogenous.
343
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
344
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Figure 10.18: Foreign exchange intervention with rise in foreign price level
We now look at what difference it makes if the country has a flexible exchange rate. As
before, the increase in foreign prices 𝑃𝑃∗ implies fall in exchange rate 𝑒𝑒 = 𝑃𝑃/𝑃𝑃∗ consistent
with PPP, leading to pressure for an appreciation of the domestic currency. This is shown in
Figure 10.19, where the rise of 𝑃𝑃∗ pivots 𝑀𝑀𝑑𝑑 rightwards. With domestic monetary policy
unchanged, i.e. with no shift of the money supply curve 𝑀𝑀 𝑠𝑠 , the exchange rate falls from 𝑒𝑒1
to 𝑒𝑒2 . There is an appreciation of the domestic currency and no foreign-exchange
intervention occurs to prevent this. Observe that the nominal exchange rate 𝑒𝑒 adjusts so
that 𝑃𝑃 = 𝑒𝑒𝑃𝑃 ∗ is unaffected (money-market equilibrium 𝑀𝑀 𝑠𝑠 = 𝑀𝑀𝑑𝑑 = 𝑃𝑃𝑃𝑃(𝑌𝑌, 𝑟𝑟 ∗ + 𝜋𝜋) is
reached at the same price level 𝑃𝑃). Inflation in foreign markets does not cause a rise in
domestic prices 𝑃𝑃 under a flexible exchange-rate regime.
Figure 10.19: Flexible exchange rate with rise in foreign price level
345
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
The money supply curve 𝑀𝑀 𝑠𝑠 shifts to the right, and the equilibrium exchange rate 𝑒𝑒 rises in
proportion to the increase in 𝑀𝑀 𝑠𝑠 , implying a depreciation of the domestic currency.
Purchasing power parity 𝑃𝑃 = 𝑒𝑒𝑃𝑃∗ implies the price level 𝑃𝑃 also rises in proportion to 𝑀𝑀 𝑠𝑠 . As
no further inflation expected, there is no shift of 𝑀𝑀𝑑𝑑 through higher 𝑖𝑖 = 𝑟𝑟 ∗ + 𝜋𝜋 and there
are no real effects because prices are fully flexible. This is the open-economy equivalent of
the money neutrality result from Section 6.9. Monetary policy autonomy is still valuable to
the extent that it gives monetary policy control over domestic prices and the inflation rate.
346
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
goods. We assume that net exports 𝑁𝑁𝑁𝑁 are positively related to competitiveness 𝑞𝑞 as
explained in Section 10.5.
Both goods prices 𝑃𝑃 and 𝑃𝑃∗ are sticky in this model for the reasons discussed in Section 8.1.
As explained in the new Keynesian model from Section 8.2, real GDP 𝑌𝑌 becomes demand
determined, i.e. the output supply curve 𝑌𝑌 𝑠𝑠 is not relevant in the short run. Unlike the real
dynamic model in Section 10.4, only the output demand curve 𝑌𝑌 𝑑𝑑 is relevant here.
Even though prices of goods are sticky in the currency of the country in which they are
produced, competitiveness 𝑞𝑞 can change because of fluctuations in the nominal exchange
rate 𝑒𝑒. A depreciation of domestic currency, higher 𝑒𝑒, increases competitiveness, higher 𝑞𝑞,
which raises net exports 𝑁𝑁𝑁𝑁. This shifts output demand 𝑌𝑌 𝑑𝑑 = 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺 + 𝑁𝑁𝑁𝑁 to the right.
Domestic demand in 𝑌𝑌 𝑑𝑑 depends on the real interest rate 𝑟𝑟 as usual. If prices are expected
to remain constant, the Fisher equation implies the real interest rate 𝑟𝑟 equals the nominal
interest rate 𝑖𝑖 controlled by monetary policy. Although producer prices are sticky, consumer
prices can change with the exchange rate but we assume the share of spending on imports
is sufficiently small that this effect can be ignored when thinking about expected inflation.
1 + 𝑖𝑖 = (1 + 𝑖𝑖 ∗ )(1 + ∆)
This is known as the uncovered interest parity (UIP) condition.
347
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
Assuming ∆ is not too large relative to 𝑖𝑖 ∗ , the UIP equation becomes 𝑖𝑖 ≈ 𝑖𝑖 ∗ + ∆. Ignoring the
approximation, with perfect capital mobility and risk-neutral investors, balance-of-payments
equilibrium requires that the following uncovered interest parity condition holds:
𝑖𝑖 = 𝑖𝑖 ∗ + ∆
This says that the domestic nominal interest rate 𝑖𝑖 must be equal to the foreign nominal
interest rate 𝑖𝑖 ∗ plus any expected depreciation ∆ of the domestic currency relative to the
foreign currency. Given 𝑖𝑖 ∗ , the domestic interest rate 𝑖𝑖 must rise to compensate investors
for a positive expected depreciation ∆ of the domestic currency.
Assuming consumer-price inflation is approximately zero so 𝑟𝑟 ≈ 𝑖𝑖, we can state the UIP
condition in terms of domestic and foreign real interest rates 𝑟𝑟 and 𝑟𝑟 ∗ :
𝑟𝑟 = 𝑟𝑟 ∗ + ∆
In Figure 10.21, suppose domestic output demand 𝑌𝑌𝑎𝑎𝑑𝑑 = 𝐶𝐶 + 𝐼𝐼 + 𝐺𝐺 intersects 𝑀𝑀𝑀𝑀 at 𝑟𝑟𝑎𝑎 ,
which is above the 𝐵𝐵𝐵𝐵 line. The exchange rate appreciates (𝑒𝑒 falls), with lower
competitiveness reducing net exports and shifting 𝑌𝑌 𝑑𝑑 to the left until it intersects 𝑀𝑀𝑀𝑀 on
the 𝐵𝐵𝐵𝐵 line. This process determines equilibrium real GDP 𝑌𝑌0 and the exchange rate 𝑒𝑒0 .
Note that the 𝑀𝑀𝑀𝑀 line is drawn as upward sloping. With perfect capital mobility, an 𝑀𝑀𝑀𝑀
line that is horizontal or close to horizontal might require huge movements in the exchange
rate and real GDP to restore balance-of-payments equilibrium. For this reason, we assume
monetary policy is willing to adjust the nominal interest rate if the changes in real GDP
brought about by exchange-rate movements were too large.
If the government has a fixed exchange rate policy then the central bank must use monetary
policy to support the target 𝑒𝑒 = 𝑒𝑒̅ for the exchange rate. If the exchange rate is kept fixed at
𝑒𝑒̅ then there is no shift of the output demand curve 𝑌𝑌 𝑑𝑑 because of 𝑒𝑒. Moreover, if the
348
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
exchange rate is expected to remain fixed in the future then there is no expected
depreciation or appreciation (∆ = 0) and the 𝐵𝐵𝐵𝐵 line is at 𝑟𝑟 = 𝑟𝑟 ∗. Given the requirement of
balance-of-payments equilibrium, Figure 10.22 shows that keeping the exchange rate fixed
requires the central bank to adjust its stance of monetary policy so that the 𝑀𝑀𝑀𝑀 line
intersects 𝑌𝑌 𝑑𝑑 on the 𝐵𝐵𝐵𝐵 line.
Figure 10.21: Open-economy sticky-price model with flexible exchange rate
With a fixed exchange rate, having the 𝑀𝑀𝑀𝑀 line shift to intersect 𝐵𝐵𝐵𝐵 and 𝑌𝑌 𝑑𝑑 is equivalent to
having a horizontal 𝑀𝑀𝑀𝑀 line in the same position as the 𝐵𝐵𝐵𝐵 line. Intuitively, this represents
the subordination of monetary policy to defending the fixed exchange rate, with the central
bank being forced set the same interest rate 𝑟𝑟 ∗ as the foreign economy.
Figure 10.22: Open-economy sticky-price model with fixed exchange rate
349
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
3. Free capital flows: perfect capital mobility, where the UIP condition 𝑖𝑖 = 𝑖𝑖 ∗ +
∆ holds.
With the second and third of these policies, Figure 10.23 shows how a shift in the
monetary policy stance to meet some domestic objective requires that the
exchange rate 𝑒𝑒 must adjust so 𝑌𝑌 𝑑𝑑 intersects 𝑀𝑀𝑀𝑀 and 𝐵𝐵𝐵𝐵, with free capital flows
tying down the position of the 𝐵𝐵𝐵𝐵 line if no change in the exchange rate is
expected. It is therefore not possible to have the first policy of keeping the
exchange rate fixed as well.
Figure 10.23: Monetary autonomy and free capital flows
Choosing the first and third policies of a fixed exchange rate 𝑒𝑒 = 𝑒𝑒̅ and free capital
flows, the position of the output demand curve is tied down at 𝑌𝑌 𝑑𝑑 (𝑒𝑒̅). We have
seen that the stance of monetary policy must adjust so that the 𝑀𝑀𝑀𝑀 line intersects
𝐵𝐵𝐵𝐵 and 𝑌𝑌 𝑑𝑑 , as shown in Figure 10.24. This means it is not possible to have the
second policy choice, an independent monetary policy, that is, the freedom to
choose the position of the 𝑀𝑀𝑀𝑀 line.
Making the first and second policy choices of a fixed exchange rate and monetary
policy autonomy, Figure 10.25 shows that it is not possible simultaneously to be on
the perfect capital mobility 𝐵𝐵𝐵𝐵 line in general. In the absence of large private
350
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
If the government wants to maintain a fixed exchange rate and allow capital
mobility, are there policies that can substitute for an independent monetary policy?
Figure 10.26 shows how fiscal policy, for example, an increase in government
expenditure 𝐺𝐺, can shift the output demand curve 𝑌𝑌 𝑑𝑑 for a given exchange rate 𝑒𝑒̅.
This affects real GDP because the stance of monetary policy must adjust to ensure
the 𝑀𝑀𝑀𝑀 line intersects both the 𝑌𝑌 𝑑𝑑 curve
351
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
and the 𝐵𝐵𝐵𝐵 line (which can be shown as the 𝑀𝑀𝑀𝑀 line overlapping the 𝐵𝐵𝐵𝐵 line).
Thus, fiscal policy becomes highly effective under a fixed exchange rate regime.
Note that fiscal policy would not be effective under a flexible exchange rate regime
unless accommodated by a change in monetary policy.
Figure 10.26: Fiscal policy with fixed exchange rate and free capital flows
352
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
rate cannot be defended any longer and the exchange rate will float. With the
money supply continuing to increase, the exchange rate will depreciate over time,
with PPP implying this will lead to ongoing inflation. A higher inflation rate 𝜋𝜋 raises
𝑖𝑖 = 𝑟𝑟 ∗ + 𝜋𝜋 and reduces money demand as shown in the right panel of the figure. At
the time of the collapse of the fixed exchange rate, there is sharp reduction in
demand for domestic currency, triggering a large terminal drop in the central
bank’s foreign-exchange reserves when the crisis occurs.
Figure 10.27: Foreign-exchange interventions to defend fixed exchange rate
353
EC2065 Macroeconomics | Chapter 10: Open-economy macroeconomics
If the government is not willing to sacrifice real GDP and employment to defend the
fixed exchange rate then it fails to shift the 𝑀𝑀𝑀𝑀 line, which would cause the
exchange rate to depreciate to restore balance-of-payments equilibrium. However,
this justifies the initial expectation that the fixed exchange rate will be abandoned,
so the shift in expectations of the exchange rate is not irrational. This shows the
collapse of the fixed exchange rate regime can be a self-fulfilling prophecy.
However, if no collapse is expected then the 𝐵𝐵𝐵𝐵 line does not shift, the central
bank does not need to change monetary policy, and neither GDP nor the exchange
rate change, confirming the belief of no collapse is not irrational, so there are
multiple equilibria in this example.
We see that currency crisis can be self-fulfilling when the government will not
prioritise the defence of a fixed exchange rate if that jeopardises other objectives
such as GDP.
354