Blanchardm2 I Een Fil 2015 PDF
Blanchardm2 I Een Fil 2015 PDF
Christian Groth
Preface xvii
iii
iv CONTENTS
35 Outlook 1107
Introduction
1.1 Macroeconomics
1.1.1 The …eld
Economics is the social science that studies the production and distribution of
goods and services in society. Then, what de…nes the branch of economics named
macroeconomics? There are two de…ning characteristics. First, macroeconomics
is the systematic study of the economic interactions in society as a whole. This
could also be said of microeconomic general equilibrium theory, however. The
second de…ning characteristic of macroeconomics is that it aims at understanding
the empirical regularities in the behavior of aggregate economic variables such
as aggregate production, investment, unemployment, the general price level for
goods and services, the in‡ation rate, the level of interest rates, the level of real
wages, the foreign exchange rate, productivity growth etc. Thus macroeconomics
focuses on the major lines of the economics of a society.
The aspiration of macroeconomics is three-fold:
1. to explain the levels of the aggregate variables as well as their movement
over time in the short run and the long run;
2. to make well-founded forecasts possible;
3. to provide foundations for rational economic policy applicable to macroeco-
nomic problems, be they short-run distress in the form of economic recession
or problems of a more long-term, structural character.
3
4 CHAPTER 1. INTRODUCTION
1
These number-of-years …gures are only a rough indication. The di¤erent “runs”are relative
concepts and their appropriateness depends on the speci…c problem and circumstances at hand.
as exogenous.2
Figure 1.1: Quarterly Industrial Production Index in six major countries (Q1-1958 to
Q2-2013; index Q1-1961=100). Source: OECD Industry and Service Statistics. Note:
Industrial production includes manufacturing, mining and quarrying, electricity, gas,
and water, and construction.
2
References to textbooks on economic growth are given in Literature notes at the end of this
chapter.
Denmark
Eurozone
100 United States
90
80
70
Figure 1.2: Indexed real GDP for Denmark, Eurozone and US, 1995-2012 (2007=100).
Source: EcoWin and Statistics Denmark.
Basic categories
Technological constraints.
The institutions and social norms regulating the economic interactions (for-
mal and informal “rules of the game”).
Types of variables
Endogenous vs. exogenous variables.
8. Initial conditions are equations …xing the initial values of the state variables
in a dynamic model
Types of analysis
Statics vs. dynamics. Comparative dynamics vs. study of dynamic e¤ects of
a parameter shift in historical time.
Macroeconomics studies processes in real time. The emphasis is on dynamic
models, that is, models that establishes a link from the state of the economic
system to the subsequent state. A dynamic model thus allows a derivation of
the evolution over time of the endogenous variables. A static model is a model
where time does not enter or where all variables refer to the same point in time.
Occasionally we consider static models, or more precisely quasi-static models. The
modi…er “quasi-”is meant to indicate that although the model is a framework for
analysis of only a single period, the model considers some variables as inherited
from the past and some variables that involve expectations about the future.
What we call temporary equilibrium models are of this type. Their role is to serve
as a prelude to a more elaborate dynamic model dealing with the same elements.
Dynamic analysis aims at establishing dynamic properties of an economic
system: is the system stable or unstable, is it asymptotically stable, if so, is it
globally or only locally asymptotically stable, is it oscillatory? If the system is
asymptotically stable, how fast is the adjustment?
Partial equilibrium vs. general equilibrium:
We say that a given single market is in partial equilibrium at a given point in
time if for arbitrarily given prices and quantities in the other markets, the agents’
chosen actions in this market are mutually compatible. In contrast the concept of
general equilibrium take the mutual dependencies between markets into account.
We say that a given economy is in general equilibrium at a given point in time if
in all markets the actions chosen by all the agents are mutually compatible.
An analyst trying to clarify a partial equilibrium problem is doing partial
equilibrium analysis. Thus partial equilibrium analysis does not take into account
the feedbacks from these actions to the rest of the economy and the feedbacks
from these feedbacks and so on. In contrast, an analyst trying to clarify a
general equilibrium problem is doing general equilibrium analysis. This requires
considering the mutual dependencies in the system of markets as a whole.
Sometimes even the analysis of the constrained maximization problem of a
single decision maker is called partial equilibrium analysis. Consider for instance
the consumption-saving decision of a household. Then the analytical derivation
of the saving function of the household is by some authors included under the
heading partial equilibrium analysis, which may seem natural since the real wage
and real interest rate appearing as arguments in the derived saving function are
arbitrary. Indeed, what the actual saving of the young will be in the end, depends
on the real wage and real interest rate formed in the general equilibrium.
In this book we call the analysis of a single decision maker’s problem partial
analysis, not partial equilibrium analysis. The motivation for this is that trans-
parency is improved if one preserves the notion of equilibrium for a state of a
market or a state of a system of markets .
One of the reasons that confusion of stocks and ‡ows may arise is the tradition
in macroeconomics to use the same symbol, K; for the capital input (the number
of machine hours per year), in (1.1) as for the capital stock in an accumulation
equation like
Kt+1 = Kt + It Kt : (1.4)
Here the interpretation of Kt is as a capital stock (number of machines) at the
beginning of period t; It is gross investment, and is the rate of physical capital
depreciation due to wear and tear (0 1): In (1.4) there is no role for the
rate of utilization of the capital stock, which is, however, of key importance in
(1.1). Similarly, there is a tradition in macroeconomics to denote the number of
heads in the labor force by L and write, for example, Lt = L0 (1 + n)t ; where n
is a constant growth rate of the labor force. Here the interpretation of Lt is as a
stock (number of persons). There is no role for the average rate of utilization in
actual employment of this stock over the year.
This text will not attempt a break with this tradition of using the same symbol
for two in principle di¤erent variables. But we insist on interpretations such that
the notation is consistent. This requires normalization of the utilization rates for
capital and labor in the production function to equal one, as indicated in (1.2)
and (1.3) above. We are then allowed to use the same symbol for a stock and the
corresponding ‡ow because the values of the two variables will coincide.
An illustration of the importance of being aware of the distinction between
stock and ‡ows appears when we consider the following measure of per capita
income in a given year:
GDP GDP #hours of work #employed workers #workers
= ;
N #hours of work #employed workers #workers N
(1.5)
where N; #workers, and #employed workers indicate, say, the average size of the
population, the workforce (including the unemployed), and the employed work-
force, respectively, during the year. That is, aggregate per capita income equals
average labor productivity times average labor intensity times the crude employ-
ment rate times the workforce participation rate.3 An increase from one year to
3
By the crude employment rate is meant the number of employed individuals, without
weighting by the number of hours they work per week, divided by the total number of individuals
in the labor force.
the next in the ratio on the left-hand side of the equation re‡ects the net e¤ect
of changes in the four ratios on the right-hand side. Similarly, a fall in per capita
income (a ratio between a ‡ow and a stock) need not re‡ect a fall in productiv-
ity (GDP=#hours of work, a ratio of two ‡ows), but may re‡ect, say, a fall in
the number of hours per member of the workforce (#hours of work/#workers)
due to a rise in unemployment (fall in #employed workers/workers) or an ageing
population (fall in #workers/N ).
A second conceptual issue concerning the production function in (1.1) re-
lates to the question: what about land and other natural resources? As farming
requires land and factories and o¢ ce buildings require building sites, a third
argument, a natural resource input, should in principle appear in (1.1). In theo-
retical macroeconomics for industrialized economies this third factor is often left
out because it does not vary much as an input to production and tends to be of
secondary importance in value terms.
A third conceptual issue concerning the production function in (1.1) relates to
the question: what about intermediate goods? By intermediate goods we mean
non-durable means of production like raw materials and energy. Certainly, raw
materials and energy are generally necessary inputs at the micro level. Then
it seems strange to regard output as produced by only capital and labor. The
point is that in macroeconomics we often abstract from the engineering input-
output relations, involving intermediate goods. We imagine that at a lower stage
of production, raw materials and energy are continuously produced by capital
and labor, but are then immediately used up at a higher stage of production,
again using capital and labor. The value of these materials are not part of value
added in the sector or in the economy as a whole. Since value added is what
macroeconomics usually focuses at and what the Y in (1.1) represents, materials
therefore are often not explicit in the model.
On the other hand, if of interest for the problems studied, the analysis should,
of course, take into account that at the aggregate level in real world situations,
there will generally be a minor di¤erence between produced and used-up raw
materials which then constitute net investment in inventories of materials.
To further clarify this point as well as more general aspects of how macro-
economic models are related to national income and product accounts, the next
section gives a review of national income accounting.
(very incomplete)
We give here a stylized picture of national income and product accounts with
emphasis on the conceptual structure. The basic point to be aware of is that
national income accounting looks at output from three sides:
The aim of this chapter is threefold. First, we shall introduce this book’s vocabu-
lary concerning …rms’technology and technological change. Second, we shall re-
fresh our memory of key notions from microeconomics relating to …rms’behavior
and factor market equilibrium under simplifying assumptions, including perfect
competition. Finally, to prepare for the many cases where perfect competition
and other simplifying assumptions are not good approximations to reality, we
give an introduction to …rms’behavior under more realistic conditions including
monopolistic competition.
The vocabulary pertaining to other aspects of the economy, for instance house-
holds’preferences and behavior, is better dealt with in close connection with the
speci…c models to be discussed in the subsequent chapters. Regarding the dis-
tinction between discrete and continuous time analysis, most of the de…nitions
contained in this chapter are applicable to both.
where Y is output (value added) per time unit, K is capital input per time unit,
and L is labor input per time unit (K 0; L 0). We may think of (2.1)
as describing the output of a …rm, a sector, or the economy as a whole. It is
in any case a very simpli…ed description, ignoring the heterogeneity of output,
capital, and labor. Yet, for many macroeconomic questions it may be a useful
…rst approach.
Note that in (2.1) not only Y but also K and L represent ‡ows, that is,
quantities per unit of time. If the time unit is one year, we think of K as
17
18 CHAPTER 2. REVIEW OF TECHNOLOGY AND FIRMS
(a) F (K; L) has continuous …rst- and second-order partial derivatives satisfying:
(b) F (K; L) is strictly quasiconcave (i.e., the level curves, also called isoquants,
are strictly convex to the origin).
where the third argument indicates that the production function may shift over
time, due to changes in technology. We then say that F is a neoclassical produc-
tion function if for all t in a certain time interval it satis…es the conditions (a)
and (b) w.r.t its …rst two arguments. Technological progress can then be said to
occur when, for Kt and Lt held constant, output increases with t:
For convenience, to begin with we skip the explicit reference to time and level
of technology.
So M RSKL equals the ratio of the marginal productivities of labor and capital,
respectively.2 The economic interpretation of M RSKL is that it indicates (ap-
proximately) the amount of K that can be saved by applying an extra unit of
labor.
Since F is neoclassical, by de…nition F is strictly quasi-concave and so the
marginal rate of substitution is diminishing as substitution proceeds, i.e., as the
labor input is further increased along a given isoquant. Notice that this feature
characterizes the marginal rate of substitution for any neoclassical production
function, whatever the returns to scale (see below).
Figure 2.1: M RSKL as the absolute slope of the isoquant representing F (K; L) = Y .
In this case, the marginal productivity of either production factor has no upper
bound when the input of the factor becomes in…nitely small. And the marginal
productivity is gradually vanishing when the input of the factor increases without
bound. Actually, (2.6) and (2.7) express four conditions, which it is preferable to
consider separately and label one by one. In (2.6) we have two Inada conditions
for M P K (the marginal productivity of capital), the …rst being a lower, the
second an upper Inada condition for M P K. And in (2.7) we have two Inada
conditions for M P L (the marginal productivity of labor), the …rst being a lower,
the second an upper Inada condition for M P L. In the literature, when a sentence
like “the Inada conditions are assumed”appears, it is sometimes not made clear
which, and how many, of the four are meant. Unless it is evident from the context,
it is better to be explicit about what is meant.
The de…nition of a neoclassical production function we have given is quite
common in macroeconomic journal articles and convenient because of its ‡exibil-
ity. Yet there are textbooks that de…ne a neoclassical production function more
narrowly by including the Inada conditions as a requirement for calling the pro-
duction function neoclassical. In contrast, in this book, when in a given context
we need one or another Inada condition, we state it explicitly as an additional
assumption.
F ( K; L) = F (K; L):
As all inputs are scaled up or down by some factor, output is scaled up or down
by the same factor.4 The assumption of CRS is often defended by the replication
3
After the Japanese economist Ken-Ichi Inada, 1925-2002.
4
In their de…nition of a neoclassical production function some textbooks add constant re-
turns to scale as a requirement besides (a) and (b) above. This book follows the alternative
argument saying that “by doubling all inputs we are always able to double the
output since we are essentially just replicating a viable production activity”.
Before discussing this argument, lets us de…ne the two alternative “pure”cases.
The production function F (K; L) is said to have increasing returns to scale
(IRS for short) if, for all (K; L) 2 R2++ and all > 1,
That is, IRS is present if, when increasing the scale of operations by scaling up
every input by some factor > 1, output is scaled up by more than this factor. One
argument for the plausibility of this is the presence of equipment indivisibilities
leading to high unit costs at low output levels. Another argument is that gains
by specialization and division of labor, synergy e¤ects, etc. may be present, at
least up to a certain level of production. The IRS assumption is also called the
economies of scale assumption.
Another possibility is decreasing returns to scale (DRS). This is said to occur
when for all (K; L) 2 R2++ and all > 1;
That is, DRS is present if, when all inputs are scaled up by some factor, output
is scaled up by less than this factor. This assumption is also called the disec-
onomies of scale assumption. The underlying hypothesis may be that control and
coordination problems con…ne the expansion of size. Or, considering the “repli-
cation argument” below, DRS may simply re‡ect that behind the scene there
is an additional production factor, for example land or a irreplaceable quality
of management, which is tacitly held …xed, when the factors of production are
varied.
EXAMPLE 1 The production function
+ > 1; there are IRS. Note that and must be less than 1 in order not to
violate the diminishing marginal productivity condition.
EXAMPLE 2 The production function
1
Y =A K + (1 )L ; (2.9)
where A; ; and are parameters satisfying A > 0, 0 < < 1; and < 1; 6= 0;
is called a CES production function (CES for Constant Elasticity of Substitution).
For a given choice of measurement units, the parameter A re‡ects e¢ ciency (or
“total factor productivity”) and is thus called the e¢ ciency parameter. The
parameters and are called the distribution parameter and the substitution
parameter, respectively. The latter name comes from the property that the higher
is ; the more sensitive is the cost-minimizing capital-labor ratio to a rise in
the relative factor price. Equation (2.9) gives the CES function for the case of
constant returns to scale; the cases of increasing or decreasing returns to scale
are presented in Chapter 4.5. A limiting case of the CES function (2.9) gives the
Cobb-Douglas function with CRS. Indeed, for …xed K and L;
1
lim A K + (1 )L = AK L1 :
!0
This and other properties of the CES function are shown in Chapter 4.5. The
CES function has been used intensively in empirical studies.
EXAMPLE 3 The production function
than Leontief functions. But sometimes the latter are preferred, in particular in
short-run analysis with focus on the use of already installed equipment where the
substitution possibilities tend to be limited.6 As (2.10) reads, the function has
CRS. A generalized form of the Leontief function is Y = min(AK ; BL ); where
> 0. When < 1; there are DRS, and when > 1; there are IRS.
by the same factor. Hence, the replication argument requires that indivisibilities
are negligible, which is certainly not always the case. In fact, the replication
argument is more an argument against DRS than for CRS in particular. The
argument does not rule out IRS due to synergy e¤ects as scale is increased.
Sometimes the replication line of reasoning is given a more subtle form. This
builds on a useful local measure of returns to scale, named the elasticity of scale.
The elasticity of scale*8 To allow for indivisibilities and mixed cases (for
example IRS at low levels of production and CRS or DRS at higher levels), we
need a local measure of returns to scale. One de…nes the elasticity of scale,
(K; L); of F at the point (K; L); where F (K; L) > 0; as
dF ( K; L) F ( K; L)=F (K; L)
(K; L) = ; evaluated at = 1:
F (K; L) d =
(2.11)
So the elasticity of scale at a point (K; L) indicates the (approximate) percentage
increase in output when both inputs are increased by 1 percent. We say that
8
< > 1; then there are locally IRS,
if (K; L) = 1; then there are locally CRS, (2.12)
:
< 1; then there are locally DRS.
The production function may have the same elasticity of scale everywhere. This
is the case if and only if the production function is homogeneous of some degree
h > 0. In that case (K; L) = h for all (K; L) for which F (K; L) > 0; and h
indicates the global elasticity of scale. The Cobb-Douglas function, cf. Example
1, is homogeneous of degree + and has thereby global elasticity of scale equal
to + :
Note that the elasticity of scale at a point (K; L) will always equal the sum
of the partial output elasticities at that point:
This follows from the de…nition in (2.11) by taking into account that
dF ( K; L)
= FK ( K; L)K + FL ( K; L)L
d
= FK (K; L)K + FL (K; L)L; when evaluated at = 1:
8
A section headline marked by * indicates that in a …rst reading the section can be skipped
- or at least just skimmed through.
LAC(Y )
(K; L) = ; (2.14)
LM C(Y )
where LAC(Y ) is average costs (the minimum unit cost associated with producing
Y ) and LM C(Y ) is marginal costs at the output level Y . The L in LAC and
LM C stands for “long-run”, indicating that both capital and labor are considered
variable production factors within the period considered. At the optimal plant
size, Y ; there is equality between LAC and LM C, implying a unit elasticity
of scale. That is, locally we have CRS. That the long-run average costs are
here portrayed as rising for Y > Y ; is not essential for the argument but may
re‡ect either that coordination di¢ culties are inevitable or that some additional
production factor, say the building site of the plant, is tacitly held …xed.
Anyway, on this basis Robert Solow (1956) came up with a more subtle repli-
cation argument for CRS at the aggregate level. Even though technologies may
di¤er across plants, the surviving plants in a competitive market will have the
same average costs at the optimal plant size. In the medium and long run, changes
in aggregate output will take place primarily by entry and exit of optimal-size
9
By a “…rm” is generally meant the company as a whole. A company may have several
“manufacturing plants” placed at di¤erent locations.
plants. Then, with a large number of relatively small plants, each producing at
approximately constant unit costs for small output variations, we can without
substantial error assume constant returns to scale at the aggregate level. So the
argument goes. Notice, however, that even in this form the replication argument
is not entirely convincing since the question of indivisibility remains. The opti-
mal, i.e., cost-minimizing, plant size may be large relative to the market and
is in fact so in many industries. Besides, in this case also the perfect competition
premise breaks down.
(i) marginal costs are constant and equal to average costs (so the right-hand
side of (2.14) equals unity);
(iii) a production function known to exhibit CRS and satisfy property (a) from
the de…nition of a neoclassical production function above, will automatically
satisfy also property (b) and consequently be neoclassical;
(iv) a neoclassical two-factor production function with CRS has always FKL > 0;
i.e., it exhibits “direct complementarity”between K and L;
(v) a two-factor production function that has CRS and is twice continuously
di¤erentiable with positive marginal productivity of each factor everywhere
in such a way that all isoquants are strictly convex to the origin, must
have diminishing marginal productivities everywhere and thereby be neo-
classical.10
with L > 0; we can under CRS write F (K; L) = LF (K=L; 1) Lf (k); where
k K=L is called the capital-labor ratio (sometimes the capital intensity) and
f (k) is the production function in intensive form (sometimes named the per capita
production function). Thus output per unit of labor depends only on the capital
intensity:
Y
y = f (k):
L
When the original production function F is neoclassical, under CRS the expres-
sion for the marginal productivity of capital simpli…es:
@Y @ [Lf (k)] @k
FK (K; L) = = = Lf 0 (k) = f 0 (k): (2.15)
@K @K @K
And the marginal productivity of labor can be written
@Y @ [Lf (k)] @k
FL (K; L) = = = f (k) + Lf 0 (k)
@L @L @L
= f (k) + Lf 0 (k)K( L 2 ) = f (k) f 0 (k)k: (2.16)
@f 0 (k) @k 1
FKK (K; L) = = f 00 (k) = f 00 (k) :
@K @K L
For a neoclassical production function with CRS, we also have
Indeed, from the mean value theorem 11 we know there exists a number a 2 (0; 1)
such that for any k > 0 we have f (k) f (0) = f 0 (ak)k: From this follows f (k)
f 0 (ak)k = f (0) < f (k) f 0 (k)k; since f 0 (ak) > f 0 (k) by f 00 < 0. In view of
f (0) 0; this establishes (2.17): And from f (k) > f (k) f 0 (k)k > f (0) and
continuity of f follows (2.18).
11
This theorem says that if f is continuous in [ ; ] and di¤erentiable in ( ; ); then there
exists at least one point in ( ; ) such that f 0 ( ) = (f ( ) f ( ))=( ):
In this case standard parlance is just to say that “f satis…es the Inada conditions”.
An input which must be positive for positive output to arise is called an
essential input; an input which is not essential is called an inessential input. The
second part of (2.19), representing the upper Inada condition for M P K under
CRS, has the implication that labor is an essential input; but capital need not
be, as the production function f (k) = a + bk=(1 + k); a > 0; b > 0; illustrates.
Similarly, under CRS the upper Inada condition for M P L implies that capital
is an essential input. These claims are proved in Appendix C. Combining these
results, when both the upper Inada conditions hold and CRS obtain, then both
capital and labor are essential inputs.12
Fig. 2.3 is drawn to provide an intuitive understanding of a neoclassical
CRS production function and at the same time illustrate that the lower Inada
conditions are more questionable than the upper Inada conditions. The left panel
of Fig. 2.3 shows output per unit of labor for a CRS neoclassical production
function satisfying the Inada conditions for M P K. The f (k) in the diagram
could for instance represent the Cobb-Douglas function in Example 1 with =
1 ; i.e., f (k) = Ak : The right panel of Fig. 2.3 shows a non-neoclassical
case where only two alternative Leontief techniques are available, technique 1: y
= min(A1 k; B1 ); and technique 2: y = min(A2 k; B2 ): In the exposed case it is
assumed that B2 > B1 and A2 < A1 (if A2 A1 at the same time as B2 > B1 ;
technique 1 would not be e¢ cient, because the same output could be obtained
with less input of at least one of the factors by shifting to technique 2). If the
available K and L are such that k K=L < B1 =A1 or k > B2 =A2 , some of either
L or K; respectively, is idle. If, however, the available K and L are such that
B1 =A1 < k < B2 =A2 ; it is e¢ cient to combine the two techniques and use the
fraction of K and L in technique 1 and the remainder in technique 2, where
= (B2 =A2 k)=(B2 =A2 B1 =A1 ): In this way we get the “labor productivity
curve” OPQR (the envelope of the two techniques) in Fig. 2.3. Note that for
k ! 0; M P K stays equal to A1 < 1; whereas for all k > B2 =A2 ; M P K = 0:
A similar feature remains true, when we consider many, say n; alternative
e¢ cient Leontief techniques available. Assuming these techniques cover a con-
siderable range w.r.t. the B=A ratios, we get a labor productivity curve looking
more like that of a neoclassical CRS production function. On the one hand, this
gives some intuition of what lies behind the assumption of a neoclassical CRS
production function. On the other hand, it remains true that for all k > Bn =An ;
12
Given a Cobb-Douglas production function, both production factors are essential whether
we have DRS, CRS, or IRS.
Figure 2.3: Two labor productivity curves based on CRS technologies. Left: neoclas-
sical technology with Inada conditions for MPK satis…ed; the graphical representation
of MPK and MPL at k = k0 as f 0 (k0 ) and f (k0 ) f 0 (k0 )k0 are indicated. Right: the
line segment PQ makes up an e¢ cient combination of two e¢ cient Leontief techniques.
Yt = F (At Kt ; Bt Lt ); (2.20)
13
Here we assume the techniques are numbered according to ranking with respect to the size
of B:
Yt = F (Kt ; Tt Lt ): (2.21)
The e¢ ciency of labor, Tt ; is then said to indicate the technology level. Although
one can imagine natural disasters implying a fall in Tt ; generally Tt tends to rise
over time and then we say that (2.21) represents Harrod-neutral technological
progress. An alternative name often used for this is labor-augmenting technolog-
ical progress. The names “factor-augmenting”and, as here, “labor-augmenting”
have become standard and we shall use them when convenient, although they
may easily be misunderstood. To say that a change in Tt is labor-augmenting
might be understood as meaning that more labor is required to reach a given
output level for given capital. In fact, the opposite is the case, namely that Tt
has risen so that less labor input is required. The idea is that the technological
change a¤ects the output level as if the labor input had been increased exactly
by the factor by which T was increased, and nothing else had happened. (We
might be tempted to say that (2.21) re‡ects “labor saving”technological change.
But also this can be misunderstood. Indeed, keeping L unchanged in response to
a rise in T implies that the same output level requires less capital and thus the
technological change is “capital saving”.)
If the function F in (2.21) is homogeneous of degree one (so that the technol-
ogy exhibits CRS w.r.t. capital and labor), we may write
Yt Kt
y~t = F( ; 1) = F (k~t ; 1) f (k~t ); f 0 > 0; f 00 < 0:
Tt Lt Tt Lt
where k~t Kt =(Tt Lt ) kt =Tt (habitually called the “e¤ective” capital intensity
or, if there is no risk of confusion, just the capital intensity). In rough accordance
with a general trend in aggregate productivity data for industrialized countries
we often assume that T grows at a constant rate, g; so that in discrete time Tt
= T0 (1 + g)t and in continuous time Tt = T0 egt ; where g > 0: The popularity
in macroeconomics of the hypothesis of labor-augmenting technological progress
derives from its consistency with Kaldor’s “stylized facts”, cf. Chapter 4.
14
After the English economist Roy F. Harrod, 1900-1978.
Yt = F (Xt ; Tt ); (2.24)
and research institutions to make their own time-series for capital. One approach
to the measurement of Kt is the perpetual inventory method which builds upon
the accounting relationship
Kt = It 1 + (1 )Kt 1 : (2.25)
Assuming a constant capital depreciation rate ; backward substitution gives
X
N
Kt = It 1 + (1 ) [It 2 + (1 )Kt 2 ] = . . . = (1 )i 1 It i + (1 )T Kt N:
i=1
(2.26)
Based on a long time series for I and an estimate of ; one can insert these
observed values in the formula and calculate Kt , starting from a rough conjec-
ture about the initial value Kt N : The result will not be very sensitive to this
conjecture since for large N the last term in (2.26) becomes very small.
Note that even if technological change does not directly appear in the produc-
tion function, that is, even if for instance (2.21) is replaced by Yt = F (Kt ; Lt );
the economy may experience a rising standard of living when Q is growing over
time.
In contrast, disembodied technological change occurs when new technical and
organizational knowledge increases the combined productivity of the production
factors independently of when they were constructed or educated. If the Kt
appearing in (2.21), (2.22), and (2.23) above refers to the total, historically ac-
cumulated capital stock as calculated by (2.26), then the evolution of T in these
expressions can be seen as representing disembodied technological change. All
vintages of the capital equipment bene…t from a rise in the technology level Tt :
No new investment is needed to bene…t.
Based on data for the U.S. 1950-1990, and taking quality improvements into
account, Greenwood et al. (1997) estimate that embodied technological progress
explains about 60% of the growth in output per man hour. So, empirically,
embodied technological progress seems to play a dominant role. As this tends not
to be fully incorporated in national income accounting at …xed prices, there is
a need to adjust the investment levels in (2.26) to better take estimated quality
improvements into account. Otherwise the resulting K will not indicate the
capital stock measured in e¢ ciency units.
For most issues dealt with in this book the distinction between embodied and
disembodied technological progress is not very important. Hence, unless explicitly
speci…ed otherwise, technological change is understood to be disembodied.
omy. Then F (K; L) is called the aggregate production function or the production
function of the representative …rm. It is as if aggregate production is the result
of the behavior of such a single …rm.
A simple example where the aggregate production function is well-de…ned is
the following. Suppose that all …rms have the same production function so that
Yi = F (Ki ; Li ); i = 1; 2; : : : ; n: If in addition F has CRS, we have
where ki Ki =Li : Hence, facing given factor prices, cost-minimizing …rms will
choose
P the same
P capital intensity ki = k for all i: From Ki = kLi then follows
i Ki = k i Li so that k = K=L: Thence,
X X X
Y Yi = Li f (ki ) = f (k) Li = f (k)L = F (k; 1)L = F (K; L):
In this (trivial) case the aggregate production function is well-de…ned and turns
out to be exactly the same as the identical CRS production functions of the
individual …rms. Moreover, given CRS and ki = k for all i; we have @Yi =@Ki
= f 0 (ki ) = f 0 (k) = FK (K; L) for all i: So each …rm’s marginal productivity of
capital is the same as the marginal productivity of capital on the basis of the
aggregate production function.
Allowing for the existence of di¤erent production functions at …rm level, we
may de…ne the aggregate production function as
Here it is no longer generally true that @Yi =@Ki (= FKi (Ki ; Li ) = @Y =@K (=
FK (K; L):
A next step is to allow also for the existence of di¤erent output goods, dif-
ferent capital goods, and di¤erent types of labor. This makes the issue even
more intricate, of course. Yet, if …rms are price taking pro…t maximizers and
face nonincreasing returns to scale, we at least know from microeconomics that
the aggregate outcome is as if, for given prices, the …rms jointly maximize aggre-
gate pro…t on the basis of their combined production technology. The problem
is, however, that the conditions needed for this to imply existence of an aggre-
gate production function which is well-behaved (in the sense of inheriting simple
qualitative properties from its constituent parts) are restrictive.
Nevertheless macroeconomics often treats aggregate output as a single homo-
geneous good and capital and labor as being two single and homogeneous inputs.
There was in the 1960s a heated debate about the problems involved in this,
with particular emphasis on the aggregation of di¤erent kinds of equipment into
one variable, the capital stock “K”. The debate is known as the “Cambridge
controversy”because the dispute was between a group of economists from Cam-
bridge University, UK, and a group from Massachusetts Institute of Technology
(MIT), which is located in Cambridge, USA. The former group questioned the
theoretical robustness of several of the neoclassical tenets, including the propo-
sition that a higher aggregate capital intensity is induced by a lower rate of
interest. Starting at the disaggregate level, an association of this sort is not a
logical necessity because, with di¤erent production functions across the indus-
tries, the relative prices of produced inputs tend to change, when the interest
rate changes. While acknowledging the possibility of “paradoxical”relationships,
the MIT group maintained that in a macroeconomic context they are likely to
cause devastating problems only under exceptional circumstances. In the end this
is a matter of empirical assessment.18
To avoid complexity and because, for many important issues in macroeco-
nomics, there is today no well-tried alternative, this book is about models that
use aggregate constructs like “Y ”, “K”, and “L” as simplifying devices, assum-
ing they are, for a broad class of cases, acceptable in a …rst approximation. Of
course there are cases where some disaggregation is pertinent. When for example
the role of imperfect competition is in focus, we shall be ready to (modestly)
disaggregate the production side of the economy into several product lines, each
producing its own di¤erentiated product (cf. Section 2.5.3).
Like the representative …rm, the representative household and the aggregate
consumption function are simplifying notions that should be applied only when
they do not get in the way of the issue to be studied. The role of budget con-
straints may make it even more di¢ cult to aggregate over households than over
…rms. Yet, if (and that is a big if) all households have the same constant propen-
sity to consume out of income or wealth, aggregation is straightforward and the
representative household is a meaningful simplifying concept. On the other hand,
if we aim at understanding, say, the interaction between lending and borrowing
households, perhaps via …nancial intermediaries, the representative household is
not a useful starting point. Similarly, if the theme is con‡icts of interests between
…rm owners and employees, the existence of di¤erent types of households should
be taken into account. Or if we want to assess the welfare costs of business cycle
‡uctuations, we have to take heterogeneity into account in view of the fact that
exposure to unemployment risk tends to be very unevenly distributed.
18
In his review of the Cambridge controversy Mas-Colell (1989) concluded that: “What the
‘paradoxical’comparative statics [of disaggregate capital theory] has taught us is simply that
modelling the world as having a single capital good is not a priori justi…ed. So be it.”
(a) There is only one produced good, an all-purpose good that can be used for
consumption as well as investment. Physical capital is just the accumulated
amount of what is left of the produced good after consumption. Models
using this simpli…cation are called one-sector models. One may think of
“corn”, a good that can be used for consumption as well as investment in
the form of seed to yield corn next period.
(c) Capital goods become productive immediately upon purchase or renting (so
installation costs and similar features are ignored).
(d) In all markets perfect competition rules and so the economic actors are price
takers, perceiving no constraint on how much they can sell or buy at the
going market price. It is understood that market prices are ‡exible and
adjust quickly to levels required for market clearing.
where technological change is ignored. Although in this book often CRS will be
assumed, we may throw the CRS outcome in relief by starting with a broader
view.
From microeconomics we know that equilibrium with perfect competition is
compatible with producers operating under the condition of locally nonincreasing
returns to scale (cf. Fig. 2.2). In standard macroeconomics it is common to
accept a lower level of generality and simply assume that F is a concave function.
This allows us to carry out the analysis as if there were non-increasing returns
to scale everywhere (see Appendix D).19
Since F is neoclassical, we have FKK < 0 and FLL < 0 everywhere. To
guarantee concavity it is then necessary and su¢ cient to add the assumption
that
D FKK (K; L)FLL (K; L) FKL (K; L)2 0; (2.29)
holds for all (K; L): This is a simple application of a general theorem on concave
functions (see Math Tools).
We consider both K and L as variable production factors. Let the factor
prices be denoted wK and wL ; respectively. For the time being we assume the
…rm rents the machines it uses; then the price, wK ; of capital services is called
the rental price or the rental rate. As numeraire (unit of account) we apply the
output good. So all prices are measured in terms of the output good which itself
has the price 1. Then pro…t, de…ned as revenue minus costs, is
= F (K; L) wK K wL L: (2.30)
We assume both production inputs are variable inputs. Taking the factor prices
as given from the factor markets, the …rm’s problem is to choose (K; L); where
K 0 and L 0, so as to maximize . An interior solution will satisfy the
…rst-order conditions
@
= FK (K; L) wK = 0 or FK (K; L) = wK ; (2.31)
@K
@
= FL (K; L) wL = 0 or FL (K; L) = wL : (2.32)
@L
Since F is concave, so is the pro…t function. The …rst-order conditions are then
su¢ cient for (K; L) to be a solution.
It is now convenient to proceed by considering the two cases, DRS and CRS,
separately.
19
By de…nition, concavity means that by applying a weighted average of two factor combina-
tions, (K1 ; L1 ) and (K2 ; L2 ); the obtained output is at least as large as the weighted average
of the original outputs, Y1 and Y2 . So, if 0 < < 1 and (K; L) = (K1 ; L1 ) +(1 )(K2 ; L2 ),
then F (K; L) F (K1 ; L1 ) +(1 )F (K2 ; L2 ).
K d = K(wK ; wL ); Ld = L(wK ; wL ):
An easy way to …nd the partial derivatives of these functions is to …rst take the
di¤erential20 of both sides of (2.31) and (2.32), respectively:
Then we interpret these conditions as a system of two linear equations with two
unknowns, the variables dK d and dLd : The determinant of the coe¢ cient matrix
equals D in (2.29) and is in this case positive everywhere. Using Cramer’s rule
(see Math Tools), we …nd
@K d FLL @K d FKL
= < 0; = < 0 if FKL > 0; (2.33)
@wK D @wL D
@Ld FKL @Ld FKK
= < 0 if FKL > 0; = < 0; (2.34)
@wK D @wL D
20
The di¤ erential of a di¤erentiable function is a convenient tool for deriving results like
(2.33) and (2.34). For a function of one variable, y = f (x); the di¤erential is denoted dy (or df )
and is de…ned as f 0 (x)dx; where dx is some arbitrary real number (interpreted as the change in
x): For a di¤erentiable function of two variables, z = g(x; y) ; the di¤ erential of the function is
denoted dz (or dg) and is de…ned as dz = gx (x; y)dx +gy (x; y)dy; where dx and dy are arbitrary
real numbers.
D=0
Figure 2.4: Constancy of MRS along rays when the production function is homogeneous
of degree h (the cost-minimizing capital intensity is the same at all output levels).
justment process.23 We imagine that our period is sub-divided into many short
time intervals (t; t + t): In the initial short time interval the factor markets
may not be in equilibrium. It is assumed that no capital or labor is hired out
of equilibrium. To allow an analysis in continuous time, we let t ! 0: A dot
over a variable denotes the time derivative, i.e., x(t)
_ = dx(t)=dt. The adjustment
process assumed is the following:
K_ d (t) = 1 FK (K d (t); Ld (t)) wK (t) ; 1 > 0;
L_ d (t) = 2
d d
FL (K (t); L (t)) wL (t) ; 2 > 0;
d s
w_ K (t) = K (t) K ;
w_ L (t) = Ld (t) Ls ;
where the initial values, K d (0); Ld (0); wK (0); and wL (0); are given. The parame-
ters 1 and 2 are constant adjustment speeds. The corresponding adjustment
speeds for the factor prices are set equal to one by choice of measurement units of
the inputs. Of course, the four endogenous variables should be constrained to be
nonnegative, but that is not important for the discussion here. The system has
a unique stationary state: K d (t) = K s ; Ld (t) = Ls ; wK (t) = KK (K s ; Ls ); wL (t)
= KL (K s ; Ls ):
A widespread belief, even in otherwise well-informed circles, seems to be that
with such adjustment dynamics, the stationary state is at least locally asymptot-
ically stable. By this is meant that there exists a (possibly only small) neigh-
borhood, N , of the stationary state with the property that if the initial state,
(K d (0); Ld (0); wK (0); wL (0)); belongs to N ; then the solution (K d (t); Ld (t);
wK (t); wL (t)) converges to the stationary state for t ! 1?
Unfortunately, however, this stability property is not guaranteed. To bear
1 1
this out, it is enough to present a counterexample. Let F (K; L) = K 2 L 2 ; 1
= 2 = K s = Ls = 1; and suppose K d (0) = Ld (0) > 0 and wK (0) = wL (0) > 0:
All this symmetry implies that K d (t) = Ld (t) = x(t) > 0 and wK (t) = wL (t)
= w(t) for all t 0: So FK (K d (t); Ld (t)) = 0:5x(t) 0:5 x(t)0:5 = 0:5; and similarly
FL (K d (t); Ld (t)) = 0:5 for all t 0: Now the system is equivalent to the two-
dimensional system,
x(t)
_ = 0:5 w(t); (2.40)
w(t)
_ = x(t) 1: (2.41)
Using the theory of coupled linear di¤erential equations, the solution is24
x(t) = 1 + (x(0) 1) cos t (w(0) 0:5) sin t; (2.42)
w(t) = 0:5 + (w(0) 0:5) cos t + (x(0) 1) sin t: (2.43)
23
Tâtonnement is a French word meaning “groping”.
24
For details, see hints in Exercise 2.6.
The solution exhibits undamped oscillations and never settles down at the sta-
tionary state, (1; 0:5); if not being there from the beginning. In fact, the solution
curves in the (x; w) plane will be circles around
p the stationary state. This is
so whatever the size of the initial distance, (x(0) 1)2 + (w(0) 0:5)2 ; to the
stationary point.
The economic mechanism is as follows. Suppose for instance that x(0) < 1
and w(0) < 0:5: Then to begin with there is excess supply and so w will be falling
while, with w below marginal products, x will be increasing. When x reaches its
potential equilibrium value, 1, w is at its trough and so induces further increases
in the factor demands, thus bringing about a phase where x > 1: This excess
demand causes w to begin an upturn. When w reaches its potential equilibrium
value, 0.5, however, excess demand, x 1; is at its peak and this induces further
increases in factor prices, w: This brings about a phase where w > 0:5 so that
factor prices exceed marginal products, which leads to declining factor demands.
But as x comes back to its potential equilibrium value, w is at its peak and drives
x further down. Thus excess supply arises which in turn triggers a downturn of w:
This continues in never ending oscillations where the overreaction of one variable
carries the seed to an overreaction of the other variable soon after and so on.
This possible outcome underlines that the theoretical existence of equilibrium
is one thing and stability of the equilibrium is another. In particular under CRS,
where demand functions for inputs are absent, the issue of stability can be more
intricate than one might at …rst glance think.
ciation or the deterioration rate. When changes in relative prices can occur, this
must be distinguished from the economic depreciation of capital which refers to
the loss in economic value of a machine after one year.
Let pt 1 be the price of a certain type of machine bought at the end of period
t 1: Let prices be expressed in the same numeraire as that in which the interest
rate, r; is measured. And let pt be the price of the same type of machine one
period later. Then the economic depreciation in period t is
pt 1 (1 )pt = pt (pt pt 1 ):
The economic depreciation thus equals the value of the physical wear and tear
minus the capital gain (positive or negative) on the machine.
By holding the machine the owner faces an opportunity cost, namely the
forgone interest on the value pt 1 placed in the machine during period t: If rt is
the interest rate on a loan from the end of period t 1 to the end of period t; this
interest cost is rt pt 1 : The bene…t of holding the (new) machine is that it can be
rented out to the representative …rm and provide the return wKt at the end of
the period. Since there is no uncertainty, in equilibrium we must then have wKt
= rt pt 1 + pt (pt pt 1 ); or
wKt p t + pt pt 1
= rt : (2.44)
pt 1
This is a no-arbitrage condition saying that the rate of return on holding the
machine equals the rate of return obtainable in the loan market (no pro…table
arbitrage opportunities are available).25
In the simple setup considered so far, the capital good and the produced good
are physically identical and thus have the same price. As the produced good
is our numeraire, we have pt 1 = pt = 1: This has two implications. First, the
interest rate, rt ; is a real interest rate so that 1 + rt measures the rate at which
future units of output can be traded for current units of output. Second, (2.44)
simpli…es to
wKt = rt :
Combining this with equation (2.38), we see that in the simple neoclassical setup
the equilibrium real interest rate is determined as
where KtS and Lst are predetermined. Under CRS this takes the form rt = f 0 (kts )
where K is the size of the installed machinery (a …xed factor in the short run)
measured in e¢ ciency units, u is its utilization rate (0 u 1); and A and B
are given technical coe¢ cients measuring e¢ ciency (cf. Section 2.1.2).
So in the short run the choice variables are u and L: In fact, essentially only
u is a choice variable since e¢ cient production trivially requires L = AuK=B:
Under “full capacity utilization” we have u = 1 (each machine is used 24 hours
per day seven days per week). “Capacity” is given as AK per week. Producing
e¢ ciently at capacity requires L = AK=B and the marginal product by increasing
labor input is here nil. But if demand, Y d ; is less than capacity, satisfying this
demand e¢ ciently requires L = Y d =B and u = BL=(AK) < 1: As long as u < 1;
the marginal productivity of labor is a constant, B:
The various e¢ cient input proportions that are possible ex ante may be ap-
proximately described by a neoclassical CRS production function. Let this func-
tion on intensive form be denoted y = f (k): When investment is decided upon
and undertaken, there is thus a choice between alternative e¢ cient pairs of the
technical coe¢ cients A and B in (2.46). These pairs satisfy
f (k) = Ak = B: (2.47)
In contrast, the standard neoclassical setup assumes the same range of sub-
stitutability between capital and labor ex ante and ex post. Then the technology
is called putty-putty. This term may also be used if ex post there is at least some
substitutability although less than ex ante. At the opposite pole of putty-putty
we may consider a technology which is clay-clay. Here neither ex ante nor ex post
is factor substitution possible. Table 1 gives an overview of the alternative cases.
The putty-clay case is generally considered the realistic case. As time pro-
ceeds, technological progress occurs. To take this into account, we may replace
(2.47) and (2.46) by f (kt ; t) = At kt = Bt and Yt = min(At ut Kt ; Bt Lt ); respec-
tively. If a new pair of Leontief coe¢ cients, (At2 ; Bt2 ); e¢ ciency-dominates its
predecessor (by satisfying At2 At1 and Bt2 Bt1 with at least one strict equal-
ity), it may pay the …rm to invest in the new technology at the same time as
some old machinery is scrapped. Real wages tend to rise along with technolog-
ical progress and the scrapping occurs because the revenue from using the old
machinery in production no longer covers the associated labor costs.
The clay property ex-post of many technologies is important for short-run
analysis. It implies that there may be non-decreasing marginal productivity of
labor up to a certain point. It also implies that in its investment decision the
…rm will have to take expected future technologies and future factor prices into
account. For many issues in long-run analysis the clay property ex-post may be
less important, since over time adjustment takes place through new investment.
i = Pi Y i WK Ki WL Li ;
27
We ignore production for inventory holding.
subject to (2.48) and the neoclassical production function Yi = F (Ki ; Li ): For the
purpose of simple comparison with the case of perfect competition as described
in Section 2.4, we return to the case where both labor and capital are variable
inputs in the short run.28 It is no serious restriction on the problem to assume
the monopolist will want to produce the amount demanded so that Yi = D(Pi ):
It is convenient to solve the problem in two steps.
Step 1. Imagine the monopolist has already chosen the output level Yi : Then
the problem is to minimize cost:
min WK Ki + WL Li s.t. F (Ki ; Li ) = Yi :
Ki ;Li
28
Generally, the technology would di¤er across the di¤erent product lines and F should thus
be replaced by F i , but for notational convenience we ignore this.
We have here introduced “total revenue” R(Yi ) = P(Yi )Yi , where P(Yi ) is the
inverse demand function de…ned by P(Yi ) D 1 (Yi ) = [Yi =(Y =n)] 1=" P from
(2.48). The …rst-order condition is
where the left-hand side is marginal revenue and the right-hand side is marginal
cost.
A su¢ cient second-order condition is that 00 (Yi ) = R00 (Yi ) C 00 (Yi ) < 0; i.e.,
the marginal revenue curve crosses the marginal cost curve from above. In the
present case this is surely satis…ed if we assume C 00 (Yi ) 0; which also ensures
existence and uniqueness of a solution to (2.50). Substituting this solution, which
we denote Yis ; cf. Fig. 2.5, into the conditional factor demand functions from
Step 1, we …nd the factor demands, Kid and Ldi : Owing to the downward-sloping
demand curves the factor demands are unique whether the technology exhibits
DRS, CRS, or IRS. Thus, contrary to the perfect competition case, neither CRS
nor IRS pose particular problems.
From the de…nition R(Yi ) = P (Yi )Yi follows
Yi 0 1 " 1
R0 (Yi ) = Pi 1 + P (Yi ) = Pi 1 = Pi :
Pi " "
where Yis = F (Kid ; Ldi ): These conditions follow from (2.49), since the Lagrange
multiplier equals marginal cost (see Appendix A), which equals marginal revenue.
That is, at pro…t maximum the marginal revenue products of capital and labor,
respectively, equal the corresponding factor prices. Since Pi > R0 (Yis ); the factor
prices are below the value of the marginal productivities. This re‡ects the market
power of the …rms.
In macro models a lot of symmetry is often assumed. If there is complete
symmetry across product lines and if factor markets clear as in (2.36) and (2.37)
with inelastic factor supplies, K s and Ls ; then Kid = K s =n and Ldi = Ls =n:
Furthermore, all …rms will choose the same price so that Pi = P; i = 1; 2; : : : ; n:
Then the given factor supplies, together with (2.51) and (2.52), determine the
equilibrium real factor prices:
WK 1 K s Ls
wK = FK ( ; );
P 1+ n n
WL 1 K s Ls
wL = FL ( ; );
P 1+ n n
where we have used that R0 (Yis ) = P=(1+ ) under these circumstances. As under
perfect competition, the real factor prices are proportional to the corresponding
marginal productivities, although with a factor of proportionality less than one,
namely equal to the inverse of the markup. This observation is sometimes used
as a defence for applying the simpler perfect-competition framework for studying
certain long-run aspects of the economy. For these aspects, the size of the pro-
portionality factor may be immaterial, at least as long as it is relatively constant
over time. Indeed, the constant markups open up for a simple transformation of
many of the perfect competition results to monopolistic competition results by
inserting the markup factor 1 + the relevant places in the formulas.
If in the short term only labor is a variable production factor, then (2.51)
need not hold. As claimed by Keynesian and New Keynesian thinking, also the
prices chosen by the …rms may be more or less …xed in the short run because
the …rms face price adjustment costs (“menu costs”) and are reluctant to change
prices too often, at least vis-a-vis changes in demand. Then in the short run only
the produced quantity will adjust to changes in demand. As long as the output
level is within the range where marginal cost is below the price, such adjustments
are still bene…cial to the …rm. As a result, even (2.52) may at most hold “on
average” over the business cycle. These matters are dealt with in Part V of this
book.
In practice, market power and other market imperfections also play a role in
the factor markets, implying that further complicating elements enter the pic-
ture. One of the tasks of theoretical and empirical macroeconomics is to clarify
the aggregate implications of market imperfections and sort out which market
imperfections are quantitatively important in di¤erent contexts.
2.7 Appendix
A. Strict quasiconcavity
Consider a function f : A ! R, where A is a convex set, A Rn .30 Given a
real number a; if f (x) = a, the upper contour set is de…ned as fx 2 Aj f (x) ag
(the set of input bundles that can produce at least the amount a of output). The
function f (x) is called quasiconcave if its upper contour sets, for any constant
a; are convex sets. If all these sets are strictly convex, f (x) is called strictly
quasiconcave.
Average and marginal costs To show that (2.14) holds with n production
inputs, n = 1; 2;. . . , we derive the cost function of a …rm with a neoclassical
production function, Y = F (X1 ; X2 ; : : : ; Xn ): Given a vector of strictly positive
input prices w = (w1 ; : : : ; wn ) >> 0; the …rm faces the problem of …nding a cost-
minimizing way to produce a given positive output level Y within the range of
F: The problem is
Xn
min wi Xi s.t. F (X1 ; : : : ; Xn ) = Y and Xi 0; i = 1; 2; : : : ; n:
i=1
where the third equality comes from the …rst-order conditions, and the last equal-
ity is due to the constraint
Pn F0 (X (Y )) = Y ; which, by taking the total derivative
0
on both sides, gives i=1 Fi (X )Xi (Y ) = 1: Consequently, the ratio of average
to marginal costs is
Pn Pn 0
C(Y )=Y i=1 wi Xi (Y ) i=1 Fi (X )Xi (Y )
= = ;
C 0 (Y ) Y F (X )
30
Recall that a set S is said to be convex if x; y 2 S and 2 [0; 1] implies x + (1 )y 2 S:
31
Since in this section we use a bit of vector notation, we exceptionally mark …rst-order partial
derivatives by a prime in order to clearly distinguish from the elements of a vector (so we write
Fi0 instead of our usual Fi ):
32
See Sydsaeter et al. (2008), pp. 74, 75, and 125.
which in analogy with (2.13) is the elasticity of scale at the point X : This proves
(2.14).
Su¢ cient conditions for strict quasiconcavity The claim (iii) in Section
2.1.3 was that a continuously di¤erentiable two-factor production function F (K; L)
with CRS, satisfying FK > 0; FL > 0; and FKK < 0; FLL < 0; will automatically
also be strictly quasi-concave in the interior of R2 and thus neoclassical.
To prove this, consider a function of two variables, z = f (x, y); that is twice
continuously di¤erentiable with f1 @z=@x > 0 and f2 @z=@y > 0; everywhere.
Then the equation f (x, y) = a; where a is a constant, de…nes an isoquant,
y = g(x); with slope g 0 (x) = f1 (x; y)=f2 (x; y): Substitute g(x) for y in this
equation and take the derivative w.r.t. x. By straightforward calculation we …nd
If the numerator is negative, then g 00 (x) > 0; that is, the isoquant is strictly
convex to the origin. And if this holds for all (x, y), then f is strictly quasi-
concave in the interior of R2 . A su¢ cient condition for a negative numerator is
that f11 < 0, f22 < 0 and f21 0. All these conditions, including the last three
are satis…ed by the given function F: Indeed, FK ; FL , FKK ; and FLL have the
required signs. And when F has CRS, F is homogeneous of degree 1 and thereby
FKL > 0; see Appendix B. Hereby claim (iii) in Section 2.1.3 is proved.
In (2.55) we can substitute FLK = FKL (by Young’s theorem). In view of Claim
2 this shows:
CLAIM 3 The marginal products, FK and FL , considered as functions of K and
L, are homogeneous of degree h 1.
We see also that when h 1 and K and L are positive; then
For h = 1 this establishes the direct complementarity result, (iv) in Section 2.1.3,
to be proved. A by-product of the derivation is that also when a neoclassical
production function is homogeneous of degree h > 1 (which implies IRS), does
direct complementarity between K and L hold.
Remark. The terminology around complementarity and substitutability may eas-
ily lead to confusion. In spite of K and L exhibiting direct complementarity when
FKL > 0; K and L are still substitutes in the sense that cost minimization for a
given output level implies that a rise in the price of one factor results in higher
demand for the other factor.
The claim (v) in Section 2.1.3 was the following. Suppose we face a CRS
production function, Y = F (K; L); that has positive marginal products, FK and
FL ; everywhere and isoquants, K = g(L); satisfying the condition g 00 (L) > 0
everywhere (i.e., F is strictly quasi-concave). Then the partial second derivatives
must satisfy the neoclassical conditions:
The proof is as follows. The …rst inequality in (2.59) follows from (2.53) combined
with (2.55). Indeed, for h = 1; (2.55) and (2.56) imply FKK = FLK L=K
= FKL L=K and FKL = FLL L=K, i.e., FKK = FLL (L=K)2 (or, in the notation
of Appendix A, f22 = f11 (x=y)2 ), which combined with (2.53) gives the conclusion
FKK < 0, when g 00 > 0. The second inequality in (2.59) can be veri…ed in a similar
way.
Note also that for h = 1 the equations (2.55) and (2.56) entail
respectively. By dividing the left- and right-hand sides of the …rst of these equa-
tions with those of the second we conclude that FKK FLL = FKL 2
in the CRS case.
We see also from (2.60) that, under CRS, the implications in (2.57) and (2.58)
can be turned round.
Finally, we asserted in § 2.1.1 that when the neoclassical production function
Y = F (K, L) is homogeneous of degree h, then the marginal rate of substitution
between the production factors depends only on the factor proportion k K=L:
Indeed,
where k K=L: The result (2.61) follows even if we only assume F (K; L) is
homothetic. When F (K; L) is homothetic, by de…nition we can write F (K, L)
'(G(K; L)), where G is homogeneous of degree 1 and ' is an increasing function.
In view of this, we get
Essential inputs In Section 2.1.2 we claimed that the upper Inada condition
for M P L together with CRS implies that without capital there will be no output:
In other words: in this case capital is an essential input. To prove this claim, let
K > 0 be …xed and let L ! 1: Then k ! 0; implying, by (2.16) and (2.18),
that FL (K; L) = f (k) f 0 (k)k ! f (0): But from the upper Inada condition for
M P L we also have that L ! 1 implies FL (K; L) ! 0: It follows that
Since under CRS, for any L > 0; F (0; L) = LF (0; 1) Lf (0); we have hereby
shown our claim.
Similarly, we can show that the upper Inada condition for M P K together
with CRS implies that labor is an essential input. Consider the output-capital
ratio x Y =K: When F has CRS, we get x = F (1; `) g(`); where ` L=K;
g 0 > 0; and g 00 < 0: Thus, by symmetry with the previous argument, we …nd that
under CRS, the upper Inada condition for M P K implies g(0) = 0: Since under
CRS F (K; 0) = KF (1; 0) Kg(0); we conclude that the upper Inada condition
for M P K together with CRS implies
Su¢ cient conditions for output going to in…nity when either input goes
to in…nity Here our …rst claim is that when F exhibits CRS and satis…es the
upper Inada condition for M P L and the lower Inada condition for M P K, then
To prove this, note that Y can be written Y = Kf (k)=k; since K=k = L: Here,
The proof is analogue. So, in combination, the four Inada conditions imply, under
CRS, that output has no upper bound when either input goes to in…nity.
X
2
F (x) Fi0 (x)xi : (2.63)
i=1
Suppose x 2R2++ . Then F (x) > 0 in view of F being neoclassical so that FK > 0
and FL > 0: From (2.63) we now …nd the elasticity of scale to be
X
2
Fi0 (x)xi =F (x) 1: (2.64)
i=1
In view of (2.13) and (2.12), this implies non-increasing returns to scale every-
where.
CLAIM 2 When a neoclassical production function F (K; L) is strictly concave,
it has decreasing returns to scale everywhere.
Proof. The argument is analogue to that above, but in view of strict concavity
the inequalities in (2.63) and (2.64) become strict. This implies that F has DRS
everywhere.
2.8 Exercises
2.1
63
Chapter 3
There exists two main analytical frameworks for analyzing the basic intertemporal
choice, consumption versus saving, and the dynamic long-run implications of
this choice: overlapping generations models and representative agent models. In
the …rst class of models the focus is on (a) the interaction between di¤erent
generations alive at the same time, and (b) the never-ending entrance of new
generations. In the second class of models the household sector is modelled as
consisting of a …nite number of in…nitely-lived agents. One interpretation is that
these agents are dynasties where parents take the utility of their descendants fully
into account by leaving bequests. This approach, which is also called the Ramsey
approach (after the British mathematician and economist Frank Ramsey, 1903-
1930), will be described in Chapter 8 (discrete time) and Chapter 10 (continuous
time).
In the present chapter we introduce the overlapping generations approach
which has shown its usefulness for analysis of questions associated with public
debt problems, taxation of capital income, …nancing of social security (pensions),
design of educational systems, non-neutrality of money, and the possibility of
speculative bubbles. Our focus will be on the overlapping generations model
called Diamond’s OLG model1 after the American economist and Nobel Prize
laureate Peter A. Diamond (1940-).
Among the strengths of the model are:
The life-cycle aspect of human behavior is taken into account. Although
the economy is in…nitely-lived, the individual agents have …nite time hori-
zons. During lifetime one’s educational level, working capacity, income, and
needs change and this is re‡ected in the individual labor supply and saving
behavior. The aggregate implications of the life-cycle behavior of coexisting
individual agents at di¤erent stages in their life is at the centre of attention.
1
Diamond (1965).
65
66 CHAPTER 3. THE BASIC OLG MODEL: DIAMOND
(b) The precautionary motive for saving. Income as well as needs may vary
due to conditions of uncertainty: sudden unemployment, illness, or other
kinds of bad luck. By saving, the individual can obtain a bu¤er against
such unwelcome events.
Horioka and Watanabe (1997) …nd that empirically, the saving motives (a)
and (b) are of dominant importance (Japanese data). Yet other motives include:
(c) Saving enables the purchase of durable consumption goods and owner-occupied
housing as well as repayment of debt.
(e) Saving may simply be motivated by the fact that …nancial wealth may lead
to social prestige and economic or political power.
2. Only the young work. Each young supplies one unit of labor inelastically.
The division of available time between work and leisure is thereby considered
as exogenous.
6. In each period three markets are open, a market for output, a market for
labor services, and a market for capital services. Perfect competition rules
in all markets. Uncertainty is absent; when a decision is made, its conse-
quences are known.
Assumption 7 entails the following. First, the agents are assumed to have
“rational expectations”or, with a better name, “model-consistent expectations”.
2
As to the disregard of money we may imagine that agents have safe electronic accounts in
a …ctional central bank allowing costless transfers between accounts.
This means that forecasts made by the agents coincide with the forecasts that
can be calculated on the basis of the model. Second, as there are no stochastic
elements in the model (no uncertainty), the forecasts are point estimates rather
than probabilistic forecasts. Thereby the model-consistent expectations take the
extreme form of perfect foresight: the agents agree in their expectations about
the future evolution of the economy and these expectations are point estimates
that coincide with the subsequent actual evolution of the economy.
Let the output good be the numeraire and let r^t denote the rental rate for
capital in period t; that is, r^t is the real price a …rm has to pay at the end of
period t for the right to use one unit of someone else’s physical capital through
period t. So the owner of Kt units of physical capital receives a
r^t Kt Kt
real (net) rate of return on capital = = r^t ; (3.1)
Kt
r^t = rt : (3.2)
This no-arbitrage condition indicates how the rental rate for capital and the more
everyday concept, the interest rate, would be related in an equilibrium where
both the market for capital services and a credit market were active. We shall
see, however, that in this model no credit market will be active in an equilibrium.
Nevertheless we will follow the tradition and call the right-hand side of (3.2) the
interest rate.
Table 3.1 provides an overview of the notation. As to our timing convention,
notice that any stock variable dated t indicates the amount held at the beginning
of period t: That is, the capital stock accumulated by the end of period t 1
and available for production in period t is denoted Kt : We therefore write Kt
= (1 )Kt 1 + It 1 and Yt = F (Kt ; Lt ); where F is an aggregate production
function. In this context it is useful to think of “period t”as running from date t
to date t + 1: So period t is the time interval [t; t + 1) on a continuous time axis.
Still, all decisions are made at discrete points in time t = 0; 1; 2; ::: (“dates”). We
imagine that receipts for work and lending as well as payment for the consumption
in period t occur at the end of the period. These timing conventions are common
in discrete-time growth and business cycle theory;3 they are convenient because
they make switching between discrete and continuous time analysis fairly easy.
3
In contrast, in accounting and …nance literature, typically Kt would denote the end-of-
period-t stock that begins to yield its services next period.
The interpretation of the variables is given in Table 3.1 above. We may think
of the “young” as a household consisting of one adult and 1 + n children whose
consumption is included in c1t : Note that “utility” appears at two levels. There
is a lifetime utility function, U; and a period utility function, u:4 The latter is
assumed to be the same in both periods of life (this has no e¤ects on the qualita-
tive results and simpli…es the exposition). The period utility function is assumed
continuous and twice continuously di¤erentiable with u0 > 0 and u00 < 0 (positive,
but diminishing marginal utility of consumption). Many popular speci…cations
4
Other names for these two functions are the intertemporal utility function and the subutility
function, respectively.
of u, e.g., u(c) = ln c; have the property that limc!0 u(c) = 1; then we de…ne
u(0) = 1:
The parameter is called the rate of time preference. It acts as a utility
discount rate, whereas (1 + ) 1 is a utility discount factor. Thus indicates the
degree of impatience w.r.t. the “arrival” of utility. By de…nition, > 1; but
> 0 is often assumed. When preferences can be represented in this additive way,
they are called time-separable. In principle, as seen from period t the interest rate
appearing in (3.5) should be interpreted as an expected real interest rate. But
as long as we assume perfect foresight, there is no need to distinguish between
actual and expected magnitudes.
In (3.5) the interest rate rt+1 acts as a (net) rate of return on saving.5 An
interest rate may also be seen as a discount rate relating to consumption over time.
Indeed, by isolating st in (3.5) and substituting into (3.4), we may consolidate
5
While st in (3.4) appears as a ‡ow (non-consumed income), in (3.5) st appears as a stock
(the accumulated …nancial wealth at the end of period t). This notation is legitimate because
the magnitude of the two is the same when the time unit is the same as the period length.
In real life the gross payo¤ of individual saving may sometimes be nil (if invested in a project
that completely failed). Unless otherwise indicated, it is in this book understood that an interest
rate is a number exceeding 1 as indicated in (3.5). Thereby the discount factor 1=(1 + rt+1 )
is well-de…ned. In general equilibrium, the condition 1 + rt+1 > 0 is always met in the present
model.
the two period budget constraints of the individual into one budget constraint,
1
c1t + c2t+1 = wt : (3.7)
1 + rt+1
In this intertemporal budget constraint the interest rate appears as the discount
rate entering the discount factor converting future amounts of consumption into
present equivalents, cf. Box 3.1.
In view of the sizeable period length in the model, this is de…nitely plausible.
Inserting the two budget constraints into the objective function in (3.3), we get
U (c1t ; c2t+1 ) = u(wt st ) +(1+ ) 1 u((1+rt+1 )st ) U~t (st ); a function of only one
decision variable, st : According to the non-negativity constraint on consumption
in both periods, (3.6), st must satisfy 0 st wt . Maximizing w.r.t. st gives
the …rst-order condition
dU~t
= u0 (wt st ) + (1 + ) 1 u0 ((1 + rt+1 )st )(1 + rt+1 ) = 0: (FOC)
dst
d2 U~t
= u00 (wt st ) + (1 + ) 1 u00 ((1 + rt+1 )st )(1 + rt+1 )2 < 0: (SOC)
ds2t
Hence there can at most be one st satisfying (FOC). Moreover, for a positive
wage income there always exists such an st : Indeed:
LEMMA 1 Let wt > 0 and suppose the No Fast Assumption (A1) applies. Then
the saving problem of the young has a unique solution st = s(wt ; rt+1 ). The
solution is interior, i.e., 0 < st < wt ; and st satis…es (FOC).
Proof. Assume (A1). For any s 2 (0; wt ); dU~t (s)=ds > 1: Now consider the
endpoints s = 0 and s = wt : By (FOC) and (A1),
dU~t
lim = u0 (wt ) + (1 + ) 1 (1 + rt+1 ) lim u0 ((1 + rt+1 )s) = 1;
s!0 ds s!0
dU~t
lim = lim u0 (wt s) + (1 + ) 1 (1 + rt+1 )u0 ((1 + rt+1 )wt ) = 1:
s!w ds s!wt
This is known as an Euler equation, after the Swiss mathematician L. Euler (1707-
1783) who was the …rst to study dynamic optimization problems. In the present
context the condition is called a consumption Euler equation.
Intuitively, in an optimal plan the marginal utility cost of saving must equal
the marginal utility bene…t obtained by saving. The marginal utility cost of
saving is the opportunity cost (in terms of current utility) of saving one more
unit of account in the current period (approximately). This one unit of account
is transferred to the next period with interest so as to result in 1 + rt+1 units of
account in that period. An optimal plan requires that the utility cost equals the
utility bene…t of having rt+1 more units of account in the next period. And this
utility bene…t is the discounted value of the extra utility that can be obtained
next period through the increase in consumption by rt+1 units.
It may seem odd to attempt an intuitive interpretation this way, that is, in
terms of “utility units”. The utility concept is just a convenient mathematical de-
vice used to represent the assumed preferences. Our interpretation is only meant
as an as-if interpretation: as if utility were something concrete. An interpretation
in terms of concrete measurable quantities goes like this. We rewrite (3.8) as
u0 (c1t )
= 1 + rt+1 : (3.9)
(1 + ) 1 u0 (c2t+1 )
The left-hand side measures the marginal rate of substitution, MRS, of consump-
tion as old for consumption as young, evaluated at the point (c1 ; c2 ): MRS is
6
Alternatively, one could use the Lagrange method.
@f
= u00 (c1t ) > 0;
@wt
@f
= (1 + ) 1 [u0 (c2t+1 ) + u00 (c2t+1 )st (1 + rt+1 )] :
@rt+1
Consequently, the partial derivatives of the saving function st = s(wt ; rt+1 ) are
The elasticity of marginal utility, also called the marginal utility ‡exibility,
will generally depend on the level of consumption, as implicit in the notation
(c2t+1 ): There exists a popular special case, however, where the elasticity of
marginal utility is constant.
EXAMPLE 1 The CRRA utility function. If we impose the requirement that
u(c) should have an absolute elasticity of marginal utility of consumption equal
to a constant > 0; then one can show (see Appendix A) that the utility function
must be of the CRRA form:
c1 1
; when 6= 1;
u(c) = 1 ; (3.14)
ln c; when = 1:
It may seem odd that in the upper case we subtract the constant 1=(1 )
1
from c =(1 ): But adding or subtracting a constant from a utility function
does not a¤ect the marginal rate of substitution and consequently not behavior.
Notwithstanding that we could do without this constant, its presence in (3.14)
has two advantages. One is that in contrast to c1 =(1 ); the expression
(c1 1)=(1 ) can be interpreted as valid even for = 1; namely as identical
to ln c: This is because (c1 1)=(1 ) ! ln c for ! 1 (by L’Hôpital’s
rule for “0/0”). Another advantage is that the kinship between the di¤erent
members, indexed by ; of the CRRA family becomes more transparent. Indeed,
by de…ning u(c) as in (3.14), all graphs of u(c) will go through the same point as
the log function, namely (1; 0); cf. Fig. 3.2.
The higher is ; the more “curvature”does the corresponding curve in Fig. 3.2
have. In turn, more “curvature”re‡ects a higher incentive to smooth consumption
across time. The reason is that a large curvature means that the marginal utility
will drop sharply if consumption rises and will increase sharply if consumption
falls. Consequently, not much utility is lost by lowering consumption when it
is relatively high but there is a lot of utility to be gained by raising it when it
is relatively low. So the curvature indicates the degree of aversion towards
variation in consumption. Or we may say that indicates the strength of the
preference for consumption smoothing.9
Suppose the period utility is of CRRA form as given in (3.14). (FOC) then
yields an explicit solution for the saving of the young:
1
st = 1 wt : (3.15)
1 + (1 + )( 1+r
1+
t+1
)
9
The name CRRA is a shorthand for Constant Relative Risk Aversion and comes from the
theory of behavior under uncertainty. Also in that theory does the CRRA function constitute an
important benchmark case. And is in that context called the degree of relative risk aversion.
u(c)
θ = 0
θ = 0.5
θ = 1
θ = 2
θ = 5
0 c
1
−1
We see that the signs of @st =@wt and @st =@rt+1 shown in (3.11) and (3.13), re-
spectively, are con…rmed. Moreover, the saving of the young is in this special
case proportional to income with a factor of proportionality that depends on the
interest rate (as long as 6= 1). But in the general case the saving-income ratio
depends also on the income level.
A major part of the attempts at empirically estimating suggests that > 1:
Based on U.S. data, Hall (1988) provides estimates above 5; while Attanasio and
Weber (1993) suggest 1:25 3:33: For Japanese data Okubo (2011) suggests
2:5 5:0: As these studies relate to much shorter time intervals than the
implicit time horizon of about 2 30 years in the Diamond model, we should be
cautious. But if the estimates were valid also to that model, we should expect
the income e¤ect on current consumption of an increase in the interest rate to
dominate the substitution e¤ect, thus implying sr < 0 as long as there is no
dc2
M RS = j ;
dc1 U =U
that is, M RS at the point (c1 ; c2 ) is the absolute value of the slope of the tangent
to the indi¤erence curve at that point.10 Under the “normal” assumption of
“strictly convex preferences” (as for instance in the Diamond model), M RS is
rising along the curve when c1 decreases (and thereby c2 increases). Conversely,
we can let M RS be the independent variable and consider the corresponding
point on the indi¤erence curve, and thereby the ratio c2 =c1 , as a function of
M RS. If we raise M RS along the indi¤erence curve, the corresponding value of
the ratio c2 =c1 will also rise.
The elasticity of intertemporal substitution in consumption at a given point is
de…ned as the elasticity of the ratio c2 =c1 w.r.t. the marginal rate of substitution
of c2 for c1 ; when we move along the indi¤erence curve through the point (c1 ; c2 ).
Letting the elasticity w.r.t. x of a di¤erentiable function f (x) be denoted E`x f (x);
the elasticity of intertemporal substitution in consumption can be written
(c2 =c1 )
c2 M RS d (c2 =c1 ) c2 =c1
E`M RS = j ;
c1 c2 =c1 dM RS U =U M RS
M RS
of good 1 and good 2, that is, the ratio 1=(1=(1 + r)) given in (3.7). Indeed, from
(3.10) and (3.9), omitting the time indices, we have
dc2 u0 (c1 )
M RS = jU =U = =1+r R: (3.16)
dc1 (1 + ) 1 u0 (c2 )
Letting (c1 ; c2 ) denote the elasticity of intertemporal substitution, evaluated at
the point (c1 ; c2 ); we then have
(c2 =c1 )
R d (c2 =c1 ) c2 =c1
(c1 ; c2 ) = jU =U R
: (3.17)
c2 =c1 dR R
We see that if u(c) belongs to the CRRA class and thereby (c1 ) = (c2 ) = ;
then (c1 ; c2 ) = 1= : In this case (as well as whenever c1 = c2 ) the elasticity of
marginal utility and the elasticity of intertemporal substitution are simply the
inverse of each other.
3.4 Production
Output is homogeneous and can be used for consumption as well as investment
in physical capital. The capital stock is thereby just accumulated non-consumed
output. We may imagine a “corn economy” where output is corn, part of which
is eaten (‡our) while the remainder is accumulated as capital (seed corn).
The speci…cation of technology and production conditions follows the sim-
ple competitive one-sector setup discussed in Chapter 2. Although the Diamond
model is a long-run model, we shall in this chapter for simplicity ignore techno-
logical change.
where Y is output (GNP) per period, K is capital input, L is labor input, and
k K=L is the capital-labor ratio. The derived function, f; is the production
function in intensive form. Capital installation and other adjustment costs are
ignored. Hence pro…t is F (K; L) r^K wL. The …rm maximizes under
perfect competition. This gives, …rst, @ =@K = FK (K; L) r^ = 0; that is,
@ [Lf (k)]
FK (K; L) = = f 0 (k) = r^: (3.20)
@K
Second, @ =@L = FL (K; L) w = 0; that is,
@ [Lf (k)]
FL (K; L) = = f (k) kf 0 (k) = w: (3.21)
@L
The interpretation is that the …rm will in every period use capital up to the point
where the marginal productivity of capital equals the rental rate given from the
market. Similarly, the …rm will employ labor up to the point where the marginal
productivity of labor equals the wage rate given from the market.
Kt d = Kt ; (3.22)
Lt d = Lt = L0 (1 + n)t ; (3.23)
where Kt is the aggregate supply of capital services and Lt the aggregate supply of
labor services. As was called attention to in Chapter 1, unless otherwise speci…ed
it is understood that the rate of utilization of each production factor is constant
over time and normalized to one. So the quantity Kt will at one and the same time
measure both the capital input, a ‡ow, and the available capital stock. Similarly,
the quantity Lt will at one and the same time measure both the labor input, a
‡ow, and the size of the labor force as a stock (= the number of young people).
The aggregate input demands, K d and Ld , are linked through the desired
capital-labor ratio, k d : In equilibrium we have Ktd =Ldt = kt d = Kt =Lt kt , by
(3.22) and (3.23). The k in (3.20) and (3.21) can thereby be identi…ed with the
ratio of the stock supplies, kt Kt =Lt > 0; which is a predetermined variable.
Interpreted this way, (3.20) and (3.21) determine the equilibrium factor prices r^t
and wt in each period. In view of the no-arbitrage condition (3.2), the real interest
rate satis…es rt = r^t , where is the capital depreciation rate, 0 1; and
so in equilibrium we end up with
where causality is from the right to the left in the two equations. In line with
our general perception of perfect competition, cf. Section 2.4 of Chapter 2, it is
understood that the factor prices, r^t and wt ; adjust quickly to the market-clearing
levels.
12
It might seem that k is overdetermined because we have two equations, (3.20) and (3.21),
but only one unknown. This reminds us that for arbitrary factor prices, r^ and w; there will not
exist a k satisfying both (3.20) and (3.21). But in equilibrium the factor prices faced by the
…rm are not arbitrary. They are equilibrium prices, i.e., they are adjusted so that (3.20) and
(3.21) become consistent.
To …x ideas we have assumed that households (here the old) own the physical
capital and rent it out to the …rms. In view of perfect competition and constant
returns to scale, pure pro…t is nil in equilibrium. As long as the model ignores
uncertainty and capital installation costs, the results will be una¤ected if instead
we let the …rms themselves own the physical capital and …nance capital investment
by issuing bonds and shares. These bonds and shares would then be accumulated
by the households and constitute their …nancial wealth instead of the capital
goods themselves. The equilibrium rate of return, rt , would be the same.
Combining this with (3.26) and using the de…nitions of k and f (k); we obtain the
dynamic resource constraint of the economy:
c2t
c1t + = f (kt ) + (1 )kt (1 + n)kt+1 : (3.27)
1+n
where consumption by the young and old, c1t and c2t , respectively, were deter-
mined in Section 3.
By de…nition, aggregate gross investment equals aggregate net investment,
N
It ; plus capital depreciation, i.e.,
It = ItN + Kt N
I1t N
+ I2t + Kt N
S1t N
+ S2t + Kt = st Lt + ( Kt ) + Kt : (3.29)
The …rst equality follows from the de…nition of net investment and the assump-
tion that capital depreciation equals Kt : Next comes an identity re‡ecting that
aggregate net investment is the sum of net investment by the young and net in-
vestment by the old. In turn, saving in this model is directly an act of acquiring
N N
capital goods. So the net investment by the young, I1t ; and the old, I2t ; are
N N
identical to their net saving, S1t and S2t ; respectively. As we have shown, the
net saving by the young in the model equals st Lt : And the net saving by the
old is negative and equals Kt : Indeed, because they have no bequest motive,
the old consume all they have and leave nothing as bequests. Hence, the young
in any period enter the period with no non-human wealth. Consequently, any
non-human wealth existing at the beginning of a period must belong to the old
in that period and be the result of their saving as young in the previous period.
As Kt constitutes the aggregate non-human wealth in our closed economy at the
beginning of period t; we therefore have
st 1 Lt 1 = Kt : (3.30)
Recalling that the net saving of any group is by de…nition the same as the increase
in its non-human wealth, the net saving of the old in period t is Kt : Aggregate
net saving in the economy is thus st Lt + ( Kt ); and (3.29).is thereby explained.
DEFINITION 2 For a given period t with capital stock Kt 0 and labor supply
e
Lt > 0; let the expected real interest rate be given as rt+1 > 1: With kt Kt =Lt ,
a temporary equilibrium in period t is a state (kt ; c1t ; c2t ; wt ; rt ) of the economy
such that (3.22), (3.23), (3.28), and (3.29) hold (i.e., all markets clear) for c1t
e
= wt st and c2t = (kt + rt kt )(1 + n); where st = s(wt ; rt+1 ); as de…ned in
Lemma 1, while wt = w(kt ) > 0 and rt = r(kt ); as de…ned in (3.25) and (3.24),
respectively.
The reason for the requirement wt > 0 in the de…nition is that if wt = 0;
people would have nothing to live on as young and nothing to save from for
retirement. The system would not be economically viable in this case. With
regard to the equation for c2t in the de…nition, note that (3.30) gives st 1 =
Kt =Lt 1 = (Kt =Lt )(Lt =Lt 1 ) = kt (1 + n); which is the wealth of each old at
the beginning of period t. Substituting into c2t = (1 + rt )st 1 , we get c2t =
(1 + rt )kt (1 + n); which can also be written c2t = (kt + rt kt )(1 + n): This last way
of writing c2t has the advantage of being applicable even if kt = 0; cf. Technical
Remark in Section 3.4. The remaining conditions for a temporary equilibrium
are self-explanatory.
PROPOSITION 1 Suppose the No Fast Assumption (A1) applies. Consider a
e
given period t with a given kt 0: Then for any rt+1 > 1;
(i) if kt > 0, there exists a temporary equilibrium, (kt ; c1t ; c2t ; wt ; rt ); and c1t and
c2t are positive;
(ii) if kt = 0, a temporary equilibrium exists if and only if capital is not essential;
in that case, wt = w(kt ) = w(0) = f (0) > 0 and c1t and st are positive (while
c2t = 0);
(iii) whenever a temporary equilibrium exists, it is unique.
Proof. We begin with (iii). That there is at most one temporary equilibrium is
immediately obvious since wt and rt are functions of the given kt : wt = w(kt )
e
and rt = r(kt ): And given wt , rt ; and rt+1 ; c1t and c2t are uniquely determined.
(i) Let kt > 0. Then, by (3.25), w(kt ) > 0: We claim that the state (kt ; c1t ; c2t ; wt ; rt );
e
with wt = w(kt ); rt = r(kt ); c1t = w(kt ) s(w(kt ); rt+1 ); and c2t = (1+r(kt ))kt (1+
n); is a temporary equilibrium. Indeed, Section 3.4 showed that the factor prices
wt = w(kt ) and rt = r(kt ) are consistent with clearing in the factor markets in
period t. Given that these markets clear (by price adjustment), it follows by Wal-
ras’law (see Appendix C) that also the third market, the goods market, clears
in period t. So all criteria in De…nition 2 are satis…ed. That c1t > 0 follows from
w(kt ) > 0 and the No Fast Assumption (A1), in view of Lemma 1. That c2t > 0
follows from c2t = (1 + r(kt ))kt (1 + n) when kt > 0; since r(kt ) > 1 always:
(ii) Let kt = 0. Suppose f (0) > 0: Then, by Technical Remark in Section 3.4,
e
wt = w(0) = f (0) > 0 and c1t = wt s(wt ; rt+1 ) is well-de…ned, positive, and less
e
than wt ; in view of Lemma 1; so st = s(wt ; rt+1 ) > 0. The old in period 0 will
starve since c2t = (0 + 0)(1 + n); in view of r(0) 0 = 0; cf. Technical Remark in
Section 3.4. Even though this is a bad situation for the old, it is consistent with
the criteria in De…nition 2. On the other hand, if f (0) = 0; we get wt = f (0) = 0;
which violates one of the criteria in De…nition 2.
Point (ii) of the proposition says that a temporary equilibrium may exist even
in a period where k = 0: The old in this period will starve and not survive. But if
capital is not essential, the young get positive labor income out of which they will
save a part for their old age and be able to maintain life also next period which
will be endowed with positive capital. Then, by our assumptions the economy is
viable forever.13
Generally, the term “equilibrium”is used to denote a state of “rest”, possibly
only “temporary rest”. The temporary equilibrium in the present model is an
example of a state of “temporary rest” in the following sense: (a) the agents
optimize, given their expectations and the constraints they face; and (b) the
aggregate demands and supplies in the given period are mutually consistent,
i.e., markets clear. The quali…cation “temporary” is motivated by two features.
First, in the next period circumstances may be di¤erent, among other things as a
consequence of the currently chosen actions. Second, the given expectations may
turn out wrong.
s (w (kt ) ; r (kt+1 ))
kt+1 = ; (3.32)
1+n
e e
using that st = s(wt ; rt+1 ); wt = w(kt ), and rt+1 = rt+1 = r (kt+1 ) in a sequence of
temporary equilibria with ful…lled expectations. Equation (3.32) is a …rst-order
di¤erence equation, known as the fundamental di¤erence equation or the law of
motion of the Diamond model.
PROPOSITION 2 Suppose the No Fast Assumption (A1) applies. Then,
13
For simplicity, the model ignores that in practice a certain minimum per capita consumption
level (the subsistence minimum) is needed for viability.
(i) for any k0 > 0 there exists at least one equilibrium path;
(ii) if k0 = 0; an equilibrium path exists if and only if f (0) > 0 (i.e., capital not
essential);
(iii) in any case, an equilibrium path has a positive real wage in all periods and
positive capital in all periods except possibly the …rst;
(iv) an equilibrium path satis…es the …rst-order di¤erence equation (3.32).
Proof. (i) and (ii): see Appendix D. (iii) For a given t; let kt 0: Then,
since an equilibrium path is a sequence of temporary equilibria, we have wt =
e e
w(kt ) > 0 and st = s(w (kt ) ; rt+1 ), where rt+1 = r (kt+1 ) : Hence, by Lemma 1,
e
s(w (kt ) ; rt+1 ) > 0; which implies kt+1 > 0; in view of (3.32). This shows that
only for t = 0 is kt = 0 possible along an equilibrium path. (iv) This was shown
in the text above.
The formal proofs of point (i) and (ii) of the proposition are placed in appendix
because they are quite technical. But the graphs in the ensuing …gures 3.4-3.7
provide an intuitive veri…cation. The “only if” part of point (ii) re‡ects the not
very surprising fact that if capital were an essential production factor, no capital
“now”would imply no income “now”, hence no saving and investment and thus no
capital in the next period and so on. On the other hand, the “if”part of point (ii)
says that when capital is not essential, an equilibrium path can set o¤ even from
an initial period with no capital. Then point (iii) adds that an equilibrium path
will have positive capital in all subsequent periods. Finally, as to point (iv), note
that the fundamental di¤erence equation, (3.32), rests on equation (3.31). Recall
from the previous subsection that the economic logic behind this key equation
is that since capital is the only non-human asset in the economy and the young
are born without any inheritance, the aggregate capital stock at the beginning of
period t + 1 must be owned by the old generation in that period. It must thereby
equal the aggregate saving these people had in the previous period where they
were young.
First, what can we say about the slope of the transition curve? In general a
point on the transition curve has the property that at least in a small neighbor-
hood of this point the equation (3.32) will de…ne kt+1 as an implicit function of
kt .14 Taking the total derivative w.r.t. kt on both sides of (3.32), we get
dkt+1 1 dkt+1
= sw w0 (kt ) + sr r0 (kt+1 ) : (3.33)
dkt 1+n dkt
By ordering, the slope of the transition curve within this small neighborhood can
be written
dkt+1 sw (w (kt ) ; r (kt+1 )) w0 (kt )
= ; (3.34)
dkt 1 + n sr (w (kt ) ; r (kt+1 )) r0 (kt+1 )
when sr (w(kt ); r(kt+1 ))r0 (kt+1 ) 6= 1+n: Since sw > 0 and w0 (kt ) = kt f 00 (kt ) > 0;
the numerator in (3.34) is always positive and we have
dkt+1 1+n
? 0 for sr (w(kt ); r(kt+1 )) ? 0 ;
dkt r (kt+1 )
respectively (recall that r0 (kt+1 ) = f 00 (kt+1 ) < 0):
Figure 3.4: Transition curve and the resulting dynamics in the log-utility Cobb-Douglas
case.
intuition behind this becomes visible by rewriting (3.34) in terms of small changes
in kt and kt+1 : Since kt+1 = kt dkt+1 =dkt for kt “small”, (3.34) implies
Let kt > 0: This rise in kt will always raise wage income and, via the resulting
rise in st ; raise kt+1 ; everything else equal. Everything else is not equal, however,
since a rise in kt+1 implies a fall in the rate of interest. There are four cases to
consider:
Case 1: sr ( ) = 0: Then there is no feedback e¤ect from the fall in the rate of
interest. So the tendency to a rise in kt+1 is neither o¤set nor forti…ed.
Case 2: sr ( ) > 0: Then the tendency to a rise in kt+1 will be partly o¤set
through the dampening e¤ect on saving resulting from the fall in the interest
rate. This negative feedback can not fully o¤set the tendency to a rise in kt+1 .
The reason is that the negative feedback on the saving of the young will only
be there if the interest rate falls in the …rst place. We cannot in a period have
both a fall in the interest rate triggering lower saving and a rise in the interest
rate (via a lower kt+1 ) because of the lower saving. So a su¢ cient condition for
a universally upward-sloping transition curve is that the saving of the young is a
non-decreasing function of the interest rate.
Case 3: (1 + n)=r0 (kt+1 ) < sr ( ) < 0: Then the tendency to a rise in kt+1 will
be forti…ed through the stimulating e¤ect on saving resulting from the fall in the
interest rate.
Case 4: sr ( ) < (1 + n)=r0 (kt+1 ) < 0: Then the expression in brackets on
the left-hand side of (*) is negative and requires therefore that kt+1 < 0 in
order to comply with the positive right-hand side. This is a situation of multiple
temporary equilibria, a situation where self-ful…lling expectations operate. We
shall explore this case in the next sub-section.
Another feature of the transition curve is the following:
LEMMA 2 (the transition curve is nowhere ‡at) For all kt > 0; dkt+1 =dkt 6= 0:
Proof. Since sw > 0 and w0 (kt ) > 0 always, the numerator in (3.34) is always
positive.
The implication is that no part of the transition curve can be horizontal.15
When the transition curve crosses the 45 degree line for some kt > 0, as in
the example in Fig. 3.4, we have a steady state at this kt : Formally:
DEFINITION 4 An equilibrium path f(kt ; c1t ; c2t )g1
t=0 is in a steady state with
capital-labor ratio k > 0 if the fundamental di¤erence equation, (3.32), is satis-
…ed with kt as well as kt+1 replaced by k .
15
This would not necessarily hold if the utility function were not time-separable.
by the young, s(wt ; r(k 00 ); will be such that kt+1 = k 00 ; that is, the expectation
is ful…lled. The fact that also the point (kt ; k 0 ); where k 0 > k 00 , is on transition
curve indicates that also a lower interest rate, r(k 0 ); can be self-ful…lling. By this
is meant that if an interest rate at the level r(k 0 ) is expected, then this expecta-
tion induces more saving by the young, just enough more to make kt+1 = k 0 > k 00 ,
thus con…rming the expectation of the lower interest rate level r(k 0 ): What makes
e
this possible is exactly the negative dependency of st on rt+1 : The fact that also
000 000 00
the point (kt ; k ); where k < k , is on the transition curve can be similarly
interpreted. It is also sr < 0 that makes it possible that less saving by the young
than at P can be induced by an expected higher interest rate, r(k 000 ); than at P.
These ambiguities point to a serious problem with the assumption of perfect
foresight. The model presupposes that all the young agree in their expectations.
Only then will one of the three mentioned temporary equilibria appear. But the
model is silent about how the needed coordination of expectations is brought
about, and if it is, why this coordination ends up in one rather than another of
the three possible equilibria with self-ful…lling expectations. Each single young is
isolated in the market and will not know what the others will expect. The market
mechanism in the model provides no coordination of expectations. As it stands,
the model cannot determine how the economy will evolve in this situation.
This is of course a weakness. Yet the encountered phenomenon itself that
multiple self-ful…lling equilibrium paths are theoretically possible is certainly
of interest and plays an important role in certain business cycle theories of booms
and busts.
For now we plainly want to circumvent non-uniqueness. There are at least
two ways to rule out the possibility of multiple equilibrium paths. One simple
approach is to discard the assumption of perfect foresight. Instead, some kind
of adaptive expectations may be assumed, for example in the form of myopic
foresight, also called static expectations. This means that the expectation formed
by the agents in the current period about the value of a variable next period
is that it will stay the same as in the current period. So here the assumption
e
would be that the young have the expectation rt+1 = rt . Then, given k0 > 0;
a unique sequence of temporary equilibria f(kt ; c1t ; c2t ; wt ; rt )g1
t=0 is generated by
the model. Oscillations in the sense of repetitive movements up and down of kt
are possible. Even chaotic trajectories are possible (see Exercise 3.6).
Outside steady state the agents will experience that their expectations are
systematically wrong. And the assumption of myopic foresight rules out that
learning occurs. This may be too simplistic, although it can be argued that
human beings to a certain extent have a psychological disposition to myopic
foresight.
Another approach to the indeterminacy problem in the Diamond model is
(a) If 0 < 1; then (A2) holds for all kt > 0 along an equilibrium path.
1 1
for all k > 0: In turn, su¢ cient for this is that (1 ) >1 :
That (a) is su¢ cient for (A2) is immediately visible in (3.15). The su¢ ciency
of (b) is proved in Appendix D. The elasticity of substitution between capital
and labor is a concept analogue to the elasticity of intertemporal substitution
in consumption. It is a measure of the sensitivity of the chosen k = K=L with
respect to the relative factor price. The next chapter goes more into detail with
the concept and shows, among other things, that the Cobb-Douglas production
function corresponds to = 0: So the Cobb-Douglas production function will
1
satisfy the inequality (1 ) 1>1 (since > 0); hence also the inequality
(3.36).
With these or other su¢ cient conditions in the back of our mind we shall now
proceed imposing the Positive Slope Assumption (A2). To summarize:
PROPOSITION 3 (uniqueness of an equilibrium path) Suppose the No Fast and
Positive Slope assumptions, (A1) and (A2), apply. Then, if k0 > 0, there exists
a unique equilibrium path.
(i) if k0 > 0, there exists a unique equilibrium path;
(ii) if k0 = 0; an equilibrium path exists if and only if f (0) > 0 (i.e., capital not
essential).
When the conditions of Proposition 3 hold, the fundamental di¤erence equa-
tion, (3.32), of the model de…nes kt+1 as an implicit function of kt ;
kt+1 = '(kt );
for all kt > 0; where '(kt ) is called a transition function. The derivative of this
implicit function is given by (3.34) with kt+1 on the right-hand side replaced by
'(kt ); i.e.,
sw (w (kt ) ; r ('(kt ))) w0 (kt )
'0 (kt ) = > 0: (3.37)
1 + n sr (w (kt ) ; r ('(kt ))) r0 ('(kt ))
The positivity for all kt > 0 is due to (A2). Example 2 above leads to a transition
function.
17
CES stands for Constant Elasticity of Substitution. CES production functions are consid-
ered in detail in Chapter 4.
Proof. Since f 0 > 0; the roof has positive slope. Since f 00 < 0; it can only cross
the 45 line once and only from above. If and only if limk!0 f 0 (k) > 1 + n, then
for small kt , the roof is steeper than the 45 line. Obviously, if f (0) > 0; then
close to the origin, the roof will be above the 45 line.
Figure 3.6: A case where both the roof and the ceiling cross the 45 line, but the
transition curve does not (no steady state exists).
LEMMA 5 Given w(k) = f (k) f 0 (k)k for all k 0, where f (k) satis…es
f (0) 0, f 0 > 0; f 00 < 0; the following holds:
(i) limk!1 w(k)=k = 0;
(ii) the ceiling, C(k) w(k)=(1 + n); is positive and has positive slope for all
k > 0; moreover, there exists k > 0 such that C(k) < k for all k > k:
Proof. (i) In view of f (0) 0 combined with f 00 < 0; we have w(k) > 0 for
all k > 0. Hence, limk!1 w(k)=k 0 if this limit exists. Consider an arbitrary
k1 > 0: We have f 0 (k1 ) > 0: For all k > k1 ; it holds that 0 < f 0 (k) < f 0 (k1 ); in
view of f 0 > 0 and f 00 < 0; respectively. Hence, limk!1 f 0 (k) exists and
We have
w(k) f (k)
lim = lim lim f 0 (k): (3.39)
k!1 k k!1 k k!1
There are two cases to consider. Case 1: f (k) has an upper bound. Then,
limk!1 f (k)=k = 0 so that limk!1 w(k)=k = limk!1 f 0 (k) = 0; by (3.39)
and (3.38), as w(k)=k > 0 for all k > 0. Case 2: limk!1 f (k) = 1: Then,
by L’Hôpital’s rule for “1=1”, limk!1 (f (k)=k) = limk!1 f 0 (k) so that (3.39)
implies limk!1 w(k)=k = 0:
(ii) As n > 1 and w(k) > 0 for all k > 0; C(k) > 0 for all k > 0: From
w0 (k) = kf 00 (k) > 0 follows C 0 (k) = kf 00 (k)=(1 + n) > 0 for all k > 0; that is,
the ceiling has positive slope everywhere. For k > 0; the inequality C(k) < k is
equivalent to w(k)=k < 1 + n. By (i) follows that for all " > 0; there exists k" > 0
such that w(k)=k < " for all k > k" : Now, letting " = 1 + n and k = k" proves
that there exists k > 0 such that w(k)=k < 1 + n for all k > k:
While the roof can be above the 45 line for all kt > 0; the ceiling can not.
Indeed, (ii) of the lemma implies that if for small kt the ceiling is above the 45
line, the ceiling will necessarily cross the 45 line at least once for larger kt :
In view of the ceiling being always an upper bound on kt+1 ; what is the point
of introducing also the roof? The point is that the roof is a more straightforward
construct since it is directly given by the production function and is always strictly
concave. The ceiling is generally a more complex construct. It can have convex
sections and for instance cross the 45 line at more than one point if at all. .
A necessary condition for existence of a (non-trivial) steady state is that the
roof is above the 450 line for small kt : But this is not su¢ cient for also the
transition curve to be above the 450 line for small kt . Fig. 3.6 illustrates this. Here
the transition curve is in fact everywhere below the 450 line. In this case no steady
state exists and the dynamics imply convergence towards the “catastrophic”point
(0; 0): Given the rate of population growth, the saving of the young is not su¢ cient
to avoid famine in the long run. This will for example happen if the technology
implies so low productivity that even if all income of the young were saved, we
would have kt+1 < kt for all kt > 0; cf. Exercise 3.2. The Malthusian mechanism
will be at work and bring down n (outside the model). This exempli…es that even
a trivial steady state (the point (0,0)) may be of interest in so far as it may be
the point the economy is heading to without ever reaching it.
To help existence of a steady state we will impose the condition that either
capital is not essential or preferences and technology …t together in such a way
that the slope of the transition curve is larger than one for small kt . That is, we
assume that either
(i) f (0) > 0 or (A3)
(ii) lim '0 (k) > 1;
k!0
where '0 (k) is implicitly given in (3.37). Whether condition (i) of (A3) holds in
a given situation can be directly checked from the production function. If it does
not, we should check condition (ii). But this condition is less amenable because
the transition function ' is not one of the “primitives” of the model. There
exist cases, though, where we can …nd an explicit transition function and try out
whether (ii) holds (like in Example 2 above). But generally we can not. Then we
have to resort to su¢ cient conditions for (ii) of (A3), expressed in terms of the
“primitives”. For example, if the period utility function belongs to the CRRA
class and the production function is Cobb-Douglas at least for small k, then (ii)
of (A3) holds (see Appendix E). Anyway, as (i) and (ii) of (A3) can be interpreted
as re‡ecting two di¤erent kinds of “early steepness” of the transition curve, we
shall call (A3) the Early Steepness Assumption.18
PROPOSITION 4 (existence and stability of a steady state) Assume that the
No Fast Assumption (A1) and the Positive Slope assumption (A2) apply as well
as the Early Steepness Assumption (A3). Then there exists at least one steady
state k > 0 that is locally asymptotically stable. Oscillations do not occur.
Proof. By (A1), Lemma 3 applies. From Proposition 2 we know that if (i) of
(A3) holds, then kt+1 = st =(1 + n) > 0 even for kt = 0: Alternatively, (ii) of (A3)
is enough to ensure that the transition curve lies above the 45 line for small kt :
By Lemma 4 the roof then does the same. According to (ii) of Lemma 5, for
large kt the ceiling is below the 45 line. Being below the ceiling, cf. Lemma
3, the transition curve must therefore cross the 45 line at least once. Let k
denote the smallest kt at which it crosses. Then k > 0 is a steady state with the
property 0 < '0 (k ) < 1: By graphical inspection we see that this steady state
is asymptotically stable. For oscillations to come about there must exist a steady
state, k ; with '0 (k ) < 0; but this is impossible in view of (A2).
From Proposition 4 we conclude that, given k0 ; the assumptions (A1) - (A3)
ensure existence and uniqueness of an equilibrium path; moreover, the equilibrium
path converges towards some steady state. Thus with these assumptions, for any
k0 > 0; sooner or later the system settles down at some steady state k > 0. For
the factor prices we therefore have
rt = f 0 (kt ) ! f 0 (k ) r ; and
0
wt = f (kt ) kt f (kt ) ! f (k ) k f 0 (k ) w ;
for t ! 1: But there may be more than one steady state and therefore only
local stability is guaranteed. This can be shown by examples, where the utility
function, the production function, and parameters are speci…ed in accordance
with the assumptions (A1) - (A3) (see Exercise 3.5 and ...).
18
In (i) of (A3), the “steepness” is rather a “hop” at k = 0 if we imagine k approaching nil
from below.
Figure 3.7: A case of multiple steady states (and capital being not essential).
Fig. 3.7 illustrates such a case (with f (0) > 0 so that capital is not essential).
Moving West-East in the …gure, the …rst steady state, k1 ; is stable, the second,
k2 ; unstable, and the third, k3 ; stable. In which of the two stable steady states
the economy ends up depends on the initial capital-labor ratio, k0 : The lower
steady state, k1 ; is known as a poverty trap. If 0 < k0 < k2 ; the economy is
caught in the trap and converges to the low steady state. But with high enough
k0 (k0 > k2 ); perhaps obtained by foreign aid, the economy avoids the trap and
converges to the high steady state. Looking back at Fig. 3.6, we can interpret
that …gure’s scenario as exhibiting an inescapable poverty trap.
It turns out that CRRA utility combined with a Cobb-Douglas production
function ensures both that (A1) - (A3) hold and that a unique (non-trivial)
steady state exists. So in this case global asymptotic stability of the steady state
is ensured.19 Example 2 and Fig. 3.4 above display a special case of this, the
case = 1:
This is of course a convenient case for the analyst. A Diamond model sat-
isfying assumptions (A1) - (A3) and featuring a unique steady state is called a
well-behaved Diamond model.
We end this section with the question: Is it possible that aggregate consump-
tion, along an equilibrium path, for some periods exceeds aggregate income? We
19
See last section of Appendix E.
shall see that this is indeed possible in the model if K0 (wealth of the old in the
initial period) is large enough. Indeed, from national accounting we have:
1=(1 )
(1 )A
K0 > :
(2 + )(1 )
saving maintains a capital-labor ratio above the golden-rule value forever. Let us
consider these concepts in detail.
In the present section generally the period length is arbitrary except when
we relate to the Diamond model and the period length therefore is half of adult
lifetime.
Then (3.42) has a solution in k; and it is unique because c00 (k) < 0. The solution
is called kGR so that
f 0 (kGR ) = n:
That is:
PROPOSITION 5 (the golden rule) The highest sustainable consumption level
per unit of labor in society is obtained when in steady state the net marginal
productivity of capital equals the growth rate of the economy.
with, given that you must hand over the same capital-labor ratio to the next
generation?”The appropriate answer is: the golden-rule capital-labor ratio.
r T n , f 0 (k ) T n , k S kGR ; respectively,
in view of f 00 < 0: Hence, a long-run interest rate below the growth rate of the
economy indicates that k > kGR : This amounts to a Pareto-inferior state of
a¤airs. Indeed, everyone can be made better o¤ if by a coordinated reduction of
saving and investment, k is reduced. A formal demonstration of this is given in
connection with Proposition 6 in the next subsection. Here we give an account
in more intuitive terms.
Consider Fig. 3.8. Let k be gradually reduced to the level kGR by refrain-
ing from investment in period t0 and forward until this level is reached. When
this happens, let k be maintained at the level kGR forever by providing for the
needed investment per young, ( + n)kGR : Then there would be higher aggregate
consumption in period t0 and every future period. Both the immediate reduction
of saving and a resulting lower capital-labor ratio to be maintained contribute to
this result. There is thus scope for both young and old to consume more in every
future period.
In the Diamond model a simple policy implementing such a Pareto improve-
ment in the case where k > kGR (i.e., r < n) is to incur a lump-sum tax on
the young, the revenue of which is immediately transferred lump sum to the old,
hence, fully consumed. Suppose this amounts to a transfer of one good from each
young to the old. Since there are 1 + n young people for each old person, every
old receives in this way 1 + n goods in the same period. Let this transfer be
repeated every future period. By decreasing their saving by one unit, the young
can maintain unchanged consumption in their youth, and when becoming old,
they receive 1 + n goods from the next period’s young and so on. In e¤ect, the
“return” on the tax payment by the young is 1 + n next period. This is more
than the 1 + r that could be obtained via the market through own saving.21
21
In this model with no utility of leisure, a tax on wage income, or a mandatory pay-as-you-go
pension contribution (see Chapter 5) would act like a lump-sum tax on the young.
The described tax-transfers policy will a¤ect the equilibrium interest rate negatively. By
choosing an appropriate size of the tax this policy, combined with competitive markets, will
under certain conditions (see Chapter 5.1) bring the economy to the golden-rule steady state
where overaccumulation has ceased and r = n:
A proof that k > kGR is indeed theoretically possible in the Diamond model
can be based on the log utility-Cobb-Douglas case from Example 2 in Section
3.5.3. As indicated by the formula for r in that example, the outcome r < n,
which is equivalent to k > kGR , can always be obtained by making the parameter
2 (0; 1) in the Cobb-Douglas function small enough. The intuition is that a
small implies a high 1 ; that is, a high wage income wL = (1 )K L L
= (1 )Y ; this leads to high saving by the young, since sw > 0: The result is
a high kt+1 which generates a high real wage also next period and may in this
manner be sustained forever.
An intuitive understanding of the fact that the perfectly competitive market
mechanism may thus lead to overaccumulation, can be based on the following
argument. Assume, …rst, that sr < 0. In this case, if the young in period t
expects the rate of return on their saving to end up small (less than n), the
decided saving will be large in order to provide for consumption after retirement.
But the aggregate result of this behavior is a high kt+1 and therefore a low f 0 (kt+1 ):
In this way the expectation of a low rt+1 is con…rmed by the actual events. The
young persons each do the best they can as atomistic individuals, taking the
market conditions as given. Yet the aggregate outcome is an equilibrium with
overaccumulation, hence a Pareto-inferior outcome.
Looking at the issue more closely, we see that sr < 0 is not crucial for this
outcome. Suppose sr = 0 (the log utility case) and that in the current period,
kt is, for some historical reason, at least temporarily considerably above kGR .
Thus, current wages are high, hence, st is relatively high (there is in this case no
o¤setting e¤ect on st from the relatively low expected rt+1 ): Again, the aggregate
result is a high kt+1 and thus the expectation is con…rmed. Consequently, the
situation in the next period is the same and so on. By continuity, even if sr > 0;
the argument goes through as long as sr is not too large.
Consider a technically feasible path f(ct ; kt )g1 t=0 with kt ! k for t ! 1 (the
reference path): Then there exists a t0 such that for t t0 ; kt 2 (k "; k + ");
fn0 (kt ) o < n and f 0 (kt ") < n: Consider an alternative feasible path
1
(^ct ; k^t ) ; where a) for t = t0 consumption is increased relative to the reference
t=0
path such that k^t0 +1 = kt0 "; and b) for all t > t0 ; consumption is such that
k^t+1 = kt ": We now show that after period t0 ; c^t > ct : Indeed, for all t > t0 , by
(3.40),
by (3.40).
Moreover, it can be shown22 that:
PROPOSITION 7 A technically feasible path f(ct ; kt )g1
t=0 such that for t ! 1;
kt ! k kGR ; is dynamically e¢ cient.
Accordingly, a steady state with k < kGR is never dynamically ine¢ cient.
This is because increasing k from this level always has its price in terms of a
decrease in current consumption; and at the same time decreasing k from this
level always has its price in terms of lost future consumption. But a steady state
with k > kGR is always dynamically ine¢ cient. Intuitively, staying forever with
k = k > kGR ; implies that society never enjoys its great capacity for producing
consumption goods.
The fact that k > kGR ; and therefore dynamic ine¢ ciency, cannot be ruled
out might seem to contradict the First Welfare Theorem from the microeconomic
theory of general equilibrium. This is the theorem saying that under certain con-
ditions (essentially that increasing returns to scale are absent are absent, markets
are competitive, no goods are of public good character, and there are no exter-
nalities, then market equilibria are Pareto optimal. In fact, however, the First
Welfare Theorem also presupposes a …nite number of periods or, if the number of
periods is in…nite, then a …nite number of agents. In contrast, in the OLG model
22
See Cass (1972).
there is a double in…nity: an in…nite number of periods and agents. Hence, the
First Welfare Theorem breaks down. Indeed, the case r < n, i.e., k > kGR ; can
arise under laissez-faire. Then, as we have seen, everyone can be made better
o¤ by a coordinated intervention by some social arrangement (a government for
instance) such that k is reduced:
The essence of the matter is that the double in…nity opens up for technically
feasible reallocations which are de…nitely bene…cial when r < n and which a
central authority can accomplish but the market can not. That nobody need
loose by the described kind of redistribution is due to the double in…nity: the
economy goes on forever and there is no last generation. Nonetheless, some kind
of centralized coordination is required to accomplish a solution.
There is an analogy in “Gamow’s bed problem”: There are an in…nite number
of inns along the road, each with one bed. On a certain rainy night all innkeepers
have committed their beds. A late guest comes to the …rst inn and asks for a
bed. “Sorry, full up!” But the minister of welfare hears about it and suggests
that each incumbent guest move down the road one inn.23
Whether the theoretical possibility of overaccumulation should be a matter of
practical concern is an empirical question about the relative size of rates of return
and economic growth. To answer the question meaningfully, we need an extension
of the criterion for overaccumulation so that the presence of technological progress
and rising per capita consumption in the long run can be taken into account. This
is one of the topics of the next chapter. At any rate, we can already here reveal
that there exists no indication that overaccumulation has ever been an actual
problem in industrialized market economies.
A …nal remark before concluding. Proposition 5 about the golden rule can be
generalized to the case where instead of one there are n di¤erent capital goods in
the economy. Essentially the generalization says that assuming CRS-neoclassical
production functions with n di¤erent capital goods as inputs, one consumption
good, no technological change, and perfectly competitive markets, a steady state
in which per-unit-of labor consumption is maximized has interest rate equal to
the growth rate of the labor force when technological progress is ignored (see,
e.g., Mas-Colell, 1989).
them in exchange for some of their goods in order to use them in the next period
for buying from the new young generation and so on. We have here an example of
how a social institution can solve a coordination problem. This gives a ‡avour of
Samuelson’s contribution although in his original article he assumed three periods
of life.
2. Diamond (1965) extended Samuelson’s contribution by adding capital ac-
cumulation. Because of its antecedents Diamonds OLG model is sometimes called
the Samuelson-Diamond model or the Allais-Samuelson-Diamond model. In our
exposition we have drawn upon clari…cations by Galor and Ryder (1989) and
de la Croix and Michel (2002). The last mentioned contribution is an extensive
exploration of discrete-time OLG models and their applications.
3. The life-cycle saving hypothesis was put forward by Franco Modigliani
(1918-2003) and associates in the 1950s. See for example Modigliani and Brum-
berg (1954). Numerous extensions of the framework, relating to the motives (b)
- (e) in the list of Section 3.1, see for instance de la Croix and Michel (2002).
4. A review of the empirics of life-cycle behavior and attempts at re…ning
life-cycle models are given in Browning and Crossley (2001).
5. Regarding the dynamic e¢ ciency issue, both the propositions 6 and 7 were
shown in a stronger form by the American economist David Cass (1937-2008).
Cass established the general necessary and su¢ cient condition for a feasible path
f(ct ; kt )g1
t=0 to be dynamically e¢ cient (Cass 1972). Our propositions 6 and 7 are
more restrictive in that they are limited to paths that converge. Partly intuitive
expositions of the deeper aspects of the theory are given by Shell (1971) and
Burmeister (1980).
6. Diamond has also contributed to other …elds of economics, including search
theory for labor markets. In 2010 Diamond, together with Dale Mortensen and
Christopher Pissarides, was awarded the Nobel price in economics.
From here very incomplete:
The two-period structure of Diamonds OLG model leaves little room for con-
sidering, e.g., education and dissaving in the early years of life. This kind of
issues is taken up in three-period extensions of the Diamond model, see ...
Multiple equilibria, self-ful…lling expectations, optimism and pessimism..
Dynamic ine¢ ciency, see also Burmeister (1980).
Bewley 1977, 1980.
Two-sector OLG: Galor (1992). Galor’s book??
On the golden rule in a general setup, see Mas-Colell (1989).
3.9 Appendix
A. On the CRRA utility function
Derivation of the CRRA function Consider a utility function u(c); de…ned
for all c > 0 and satisfying u0 (c) > 0; u00 (c) < 0: Let the absolute value of
the elasticity of marginal utility be denoted (c); that is, (c) cu00 (c)=u0 (c)
> 0: We claim that if (c) is a positive constant, ; then up to a positive linear
transformation u(c) must be of the form
c1
; when 6= 1;
u(c) = 1 (*)
ln c; when = 1;
The range of the CRRA function Considering the CRRA function u(c)
c1 1 (1 ) 1 for c 2 [0; 1) ; we have:
Thus, in the latter case u(c) is bounded from above and so allows asymptotic
“saturation”to occur.
Referring to Section 3.3, we here show that the de…nition of (c1 ; c2 ) in (3.17)
gives the result (3.18). Let x c2 =c1 and (1 + ) 1 : Then the …rst-order
condition (3.16) and the equation describing the considered indi¤erence curve
constitute a system of two equations
For a …xed utility level U = U these equations de…ne c1 and x as implicit functions
of R; c1 = c(R) and x = x(R). We calculate the total derivative w.r.t. R in both
equations and get, after ordering,
[u00 (c1 ) Ru00 (xc1 )x] c0 (R) Ru00 (xc1 )c1 x0 (R)
= u0 (xc1 ); (3.44)
[u0 (c1 ) + u0 (xc1 )x] c0 (R) = u0 (xc1 )c1 x0 (R): (3.45)
Substituting c0 (R) from (3.45) into (3.44) and ordering now yields
R 0 x+R
x (R) = :
x x (c1 ) + R (xc1 )
Finally, in view of xc1 = c2 and the de…nition of (c1 ; c2 ), this gives (3.18).
C. Walras’law
In the proof of Proposition 1 we referred to Walras’law. Here is how Walras’law
works in each period in a model like this. We consider period t; but for simplicity
we skip the time index t on the variables. There are three markets, a market
for capital services, a market for labor services, and a market for output goods.
Suppose a “Walrasian auctioneer”calls out the price vector (^ r; w; 1); where r^ > 0
and w > 0; and asks all agents, i.e., the young, the old, and the representative
…rm, to declare their supplies and demands.
The supplies of capital and labor are by assumption inelastic and equal to
K units of capital services and L units of labor services. But the demand for
capital and labor services depends on the announced r^ and w: Let the potential
pure pro…t of the representative …rm be denoted : If r^ and w are so that < 0;
the …rm declares K d = 0 and Ld = 0: If on the other hand at the announced r^
and w; = 0 (as when r^ = r(k) + and w = w(k)); the desired capital-labor
ratio is given as k d = f 0 1 (^
r) from (3.20), but the …rm is indi¤erent w.r.t. the
absolute level of the factor inputs. In this situation the auctioneer tells the …rm
to declare Ld = L (recall L is the given labor supply) and K d = k d Ld which is
certainly acceptable for the …rm. Finally, if > 0; the …rm is tempted to declare
in…nite factor demands, but to avoid that, the auctioneer imposes the rule that
the maximum allowed demands for capital and labor are 2K and 2L; respectively:
Within these constraints the factor demands will be uniquely determined by r^
and w and we have
= r; w; 1) = F (K d ; Ld )
(^ r^K d wLd : (3.46)
The owners of both the capital stock K and the representative …rm must be
those who saved in the previous period, namely the currently old. These elderly
will together declare the consumption c2 L 1 = (1 + r^ )K + and the net
investment K (which amounts to disinvestment). The young will declare the
e e
consumption c1 L = wL s(w; r+1 )L and the net investment sL = s(w; r+1 )L: So
e
aggregate declared consumption will be C = (1 + r^ )K + + wL s(w; r+1 )L
e
and aggregate net investment I K = s(w; r+1 )L K: It follows that C + I
= wL + r^K + : The aggregate declared supply of output is Y s = F (K d ; Ld ):
The values of excess demands in the three markets now add to
Z(^
r; w; 1) w(Ld L) + r^(K d K) + C + I Y s
= wLd wL + r^K d r^K + wL + r^K + F (K d ; Ld )
= wLd + r^K d + F (K d ; Ld ) = 0;
by (3.46).
s (w; r (k))
= 1 + n: (3.47)
k
Proof. Note that 1 + n > 0: From Lemma 1 in Section 3.3 follows that for all
possible values of r(k); 0 < s(w; r(k)) < w: Hence, for any k > 0;
s (w; r (k)) w
0< < :
k k
Letting k ! 1 we then have s (w; r (k)) =k ! 0 since s (w; r (k)) =k is squeezed
between 0 and 0 (as indicated in the two graphs in Fig. 3.9).
where the second equality comes from the fact that we are in case 2 and the
third comes from (3.48). But since u0 (c) > 0 and u00 (c) < 0 for all c > 0;
limk!0 u0 (c(w; k)) = 0 requires limk!0 c(w; k) = 1, as was to be shown:
In both Case 1 and Case 2 we thus have that k ! 0 implies s (w; r (k)) =k !
1. Since s (w; r (k)) =k is a continuous function of k; there must be at least one
k > 0 such that (3.47) holds (as illustrated by the two graphs in Fig. 3.14).
Now, to prove (i) of Proposition 2, consider an arbitrary kt > 0: We have
w(kt ) > 0: In (3.47), let w = w(kt ). By Lemma C1, (3.47) has a solution k > 0.
Set kt+1 = k: Starting with t = 0; from a given k0 > 0 we thus …nd a k1 > 0 and
letting t = 1; from the now given k1 we …nd a k2 and so on. The resulting in…nite
sequence fkt g1t=0 is an equilibrium path. In this way we have proved existence of
an equilibrium path if k0 > 0: Thereby (i) of Proposition 2 is proved:
But what if k0 = 0? Then, if f (0) = 0; no temporary equilibrium is possible in
period 0, in view of (ii) of Proposition 1; hence there can be no equilibrium path.
Suppose f (0) > 0: Then w(k0 ) = w(0) = f (0) > 0; as explained in Technical
Remark in Section 3.4. Let w in equation (3.47) be equal to f (0): By Lemma
C1 this equation has a solution k > 0. Set k1 = k: Letting period 1 be the new
initial period, we are back in the case with initial capital positive. This proves
(ii) of Proposition 2.
then implies
w
= k 1 + (1 + ) R(k)1 h(k); (3.49)
1+n
where R(k) 1 + r(k) 1 + f 0 (k) > 0 for all k > 0: It remains to provide a
0
su¢ cient condition for obtaining h (k) > 0 for all k > 0: We have
since (k) kR0 (k)=R(k) > 0; the sign being due to R0 (k) = f 00 (k) < 0: So
h (k) > 0 if and only if 1 (1 ) (k) > (1+ ) R(k) 1 ; a condition equivalent
0
to
1 1
> : (3.51)
(k) 1 + (1 + ) R(k) 1
To make this condition more concrete, consider the CES production function
Transition curve steep for k small Here we specialize further and consider
the CRRA-Cobb-Douglas case: u(c) = (c1 1)=(1 ); > 0; and f (k) = Ak ,
A > 0; 0 < < 1. In the prelude to Proposition 4 in Section 3.5 it was claimed
that if this combined utility and technology condition holds at least for small k;
then (ii) of (A3) is satis…ed. We now show this.
Letting ! 0 in (3.52) gives the Cobb-Douglas function f (k) = Ak (this
is proved in the appendix to Chapter 4). With = 0; clearly (1 ) 1 = 1
1
>1 ; where > 0: This inequality implies that (*) above holds and
so the transition curve is positively sloped everywhere. As an implication there
is a transition function, '; such that kt+1 = '(kt ); '0 (kt ) > 0: Moreover, since
f (0) = 0; we have, by Lemma 5, limkt !0 '(kt ) = 0:
Given the imposed CRRA utility, the fundamental di¤erence equation of the
model is
w(kt )
kt+1 = (3.54)
(1 + n) [1 + (1 + ) R(kt+1 )1 ]
or, equivalently,
w(kt )
h(kt+1 ) = ;
1+n
where h(kt+t ) is de…ned as in (3.49). By implicit di¤erentiation we …nd h0 (kt+1 )'0 (kt )
= w0 (kt )=(1 + n); i.e.,
w0 (kt )
'0 (kt ) = > 0:
(1 + n)h0 (kt+1 )
If k > 0 is a steady-state value of kt ; (3.54) implies
w(k )
1 + (1 + ) R(k )1 = ; (3.55)
(1 + n)k
and the slope of the transition curve at the steady state will be
w0 (k )
'0 (k ) = > 0: (3.56)
(1 + n)h0 (k )
If we can show that such a k > 0 exists, is unique, and implies '0 (k ) < 1; then
the transition curve crosses the 45 line from above, and so (ii) of (A3) follows in
view of limkt !0 = 0.
De…ning x(k) f (k)=k = Ak 1 ; where x0 (k) = ( 1)Ak 2 < 0; and using
that f (k) = Ak ; we have R(k) = 1 + x(k) and w(k)=k = (1 )x(k):
Hence, (3.55) can be written
1
1 + (1 + ) (1 + x )1 = x; (3.57)
1+n
where x = x(k ): It is easy to show graphically that this equation has a unique
solution x > 0 whether < 1; = 1; or > 1: Then k = (x =A)1=( 1) > 0 is
also unique.
By (3.50) and (3.57),
1 1
h0 (k ) = 1 + ( x 1) [1 (1 ) (k )] > 1 + ( x 1)(1 (k ))
1+n 1+n
1
1+( x 1) ;
1+n
where the …rst inequality is due to > 0 and the second to the fact that (k)
1 in view of (3.53) with = 0 and (k) = : Substituting this together with
w0 (k ) = (1 ) x into (3.56) gives
x
0 < '0 (k ) < < 1; (3.58)
1+n+ x
as was to be shown.
3.10 Exercises
3.1 The dynamic accounting relation for a closed economy is
Kt+1 = Kt + S N (*)
where Kt is the aggregate capital stock and StN is aggregate net saving. In the
Diamond model, let S1t be aggregate net saving of the young in period t and
S2t aggregate net saving of the old in the same period. On the basis of (*)
give a direct proof that the link between two successive periods takes the form
kt+1 = st =(1+n); where st is the saving of each young, n is the population growth
rate, and kt+1 is the capital/labor ratio at the beginning of period t + 1. Hint:
by de…nition, the increase in …nancial wealth is the same as net saving (ignoring
gifts).
3.2 Suppose the production function in Diamond’s OLG model is Y = A( K +
(1 )L )1= ; A > 0; 0 < < 1; < 0; and A 1= < 1+n. a) Given k K=L; …nd
the equilibrium real wage, w(k): b) Show that w(k) < (1 + n)k for all k > 0: Hint:
consider the roof. c) Comment on the implication for the long-run evolution of
the economy. Hint: consider the ceiling.
3.3 (multiple temporary equilibria with self-ful…lling expectations) Fig. 3.10
shows the transition curve for a Diamond OLG model with u(c) = c1 =(1 );
p 1=p
= 8; = 0:4; n = 0:2; = 0:6, f (k) = A(bk + 1 b) ; A = 7; b = 0:33;
p = 0:4:
a) Let t = 0: For a given k0 slightly below 1, how many temporary equilibria
with self-ful…lling expectations are there?
b) Suppose the young in period 0 expect the real interest rate on their saving
to be relatively low. Describe by words the resulting equilibrium path in
this case. Comment (what is the economic intuition behind the path?).
c) In the …rst sentence under b), replace “low”by “high”. How is the answer
to b) a¤ected? What kind of di¢ culty arises?
3.4 (plotting the transition curve by MATLAB) This exercise requires compu-
tation on a computer. You may use MATLAB OLG program.27
27
Made by Marc P. B. Klemp and available at the address:
Figure 3.10: Transition curve for Diamond’s OLG model in the case described in Ex-
ercise 3.3.
a) Enter the model speci…cation from Exercise 3.3 and plot the transition
curve.
b) Plot examples for two other values of the substitution parameter: p = 1:0
and p = 0:5: Comment.
c) Find the approximate largest lower bound for p such that higher values of
p eliminates multiple equilibria.
e) The empirical evidence for industrialized countries suggests that 0:4 < <
1:0: Is your from d) empirically realistic? Comment.
3.5 (one stable and one unstable steady state) Consider the following Diamond
model: u(c) = ln c; = 2:3; n = 2:097; = 1:0; f (k) = A(bk p + 1 b)1=p ; A = 20;
b = 0:5; p = 1:0:
http://www.econ.ku.dk/okocg/Computation/main.htm.
a) Plot the transition curve of the model. Hint: you may use either a program
like MATLAB OLG Program (available on the course website) or …rst a
little algebra and then Excel (or similar simple software).
b) Comment on the result you get. Will there exist a poverty trap? Why or
why not?
d) Brie‡y discuss how your results in c) comply with your knowledge of cor-
responding empirical magnitudes in industrialized Western countries?
e) There is one feature which this model, as a long-run model, ought to incor-
porate, but does not. Extend the model, taking this feature into account,
and write down the fundamental di¤erence equation for the extended model
in algebraic form.
f) Plot the new transition curve. Hint: given the model speci…cation, this
should be straightforward if you use Excel (or similar); and if you use MAT-
LAB OLG Program, note that by a simple “trick”you can transform your
new model into the “old”form.
g) The current version of the MATLAB OLG Program is not adapted to this
question. So at least here you need another approach, for instance based on
a little algebra and then Excel (or similar simple software). Given k0 = 10;
calculate numerically the time path of kt and plot the time pro…le of kt ; i.e.,
the graph (t; kt ) in the tk-plane. Next, do the same for k0 = 1: Comment.
3.9 Consider a Diamond OLG model for a closed economy. Let the utility
discount rate be denoted and let the period utility function be speci…ed as
u (c) = ln c:
d) Given the above information, can we be sure that there exists a unique and
globally asymptotically stable steady state? Why or why not?
f) Let k K=L: In the (t; ln k) plane, draw a graph of ln kt such that the
qualitative features of the time path of ln k before and after the shock,
including the long run, are exhibited.
g) How, if at all, is the real interest rate in the long run a¤ected by the shock?
h) How, if at all, is the real wage in the long run a¤ected by the shock?
i) How, if at all, is the labor income share of national income in the long run
a¤ected by the shock?
j) Explain by words the economic intuition behind your results in h) and i).
3.10
A growing economy
123
124
GDP per capita in United States, United KingdomCHAPTER 4. A GROWING ECONOMY
and Japan (1870-2010)
Figure 4.1: GDP per capita in USA, UK, and Japan 1870-2010. Source: Bolt and van
Zanden
Sources: Bolt,(2013).
J. and J. L. van Zanden (2013): The First Update of the Maddison Project; Re-Estimating
Growth Before 1820. Maddison Project Working Paper 4.
4.1 Harrod-neutrality and Kaldor’s stylized facts
Suppose the technology changes over time in such a way that we can write the
aggregate production function as
Yt = F (Kt ; Tt Lt ); (4.1)
where the level of technology is represented by the factor Tt which is growing over
time, and where Yt ; Kt ; and Lt stand for output, capital input, and labor input,
respectively. When technological change takes this purely “labor-augmenting”
form, it is known as Harrod-neutral technological progress.
Figure 4.2: GDP and GDP per capita. Denmark 1870-2006. Sources: Bolt and van
Zanden (2013); Maddison (2010); The Conference Board Total Economy Database
Sources: Bolt, J. and J. L. van Zanden (2013): The First Update of the Maddison Project; Re-Estimating
(2013).
Growth Before 1820. Maddison Project Working Paper 4, Maddison (2010): Statistics on World Population,
GDP and Per Capita GDP, 1-2008 AD, and The Conference Board Total Economy Database (2013).
1. the growth rates in K=L and Y =L are roughly constant;
2. the output-capital ratio, Y =K; the income share of labor, wL=Y; and the
average rate of return, (Y wL K)=K;1 are roughly constant;
3. the growth rate of Y =L can vary substantially across countries for quite
long time.
for that part of Kaldor’s “fact 2” which claims long-run constancy of the labor
income share. The third fact is a fact well documented empirically.2
1
Denmark
USA
0.8
Labor’s share of income
0.6
0.4
0.2
0
1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010
Figure 4.3: Labor’s share of GDP in USA (1950-2011) and Denmark (1970-2011).
Source: Feenstra, Inklaar and Timmer (2013), www.ggdc.net/pwt.
Balanced growth
With Kt , Yt , and Ct denoting aggregate capital, output, and consumption as
above, we de…ne a balanced growth path the following way:
DEFINITION 1 A balanced growth path is a path f(Kt ; Yt ; Ct )g1t=0 along which the
variables Kt ; Yt ; and Ct are positive and grow at constant rates (not necessarily
positive).
At least for a closed economy there is a general equivalence relationship be-
tween balanced growth and constancy of certain key ratios like Y =K and C=Y .
This relationship is an implication of accounting based on the above aggregate
dynamic resource constraint (4.2).
For an arbitrary variable xt 2 R++ ; we de…ne xt xt xt 1 : Whenever
xt 1 > 0; the growth rate of x from t 1 to t; denoted gx (t); is de…ned by gx (t)
xt =xt 1 . When there is no risk of confusion, we suppress the explicit dating
and write gx x=x:
PROPOSITION 1 (the balanced growth equivalence theorem). Let f(Kt ; Yt ; Ct )g1 t=0
be a path along which Kt , Yt ; Ct ; and St ( Yt Ct ) are positive for all t =
0; 1; 2; : : : : Then, given the dynamic resource constraint (4.2), the following holds:
(i) if there is balanced growth, then gY = gK = gC and so the ratios Y =K and
C=Y are constant;
(ii) if Y =K and C=Y are constant, then Y; K; and C grow at the same constant
rate, i.e., not only is there balanced growth but the growth rates of Y; K; and C
are the same.
Proof Consider a path f(Kt ; Yt ; Ct )g1 t=0 along which K, Y; C; and St Y Ct
are positive for all t = 0; 1; 2; : : : :
(i) Suppose the path is a balanced growth path. Then, by de…nition, gY ; gK ;
and gC are constant. Hence, by (4.2), S=K = gK + must be constant, implying6
gS = gK : (*)
By (4.2), Y C + S; and so
Y C S C S C S
gY = = + = gC + gS = gC + gK (by (*))
Y Y Y Y Y Y Y
C Y C C
= gC + gK = (gC gK ) + gK : (**)
Y Y Y
6
The ratio between two positive variables is constant if and only if the variables have the
same growth rate (not necessarily constant or positive). For this and similar simple growth-
arithmetic rules, see Appendix A.
except for the assumed constancy of the capital depreciation rate , no assumption
about the technology is involved, not even that constant returns to scale is present.
Proposition 1 suggests that if one accepts Kaldor’s stylized facts as a rough
description of more than a century’s growth experience and therefore wants the
model to be consistent with them, one should construct the model so that it can
generate balanced growth.
@ F~
Yt = F~ (Kt ; ALt ; t); A > 0; > 0; (4.4)
@t
where F~ is homogeneous of degree one w.r.t. the …rst two arguments (CRS) and
A is a constant that depends on measurement units. The third argument, t;
represents technological progress: as time proceeds, unchanged inputs of capital
and labor result in more and more output. Let the labor force grow at a constant
rate n;
Lt = L0 (1 + n)t ; n > 1; (4.5)
where L0 > 0. The Japanese economist Hirofumi Uzawa (1928-) is famous for
several contributions, not least his balanced growth theorem (Uzawa 1961), which
we here state in a modernized form.
PROPOSITION 2 (Uzawa’s balanced growth theorem). Let f(Kt ; Yt ; Ct )g1 t=0 be a
path along which Yt ; Kt , Ct , and St Yt Ct are positive for all t = 0; 1; 2;. . . ,
and satisfy the dynamic resource constraint (4.2), given the production function
(4.4) and the labor force (4.5). Assume (1 + g)(1 + n) > 1 . Then:
(i) a necessary condition for this path to be a balanced growth path is that along
the path it holds that
Yt = F~ (Kt ; Tt Lt ; 0); (4.6)
where Tt = A(1 + g)t with 1 + g (1 + gY )=(1 + n); gY being the constant growth
rate of output along the balanced growth path;
7
On the other hand we do not imply that CRS is always necessary for a balanced growth
path (see Exercise 4.??).
(ii) for any g 0 such that there is a q > (1 + g)(1 + n) (1 ) with the
property that the production function F~ in (4.4) allows an output-capital ratio
equal to q at t = 0 (i.e., F~ (1; k~ 1 ; 0) = q for some real number k~ > 0), a su¢ cient
condition for F~ to be consistent with a balanced growth path with output-capital
ratio equal to q is that F~ can be written as in (4.6) with Tt = A(1 + g)t .
Proof (i) Suppose the given path f(Kt ; Yt ; Ct )g1t=0 is a balanced growth path.
By de…nition, gK and gY are then constant so that Kt = K0 (1 + gK )t and Yt
= Y0 (1 + gY )t : With t = 0 in (4.4) we then have
Yt (1 + gY ) t
= Y0 = F~ (K0 ; AL0 ; 0) = F~ (Kt (1 + gK ) t ; ALt (1 + n) t ; 0): (4.7)
We see that (4.6) holds for Tt = A(1 + g)t with 1 + g (1 + gY )=(1 + n):
(ii) See Appendix B.
The form (4.6) indicates that along a balanced growth path (BGP from
now), technological progress must be purely labor augmenting, that is, Harrod-
neutral. Moreover, by de…ning a new CRS production function F by F (Kt ; Tt Lt )
F~ (Kt ; Tt Lt ; 0); we see that (i) of the proposition implies that at least along the
BGP, we can rewrite the original production function this way:
COROLLARY Along a BGP with positive gross saving and the technology level
T growing at a constant rate g 0; output grows at the rate (1 + g)(1 + n) 1
( g + n for g and n “small”) while labor productivity, y; capital-labor ratio, k;
and consumption-labor ratio, c; all grow at the rate g:
showing that y grows at the rate g: Moreover, y=k = Y =K; which is constant
along a BGP, by (i) of Proposition 1. Hence k grows at the same rate as y:
Finally, also c=y C=Y is constant along a BGP, implying that also c grows at
the same rate as y.
that t = f 0 (k~ )=q : Moreover, 0 < < 1; since 0 < is implied by f 0 > 0;
and < 1 is implied by the fact that q = Y =K = f (k~ )=k~ > f 0 (k~ ); in view of
f 00 < 0 and f (0) 0: So, by the …rst equality in (4.12), the labor income share
can be written wt Lt =Yt = 1 t = 1 : Consequently, by (4.11), the rate of
return on capital is rt = (1 wt Lt =Yt )Yt =Kt = q :
Although this proposition implies constancy of the factor income shares under
balanced growth, it does not determine them. The proposition expresses the
factor income shares in terms of the unknown constants and q: These constants
will generally depend on the e¤ective capital-labor ratio in steady state, k~ ; which
will generally be an unknown as long as we have not formulated a theory of saving.
This takes us back to Diamond’s OLG model which provides such a theory.
Yt = F (Kt ; Tt Lt ); (4.13)
Tt = T0 (1 + g)t ; g 0: (4.14)
Y K ~ 1) ~
y~ = F( ; 1) = F (k; f (k); f 0 > 0; f 00 < 0;
TL TL
period the representative …rm maximizes pro…t, = F (K; T L) r^K wL: With
respect to capital this leads to the …rst-order condition
h i
@ T Lf ( ~
k)
@Y ~ =r+ ;
= = f 0 (k) (4.15)
@K @K
where is a constant capital depreciation rate, 0 1: With respect to labor
we get the …rst-order condition
h i
@ T Lf ( ~
k) h i
@Y ~ ~ k~ T = w:
= = f (k) f 0 (k) (4.16)
@L @L
In view of f 00 < 0; a k~ satisfying (4.15) is unique. Let us denote its value in
period t; k~td : Assuming equilibrium in the factor markets, this desired e¤ective
capital-labor ratio equals the e¤ective capital-labor ratio from the supply side,
k~t Kt =(Tt Lt ) kt =Tt ; which is predetermined in every period. The equilibrium
interest rate and real wage in period t are thus given by
This property entails that if the value of the “endowment”, here the human wealth
wt ; is multiplied by a > 0, then the chosen c1t and c2t+1 are also multiplied by
this factor (see Appendix C); it then follows that st is multiplied by as well.
Letting = 1=(w( ~ k~t )Tt ); (A4) thus allows us to write
where s^(r(k~t+1 )) is the saving-wealth ratio of the young. The distinctive feature
is that this saving-wealth ratio is independent of wealth (but in general it depends
on the interest rate). By (4.19), the law of motion of the economy reduces to
s^(r(k~t+1 ))
k~t+1 = ~ k~t ):
w( (4.21)
(1 + g)(1 + n)
The equilibrium path of the economy can be analyzed in a similar way as in
the case of no technological progress. In the assumptions (A2) and (A3) from
Chapter 3 we replace k by k~ and 1 + n by (1 + g)(1 + n). As a generalization
of Proposition 4 from Chapter 3, these generalized versions of (A2) and (A3),
together with the No Fast Assumption (A1) and the Homotheticity Assumption
(A4), guarantee that there exists at least one locally asymptotically stable steady
state k~ > 0: That is, given these assumptions, we have k~t ! k~ for t ! 1 and so
the system will sooner or later settle down in a steady state. The convergence of
k~ implies convergence of many key variables, for instance the equilibrium factor
prices given in (4.17) and (4.18). We see that, for t ! 1;
The prediction of the model is now that the economy will in the long run
behave in accordance with Kaldor’s stylized facts. Indeed, in many models, in-
cluding the present one, convergence toward a steady state is equivalent to saying
that the time path of the economy converges toward a BGP. In the present case,
with perfect competition, the implication is that in the long run the economy will
be consistent with Kaldor’s stylized facts.
The claimed equivalence follows from:
Figure 4.4: Transition curve for a well-behaved Diamond OLG model with Harrod-
neutral technical progress.
Let us portray the dynamics by a transition diagram. Fig. 4.4 shows a “well-
behaved” case in the sense that there is only one steady state. In the …gure the
initial e¤ective capital-labor ratio, k~0 ; is assumed to be relatively large. This
need not be interpreted as if the economy is highly developed and has a high
initial capital-labor ratio, K0 =L0 : Indeed, the reason that k~0 K0 =(T0 L0 ) is
large relative to its steady-steady value may be that the economy is “backward”
in the sense of having a relatively low initial level of technology. Growing at a
given rate g; the technology will in this situation grow faster than the capital-
labor ratio, K=L; so that the e¤ective capital-labor ratio declines over time. The
process continues until the steady state is essentially reached with a real interest
rate r = f 0 (k~ ) : This is to remind the reader that from an empirical point of
view, the adjustment towards a steady state can be from above as well as from
below.
The output growth rate in steady state, (1 + g)(1 + n) 1; is sometimes called
the “natural rate of growth”. Since (1 + g)(1 + n) 1 = g + n + gn g + n for
g and n “small”, the natural rate of growth approximately equals the sum of the
rate of technological progress and the growth rate of the labor force. Warning:
When measured on an annual basis, the growth rates of technology and labor
force, g and n; do indeed tend to be “small”, say g = 0:02 and n = 0:005; so
that g + n + gn = 0:0251 0:0250 = g + n: But in the context of models like
Diamond’s, the period length is, say, 30 years. Then the corresponding g and n
will satisfy the equations 1 + g = (1 + g)30 = 1:0230 = 1:8114 and 1 + n = (1 + n)30
= 1:00530 = 1:1614, respectively. We get g + n = 0:973, which is about 10 per
cent smaller than the true output growth rate over 30 years, which is g + n + gn
= 1:104:
We end our account of Diamond’s OLG model with some remarks on a popular
special case of a homothetic utility function.
1
k~t+1 = ~ k~t ):
w(
(1 + g)(1 + n)(2 + )
We see that in the = 1 case, whatever the production function, k~t+1 enters
only at the left-hand side of the fundamental di¤erence equation, which thereby
~ > 0; the transition curve
reduces to a simple transition function. Since w~ 0 (k)
is positively sloped everywhere. If the production function is Cobb-Douglas, Yt
= Kt (Tt Lt )1 ; then w(~ k~t ) = (1 )k~t : Combining this with = 1 yields a
“well-behaved” Diamond model (thus having a unique and globally asymptoti-
cally stable steady state), cf. Fig. 4.4 above. In fact, as noted in Chapter 3,
in combination with Cobb-Douglas technology, CRRA utility results in “well-
behavedness”whatever the value of > 0.
which is a milder condition than the Inada conditions. Considering the second-
~ = f 00 (k)
order condition c~00 (k) ~ < 0; the k~ satisfying (4.24) does indeed maximize
~ By de…nition, this k~ is the golden-rule capital intensity, k~GR : Thus
c~(k):
where the right-hand side is the “natural rate of growth”. This says that the
golden-rule capital intensity is that level of the capital intensity at which the net
marginal productivity of capital equals the output growth rate in steady state.
Abel et al. (1989) study the problem on the basis of a model with uncertainty.
They show that a su¢ cient condition for dynamic e¢ ciency is that gross invest-
ment, I; does not exceed the gross capital income in the long run, that is I
Y wL: They …nd that for the U.S. and six other major OECD nations this seems
to hold. Indeed, for the period 1929-85 the U.S. has, on average, I=Y = 0:15 and
(Y wL)=Y = 0:29: A similar di¤erence is found for other industrialized coun-
tries, suggesting that they are dynamically e¢ cient. At least in these countries,
therefore, the potential coordination failure laid bare by OLG models does not
seem to have been operative in practice.
Y = r^K + wL;
where r^ = r + = f 0 (k) r^(k) is the cost per unit of capital input and w
= f (k) kf 0 (k) w(k) is the real wage, i.e., the cost per unit of labor input.
The labor income share is
w=^
r
wL f (k) kf 0 (k) w(k) wL k
= SL(k) = = ; (4.26)
Y f (k) f (k) r^K + wL 1 + w=^
k
r
where the function SL( ) is the income share of labor function, w=^ r is the factor
price ratio, and (w=^ r)=k = w=(^rk) is the factor income ratio. As r^ (k) = f 00 (k) < 0
0
0 00
and w (k) = kf (k) > 0; the factor price ratio, w=^ r, is an increasing function
of k:
Suppose that capital tends to grow faster than labor so that k rises over time.
Unless the production function is Cobb-Douglas, this will under perfect competi-
tion a¤ect the labor income share. But apriori it is not obvious in what direction.
By (4.26) we see that the labor income share moves in the same direction as the
factor income ratio, (w=^ r)=k: The latter goes up (down) depending on whether
the percentage rise in the factor price ratio w=^ r is greater (smaller) than the
percentage rise in k. So, if we let E`x g(x) denote the elasticity of a function g(x)
w.r.t. x; we can only say that
w
SL0 (k) R 0 for E`k R 1; (4.27)
r^
respectively. In words: if the production function is such that the ratio of the
marginal productivities of the two production factors is strongly (weakly) sensitive
to the capital-labor ratio, then the labor income share rises (falls) along with a
rise in K=L:
Usually, however, the inverse elasticity is considered, namely E`w=^r k (= 1=E`k wr^ ):
This elasticity indicates how sensitive the cost minimizing capital-labor ratio, k;
is to a given factor price ratio w=^r: Under perfect competition E`w=^r k coincides
with what is known as the elasticity of factor substitution (for a general de…n-
ition, see below). The latter is often denoted : In the CRS case, will be a
function of only k so that we can write E`w=^r k = (k): By (4.27), we therefore
have
SL0 (k) R 0 for (k) Q 1;
respectively.
If F is Cobb-Douglas, i.e., Y = K L1 ; 0 < < 1; we have (k) 1; as
shown in Section 4.5. In this case variation in k does not change the labor income
share under perfect competition. Empirically there is not agreement about the
“normal”size of the elasticity of factor substitution for industrialized economies,
but the bulk of studies seems to conclude with (k) < 1 (see below).
wL w=T w~
= :
Y Y =(T L) y~
respectively. We see that if (k) ~ < 1 in the relevant range for k;~ then market
forces tend to increase the income share of the factor that is becoming relatively
more scarce, which is e¢ ciency-adjusted labor, T L; if k~ is increasing. And if
instead (k)~ > 1 in the relevant range for k;
~ then market forces tend to decrease
the income share of the factor that is becoming relatively more scarce.
While k empirically is clearly growing, k~ k=T is not necessarily so because
also T is increasing. Indeed, according to Kaldor’s “stylized facts”, apart from
short- and medium-term ‡uctuations, k~ and therefore also r^ and the labor
income share tend to be more or less constant over time. This can happen
whatever the sign of (k~ ) 1; where k~ is the long-run value of the e¤ective
~ Given CRS and the production function f; the elasticity
capital-labor ratio k.
of substitution between capital and labor does not depend on whether g = 0 or
g > 0, but only on the function f itself and the level of K=(T L).
As alluded to earlier, there are empiricists who reject Kaldor’s “facts” as a
general tendency. For instance Piketty (2014) essentially claims that in the very
long run the e¤ective capital-labor ratio k~ has an upward trend, temporarily
braked by two world wars and the Great Depression in the 1930s. If so, the sign
~
of (k) 1 becomes decisive for in what direction wL=Y will move. Piketty
~ > 1; which means there
interprets the econometric literature as favoring (k)
should be downward pressure on wL=Y . This particular source behind a falling
wL=Y can be questioned, however. Indeed, (k) ~ > 1 contradicts the more general
10
empirical view referred to above.
Immigration
~ Con-
Here is another example that illustrates the importance of the size of (k):
sider an economy with perfect competition and a given aggregate capital stock K
and technology level T (entering the production function in the labor-augmenting
way as above). Suppose that for some reason, immigration, say, aggregate labor
supply, L; shifts up and full employment is maintained by the needed real wage
adjustment. Given the present model, in what direction will aggregate labor in-
come wL = w( ~ L then change? The e¤ect of the larger L is to some extent
~ k)T
o¤set by a lower w brought about by the lower e¤ective capital-labor ratio. In-
~ k~ = kf
deed, in view of dw=d ~ 00 (k)
~ > 0; we have k~ # implies w # for …xed T: So
we cannot apriori sign the change in wL: The following relationship can be shown
(Exercise 4.??), however:
@(wL) ~
(k)
= (1 )w R 0 for (k)
~ Q (k);
~ (4.28)
@L ~
(k)
~
respectively, where a(k) ~ 0 (k)=f
kf ~ (k)~ is the output elasticity w.r.t. capital
which under perfect competition equals the gross capital income share. It follows
that the larger L will not be fully o¤set by the lower w as long as the elasticity
~ exceeds the gross capital income share, (k).
of factor substitution, (k); ~ This
condition seems con…rmed by most of the empirical evidence (see Section 4.5).
a given point (K; L) on the isoquant curve, M RS is given by the absolute value
of the slope of the tangent to the isoquant at that point. This tangent coincides
with that isocost line which, given the factor prices, has minimal intercept with
the vertical axis while at the same time touching the isoquant. In view of F ( )
being neoclassical, the isoquants are by de…nition strictly convex to the origin.
Consequently, M RS is rising along the curve when L decreases and thereby K
increases. Conversely, we can let M RS be the independent variable and consider
the corresponding point on the indi¤erence curve, and thereby the ratio K=L, as a
function of M RS. If we let M RS rise along the given isoquant, the corresponding
value of the ratio K=L will also rise.
Figure 4.6: Substitution of capital for labor as the marginal rate of substitution in-
creases from M RS to M RS 0 .
The elasticity of substitution between capital and labor is de…ned as the elas-
ticity of the ratio K=L with respect to M RS when we move along a given isoquant,
evaluated at the point (K; L). Let this elasticity be denoted ~ (K; L): Thus,
d(K=L)
M RS d(K=L) K=L
~ (K; L) = = dM RS
: (4.29)
K=L dM RS jY =Y M RS jY =Y
factor price ratio, holding the output level unchanged.12 The elasticity of factor
substitution is thus a positive number and re‡ects how sensitive the capital-labor
ratio K=L is under cost minimization to an increase in the factor price ratio w=^ r
for a given output level: The less curvature the isoquant has, the greater is the
elasticity of factor substitution. In an analogue way, in consumer theory one con-
siders the elasticity of substitution between two consumption goods or between
consumption today and consumption tomorrow, cf. Chapter 3. In that context
the role of the given isoquant is taken over by an indi¤erence curve. That is also
the case when we consider the intertemporal elasticity of substitution in labor
supply, cf. the next chapter.
Calculating the elasticity of substitution between K and L at the point (K; L),
we get
FK FL (FK K + FL L)
~ (K; L) = ; (4.30)
KL [(FL )2 FKK 2FK FL FKL + (FK )2 FLL ]
where all the derivatives are evaluated at the point (K; L): When F (K; L) has
CRS, the formula (4.30) simpli…es to
where k K=L:13 We see that under CRS, the elasticity of substitution depends
only on the capital-labor ratio k, not on the output level. We will now consider the
case where the elasticity of substitution is independent also of the capital-labor
ratio:
is known as total factor productivity) and is thus called the e¢ ciency parameter.
The parameters and are called the distribution parameter and the substitution
parameter, respectively. The restriction < 1 ensures that the isoquants are
strictly convex to the origin. Note that if < 0; the right-hand side of (4.32)
is not de…ned when either K or L (or both) equal 0: We can circumvent this
problem by extending the domain of the CES function and assign the function
value 0 to these points when < 0. Continuity is maintained in the extended
domain (see Appendix E).
By taking partial derivatives in (4.32) and substituting back we get
1 1
@Y Y @Y Y
= A and = (1 )A ; (4.33)
@K K @L L
1 1
where Y =K = A + (1 )k and Y =L = A k +1 : The marginal
rate of substitution of K for L therefore is
@Y =@L 1
M RS = = k1 > 0:
@Y =@K
Consequently,
dM RS 1
= (1 )k ;
dk
where the inverse of the right-hand side is the value of dk=dM RS: Substituting
these expressions into (4.29) gives
1
~ (K; L) = ; (4.34)
1
2
Aα
3
2
σ=∞
2 σ=0
A
1
σ = 0.5
σ = 1.5 σ=1
L
0 1 2
2 2
3 4 5 6 7
A A(1−α)
Figure 4.7: Isoquants for the CES function for alternative values of (A = 1:5, Y = 2;
and = 0:42).
in Appendix E.
The marginal productivity of labor is
@Y
MP L = = (1 )A y 1 = (1 )A( k + 1 )(1 )=
w(k);
@L
from (4.33).
Since (4.32) is symmetric in K and L; we get a series of symmetric results by
1=
considering output per unit of capital as x Y =K = A + (1 )(L=K) :
In total, therefore, when there is low substitutability ( < 0); for …xed input
of either of the production factors, there is an upper bound for how much an
unlimited input of the other production factor can increase output. And when
there is high substitutability ( > 0); there is no such bound and an unlimited
input of either production factor take output to in…nity.
The Cobb-Douglas case, i.e., the limiting case for ! 0; constitutes in several
respects an intermediate case in that all four Inada conditions are satis…ed and
we have y ! 0 for k ! 0; and y ! 1 for k ! 1:
y y
1
∆x · Aα β
1 ∆x
5 ∆x · Aα β 5
1 ∆x
A (1 − α) β
1
k A (1 − α) β k
0 5 10 0 5 10
Figure 4.8: The CES production function in intensive form, = 1=(1 ); < 1:
Generalizations
The CES production function considered above has CRS. By adding an elasticity
of scale parameter, , we get the generalized form
Y =A K + (1 )L ; > 0: (4.37)
In this form the CES function is homogeneous of degree : For 0 < < 1; there are
DRS, for = 1 CRS, and for > 1 IRS. If 6= 1; it may be convenient to consider
1=
Q Y 1= = A1= K + (1 )L and q Q=L = A1= ( k + 1 )1= :
where it is understood that not only the output level but also all Xk , k 6= i, j,
are kept constant. Note that ji = ij : In the CES case considered in (4.38), all
the partial elasticities of substitution take the same value, 1=(1 ):
empirical fact that as income rises, the share of consumption expenditures devoted
to agricultural and industrial products declines and the share devoted to services,
hobbies, and amusement increases. Although “economic development”is perhaps
a more appropriate term (suggesting qualitative and structural change), we retain
standard terminology and speak of “economic growth”.
4.8 Appendix
A. Growth and interest arithmetic in discrete time
Let t = 0; 1; 2; : : : ; and consider the variables zt ; xt ; and yt ; assumed positive
for all t. De…ne zt = zt zt 1 and xt and yt similarly. These ’s need not
be positive. The growth rate of xt from period t 1 to period t is de…ned as the
relative rate of increase in x; i.e., xt =xt 1 xt =xt 1 : And the growth factor for
xt from period t 1 to period t is de…ned as 1 + xt =xt 1 :
As we are here interested not in the evolution of growth rates, we simplify
notation by suppressing the t’s. So we write the growth rate of x as gx x=x 1
and similarly for y and z:
PRODUCT RULE If z = xy; then 1 + gz = (1 + gx )(1 + gy ) and gz gx + gy ;
when gx and gy are “small”.
Proof. By de…nition, z = xy; which implies z 1 + z = (x 1 + x)(y 1 +
y): Dividing by z 1 = x 1 y 1 gives 1 + z=z 1 = (1 + x=x 1 )(1 + y=y 1 )
as claimed. By carrying out the multiplication on the right-hand side of this
equation, we get 1 + z=z 1 = 1 + x=x 1 + y=y 1 + ( x=x 1 )( y=y 1 )
1 + x=x 1 + y=y1 when x=x 1 and y=y 1 are “small”. Subtracting 1 on
both sides gives the stated approximation.
So the product of two positive variables will grow at a rate approximately
equal to the sum of the growth rates of the two variables.
1+gx
FRACTION RULE If z = xy ; then 1 + gz = 1+gy
and gz gx gy ; when gx and
gy are “small”.
Proof. By interchanging z and x in Product Rule and rearranging, we get 1 +
z=z 1 = 1+ x=x 1
1+ y=y 1
; as stated in the …rst part of the claim. Subtracting 1 on
both sides gives z=z 1 = x=x 1 y=y 1
1+ y=y 1
x=x 1 y=y 1 ; when x=x 1
and y=y 1 are “small”. This proves the stated approximation.
So the ratio between two positive variables will grow at a rate approximately
equal to the excess of the growth rate of the numerator over that of the denomina-
tor. An implication of the …rst part of Claim 2 is: the ratio between two positive
variables is constant if and only if the variables have the same growth rate (not
necessarily constant or positive).
POWER FUNCTION RULE If z = x ; then 1 + gz = (1 + gx ) :
Proof. 1 + gz z=z 1 = (x=x 1 ) (1 + gx ) .
Given a time series x0 ; x1 ; :::; xn , by the average growth rate per period, or
more precisely, the average compound growth rate, is meant a g which satis…es
xn = x0 (1 + g)n : The solution for g is g = (xn =x0 )1=n 1:
Finally, the following approximation may be useful (for intuition) if used with
caution:
THE GROWTH FACTOR With n denoting a positive integer above 1 but “not
too large”, the growth factor (1 + g)n can be approximated by 1 + ng when g is
“small”. For g 6= 0; the approximation error is larger the larger is n:
Proof. (i) We prove the claim by induction. Suppose the claim holds for a …xed
n 2; i.e., (1 + g)n 1 + ng for g “small”: Then (1 + g)n+1 = (1 + g)n (1 + g)
(1 + ng)(1 + g) = 1 + ng + g + ng 2 1 + (n + 1)g since g “small” implies g 2
“very small”and therefore ng 2 “small”if n is not “too”large. So the claim holds
also for n + 1: Since (1 + g)2 = 1 + 2g + g 2 1 + 2g; for g “small”, the claim does
indeed hold for n = 2:
THE EFFECTIVE ANNUAL RATE OF INTEREST Suppose interest on a
loan is charged n times a year at the rate r=n per year. Then the e¤ective annual
interest rate is (1 + r=n)n 1:
We see that k~t is constant if and only if k~t satis…es the equation f (k~t )=k~t =
[(1 + g)(1 + n) (1 )] =^
s q: By (**) and the de…nition of f; the required
~ ~
value of kt is k; which is thus the steady state for the constructed Solow model.
Letting K0 satisfy K0 = kAL ~ 0 ; where A = T0 ; we thus have k~0 = K0 =(T0 L0 ) = k:
~
So that the initial value of k~t equals the steady-state value. It follows that k~t = k~
for all t = 0; 1; 2; : : : ; and so Yt =Kt = f (k~t )=k~t = f (k)=
~ k~ = q for all t 0: In
addition, Ct = (1 s^)Yt ; so that Ct =Yt is constant along the path P: As both Y =K
and C=Y are thus constant along the path P , by (ii) of Proposition 1 follows that
P is a balanced growth path.
It is noteworthy that the proof of the su¢ ciency part of the theorem is con-
structive. It provides a method for constructing a balanced growth path with a
given technology growth rate and a given output-capital ratio.
The last equality is due to Euler’s theorem saying that when f is homogeneous of
degree 1, then the …rst-order partial derivatives of f are homogeneous of degree
0. Now, (4.39) implies that for a given M RS; in optimum re‡ecting a given
relative price of the two goods, the same consumption ratio, x2 =x1 ; will be chosen
whatever the budget. For a given relative price, a rising budget (rising wealth)
will change the position of the budget line, but not its slope. So M RS will not
change, which implies that the chosen pair (x1 ; x2 ) will move outward along a
given ray in R2+ : Indeed, as the intercepts with the axes rise proportionately with
the budget (the wealth), so will x1 and x2 .
dK FL (K; L)
= M RS = : (4.40)
dL jY =Y FK (K; L)
dM RS dM RS dL
=
dk jY =Y dL jY =Y dk jY =Y
(FL )2 FKK 2FK FL FKL + (FK )2 FLL dL
= (4.41)
FK3 dk jY =Y
dL k dK
dk jY =Y
K k dK
dL jY =Y
dL
dk jY =Y
K kM RS dL
dk jY =Y
K
= = = :
dk jY =Y k2 k2 k2
M RS dk
(K; L)
k dM RS jY =Y
FL =FK (k + FL =FK )k FK3
=
k K [(FL )2 FKK 2FK FL FKL + (FK )2 FLL ]
FK FL (FK K + FL L)
= ;
KL [(FL ) FKK 2FK FL FKL + (FK )2 FLL ]
2
FK FL F (K; L)
(K; L) = (from (2.54) with h = 1)
KL [(FL )2 FKK 2FK FL FKL + (FK )2 FLL ]
FK FL F (K; L)
= (from (2.60))
KLFKL [ (FL )2 L=K 2FK FL (FK )2 K=L]
FK FL F (K; L) FK FL
= 2
= ; (using (2.54) with h = 1)
FKL (FL L + FK K) FKL F (K; L)
which proves the …rst part of (4.31). The second part is an implication of rewriting
the formula in terms of the production in intensive form.
Y =A K + (1 )L ; (4.42)
where A; ; and are parameters satisfying A > 0, 0 < < 1; and < 1;
6= 0; > 0: If < 1; there is DRS, if = 1; CRS, and if > 1, IRS. The
elasticity of substitution is always = 1=(1 ): Throughout below, k means
K=L:
The limiting functional forms We claimed in the text that, for …xed K > 0
and L > 0, (4.42) implies:
lim Y = A(K L1 ) = AL k ; (*)
!0
lim Y = A min(K ; L ) = AL min(k ; 1): (**)
! 1
=
Proof. Let q Y =(AL ): Then q = ( k + 1 ) so that
ln( k + 1 ) m( )
ln q = ; (4.43)
where
k ln k ln k
m0 ( ) = = : (4.44)
k +1 + (1 )k
Hence, by L’Hôpital’s rule for “0/0”,
m0 ( )
lim ln q = lim = ln k = ln k ;
!0 !0 1
so that lim !0 q = k ; which proves (*). As to (**), note that
8
1 < 0 for k > 1;
lim k = lim ! 1 for k = 1;
! 1 ! 1 k :
1 for k < 1:
Hence, by (4.43),
0 for k 1;
lim ln q = m0 ( )
! 1 lim ! 1 1
= ln k = ln k for k < 1;
where the result for k < 1 is based on L’Hôpital’s rule for “1/-1”. Consequently,
1 for k 1;
lim q =
! 1 k for k < 1;
which proves (**).
Continuity at the boundary of R2+ When 0 < < 1; the right-hand side of
(4.42) is de…ned and continuous also on the boundary of R2+ : Indeed, we get
(
A K for L ! 0;
Y = F (K; L) = A K + (1 )L !
A(1 ) L for K ! 0:
When < 0; however, the right-hand side is not de…ned on the boundary. We
circumvent this problem by rede…ning the CES function in the following way
when < 0:
(
Y = F (K; L) = A K + (1 )L when K > 0 and L > 0; (4.45)
0 when either K or L equals 0.
We now show that continuity holds in the extended domain. When K > 0 and
L > 0, we have
Let < 0 and (K; L) ! (0; 0): Then, G(K; L) ! 1; and so Y = ! 1: Since
= < 0; this implies Y ! 0 = F (0; 0); where the equality follows from the
de…nition in (4.45): Next, consider a …xed L > 0 and rewrite (4.46) as
1 1 1 1 1
Y = A K + (1 )L = A L( k + 1 )
1
A L
= 1=
! 0 for k ! 0;
(ak + 1 )
when < 0: Since 1= > 0; this implies Y ! 0 = F (0; L); from (4.45): Finally,
consider a …xed K > 0 and let L=K ! 0: Then, by an analogue argument we get
Y ! 0 = F (K; 0); (4.45): So continuity is maintained in the extended domain.
4.9 Exercises
4.1 (the aggregate saving rate in steady state)
c) Compare your result in a) with the formula for S N =Y in steady state one
gets in any model with the same CRS-production function and no techno-
logical change. Comment.
d) Assume that n = 0. What does the formula from a) tell you about the level
of net aggregate savings in this case? Give the intuition behind the result in
terms of the aggregate saving by any generation in two consecutive periods.
One might think that people’s rate of impatience (in Diamond’s model the
rate of time preference ) a¤ect S N =Y in steady state. Does it in this case?
Why or why not?
g) Consider the statement: “In Diamond’s OLG model any generation saves
as much when young as it dissaves when old.”True or false? Why?
1 a¤ecting resource allocation (provide public goods that would otherwise not
be supplied in a su¢ cient amount, correct externalities and other markets
failures, prevent monopoly ine¢ ciencies, provide social insurance);
The design of …scal policy with regard to the aims 1 and 2 at a disaggregate
level is a major theme within the …eld of public economics. Macroeconomics
studies ways of dealing with aim 3 as well as big-picture aspects of 1 and 2, like
overall policies to maintain and promote sustainable prosperity.
In this chapter we address …scal sustainability and long-run implications of
debt …nance. This relates to one of the conditions that constrain public …nancing
instruments. To see the issue of …scal sustainability in a broader context, Section
6.1 provides an overview of conditions and factors that constrain public …nanc-
ing instruments. Section 6.2 introduces the basics of government budgeting and
Section 6.3 de…nes the concepts of government solvency and …scal sustainability.
In Section 6.4 the analytics of debt dynamics is presented. As an example, the
203
CHAPTER 6. LONG-RUN ASPECTS OF FISCAL POLICY
204 AND PUBLIC DEBT
Stability and Growth Pact of the EMU (the Economic and Monetary Union of
the European Union) is discussed. Section 6.5 looks more closely at the link be-
tween government solvency and the government’s No-Ponzi-Game condition and
intertemporal budget constraint. In Section 6.6 we widen public sector accounting
by introducing separate operating and capital budgets so as to allow for proper
accounting of public investment. A theoretical claim, known as the Ricardian
equivalence proposition, is studied in Section 6.7. The question “is Ricardian
equivalence likely to be a good approximation to reality?”is addressed, applying
the Diamond OLG framework extended with a public sector.
(i) …nancing by debt issue is constrained by the need to remain solvent and
avoid catastrophic debt dynamics;
(iii) time lags in spending as well as taxing may interfere with attempts to
stabilize the economy (recognition lag, decision lag, implementation lag,
and e¤ect lag);
Point (i) is the main focus of sections 6.2-6.6. Point (ii) is brie‡y considered
in Section 6.4.1 in connection with the so-called La¤er curve. In Section 6.6 point
(iii) is brie‡y commented on. The remaining points, (iv) - (v), are not addressed
speci…cally in this chapter. They should always be kept in mind, however, when
discussing …scal policy. Hence some remarks at the end of the chapter.
Now to the speci…cs of government budget accounting and debt …nancing.
We generally perceive the public sector (or the nation state) as consisting of the
national government and a central bank. In economics the term “government”
does not generally refer to the particular administration in o¢ ce at a point in
time. The term is rather used in a broad sense, encompassing both legislation
and administration. The aspects of legislation and administration in focus in
macroeconomics are the rules and decisions concerning spending on public con-
sumption, public investment, transfers, and subsidies on the expenditure side and
on levying taxes and incurring debts on the …nancing side. Within certain limits
the government has usually delegated the management of the nation’s currency
to the central bank, also called the monetary authority. Our accounting treats
“government budgeting” as covering the public sector as a whole, that is, the
consolidated government (including local government) and central bank. Gov-
ernment bonds held by the central bank are thus excluded from what we call
“government debt”. So the terms government debt, public debt, and state debt
are used synonymously.
Symbol Meaning
Yt real GDP (= real GNP if the economy is closed)
Ctg public consumption
Itg public …xed capital investment
Gt Ctg + Itg real public spending on goods and services
Xt real transfer payments
T~t real gross tax revenue
Tt T~t Xt real net tax revenue
Mt the monetary base (currency and bank reserves in the central bank)
Pt price level (in money) for goods and services (the GDP de‡ator)
Dt nominal net public debt
Dt
Bt Pt 1
real net public debt
Bt
bt Yt
government debt-to-income ratio
it nominal short-term interest rate
xt = xt xt 1 (where x is some arbitrary variable)
Pt Pt Pt 1
t Pt 1 Pt 1
in‡ation rate
Pt 1 (1+it ) 1+it
1 + rt Pt 1+ t
real short-term interest rate
Note that Yt ; Gt ; and Tt are quantities de…ned per period, or more generally,
per time unit, and are thus ‡ow variables. On the other hand, Mt ; Dt ; and Bt
are stock variables, that is, quantities de…ned at a given point in time, here at
the beginning of period t. We measure Dt and Bt net of …nancial claims held
by the government. Almost all countries have positive government net debt, but
in principle Dt < 0 is possible.1 The monetary base, Mt ; is currency plus fully
liquid deposits in the central bank held by the private sector at the beginning of
period t; Mt is by de…nition nonnegative.
We shall in this chapter most of the time ignore uncertainty and risk of default.
Then the nominal interest rate on government bonds must be the same as that on
other interest-bearing assets in the economy. For ease of exposition we imagine
that all government bonds are one-period bonds. That is, each government bond
promises a payout equal to one unit of account at the end of the period and
then the bond expires. Given the interest rate, it ; the market value of a bond at
the start of period t is vt = 1=(1 + it ): If the number of outstanding bonds (the
quantity of bonds) in period t is qt ; the government debt has face value (value at
maturity) equal to qt . The market value at the start of period t of this quantity
of bonds will be Dt = qt =(1 + it ). The nominal expenditure to be made at the
1
If Dt < 0, the government has positive net …nancial claims on the private sector and earns
interest on these claims which is then an additional source of government revenue besides
taxation.
end of the period to redeem the outstanding debt can then be written
qt = Dt (1 + it ): (6.1)
This is the usual way of writing the expenditure to be made, namely as if the
government debt were like a given bank loan of size Dt with a variable rate of
interest. We should not forget, however, that given the quantity, qt ; of the bonds,
the value, Dt , of the government debt at the issue date depends negatively on it :
Anyway, the total nominal government expenditure in period t can be written
Pt (Gt + Xt ) + Dt (1 + it ):
By rearranging we have
spending less taxes). The second term, it Dt ; is called the debt service. Simi-
larly, Pt (T~t Xt Gt ) is called the primary budget surplus. A negative value of
a “de…cit” thus amounts to a positive value of a corresponding “surplus”, and
a negative value of a “surplus” amounts to a positive value of a corresponding
“de…cit”.
We immediately see that this accounting deviates from “normal” principles.
Business companies typically have sharply separated capital and operating bud-
gets. In contrast, the budget de…cit de…ned above treats that part of G which
represents government net investment as parallel to government consumption.
Government net investment is attributed as an expense in a single year’s ac-
count; according to “normal” principles it is only the depreciation on the public
capital that should …gure as an expense. Likewise, the above accounting does
not consider that a part of D (or perhaps more than D) may be backed by the
value of public physical capital. And if the government sells a physical asset to
the private sector, the sale will appear as a reduction of the government budget
de…cit while in reality it is merely a conversion of an asset from a physical form
to a …nancial form. So the cost and asset aspects of government net investment
are not properly dealt with in the standard public accounting.3
With the exception of Section 6.6 we will nevertheless stick to the traditional
vocabulary. Where this might create logical di¢ culties, it helps to imagine that:
(a) all of G is public consumption, i.e., Gt = Ctg for all t;
(b) there is no public physical capital.
Now, from (6.2) and the de…nition Tt T~t Xt (net tax revenue) follows that
real government debt at the beginning of period t + 1 is:
Dt+1 Dt Mt+1
Bt+1 = Gt + Xt T~t + (1 + it )
Pt Pt Pt
Dt =Pt 1 Mt+1 1 + it Mt+1
= Gt Tt + (1 + it ) = Gt Tt + Bt
Pt =Pt 1 Pt 1+ t Pt
Mt+1
(1 + rt )Bt + Gt Tt : (6.4)
Pt
We see from the second line that, everything else equal, in‡ation curtails the real
value of the debt and interest payments. Hence, sometimes not only the actual
nominal budget de…cit is recorded but also a measure where t Dt is subtracted.
3
Another anomaly is related to the fact that some countries, for instance Denmark, have
large implicit government assets due to deferred taxes on the part of personal income invested
in pension funds. If the government then decides to reverse the deferred taxation (as the Danish
government did 2012 and 2014 to comply better with the 3%-de…cit rule of the Stability and
Growth Pact of the EMU), the o¢ cial budget de…cit is reduced, but essentially it is just a
matter of replacing one government asset by another.
The last term, Mt+1 =Pt ; in (6.4) is seigniorage, i.e., public sector revenue
obtained by issuing base money (ignoring the diminutive cost of printing money).
To get a sense of this variable, suppose real output grows at the constant rate gY
so that Yt+1 = (1 + gY )Yt : Then the public debt-to-income ratio can be written
Bt+1 1 + rt Gt Tt Mt+1
bt+1 = bt + : (6.5)
Yt+1 1 + gY (1 + gY )Yt Pt (1 + gY )Yt
Apart from the growth-correcting factor, (1+gY ) 1 ; the last term is the seigniorage-
income ratio,
Mt+1 Mt+1 Mt
= :
P t Yt Mt Pt Yt
If in the long run the base money growth rate, Mt+1 =Mt , as well as the nominal
interest rate (i.e., the opportunity cost of holding money) are constant, then the
velocity of money and its inverse, the money-nominal income ratio, Mt =(Pt Yt ),
are also likely to be roughly constant. So is, therefore, the seigniorage-income
ratio.4 For the more developed countries this ratio tends to be a fairly small
number although not immaterial. For emerging economies with poor institutions
for collecting taxes seigniorage matters more.5
The U.S. has a single monetary authority, the central bank, and a single
…scal authority, the treasury. The seigniorage created is immediately transferred
from the …rst to the latter. The Eurozone has a single monetary authority but
multiple …scal authorities, namely the treasuries of the member countries. The
seigniorage created by the ECB is every year shared by the national central banks
of the Eurozone countries in proportion to their equity share in the ECB. And
the national central banks then transfer their share to the national treasuries.
This makes up a Mt+1 term for the consolidated public sector of the individual
Eurozone countries.
In monetary unions and countries with their own currency, government budget
de…cits are thus generally …nanced both by debt creation and money creation, as
envisioned in the above equations. Nonetheless, from now on, for simplicity, in
this chapter we will predominantly ignore the seigniorage term in (6.5) and only
occasionally refer to the modi…cations implied by taking it into account.
4
A reasonable money demand function is Mtd = Pt Yt e i ; > 0; where i is the nominal
interest rate. With clearing in the money market, we thus have Mt =(Pt Yt ) = e i . In view of
1 + i (1 + r)(1 + ); when r and are constant, so is i and, thereby, Mt =(Pt Yt ):
5
In the U.S. over the period 1909-1950s seigniorage ‡uctuated a lot and peaked 4 % of GDP
in the 1930s and 3 % of GDP at the end of WW II. But over the period from the late 1960s
to 1986 seigniorage ‡uctuated less around an average close to 0.5 %.of GDP (Walsh, 2003, p.
177). In Denmark seigniorage was around 0.2 % of GDP during the 1990s (Kvartalsoversigt 4.
kvartal 2000, Danmarks Nationalbank). In Bolivia, up to the event of hyperin‡ation 1984-85,
seigniorage reached 5 % of GDP and more than 50 % of government revenue (Sachs and Larrain,
1993).
Bt+1 Bt = rt Bt + Gt Tt ; (DGBC)
where the right-hand side is the real budget de…cit. This equation is in macro-
economics often called the dynamic government budget constraint (or DGBC for
short). It is in fact just an accounting identity conditional on M = 0. It says
that if the real budget de…cit is positive and there is essentially no …nancing
by money creation, then the real public debt grows. We come closer to a con-
straint when combining (DGBC) with the requirement that the government stays
solvent.
So taxes cover only the primary expenses while interest payments (and debt
repayments when necessary) are …nanced by issuing new debt. That is, the
government attempts a permanent roll-over of the debt including the interest
due for payment. In view of (DGBC), this implies that Bt+1 = (1 + rt )Bt ; saying
that the debt grows at the rate rt . Assuming, for simplicity, that rt = r (a
Bt+1 1 + r Bt 1+r
bt+1 = bt ; b0 > 0;
Yt+1 1 + gY Yt 1 + gY
where we have maintained the assumption of a constant output growth rate, gY .
The solution to this linear di¤erence equation then becomes
1+r t
bt = b0 ( );
1 + gY
where we consider both r and gY as exogenous. We see that the growth-corrected
1+r
interest rate, 1+g Y
1 r gY (for gY and r “small”) plays a key role. There
are contrasting cases to discuss.
Case 1: r > gY : In this case, bt ! 1 for t ! 1: Owing to compound interest,
the debt grows so large in the long run that the government will be unable to …nd
buyers for all the debt. Permanent debt roll-over is thus not feasible. Imagine for
example an economy described by the Diamond OLG model. Here the buyers of
the debt are the young who place part of their saving in government bonds. But
if the stock of these bonds grows at a higher rate than income, the saving of the
young cannot in the long run keep track with the fast-growing government debt.
In this situation the private sector will understand that bankruptcy is threatening
and nobody will buy government bonds except at a low price, which means a high
interest rate. The high interest rate only aggravates the problem. That is, the
…scal policy (6.6) breaks down. Either the government defaults on the debt or T
must be increased or G decreased (or both) until the growth rate of the debt is
no longer higher than gY :
If the debt is denominated in the country’s own currency, an alternative way
out is of course a shift to money …nancing of the budget de…cit, that is, seignior-
age. When capacity utilization is high, this leads to rising in‡ation and thus
the real value of the debt is eroded. Bond holders will then demand a higher
nominal interest rate, thus aggravating the …scal di¢ culties. The economic and
social chaos of hyperin‡ation threatens.6 The hyperin‡ation in Germany 1922-23
peaked in Nov. 1923 at 29,525% per month; it eroded the real value of the huge
government debt of Germany after WW I by 95 percent.
Case 2: r = gY : If r = gY ; we get bt = b0 for all t 0: Since the debt, increas-
ing at the rate r; does not increase faster than national income, the government
has no problem …nding buyers of its newly issued bonds the government stays
6
In economists’ standard terminology “hyperin‡ation” is present when the in‡ation rate
exceeds 50 percent per month. As we shall see in Chapter 18, the monetary …nancing route comes
to a dead end if the needed seigniorage reaches the backward-bending part of the “seigniorage
La¤er curve”.
%
30
Real short-term DK interest rate (ex post)
Real annual DK GDP growth rate
20
10
−10
−20
Average interest rate (1875-2003): 2.9%
Compound annual GDP growth rate (1875-2002):2.7%
−30
1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000
%
30
Real short-term US interest rate (ex post)
Real annual US GDP growth rate
20
10
−10
−20
Average interest rate (1875-2003): 2.4%
Compound annual GDP growth rate (1875-2002):3.4%
−30
1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000
Figure 6.1: Real short-term interest rate and annual growth rate of real GDP in Den-
mark and the US since 1875. The real short-term interest rate is calculated as the
money market rate minus the contemporaneous rate of consumer price in‡ation. Source:
Abildgren (2005) and Maddison (2003).
solvent. Thereby the government is able to …nance its interest payments simply
by issuing new debt. The growing debt is passed on to ever new generations with
higher income and saving and the debt roll-over implied by (6.6) can continue
forever.
Case 3: r < gY : Here we get bt ! 0 for t ! 1; and the same conclusion
holds a fortiori.
In Case 2 as well as Case 3, where the interest rate is not higher than the
growth rate of the economy, the government can thus pursue a permanent debt
roll-over policy as implied by (6.6) and still remain solvent. But in Case 1,
permanent debt roll-over is impossible and sooner or later the interest payments
must be tax …nanced.
Which of the cases is relevant in real life? Fig. 6.1 shows for Denmark (upper
panel) and the US (lower panel) the time paths of the real short-term interest
rate and the GDP growth rate, both on an annual basis. Overall, the levels of
the two are more or less the same, although on average the interest rate is in
Denmark slightly higher but in the US somewhat lower than the growth rate.
(Note that the interest rates referred to are not the average rate of return in the
economy but a proxy for the lower interest rate on government bonds.)
Nevertheless, many macroeconomists believe there is good reason for paying
attention to the case r > gY , also for a country like the US. This is because we live
in a world of uncertainty, with many di¤erent interest rates, and imperfect credit
markets, aspects the above line of reasoning has not incorporated. The prudent
debt policy needed whenever, under certainty, r > gY can be shown to apply
to a larger range of circumstances when uncertainty is present (see Literature
notes). To give a ‡avor we may say that a prudent debt policy is needed when
the average interest rate on the public debt exceeds gY " for some “small”but
positive ":7 On the other hand there is a di¤erent feature which draws the matter
in the opposite direction. This is the possibility that a tax, 2 (0; 1); on interest
income is in force so that the net interest rate on the government debt is (1 )r
rather than r:
Bt+1 1+r
bt+1 = bt ; (6.7)
Yt+1 1 + gY 1 + gY
where we have assumed a constant interest rate, r. There are (again) three cases
to consider.
Case 1: r > gY : As emphasized above this case is generally considered the one
of most practical relevance. And it is in this case that latent debt instability is
present and the government has to pay attention to the danger of runaway debt
dynamics. To see this, note that the solution of the linear di¤erence equation
(6.7) is
t
1+r
bt = (b0 b) +b ; where (6.8)
1 + gY
1
1+r s
b = 1 = ; (6.9)
1 + gY 1 + gY r gY r gY
where s is the primary surplus as a share of GDP. Here b0 is historically given. But
the steady-state debt-income ratio, b ; depends on …scal policy. The important
feature is that the growth-corrected interest factor is in this case higher than 1
and has the exponent t. Therefore, if …scal policy is such that b < b0 ; the debt-
income ratio exhibits geometric growth. The solid curve in the topmost panel in
Fig. 6.2 shows a case where …scal policy is such that < (r gY )b0 whereby we
get b < b0 when r > gY ; so that the debt-income ratio, bt ; grows without bound.
This re‡ects that with r > gY ; compound interest is stronger than compound
growth. The sequence of discrete points implied by our discrete-time model is in
the …gure smoothed out as a continuous curve.
The American economist and Nobel Prize laureate George Akerlof (2004, p.
6) came up with this analogy:
“It takes some time after running o¤ the cli¤ before you begin to fall.
But the law of gravity works, and that fall is a certainty”.
Figure 6.2: Evolution of the debt-income ratio, depending on the sign of b0 b ; in the
cases r > gY (the three upper panels) and r < gY (the two lower panels), respectively.
Delaying the adjustment increases the size of the needed policy action, since
the debt-income ratio, and thereby s^; will become higher in the meantime.
For …xed spending-income ratio ; the minimum tax-to-income ratio needed
for …scal sustainability is
^ = + (r gY )b0 : (6.11)
Given b0 and ; this tax-to-income ratio is sometimes called the sustainable tax
rate. The di¤erence between this rate and the actual tax rate, ; indicates the
size of the needed tax adjustment, were it to take place at time 0, assuming a
given :
Suppose that the debt build-up can be and is prevented already at time
0 by ensuring that the primary surplus as a share of income, ; at least equals
s^ so that b b0 : The solid curve in the midmost panel in Fig. 6.2 illustrates the
resulting evolution of the debt-income ratio if b is at the level corresponding to
the hatched horizontal line while b0 is unchanged compared with the top panel.
Presumably, the government would in such a state of a¤airs relax its …scal policy
after a while in order not to accumulate large government …nancial net wealth.
Yet, the pre-funding strategy vis-a-vis the …scal challenge of population ageing
(referred to above) is in fact based on accumulating some positive public …nancial
net wealth as a bu¤er before the substantial e¤ects of population ageing set in. In
this context, the higher the growth-corrected interest rate, the shorter the time
needed to reach a given positive net wealth position.
Case 2: r = gY : In this knife-edge case there is still a danger of runaway dy-
namics, but of a less explosive form. The formula (6.8) is no longer valid. Instead
the solution of (6.7) is bt = b0 + [( )=(1 + gY )] t = b0 [( )=(1 + gY )] t:
Here, a non-negative primary surplus is both necessary and su¢ cient to avoid
bt ! 1 for t ! 1.
Case 3: r < gY : This is the case of stable debt dynamics. The formula (6.8)
is again valid, but now implying that the debt-income ratio is non-explosive.
Indeed, bt ! b for t ! 1; whatever the level of the initial debt-income ratio
and whatever the sign of the budget surplus. Moreover, when r < gY ;
b = S 0 for T 0: (*)
r gY
So, if there is a forever positive primary surplus, the result is a negative long-run
debt, i.e., a positive government …nancial net wealth in the long run. And if there
is a forever negative primary surplus, the result is not debt explosion but just
convergence toward some positive long-run debt-income ratio. The second from
bottom panel in Fig. 6.2 illustrates this case for a situation where b0 > b and
b > 0; i.e., < 0; by (*). When the GDP growth rate continues to exceed
the interest rate on government debt, a large debt-income ratio can be brought
La¤er curve*
R( ) = f (1 );
1 L 1 L
= f 0 (1 L) E`1 f (1 L ):
L f (1 L)
Rearranging gives
1
L = :
1 + E`1 f (1 L)
“The higher the level of debt, the smaller is the distance between
solvency and default”.11
The background for this remark is the following. There is likely to be an upper
bound for the tax-income ratio deemed politically or economically feasible by the
government as well as the market participants. Similarly, a lower bound for the
spending-income ratio is likely to exist, be it for economic or political reasons. In
the present framework we therefore let the government face the constraints
and ; where is the least upper bound for the tax-income ratio and is
the greatest lower bound for the spending-income ratio. Then the actual primary
surplus, s; can at most equal s :
10
As suggested for the U.S. by Gruber and Saez, 2002.
11
Blanchard (2011).
Suppose that at …rst the situation in the considered country is as in the second
from the top panel in Fig. 6.2. That is, initially,
s= = s^ = (r gY )b0 s ; (6.12)
where s is given in (6.12). The government could possibly increase its primary
surplus, s; but at most up to s; and this will not be enough since the required
primary surplus, s^0 ; exceeds s: The situation would be as illustrated in the top
panel of Fig. 6. 2 with b given as s=(r0 gY ) < b0 :
That is, if the actual interest rate should rise above the critical interest rate,
r; runaway debt dynamics would take o¤ and debt default thereby be threatening.
A fear that it may happen may be enough to trigger a fall in the market price of
government bonds which means a rise in the actual interest rate, r. So …nancial
investors’fear can be a self-ful…lling prophesy. Moreover, as we saw in connection
with (6.13), the risk that r becomes greater than r is larger the larger is b0 .
It is not so that across countries there is a common threshold value for a
“too large” public debt-to-income ratio. This is because variables like ; ; r;
and gY , as well as the net foreign debt position and the current account de…cit
(not in focus in this chapter), di¤er across countries. Late 2010 Greece had
(gross) government debt of 148 percent of GDP and the interest rate on 10-year
government bonds skyrocketed. Conversely Japan had (gross) government debt
of more than 200 percent of GDP while the interest rate on 10-year government
bonds remained very low.
Finer shades
Mt+1 Mt+1 Mt
s^ = (r gY )b = (r gY )b :
Pt Y t Mt P t Yt
12
In a monetary union which is not also a …scal union (think of the eurozone), the situation
is more complicated. A single member country with large government debt (or large debt in
commercial banks for that matter) may …nd itself in an acute liquidity crisis without its own
means to solve it. Indeed, the elevation of interest rates on government bonds in the Southern
part of the eurozone in 2010-2012 can be seen as a manifestation of investors’fear of payment
di¢ culties. The elevation was not reversed until the European Central Bank in September 2012
declared its willingness to e¤ectively act as a “lender of last resort” (on a conditional basis),
see Box 6.2 in Section 6.4.2.
In the SGP, = 0:03. Here we consider the general case: > 0: To see the
implication for the (public) debt-to-income ratio in the long run, let us …rst
imagine a situation where the de…cit ceiling, ; is always binding for the economy
we look at. Then Dt+1 = Dt + Pt Yt and so
Bt+1 Dt+1 Dt
bt+1 = + ;
Yt+1 Pt Yt+1 (1 + )Pt 1 (1 + gY )Yt 1 + gY
assuming constant output growth rate, gY ; and in‡ation rate : This reduces to
1
bt+1 = bt + : (6.15)
(1 + )(1 + gY ) 1 + gY
Assuming that (1 + )(1 + gY ) > 1 (as is normal over the medium run), this linear
di¤erence equation has the stable solution
t
1
bt = (b0 b) + b ! b for t ! 1; (6.16)
(1 + )(1 + gY )
where
(1 + )
b = : (6.17)
(1 + )(1 + gY ) 1
Consequently, if the de…cit rule (6.14) is always binding, the debt-income ratio
tends in the long run to be proportional to the de…cit bound : The factor of
proportionality is a decreasing function of the long-run growth rate of real GDP
and the in‡ation rate. This result con…rms the general tenet that if there is
economic growth, perpetual budget de…cits need not lead to …scal problems.
If on the other hand the de…cit rule is not always binding, then the budget
de…cit is on average smaller than above so that the debt-income ratio will in the
long run be smaller than b .
The conclusion is the following. With one year as the time unit, suppose the
de…cit rule is = 0:03 and that gY = 0:03 and = 0:02 (the upper end of
the in‡ation interval aimed at by the ECB). Suppose further the de…cit rule is
never violated. Then in the long run the debt-income ratio will be at most b
= 1:02 0:03=(1:02 1:03 1) 0:60: This is in agreement with the debt rule
of the SGP according to which the maximum value allowed for the debt-income
ratio is 60%.
Although there is nothing sacred about either of the numbers 0:60 or 0:03;
they are mutually consistent, given = 0:02 and gY = 0:03.
We observe that the de…cit rule (6.14) implies that:
The upper bound, b ; on the long-run debt income ratio is lower the higher
is in‡ation. The reason is that the growth factor [(1 + ) (1 + gY )] 1
for bt in (6.15) depends negatively on the in‡ation rate, . So does therefore
b since, by (6.16), b (1 + gY ) 1 (1 ) 1:
For a given ; the upper bound on the long-run debt income ratio is inde-
pendent of both the nominal and real interest rate (this follows from the
indicated formula for the growth factor for bt and the fact that (1+i)(1+r) 1
= 1 + ):
demand. The pact has therefore sometimes been called the “Instability and De-
pression Pact” it imposes a wrong timing of …scal consolidation.17
2. Since what really matters is long-run …scal sustainability, a de…cit rule
should be designed in a more ‡exible way than the 3% rule of the SGP. A mean-
ingful de…cit rule would relate the de…cit to the trend nominal GDP, which we
may denote (P Y ) . Such a criterion would imply
GBD (P Y ) : (6.18)
Then
GBD (P Y )
:
PY PY
In recessions the ratio (P Y ) =(P Y ) is high, in booms it is low. This has the
advantage of allowing more room for budget de…cits when they are needed
without interfering with the long-run aim of stabilizing government debt below
some speci…ed ceiling.
3. A further step in this direction is a rule directly in terms of the structural
or cyclically adjusted budget de…cit rather than the actual year-by-year de…cit.
The cyclically adjusted budget de…cit in a given year is de…ned as the value the
de…cit would take in case actual output were equal to trend output in that year.
Denoting the cyclically adjusted budget de…cit GBD ; the rule would be
GBD
:
(P Y )
In fact, in its original version as of 1997 the SGP contained an additional rule
like that, but in the very strict form of 0: This requirement was implicit in
the directive that the cyclically adjusted budget “should be close to balance or
in surplus”. By this requirement it is imposed that the debt-income ratio should
be close to zero in the long run. Many EMU countries certainly had and have
larger cyclically adjusted de…cits. Taking steps to comply with such a low
structural de…cit ceiling may be hard and endanger national welfare by getting in
the way of key tasks of the public sector. The minor reform of the SGP endorsed
in March 2005 allowed more contingencies, also concerning this structural bound.
By the more recent reform in 2012, the Fiscal Pact, the lid on the cyclically
17
The SGP has an exemption clause referring to “exceptional”circumstances. These circum-
stances were originally de…ned as “severe economic recession”, interpreted as an annual fall
in real GDP of at least 1-2%. By the reform of the SGP in March 2005, the interpretation
was changed into simply “negative growth”. Owing to the international economic crisis that
broke out in 2008, the de…cit rule was thus suspended in 2009 and 2010 for most of the EMU
countries. But the European Commission brought the rule into e¤ect again from 2011, which
according to many critics was much too early, given the circumstances.
adjusted de…cit-income ratio was raised to 0.5% and to 1.0% for members with a
debt-income ratio “signi…cantly below 60%”. These are still quite small numbers.
Abiding by the 0.5% or 1.0% rule implies a long-run debt-income ratio of at most
10% or 20%, respectively, given structural in‡ation and structural GDP growth
at 2% and 3% per year, respectively.18
4. Regarding the composition of government expenditure, critics have argued
that the SGP pact entails a problematic disincentive for public investment. The
view is that a …scal rule should be based on a proper accounting of public invest-
ment instead of simply ignoring the composition of government expenditure. We
consider this issue in Section 6.6 below.
5. At a more general level critics have contended that policy rules and sur-
veillance procedures imposed on sovereign nations will hardly be able to do their
job unless they encompass stronger incentive-compatible elements. Enforcement
mechanisms are bound to be week. The SGP’s threat of pecuniary …nes to a
country which during a recession has di¢ culties to reduce its budget de…cit seems
absurd (and has not been made use of so far). Moreover, abiding by the …scal
rules of the SGP prior to the Great Recession was certainly no guarantee of not
ending up in a …scal crisis in the wake of a crisis in the banking sector, as wit-
nessed by Ireland and Spain. A seemingly strong …scal position can vaporize fast,
particularly if banks, “too big to fail”, need be bailed out.
Ireland, thus widening bond yield spreads in these countries vis-a-vis Germany in the
midst of a serious economic recession. Moreover, the solvency of big German banks
that were among the prime creditors of Greece was endangered. The major Eurozone
governments and the International Monetary Fund (IMF) reached an agreement to
help Greece (and indirectly its creditors) with loans and guarantees for loans, condi-
tional on the government of Greece imposing yet another round of harsh …scal austerity
measures. The elevated bond interest rates of Greece, Italy, and Spain were not con-
vincingly curbed, however, until in August-September 2012 the president of the ECB,
Mario Draghi, launched the “Outright Monetary Transactions” (OMT) program ac-
cording to which, under certain conditions, the ECB will buy government bonds in
secondary bond markets with the aim of “safeguarding an appropriate monetary policy
transmission and the singleness of the monetary policy” and with “no ex ante quan-
titative limits”. Considerably reduced government bond spreads followed and so the
sheer announcement of the program seemed e¤ective in its own right. Doubts raised by
the German Constitutional Court about its legality vis-à-vis Treaties of the European
Union were …nally repudiated by the European Court of Justice mid-June 2015. At
the time of writing (late June 2015) the OMT program has not been used in practice.
Early 2015, a di¤erent massive program for purchases of government bonds, including
long-term bonds, in the secondary market as well as private asset-backed bonds was
decided and implemented by the ECB. The declared aim was to brake threatening de-
‡ation and return to “price stability”, by which is meant in‡ation close to 2 percent
per year.
So much about the monetary policy response. What about …scal policy? On the
basis of the SGP, the EU Commission imposed “…scal consolidation” initiatives to be
carried out in most EU countries in the period 2011-2013 (some of the countries were
required to start already in 2010). With what consequences? By many observers, partly
including the research department of IMF, the initiatives were judged self-defeating.
When at the same time comprehensive deleveraging in the private sector is going on,
“austerity” policy deteriorates aggregate demand further and raises unemployment.
Thereby, instead of budget de…cits being decreased, the numerator in the debt-income
ratio, D=(P Y ); is decreased. Fiscal multipliers are judged to be large (“in the 0.9 to
1.7 range since the Great Recession”, IMF, World Economic Outlook, Oct. 2012) in
a situation of idle resources, monetary policy aiming at low interest rates, and nega-
tive spillover e¤ects through trade linkages when “…scal consolidation”is synchronized
across countries. The unemployment rate in the Eurozone countries was elevated from
7.5 percent in 2008 to 12 percent in 2013. The British economists, Holland and Portes
(2012), concluded: “It is ironic that, given that the EU was set up in part to avoid
coordination failures in economic policy, it should deliver the exact opposite”.
The whole crisis has pointed to a basic di¢ culty faced by the Eurozone. In spite
of the member countries being economically very di¤erent sovereign nations, they are
t
lim Bt (1 + r) 0: (NPG)
t!1
This condition says that government debt is not allowed to grow in the long
run at a rate as high as (or even higher than) the interest rate.19 That is, a
…scal policy satisfying the NPG condition rules out a permanent debt rollover.
Indeed, as we saw in Section 6.3.1, with B0 > 0; a permanent debt rollover
policy (…nancing all interest payments and perhaps even also part of the primary
government spending) by debt issue leads to Bt B0 (1 + r)t for t = 0; 1; 2; : : : :
Substituting into (NPG) gives limt!1 Bt B0 (1 + r)t (1 + r) t = B0 > 0; thus
violating (NPG).
The designation No-Ponzi-Game condition refers to a guy from Boston, Charles
Ponzi, who in the 1920s made a fortune out of an investment scam based on the
chain-letter principle. The principle was to pay o¤ old investors with money from
new investors. Ponzi was sentenced to many years in prison for his transactions;
he died poor and without friends!
To our knowledge, this kind of …nancing behavior is nowhere forbidden for
the government as it generally is for private agents. But under “normal”circum-
stances a government has to plan its expenditures and taxation so as to comply
with its NPG condition since otherwise not enough lenders will be forthcoming.
As the state is in principle in…nitely-lived, however, there is no …nal date where
all government debt should be over and done with. Indeed, the NPG condition
does not even require that the debt has ultimately to be non-increasing. The
NPG condition “only” says that the debtor, here the government, can not let
the debt grow forever at a rate as high as (or higher than) the interest rate. For
instance the U.K. as well as the U.S. governments have had positive debt for
centuries and high debt after both WW I and WW II.
Suppose Y (GDP) grows at the constant rate gY (actually, for most of the
following results it is enough that limt!1 Yt+1 =Yt = 1 + gY ). We have:
PROPOSITION 1 Let bt Bt =Yt and interpret “solvency”as absence of an for
ever accelerating debt-income ratio. Then:
(ii) if r gY ; the government can remain solvent without (NPG) being satis…ed.
Proof. When bt 6= 0;
Here, by the upper inequality, (NPG) is satis…ed, yet, by the lower inequality
together with (6.19), we have limt!1 bt+1 =bt > 1 so that the debt-income ratio
explodes.
EXAMPLE 1 Let GDP = Y; a constant, and r > 0; so r > gY = 0: Let the
budget de…cit in real terms equal "Bt + ; where 0 " < r and > 0: Assuming
no money-…nancing of the de…cit, government debt evolves according to Bt+1 Bt
= "Bt + which implies a simple linear di¤erence equation:
Bt+1 = (1 + ")Bt + : (*)
Case 1: " = 0: Then the solution of (*) is
Bt = B0 + t; (**)
B0 being historically given. Then Bt (1 + r) t = B0 (1 + r) t + t(1 + r) t ! 0 for
t ! 1: So, (NPG) is satis…ed. Yet the debt-GDP ratio, Bt =Y; goes to in…nity
for t ! 1: That is, in spite of (NPG) being satis…ed, solvency is not present. For
" = 0 we thus get the insolvency result even though the lower strict inequality in
(6.20) is not satis…ed. Indeed, (**) implies Bt+1 =Bt = 1 + =Bt ! 1 for t ! 1
and 1 + gY = 1:
Case 2: 0 < " < r: Then the solution of (*) is
The government intertemporal budget constraint (GIBC), as seen from the begin-
ning of period t; is the requirement
X
1 X
1
(Gt+i + Xt+i )(1 + r) (i+1)
T~t+i (1 + r) (i+1)
Bt : (GIBC)
i=0 i=0
This condition requires that the present value (PV) of current and expected
future government spending does not exceed the government’s net wealth. The
latter equals the PV of current and expected future tax revenue minus existing
P PI
government debt. By the symbol 1 i=0 x i we mean limI!1 i=0 xi : Until further
notice we assume this limit exists.
What connection is there between the dynamic accounting relationship (6.21)
and the intertemporal budget constraint, (GIBC)? To …nd out, we rearrange
(6.21) and use forward substitution to get
Bt = (1 + r) 1 (T~t Xt Gt ) + (1 + r) 1 Bt+1
j
X
= (1 + r) (i+1) (T~t+i Xt+i Gt+i ) + (1 + r) (j+1)
Bt+j+1
i=0
X
1
= (1 + r) (i+1)
(T~t+i Xt+i Gt+i ) + lim (1 + r) (j+1)
Bt+j+1
j!1
i=0
X
1
(1 + r) (i+1)
(T~t+i Xt+i Gt+i ); (6.22)
i=0
if and only if the government debt ultimately grows at a rate less than r so that
(j+1)
lim (1 + r) Bt+j+1 0: (6.23)
j!1
This latter condition is exactly the NPG condition above (replace t in (6.23) by
0 and j by t 1). And the condition (6.22) is just a rewriting of (GIBC). We
conclude:
PROPOSITION 2 Given the book-keeping relation (6.21), then:
Bt
= (r gY ) :
Yt
X
1
T~t X
1
1 + gY
(i+1)
T~t+i (1 + r) (i+1)
= ;
i=0
1 + gY i=0 1+r
which is obtained by rearranging (GIBC) and replacing weak inequality with strict
equality. It is certainly not required that the budget is balanced all the time. The
point is “only” that for a given planned expenditure path, a government should
plan realistically a stream of future tax revenues the PV of which matches the
PV of planned expenditure plus the current debt. If an unplanned budget de…cit
is run so that the public debt rises during a recession, say then higher taxes
than otherwise must be levied in the future.
We may rewrite (GIBC’) as
X
1
T~t+i (Gt+i + Xt+i ) (1 + r) (i+1)
= Bt : (GIBC”)
i=0
This expresses the basic principle that when r > gY ; solvency requires that the
present value of planned future primary surpluses equals the initial debt. If debt
is positive today, then the government has to run a positive primary surplus for
a su¢ ciently long time in the future.
Finer shades
1. If the real interest rate varies over time, all the above formulas remain valid if
(1 + r) (i+1) is replaced by ij=0 (1 + rt+j ) 1 :
where is a (constant) capital depreciation rate. Let the annual (direct) …nancial
return per unit of public capital be rg : This is the sum of user fees and the
like. Net government revenue, T 0 ; now consists of net tax revenue, T; plus the
direct …nancial return rg K g :22 In that now only interest payments and the capital
depreciation, K g ; along with C g ; enter the operating account as “true”expenses,
the “true”budget de…cit is rB + C g + K g T 0 ; where T 0 = T + rg K g :
We impose a rule requiring balancing the “true structural budget”in the sense
that on average over the business cycle
T 0 = rB + C g + K g (6.25)
should hold. The spending on public investment of course enters the debt accu-
mulation equation which now takes the form
B = rB + C g + I g T 0:
B = Ig Kg = Kg; (6.26)
by (13.67). So the balanced “true structural budget” implies that public net
investment is …nanced by an increase in public debt. Other public spending is
tax …nanced.
Suppose that public capital keeps pace with trend GDP, Yt ; thereby growing
at the same constant rate gY > 0: So K g =K g = gY and the ratio K g =Y remains
positive constant at some level, say h: Then (13.69) implies
g
Bt+1 Bt = Kt+1 Ktg = gY Ktg = gY hYt : (6.27)
What is the implication for the evolution of the debt-to-trend-income ratio, ^bt
Bt =Yt , over time? By (6.27) together with Yt+1 = (1 + gY )Yt follows
^bt+1 Bt+1 Bt gY h 1 ^ gY h
= + bt + :
Yt+1 (1 + gY )Yt 1 + gY 1 + gY 1 + gY
This linear …rst-order di¤erence equation has the solution
Additional remarks
1. The de…cit rule described says only that the “true structural budget” should
be balanced “on average”over the business cycle. This invites de…cits in slumps
and surpluses in booms. Indeed, in economic slumps government borrowing is
usually cheap. As Harvard economist Lawrence Summers put it: “Idle workers
+ Low interest rates = Time to rebuild infrastructure”(Summers, 2014).
2. When separating government consumption and investment in budget ac-
counting, a practical as well as theoretical issue arises: where to draw the border
between the two? A sizeable part of what is investment in an economic sense is in
standard public sector accounting categorized as “public consumption”: spending
on education, research, and health are obvious examples. Distinguishing between
such categories and public consumption in a narrower sense (administration, ju-
dicial system, police, defence) may be important when economic growth policy is
on the agenda. Apart from noting the issue, we shall not pursue the matter here.
3. That time lags, cf. point (iii) in Section 6.1, are a constraining factor
for …scal policy is especially important for macroeconomic stabilization policy
aiming at dampening business cycle ‡uctuations. If the lags are ignored, there is
a risk that government intervention comes too late and ends up amplifying the
‡uctuations instead of dampening them. In particular the monetarists, lead by
Milton Friedman (1912-2006), warned against this risk. Other economists …nd
awareness of this potential problem relevant but point to ways to circumvent the
problem. During a recession there is for instance the option of reimbursing a
part of last year’s taxes, a policy that can be quickly implemented. At a more
structural level, legislation concerning taxation, transfers, and other spending can
be designed with the aim of strengthening the automatic …scal stabilizers.
23
This also holds if gY = 0: Indeed, in this case, (6.27) implies Bt+1 = Bt = B0 .
1) r > gY ;
2) …scal policy satis…es the intertemporal budget constraint with strict equal-
ity:
X1 X1
~
Tt (1 + r) (t+1)
= (Gt + Xt )(1 + r) (t+1) + B0 ; (6.28)
t=0 t=0
where the initial debt, B0 ; and the planned path of Gt + Xt are given;
4) at least some of the taxes are lump sum and only these are varied in the
thought experiment to be considered;
5) no money …nancing;
For a given planned time path of Gt + Xt ; equation (6.28) implies that a tax
cut in any period has to be met by an increase in future taxes of the same present
discounted value as the tax cut.
Ricardian equivalence
The Ricardian equivalence view is the conception that government debt is neutral
in the sense that for a given time path of government spending, aggregate private
consumption is una¤ected by a temporary tax cut. The temporary tax cut does
not make the households feel richer because they expect that the ensuing rise
in government debt will lead to higher taxes in the future. The essential claim
is that the timing of (lump-sum) taxes does not matter. The name Ricardian
equivalence comes from a seemingly false association of this view with the
early nineteenth-century British economist David Ricardo. It is true that Ricardo
articulated the possible logic behind debt neutrality. But he suggested several
reasons that debt neutrality would not hold in practice and in fact he warned
against high public debt levels (Ricardo, 1969, pp. 161-164). Therefore it is
doubtful whether Ricardo was a Ricardian.
Debt neutrality was rejuvenated, however, by Robert Barro in a paper entitled
“Are government bonds net wealth [of the private sector]?”, a question which
Barro answered in the negative (Barro 1974). Barro’s debt neutrality view rests
on a representative agent model, that is, a model where the household sector
is described as consisting of a …xed number of in…nitely-lived forward-looking
“dynasties”. With perfect …nancial markets, a change in the timing of taxes
does not change the PV of the in…nite stream of taxes imposed on the individual
dynasty. A cut in current taxes is o¤set by the expected higher future taxes.
Though current government saving (T G rB) goes down, private saving and
bequests left to the members of the next generation go up equally much.
More precisely, the logic of the debt neutrality view is as follows. Suppose, for
simplicity, that the government waits only 1 period to increase taxes and then does
so in one stroke. Then, for each unit of account current taxes are reduced, taxes
next period are increased by (1+r) units of account. The PV as seen from the end
of the current period of this future tax increase is (1+r)=(1+r) = 1: As 1 1 = 0;
the change in the time pro…le of taxation will make the dynasty feel neither richer
nor poorer. Consequently, its current and planned future consumption will be
una¤ected. That is, its current saving goes up just as much as its current taxation
is reduced. In this way the altruistic parents make sure that the next generation
is fully compensated for the higher future taxes. Current private consumption in
society is thus una¤ected and aggregate saving stays the same.24
24
The complete Barro model is presented in Chapter 7.
(a) There is perfect mobility of goods and …nancial capital across borders.
(b) There is no uncertainty and domestic and foreign …nancial claims are perfect
substitutes.
(c) The need for means of payment is ignored; hence so is the need for a foreign
exchange market.
The assumptions (a) and (b) imply real interest rate equality. That is, in
equilibrium the real interest rate in SOE must equal the real interest rate in
the world …nancial market, r. And by saying that SOE is “small” we mean
it is small enough to not a¤ect the world market interest rate as well as other
world market factors. We imagine that all countries trade one and the same
N
where the second to last equality comes from (6.31) and the identity Sgt
GBDt ; while the last equality re‡ects the maintained assumption that bud-
get de…cits are fully …nanced by debt issue.
Firms’behavior
GDP is produced by an aggregate neoclassical production function with CRS:
where Kt and Lt are input of capital and labor, respectively, and kt Kt =Lt .
Technological change is ignored. Imposing perfect competition in all markets,
markets clear so that Lt can be interpreted as both employment and labor supply
(exogenous). Pro…t maximization leads to f 0 (kt ) = r + ; where is a constant
capital depreciation rate, 0 1. When f satis…es the condition limk!0 f 0 (k)
0
> r + > limk!1 f (k), there is always a solution in k to this equation and it is
unique (since f 00 < 0) and constant over time (as long as r and are constant):
Thus,
kt = f 0 1 (r + ) k; for all t: (6.32)
The stock of capital, Kt ; is determined by the equation Kt = kLt :
In view of …rms’pro…t maximization, the equilibrium real wage before tax is
@Yt
wt = = f (k) f 0 (k)k w; (6.33)
@Lt
a constant. GDP will evolve according to
The growth rate of Y thus equals the growth rate of the labor force, i.e., gY = n.
Gt = G0 (1 + n)t ;
where 0 < G0 < F (K0 ; L0 ): It is assumed that the production of Gt uses the
same technology and therefore involves the same unit production costs as the
other components of GDP. As the focus is not on distortionary e¤ects of taxation,
taxes are assumed to be lump sum, i.e., levied on individuals irrespective of their
economic behavior.
To get explicit solutions, we specify the period utility function to be CRRA:
u(c) = (c1 1)=(1 ); where > 0: To keep things simple, the utility of
the public good enters the life-time utility additively so that it does not a¤ect
marginal utilities of private consumption. In addition we assume that the public
good does not a¤ect productivity in the private sector. There is a tax on the
young as well as the old in period t, 1 and 2 ; respectively. Until further notice
these taxes are time-independent. Possibly, 1 or 2 is negative, in which case
there is a transfer to either the young or the old.
The consumption-saving decision of the young will be the solution to the
following problem:
c11t 1 1 c12t+1 1
max U (c1t ; c2t+1 ) = + v(Gt ) + (1 + ) + v(Gt+1 ) s.t.
1 1
c1t + st = w 1;
c2t+1 = (1 + r)st 2;
c1t 0; c1t+1 0;
where the function v represents the utility contribution of the public good. The
implied Euler equation can be written
1=
c2t+1 1+r
= :
c1t 1+
Inserting the two budget constraints and solving for st , we get
1+ 1=
w 1 + 1+r 2
st = ( 1)=
s(w; r; 1; 2 ): (6.34)
1+r
1 + (1 + ) 1+
are the marginal (= average) propensities to consume out of wealth, and where
ht is the after-tax human wealth of the young, i.e., the present value, evaluated
at the end of period t; of disposable lifetime income (the “endowment”). Thus,
2
ht = w 1 h: (6.39)
1+r
Under the given conditions human wealth is thus time-independent. We assume
1 and 2 are such that h > 0: Given r; individual consumption in the …rst as well
as the second period of life is thus proportional to individual human wealth. This
is as expected in view of the homothetic life time utility function. If = r; then
c^1 (r) = c^2 (r) = (1 + r)=(2 + r); that is, there is complete consumption smoothing
as also the Euler equation indicates when = r.25
The tax revenue in period t is Tt = 1 Lt + 2 Lt 1 = ( 1 + 2 =(1 + n))Lt : Let
B0 = 0 and let the “reference path” be a path along which the budget is and
remains balanced for all t; i.e., Tt = Gt = G0 (1 + n)t : In the reference path the
tax code ( 1 ; 2 ) thus satis…es
2
1 + L0 = G0 :
1+n
This gives
X
1
1+n
t
m = x:
t=1
1+r
As r > n; from the rule for the sum of an in…nite geometric series follows that
r n
m= x m: (6.40)
1+n
The needed rise in future taxes is thus higher the higher is the interest rate r.
This is because the interest burden of the debt will be higher. On the other
hand, a higher population growth rate, n; reduces the needed rise in future taxes.
This is because the interest burden per capita is mitigated by population growth.
Finally, a greater tax cut, x; in the …rst period implies greater tax rises in future
periods.
Let the value of the variables along this alternative path be marked with a
prime. In period 0 the tax cut unambiguously bene…ts the old whose increase in
consumption equals the saved tax:
The young in period 0 know that per capita taxes next period will be increased
by m: In view of the tax cut in period 0, the young nevertheless experiences an
increase in after-tax human wealth equal to
+m 2 2
h00 h0 = w 1 +x w 1
1+r 1+r
r n
= 1 x (by (6.40))
(1 + r)(1 + n)
1 + (2 + r)n
= x > 0:
(1 + r)(1 + n)
Consequently, through the wealth e¤ect this generation enjoys increases in con-
sumption through life equal to
by (6.35) and (6.36), respectively. So the two generations alive in period 0 gain
from the temporary budget de…cit. But all future generations are worse o¤. These
generations do not bene…t from the tax relief in period 0, but they have to bear
the future cost of the tax relief by a reduction in individual after-tax human
wealth. Indeed, for t = 1; 2; : : : ;
2+m 2
h0t ht = h01 h=w 1 m w 1
1+r 1+r
m
= m+ < 0: (6.44)
1+r
All things considered, since both the young and the old in period 0 increase
their consumption, aggregate consumption in period 0 rises. Ricardian equiva-
lence thus fails.
thus implying s00 s0 > 0: Since A01 =L0 = s00 > s0 = A1 =L0 ; also aggregate private
…nancial wealth per old at the beginning of period 1 is larger than it would have
been without the temporary tax cut. This might seem paradoxical in view of the
higher aggregate private consumption in period 0. The explanation lies in the
fact that the lower taxation in period 0 means higher disposable income, allowing
both higher private consumption and higher private saving in period 0.
Nevertheless, gross national saving, cf. (6.29), is lower than in the reference
path. Indeed, C00 > C0 implies
A counterpart of the increased private saving is the public dissaving, re‡ecting the
budget de…cit created one-to-one by the reduction in taxation. As the increased
disposable income resulting from the latter partly goes to increased private saving
and partly to increased private consumption, the rise in private saving is smaller
than the public dissaving. Consequently, gross national saving ends up lower
than in the reference path.
Net national saving in the reference path is S0N = S0 K0 : The public
dissaving in the alternative path reduces net national saving by the amount
1 1
V10 = A01 B10 = s00 L0 (1 + )L0 x = (w ( 1 x) c010 x)L0 L0 x
1+n 1+n
1
= (w 1 c^1 (r)h00 ) L0 L0 x
1+n
2+m 1
= w 1 c^1 (r) w 1+x L0 L0 x
1+r 1+n
2 m 1
= w 1 c^1 (r) w 1 L0 c^1 (r) x L0 L0 x
1+r 1+r 1+n
r n 1
= s0 L 0 c^1 (r) 1 L0 x L0 x
(1 + r)(1 + n) 1+n
1 + (2 + r)n 1
= s0 L 0 c^1 (r) +1 L0 x < s0 L0 = V1 : (6.46)
1+r 1+n
We see that national wealth has decreased by an amount equal to the decrease
in net national saving in (6.45), as it should.
Thus, for t = 2; 3; : : : ;
A0t At
Q holds for s0t 1 Q st 1 ; respectively, which in turn holds for
Lt 1 Lt 1
1+r
c^1 (r) Q ; respectively. (6.48)
2+r
as of period t = 2; 3; : : : , is
Vt0 A0t Bt 2+n
= s0t 1 x
Lt 1 Lt 1 Lt 1 1+n
2+r 1
= st 1 1 c^1 (r) (r n) + 2 + n x (by (6.47))
1+r 1+n
1 + (2 + r)n 1 1+r
= s0 c^1 (r) +1 x (1 c^1 (r)) x
1+r 1+n 1+n
1 + (2 + r)n 1
< s0 c^1 (r) +1 x = V10 =L0 (by (6.46))
1+r 1+n
A1 At Vt
< s0 = = = ;
L0 Lt 1 Lt 1
where the second to last inequality is due to c^1 (r) < 1; cf. (6.37), while the two
…rst equalities in the last line are due to the constancy of “per old” variables
along the reference path. The last equality is due to the absence of government
debt along that path. So, like period 1, also the subsequent periods experience a
reduction in national wealth as a consequence of the temporary tax cut in period
0.
Period 1 is special, though. While there is a per capita tax increase by m like
in the subsequent periods, period 1’s old generation still bene…ts from the higher
disposable income in period 0. Hence, in period 2 national wealth per old is even
lower than in period 1 but remains constant henceforth.
The conclusion that under full capacity utilization budget de…cits imply a
burden for future generations may be seen in a somewhat di¤erent light if per-
sistent technological progress is included in the model. In that case, everything
else equal, future generations will generally be better o¤ than current generations.
Then it might seem less unfair if the former carry some public debt forward to the
latter. In particular this is so if a part of Gt represents spending on infrastructure,
education, research, health, and environmental protection. As future generations
directly bene…t from such investment, it seems fair that they also contribute to
the …nancing. This is the “bene…ts received principle”known from public …nance
theory.
A further concern is whether the economy is a state of full capacity utiliza-
tion or serious unemployment. The above analysis assumes the …rst. What if the
economy in period 0 is in economic depression with high unemployment due to
insu¢ cient aggregate demand? Some economists maintain that also in this situa-
tion is a cut in (lump-sum) taxes to stimulate aggregate demand futile because it
has no real e¤ect. The argument is again that foreseeing the higher taxes needed
in the future, people will save more to prepare themselves (or their descendants
through higher bequests) for paying the higher taxes in the future. The opposite
view is, …rst, that the composition-of-tax-base argument speaks against this as
usual. Second, there is in a depression an additional and quantitatively impor-
tant factor. The “…rst-round”increase in consumption due to the temporary tax
cut raises aggregate demand. Thereby production and income is stimulated and
a further (but smaller) rise in consumption occurs in the “second round”and so
on (the Keynesian multiplier process).
This Keynesian mechanism is important for the debate about e¤ects of budget
de…cits because there are limits to how large deviations from Ricardian equiva-
lence the composition-of-tax-base argument alone can deliver. Indeed, taking into
account the sizeable life expectancy of the average citizen, Poterba and Summers
(1987) point out that the composition-of-tax-base argument by itself delivers only
modest deviations if the issue is timing of taxes over the business cycle.
Another concern is that in the real world, taxes tend to be distortionary and
not lump sum. On the one hand, this should not be seen as an argument against
the possible theoretical validity of the Ricardian equivalence proposition. The
reason is that Ricardian equivalence (in its strict meaning) claims absence of
allocational e¤ects of changes in the timing of lump-sum taxes.
On the other hand, in a wider perspective the interesting question is, of course,
how changes in the timing of distortionary taxes is likely to a¤ect resource allo-
cation. Consider …rst income taxes. When taxes are proportional to income or
progressive (average tax rate rising in income), they provide insurance through re-
ducing the volatility of after-tax income. The fall in taxes in a recession thus helps
6.10 Exercises
6.xx In the OLG model of Section 6.6.2, derive (6.37) and (6.38).
6.? In the OLG model of Section 6.6.2, show that for t = 1; 2; 3; : : : ; public debt
along the “alternative path” evolves according to Bt0 = [(2 + n)=(1 + n)] L1 (1 +
n)t 2 x; where x is the temporary per capita tax cut in period 0. Hint: given
the information in Section 6.6.2 you may start by deriving a …rst-order di¤erence
equation in bt Bt =Yt with constant coe¢ cients. The information that the
“reference path" has a balanced budget for all t should be taken into account. In
addition, you should explain - and apply - that the initial condition is b1 = B1 =Y1
= (2 + n)x= [f (k)(1 + n)2 ] :
6.?? Consider the OLG model of Section 6.6.2. a) Show that if the temporary
per capita tax cut, x; is su¢ ciently small, the debt can be completely wiped out
through a per capita tax increase in only periods 1 and 2. b) Investigate how in
this case the burden of the debt is distributed across generations. Compare with
the alternative debt policy described in the text.
The intertemporal
consumption-saving problem in
discrete and continuous time
In the next two chapters we shall discuss and apply the continuous-time ver-
sion of the basic representative agent model, the Ramsey model. As a preparation
for this, the present chapter gives an account of the transition from discrete time
to continuous time analysis and of the application of optimal control theory to
formalize and solve the household’s consumption/saving problem in continuous
time.
There are many …elds in economics where a setup in continuous time is prefer-
able to one in discrete time. One reason is that continuous time formulations
expose the important distinction in dynamic theory between stock and ‡ows in
a much clearer way. A second reason is that continuous time opens up for appli-
cation of the mathematical apparatus of di¤erential equations; this apparatus is
more powerful than the corresponding apparatus of di¤erence equations. Simi-
larly, optimal control theory is more developed and potent in its continuous time
version than in its discrete time version, considered in Chapter 8. In addition,
many formulas in continuous time are simpler than the corresponding ones in
discrete time (cf. the growth formulas in Appendix A).
As a vehicle for comparing continuous time modeling with discrete time mod-
eling we consider a standard household consumption/saving problem. How does
the household assess the choice between consumption today and consumption in
the future? In contrast to the preceding chapters we allow for an arbitrary num-
ber of periods within the time horizon of the household. The period length may
thus be much shorter than in the previous models. This opens up for capturing
additional aspects of economic behavior and for undertaking the transition to
343
CHAPTER 9. THE INTERTEMPORAL CONSUMPTION-
344 SAVING PROBLEM IN DISCRETE AND CONTINUOUS TIME
(a) the contractors face the same interest rate whether borrowing or lending
(that is, monitoring, administration, and other transaction costs are ab-
sent);
(b) there are many contractors on each side and none of them believe to be able
to in‡uence the interest rate (the contractors are price takers in the loan
market);
(c) there are no borrowing restrictions other than the requirement on the part
of the borrower to comply with her …nancial commitments;
(d) the lender faces no default risk (the borrower can somehow cost-less be
forced to repay the debt with interest on the conditions speci…ed in the
contract).
2. any payment stream can be converted into another one with the same
present value;
Consider a payment stream fxt gTt=01 over T periods, where xt is the payment
in currency at the end of period t. Period t runs from time t to time t + 1 for t
= 0; 1; :::; T 1: We ignore uncertainty and so it is the interest rate on a riskless
loan from time t to time t + 1: Then the present value, P V0 , as seen from the
beginning of period 0, of the payment stream is de…ned as1
x0 x1 xT 1
P V0 = + + + : (9.1)
1 + i0 (1 + i0 )(1 + i1 ) (1 + i0 )(1 + i1 ) (1 + iT 1)
If Ms. Jones is entitled to the income stream fxt gTt=01 and at time 0 wishes
to buy a durable consumption good of value P V0 , she can borrow this amount
and use a part of the income stream fxt gTt=01 to repay the debt with interest over
the periods t = 0; 1; 2; :::; T 1. In general, when Jones wishes to have a time
pro…le on the payment stream di¤erent from the income stream, she can attain
this through appropriate transactions in the loan market, leaving her with any
stream of payments of the same present value as the given income stream.
Real versus nominal rate of return In this chapter we maintain the as-
sumption of perfect competition in all markets, i.e., households take all prices as
given from the markets. In the absence of uncertainty, the various assets (real
capital, stocks, loans etc.) in which households invest give the same rate of return
in equilibrium. The good which is traded in the loan market can be interpreted
as a (riskless) bond. The borrower issues bonds and the lender buys them. In
this chapter all bonds are assumed to be short-term, i.e., one-period bonds. For
every unit of account borrowed at the end of period t 1, the borrower pays back
with certainty (1 + short-term interest rate) units of account at the end of period
t: If a borrower wishes to maintain debt through several periods, new bonds are
issued at the end of the current period and the obtained loans are spent rolling
over the older loans at the going market interest rate. For the lender, who lends
in several periods, this is equivalent to o¤ering a variable-rate demand deposit
like in a bank.2
Our analysis will be in real terms, that is, in‡ation-corrected terms. In prin-
ciple the unit of account is a …xed bundle of consumption goods. In the simple
macroeconomic models to be studied in this and most subsequent chapters, such
1
We use “present value” as synonymous with “present discounted value”. As usual our
timing convention is such that P V0 denotes the time-0 value of the payment stream, including
the discounted value of the payment (or dividend) indexed by 0.
2
Unless otherwise speci…ed, this chapter uses terms like “loan market” and “bond market”
interchangeably. As uncertainty is ignored, this is legitimate.
a bundle is reduced to one consumption good. The models simply assume there
is only one consumption good in the economy. In fact, there will only be one
produced good, “the” output good, which can be used for both consumption and
capital investment. Whether our unit of account is seen as the consumption good
or the output good is thus immaterial.
The real (net) rate of return on an investment is the rate of return in units
of the output good. More precisely, the real rate of return in period t; rt ; is the
(proportionate) rate at which the real value of an investment, made at the end
of period t 1; has grown after one period.
The link between this rate of return and the more commonplace concept of a
nominal rate of return is the following. Imagine that at the end of period t 1
you make a bank deposit of value Vt euro. The real value of the deposit when
you invest is then Vt =Pt 1 ; where Pt 1 is the price in euro of the output good at
the end of period t 1: If the nominal short-term interest rate is it ; the deposit is
worth Vt+1 = Vt (1 + it ) euro at the end of period t: By de…nition of rt ; the factor
by which the deposit in real terms has expanded is
where t (Pt Pt 1 )=Pt 1 is the in‡ation rate in period t: So the real (net)
rate of return on the investment is rt = (it t )=(1 + t ) it t for it and t
“small”. The number 1 + rt is called the real interest factor and measures the
rate at which current units of output can be traded for units of output one period
later.
In the remainder of this chapter we will think in terms of real values and
completely ignore monetary aspects of the economy.
The period utility function is assumed to satisfy u0 (c) > 0 and u00 (c) < 0. As
explained in Box 9.1, only linear positive transformations of the period utility
function are admissible.
As (9.3) indicates, the number 1 + tells how many units of utility in the next
period the household insists on “in return”for a decrease of one unit of utility in
the current period. So, a > 0 will re‡ect that if the chosen level of consumption
is the same in two periods, then the individual always appreciates a marginal
unit of consumption higher if it arrives in the earlier period. This explains why
is named the rate of time preference or, even more to the point, the rate of
impatience. The utility discount factor, 1=(1 + )t , indicates how many units of
utility the household is at most willing to give up in period 0 to get one additional
unit of utility in period t.3
It is generally believed that human beings are impatient and that should
therefore be assumed positive.4 There is, however, a growing body of evidence
suggesting that the utility discount rate is typically not constant, but declining
with the time distance from the current period to the future periods within the
horizon. This phenomenon is referred to as “present bias” or, with a more tech-
nical term, “hyperbolic discounting”. Macroeconomics often, as a …rst approach,
3
Multiplying through in (9.3) by (1 + ) 1 would make the objective function appear in a
way similar to (9.1) in the sense that also the …rst term in the sum becomes discounted. At the
same time the ranking of all possible alternative consumption paths would remain una¤ected.
For ease of notation, however, we use the form (9.3) which is more standard. Economically,
there is no di¤erence.
4
If uncertainty were included in the model, (1 + ) 1 might be interpreted as (roughly)
re‡ecting the probability of surviving to the next period. In this perspective, > 0 is de…nitely
a plausible assumption.
ignores the problem and assumes a constant to keep things simple. We will
generally follow this practice.
For many issues the size of is immaterial. Except when needed, we shall
therefore not impose any other constraint on than the de…nitional requirement
in discrete time that > 1:
meaning that the household, having a utility discount factor 1=(1 + ) = 0:95,
0
strictly prefers consuming (c0 ; c1 ) to (c0 ; c01 ) in the …rst two periods, if and only
if the utility di¤erences satisfy the indicated inequality. (The notation x y
means that x is strictly preferred to y:)
Only a linear positive transformation of the utility function u; that is,
v(c) = au(c) + b, where a > 0, leaves the ranking of all possible alternative
T 1
consumption paths, fct gt=0 , unchanged. This is because a linear positive
transformation does not a¤ect the ratios of marginal utilities (the marginal
rates of substitution across time).
ensuring
aT = 0; (9.8)
the terminal optimality condition, necessary when u0 (c) > 0 for all c 0 (satu-
ration impossible).
To obtain …rst-order conditions we put the partial derivatives of U~ w.r.t. at+1 ,
t = 0; 1;. . . ; T 2; equal to 0:
@ U~ t
= (1 + ) u0 (ct ) ( 1) + (1 + ) 1 u0 (ct+1 )(1 + rt+1 ) = 0:
@at+1
Reordering gives the Euler equations describing the trade-o¤ between consump-
tion in two succeeding periods,
in the optimal plan (due to u00 < 0): Absent uncertainty the optimal plan entails
either increasing, constant, or decreasing consumption over time depending on
whether the rate of time preference is below, equal to, or above the rate of return
on saving.
Optimality requires that the left-hand side of this equation vanishes for t = T .
So we can solve for c0 :
" #
1+r X
T 1
1+r
c0 = PT 1 i a0 + (1 + r) (i+1) wi = PT 1 i (a0 + h0 ); (9.14)
i=0 1+r i=0 i=0 1+r
where we have inserted the human wealth of the household (present value of
expected lifetime labor income) as seen from time zero:
X
T 1
(i+1)
h0 = (1 + r) wi : (9.15)
i=0
Thus (9.14) says that initial consumption is proportional to initial total wealth,
the sum of …nancial wealth and human wealth at time 0: To allow for positive
consumption we need a0 + h0 > 0:
In (9.14) is not one of the original parameters, but a derived parameter. To
express the consumption function only in terms of the original parameters, not
that, by (9.14), the propensity to consume out of total wealth depends on:
( T
X
T 1 i 1 ( 1+r )
= 1 1+r
when 6= 1 + r; (9.16)
i=0
1+r T when = 1 + r;
6
In later sections of this chapter we let the time horizon of the decision maker go to in…nity.
To ease convergence of an in…nite sum of discounted utilities, it is an advantage not to have to
bother with additive constants in the period utilities and therefore we write the CRRA function
as c1 =(1 ) instead of the form, (c1 1)=(1 ); introduced in Chapter 3. As implied by
Box 9.1, the two forms represent the same preferences.
where the result for 6= 1 + r follows from the formula for the sum of a …nite
geometric series. Inserting this together with (9.13) into (9.14), we end up with
the expression
8
1=
>
< (1+r)[1 (1+T )= (1+r)
1= (1 )=
] 1+r
1 (1+ ) (1+r) (1 )T = (a 0 + h0 ) when 1+
6= 1 + r;
c0 = 1= (9.17)
>
: 1+r (a0 + h0 ) when 1+r
= 1 + r:
T 1+
There is another approach to the household’s saving problem. With its choice of
consumption plan the household must act in conformity with its intertemporal
budget constraint (IBC for short). The present value of the consumption plan
(c1 ; :::; cT 1 ), as seen from time zero, is
X
T 1
ct
P V (c0 ; c1 ; :::; cT 1) t
: (9.19)
t=0 =0 (1 + r )
This value cannot exceed the household’s total initial wealth, a0 + h0 : So the
household’s intertemporal budget constraint is
X
T 1
ct
t
a0 + h0 : (9.20)
t=0 =0 (1 + r )
In…nite time horizon In the Ramsey model of the next chapter the idea is
used that households may have an in…nite time horizon. One interpretation of
this is that parents care about their children’s future welfare and leave bequests
accordingly. This gives rise to a series of intergenerational links. The household
is then seen as a family dynasty with a time horizon beyond the lifetime of
the current members of the family. Barro’s bequest model in Chapter 7 is an
application of this idea. Given a su¢ ciently large rate of time preference, it is
ensured that the sum of achievable discounted utilities over an in…nite horizon is
bounded from above.
One could say, of course, that in…nity is a long time. The sun will eventually,
in some billion years, burn out and life on earth become extinct. Nonetheless,
there are several reasons that an in…nite time horizon may provide a convenient
substitute for …nite but remote horizons. First, in many cases the solution to
an optimization problem for T “large” is in a major part of the time horizon
close to the solution for T ! 1.7 Second, an in…nite time horizon tends to ease
aggregation because at any future point in time, remaining time is still in…nite.
Third, an in…nite time horizon may be a convenient notion when in any given
7
The turnpike proposition in Chapter 8 exempli…es this.
period there is a always a positive probability that there will be a next period to
be concerned about. This probability may be low, but this can be re‡ected in a
high e¤ective utility discount rate. This idea will be applied in chapters 12 and
13.
We may perform the transition to in…nite horizon by letting T ! 1 in the ob-
jective function, (9.4) and the intertemporal budget constraint, (9.20). On might
think that, in analogy of (9.8) for the case of …nite T; the terminal optimality
condition for the case of in…nite horizon is limT !1 aT = 0: This is generally not
so, however. The reason is that with in…nite horizon there is no …nal date where
all debt must be settled. The terminal optimality condition in the present prob-
lem is simply that the intertemporal budget constraint should hold with strict
equality.
As with …nite time horizon, the saving problem with in…nite time horizon
may alternatively be framed in terms of a series of dynamic period-by-period
budget identities, in the form (9.6), together with the borrowing limit known as
the No-Ponzi-Game condition:
t 1 1
lim at i=0 (1 + ri ) 0:
t!1
ai+1 ai = s i ; a0 given,
a(t)
_ = s(t); a(0) = a0 given, (9.21)
where s(t) is the saving ‡ow (saving intensity) at time t. For t “small”we have
the approximation a(t) a(t) _ t = s(t) t: In particular, for t = 1 we have
a(t) = a(t + 1) a(t) s(t):
As time unit choose one year. Going back to discrete time, if wealth grows at
a constant rate g per year, then after i periods of length one year, with annual
compounding, we have
With t still denoting time measured in years passed since date 0, we have i = nt
periods. Substituting into (9.23) gives
gt
g 1 n
a(t) = ant = a0 (1 + )nt = a0 (1 + )m ; where m :
n m g
where e is a mathematical constant called the base of the natural logarithm and
de…ned as e limm!1 (1 + 1=m)m ' 2.7182818285....
The formula (9.24) is the continuous-time analogue to the discrete time for-
mula (9.22) with annual compounding. A geometric growth factor is replaced by
an exponential growth factor, egt ; and this growth factor is valid for any t in the
time interval ( 1 ; 2 ) for which the growth rate of a equals the constant g ( 1
and 2 being some positive real numbers).
We can also view the formulas (9.22) and (9.24) as the solutions to a di¤erence
equation and a di¤erential equation, respectively. Thus, (9.22) is the solution to
the linear di¤erence equation ai+1 = (1 + g)ai , given the initial value a0 : And
(9.24) is the solution to the linear di¤erential equation a(t)
_ = ga(t); given the
initial condition a(0) = a0 : Now consider a time-dependent growth rate, g(t); a
continuous function of t: The corresponding di¤erential equation is a(t)
_ = g(t)a(t)
and it has the solution Rt
a(t) = a(0)e 0 g( )d ; (9.25)
Rt
where the exponent, 0 g( )d , is the de…nite integral of the function g( ) from 0
to t: The result
Rt
(9.25) is called the accumulation formula in continuous time and
the factor e 0 )d is called the growth factor or the accumulation factor.10
g(
the instantaneous nominal interest rate is i(t); we have V_ (t)=V (t) = i(t) and so,
by the fraction rule in continuous time (cf. Appendix A),
a(t)
_ V_ (t) P_ (t)
r(t) = = = i(t) (t); (9.26)
a(t) V (t) P (t)
where (t) P_ (t)=P (t) is the instantaneous in‡ation rate. In contrast to the
corresponding formula in discrete time, this formula is exact. Sometimes i(t) and
r(t) are referred to as the nominal and real force of interest.
Calculating the terminal value of the deposit at time t1 > t0 ; given its value at
time t0 and assuming no withdrawal in the time interval [t0 ; t1 ], the accumulation
formula (9.25) immediately yields
R t1
r(t)dt
a(t1 ) = a(t0 )e t0
:
When calculating present values in continuous time, we use compound dis-
counting. We reverse the accumulation formula and go from the compounded or
terminal value to the present value, a(t0 ). Similarly, given a consumption plan
(c(t))tt=t
1
0
, the present value of this plan as seen from time t0 is
Z t1
PV = c(t) e rt dt; (9.27)
t0
where K(t) is the capital stock, I(t) is the gross investment at time t and 0
is the (physical) capital depreciation rate. Unlike in discrete time, here > 1 is
conceptually allowed. Indeed, suppose for simplicity that I(t) = 0 for all t 0;
then (9.28) gives K(t) = K0 e t . This formula is meaningful for any 0:
Usually, the time unit used in continuous time macro models is one year (or, in
business cycle theory, rather a quarter of a year) and then a realistic value of
is of course < 1 (say, between 0.05 and 0.10). However, if the time unit applied
to the model is large (think of a Diamond-style OLG model), say 30 years, then
> 1 may …t better, empirically, if the model is converted into continuous time
with the same time unit. Suppose, for example, that physical capital has a half-
life of 10 years. With 30 years as our time unit, inserting into the formula 1=2
= e =3 gives = (ln 2) 3 ' 2:
In many simple macromodels, where the level of aggregation is high, the
relative price of a unit of physical capital in terms of the consumption good is
1 and thus constant. More generally, if we let the relative price of the capital
good in terms of the consumption good at time t be p(t) and allow p(t)
_ 6= 0; then
we have to distinguish between the physical depreciation of capital, ; and the
economic depreciation, that is, the loss in economic value of a machine per time
unit. The economic depreciation will be d(t) = p(t) p(t);
_ namely the economic
value of the physical wear and tear (and technological obsolescence, say) minus
the capital gain (positive or negative) on the machine.
Other variables and parameters that by de…nition are bounded from below
in discrete time analysis, but not so in continuous time analysis, include rates of
return and discount rates in general.
Figure 9.1: With t small the integral of s(t) from t0 to t0 + t the hatched area.
From the point of view of the economic contents, the choice between discrete
time and continuous time may be a matter of taste. Yet, everything else equal, the
clearer distinction between stocks and ‡ows in continuous time than in discrete
time speaks for the former. From the point of view of mathematical convenience,
the continuous time formulation, which has worked so well in the natural sciences,
is preferable. At least this is so in the absence of uncertainty. For problems where
uncertainty is important, discrete time formulations are easier to work with unless
one is familiar with stochastic calculus.11
where the right-hand side of the inequality is the present value of the expected
future primary saving.12 By the de…nition in (9.32), we see that this requirement
is identical to the intertemporal budget constraint (IBC) which consequently
expresses solvency.
This equation in itself is just a dynamic budget identity. It tells how much
and in which direction the …nancial wealth is changing due to the di¤erence
between current income and current consumption. The equation per se does
not impose any restriction on consumption over time. If this equation were the
only “restriction”, one could increase consumption inde…nitely by incurring an
increasing debt without limits. It is not until we add the requirement of solvency
that we get a constraint. When T < 1, the relevant solvency requirement is
a(T ) 0 (that is, no debt is left over at the terminal date). This is equivalent to
satisfying the intertemporal budget constraint (9.31).
When T = 1; the relevant solvency requirement is the No-Ponzi-Game con-
dition Rt
lim a(t)e 0 r( )d 0: (NPG)
t!1
This condition says that the present value of debts, measured as a(t); in…nitely
far out in the future, is not permitted to be positive. We have the following
equivalency:
PROPOSITION 1 (equivalence of NPG condition and intertemporal budget con-
straint) Let the time horizon be in…nite and assume that the integral (9.32)
remains …nite for T ! 1. Then, given the accounting relation (9.34), we have:
(i) the requirement (NPG) is satis…ed if and only if the intertemporal budget
constraint, (IBC), is satis…ed; and
(ii) there is strict equality in (NPG) if and only if there is strict equality in (IBC).
Proof. See Appendix C.
The condition (NPG) does not preclude that the household, or family dynasty,
can remain in debt. This would also be an unnatural requirement as the dynasty
is in…nitely-lived. The condition does imply, however, that there is an upper
bound for the speed whereby debt can increase in the long term. The NPG
condition says that in the long term, debts are not allowed to grow at a rate as
high as (or higher than) the interest rate.
To understand the implication, consider the case with a constant interest rate
r > 0. Assume that the household at time t has net debt d(t) > 0, i.e., a(t)
d(t) < 0. If d(t) were persistently growing at a rate equal to or greater than
the interest rate, (NPG) would be violated.13 Equivalently, one can interpret
(NPG) as an assertion that lenders will only issue loans if the borrowers in the
long run cover their interest payments by other means than by taking up new
_
loans. In this way, it is avoided that d(t) rd(t) in the long run. In brief, the
borrowers are not allowed to run a Ponzi Game.
13 _
Starting from a given initial positive debt, d0 ; when d(t)=d(t) r > 0; we have d(t) d0 ert
so that d(t)e rt d0 > 0 for all t 0. Consequently, a(t)e rt = d(t)e rt d0 < 0 for
all t 0; that is, lim t!1 a(t)e rt < 0; which violates (NPG).
Let I denote the time interval [0; T ] if T < 1 and the time interval [0; 1)
if T = 1: If c(t) and the corresponding evolution of a(t) ful…l (9.36) and (9.37)
for all t 2 I as well as the relevant solvency condition, we call (a(t); c(t))Tt=0 an
admissible path. If a given admissible path (a(t); c(t))Tt=0 solves the problem, it is
referred to as an optimal path.14 We assume that w(t) > 0 for all t: No condition
on the impatience parameter is imposed (in this chapter).
First-order conditions
The solution procedure for this problem is as follows:15
14
The term “path”, sometimes “trajectory”, is common in the natural sciences for a solution
to a di¤erential equation because one may think of this solution as the path of a particle moving
in two- or three-dimensional space.
15
The four-step solution procedure below is applicable to a large class of dynamic optimization
problems in continuous time, see Math tools.
where is the adjoint variable (also called the co-state variable) associated
with the dynamic constraint (9.37).16 That is, is an auxiliary variable
which is a function of t and is analogous to the Lagrange multiplier in
static optimization.
2. At every point in time, we maximize the Hamiltonian w.r.t. the control
variable. Focusing on an interior optimal path,17 we calculate
@H
= u0 (c) = 0:
@c
For every t 2 I we thus have the condition
4. We now apply the Maximum Principle which applied to this problem says:
an interior optimal path (a(t); c(t))Tt=0 will satisfy that there exits a contin-
uous function = (t) such that for all t 2 I; (9.39) and (9.40) hold along
the path, and the transversality condition,
is satis…ed.
16
The explicit dating of the time-dependent variables a, c; and is omitted where not needed
for clarity.
17
A path, (at ; ct )Tt=0 ; is an interior path if for no t 2 [0; T ) ; (at ; ct ) is at a boundary point of
the set of admissible values. In the present case where at is not constrained, except at t = T;
(at ; ct )Tt=0 ; is an interior path if ct > 0 for all t 2 [0; T ) :
r +_
= : (9.41)
This can be interpreted as a no-arbitrage condition. The left-hand side gives the
actual rate of return, measured in utility units, on the marginal unit of saving.
Indeed, r can be seen as a dividend and _ as a capital gain. The right-hand side
is the required rate of return in utility units, : Along an optimal path the two
must coincide. The household is willing to save the marginal unit of income only
up to the point where the actual return on saving equals the required return.
We may alternatively write the no-arbitrage condition as
_
r= : (9.42)
On the left-hand-side appears the actual real rate of return on saving and on
the right-hand-side the required real rate of return. The intuition behind this
condition can be seen in the following way. Suppose Mr. Jones makes a deposit
of V utility units in a “bank”that o¤ers a proportionate rate of expansion of the
utility value of the deposit equal to i (assuming no withdrawal occurs), i.e.,
V_
= i:
V
18
Recall, a shadow price (measured in some unit of account) of a good is, from the point of
view of the buyer, the maximum number of units of account that the optimizing buyer is willing
to o¤er for one extra unit of the good.
This is the actual utility rate of return, a kind of “nominal interest rate”. To
calculate the corresponding “real interest rate”let the “nominal price”of a con-
sumption good be utility units. Dividing the number of invested utility units,
V; by ; we get the real value, m = V = ; of the deposit at time t. The actual real
rate of return on the deposit is therefore
m
_ V_ _ _
r= = =i : (9.43)
m V
Mr. Jones is just willing to save the marginal unit of income if this actual
real rate of return on saving equals the required real rate, that is, the right-hand
side of (9.42); in turn this necessitates that the “nominal interest rate”, i; in
(9.43) equals the required nominal rate, : The formula (9.43) is analogue to the
discrete-time formula (9.2) except that the unit of account in (9.43) is current
utility while in (9.2) it is currency.
The transversality condition (TVC) is a terminal optimality condition. We
could, for the case T < 1; have expressed it on the equivalent form
T
a(T ) (T )e = 0;
since e T > 0 always. This form has the advantage of being “parallel” to the
transversality condition for the case T = 1: More importantly, the transversal-
ity condition has a¢ nity with the principle of complementary slackness in linear
and nonlinear programming. Let us spell out in general terms. Consider the case
T < 1. Interpret the solvency condition a(T ) 0 as just an example of a general
terminal constraint a(T ) aT , where a(T ) is the terminal value of some general
state variable with a nonnegative shadow price (T ); besides, aT is an arbitrary
real number. Continuing this line of thought, interpret (9.35) as an abstract cri-
terion function and c(t) as an abstract control variable with control region R and
with the property that a higher value of c(t) makes a(t)
_ smaller. Then “comple-
mentary slackness”is the principle that given the terminal constraint a(T ) aT ,
the terminal optimality condition must be (a(T ) aT ) (T ) = 0: The intuition
is that if the shadow price (T ) > 0 (a “slackness”), then optimality requires
a(T ) = aT : Indeed, in this case a(T ) > aT has an avoidable positive opportunity
cost. On the other hand, if a(T ) > aT is optimal (another “slackness”), then the
shadow price must be nil, i.e., (T ) = 0: There is “complementary slackness” in
the sense that at most one of the weak inequalities a(T ) aT and (T ) 0 can
be strict in optimum.
Anyway, returning to the household’s saving problem, the transversality con-
dition becomes more concrete if we insert (9.39). For the case T < 1; we then
have
a(T )u0 (c(T ))e T = 0: (9.44)
Since u0 (c(T ))e T is always positive, an optimal plan obviously must satisfy a(T )
= aT = 0. The reason is that, given the solvency requirement a(T ) 0, the only
alternative to a(T ) = 0 is a(T ) > 0: But this would imply that the level of the
consumption path could be raised, and U0 thereby be increased, by allowing a
decrease in a(T ) without violating the solvency requirement.
Now, write the solvency requirement as a(T )e T 0 and let T ! 1: Then
in the limit the solvency requirement takes the form of (NPG) above (replace T
by t), and (9.44) is replaced by
lim a(T )u0 (c(T ))e T
= 0: (9.45)
T !1
This says the same as (TVC) above. Intuitively, a plan that violates this condition
by having “>”instead “=”indicates scope for improvement and thus cannot be
optimal. There would be “purchasing power left for eternity”. This purchasing
power could be transferred to consumption on earth at an earlier date.
Generally, care must be taken when extending a necessary transversality con-
dition from a …nite to an in…nite horizon. But for the present problem, the
extension is valid. To see this, note that by Proposition 1, strict inequality in
the (NPG) condition is (by Proposition1) equivalent to strict inequality in the in-
tertemporal budget constraint (IBC). Such a path can always be improved upon
by raising c(t) a little in some time interval without decreasing c(t) in any other
time interval and without violating the (NPG) and (IBC). Hence, an optimal
plan must have strict equality in both NPG and IBC. This amounts to requiring
that none of these two conditions is “over-satis…ed”. And this requirement can
be shown to be equivalent to the condition (TVC) above. Indeed:
PROPOSITION 2 (the household’s necessary transversality condition with in-
…nite time horizon) Let T ! 1 in the criterion function (9.35) and assume
the human wealth integral (9.32) converges (and thereby remains bounded) for
T ! 1. Provided the adjoint variable, (t), satis…es the …rst-order conditions
(9.39) and (9.40), (TVC) holds if and only if (NPG) holds with strict equality.
Proof. See Appendix D.
In view of this proposition, we can write the transversality condition for T !
1 as the NPG condition with strict equality:
Rt
r( )d
lim a(t)e 0 = 0: (TVC’)
t!1
In view of the equivalence of the NPG condition with strict equality and the IBC
with strict equality, established in Proposition 1, the transversality condition for
T ! 1 can also be written
Z 1 Rt
c(t)e 0 r( )d dt = a0 + h0 : (IBC’)
0
c(t)
_ 1
= (r(t) ); (9.46)
c(t) (c(t))
where (c) is the (absolute) elasticity of marginal utility w.r.t. consumption,
c 00
(c) u (c) > 0: (9.47)
u0 (c)
As in discrete time, (c) indicates the strength of the consumer’s preference for
consumption smoothing. The inverse of (c) measures the instantaneous in-
tertemporal elasticity of substitution in consumption, which in turn indicates the
willingness to accept variation in consumption over time when the interest rate
changes, see Appendix F.
The result (9.46) says that an optimal consumption plan is characterized in
the following way. The household will completely smooth i.e., even out
consumption over time if the rate of time preference equals the real interest rate.
The household will choose an upward-sloping time path for consumption if and
only if the rate of time preference is less than the real interest rate. In this case
the household will have to accept a relatively low level of current consumption
with the purpose of enjoying higher consumption in the future. The higher the
real interest rate relative to the rate of time preference, the more favorable is
it to defer consumption everything else equal. The proviso is important. In-
deed, in addition to the negative substitution e¤ect on current consumption of a
Figure 9.2: Optimal consumption paths for a low and a high constant , given a constant
r> .
higher interest rate there is a positive income e¤ect due to the present value of
a given intertemporal consumption plan being reduced by a higher interest rate
(see (IBC)). On top of this comes a negative wealth e¤ect due to a higher interest
rate causing a lower present value of expected future labor earnings (again see
(IBC)). The special case of a CRRA utility function provides a convenient agenda
for sorting these details out, see Example 1 in Section 9.5.
By (9.46) we also see that the greater the elasticity of marginal utility (that
is, the greater the curvature of the utility function), the greater the incentive to
smooth consumption for a given value of r(t) . The reason for this is that a
strong curvature means that the marginal utility will drop sharply if consumption
increases, and will rise sharply if consumption decreases. Fig. 9.2 illustrates this
in the CRRA case where (c) = ; a positive constant. For a given constant
r > ; the consumption path chosen when is high has lower slope, but starts
from a higher level, than when is low.
The condition (9.46), which holds for all t within the time horizon whether this
is …nite or in…nite, is referred to as the Keynes-Ramsey rule. The name springs
from the English mathematician Frank Ramsey who derived the rule in 1928,
while his mentor, John Maynard Keynes, suggested a simple and intuitive way
of presenting it. The rule is the continuous-time counterpart to the consumption
Euler equation in discrete time.
The Keynes-Ramsey rule re‡ects the general microeconomic principle that
the consumer equates the marginal rate of substitution between any two goods to
the corresponding price ratio. In the present context the principle is applied to a
situation where the “two goods”refer to the same consumption good delivered at
two di¤erent dates. In Section 9.2 we used the principle to interpret the optimal
saving behavior in discrete time. How can the principle be translated into a
continuous time setting?
The evolution of wages and interest rates which prevails in general equilibrium
is not arbitrary, however. It is determined by the requirement of equilibrium.
In turn, of course existence of an equilibrium imposes restrictions on the utility
discount rate relative to the potential growth in instantaneous utility. We shall
return to these aspects in the next chapter.
EXAMPLE 1 (constant elasticity of marginal utility; in…nite time horizon). In
the problem in Section 9.4.2 with T = 1; we consider the case where the elasticity
of marginal utility (c); as de…ned in (9.47), is a constant > 0. From Appendix
A of Chapter 3 we know that this requirement implies that up to a positive linear
transformation the utility function must be of the form:
c1
; when > 0; 6= 1;
u(c) = 1 (9.52)
ln c; when = 1:
This is our familiar CRRA utility function. In this case the Keynes-Ramsey rule
_ = 1 (r(t)
implies c(t) )c(t): Solving this linear di¤erential equation yields
1
Rt
c(t) = c(0)e 0 (r( ) )d
; (9.53)
cf. the general accumulation formula, (9.25).
We know from Proposition 2 that the transversality condition is equivalent
to the NPG condition being satis…ed with strict equality, and from Proposition 1
we know that this condition is equivalent to the intertemporal budget constraint
being satis…ed with strict equality, i.e.,
Z 1 Rt
c(t)e 0 r( )d dt = a0 + h0 ; (IBC’)
0
This result can be used to determine c(0):20 Substituting (9.53) into (IBC’) gives
Z 1 R
t 1
c(0) e 0 [ (r( ) ) r( )]d dt = a0 + h0 :
0
In this special case the marginal propensity to consume is time independent and
equal to the rate of time preference. For a given total wealth, a0 + h0 , current
consumption is thus independent of the expected path of the interest rate. That
is, in the logarithmic case the substitution and pure income e¤ects on current
consumption exactly o¤set each other. Yet, on top of this comes the negative
wealth e¤ect on current consumption of an increase in the interest rate level.
The present value of future wage incomes becomes lower (similarly with expected
future dividends on shares and future rents in the housing market in a more
general setup). Because of this, h0 (and so a0 + h0 ) becomes lower, which adds
to the negative substitution e¤ect. Thus, even in the logarithmic case, and a
fortiori when < 1; the total e¤ect of an increase in the interest rate level is
unambiguously negative on c(0):
21
By an increase in the interest rate level we mean an upward shift in the time-pro…le of the
interest rate. That is, there is at least one time interval within [0; 1) where the interest rate is
higher than in the original situation and no time interval within [0; 1) where the interest rate
is lower.
0 = [( 1)r + ]= ;
a0 + h0 = a0 + w=r:
w c0
a(t) = ert (1 e rz
) (1 e t
) ; (9.59)
r
(cf. 0 in Example 1) and through their impact on the present value of expected
future labor income (or of expected future dividends on shares or imputed rental
income on owner-occupied houses in a more general setup).23
9.8 Appendix
A. Growth arithmetic in continuous time
Let the variables z; x; and y be di¤erentiable functions of time t: Suppose z(t);
x(t); and y(t) are positive for all t: Then:
PRODUCT RULE z(t) = x(t)y(t) ) z(t)=z(t)
_ = x(t)=x(t)
_ + y(t)=y(t):
_
Proof. Taking logs on both sides of the equation z(t) = x(t)y(t) gives ln z(t) =
ln x(t)+ln y(t). Di¤erentiation w.r.t. t, using the chain rule, gives the conclusion.
is expressed in terms of the arithmetic average, also called the arithmetic mean,
of the growth rates in the time interval [0; t]. This average is de…ned as g0;t
Rt
= (1=t) 0 g( )d : So we can write
which has form similar to (9.24). Similarly, let r0;t denote the arithmetic average
Rt
of the (short-term) interest rates from time 0 to time t; i.e., r0;t = (1=t) 0 r( )d :
Then we can write the present value of the consumption stream (c(t))Tt=0 as P V
RT
= 0 c(t)e r0;t t dt:
The arithmetic average growth rate, g0;t ; coincides with the average compound
growth rate from time 0 to time t; that is, the number g satisfying
1 a1 a2 an
ma + + + mg ; (9.63)
n a0 a1 an 1
where strict inequality holds unless all the n growth factors are identical. Indeed,
when the growth factors are not identical, we have, by Jensen’s inequality,
Xn X
n
'( w i xi ) > wi '(xi );
i=1 i=1
Pn
when ' is strictly concave and i=1 wi = 1; wi 0; i = 1; 2; : : : ; n: So, by (9.63),
Xn
1 ai Xn
1 ai 1X
n
ai
ln ma = ln > ln = ln = ln mg ;
i=1
n ai 1 i=1
n ai 1 n i=1 ai 1
an = a0 (1 + G)n ; (9.64)
which is natural if the compounding behind the data is discrete and occurs annu-
ally. If the compounding is much more frequent, it is in principle better to apply
the exponential growth formula,
an = a0 egn ; (9.65)
ln aan0
g= = ln(1 + G) / G
n
for G “small”, where “/”means “close to”(by a …rst-order Taylor approximation
about G = 0) but “less than”except if G = 0. The intuitive reason for “less than”
is that a given growth force is more powerful when compounding is continuous.
To put it di¤erently: rewriting (1 + G)n into exponential form gives (1 + G)n
= (eln(1+G) )n = egn < eGn ; as ln(1 + G) < G for all G 6= 0.
Anyway, the di¤erence between G and g is usually unimportant. If for example
G refers to the annual GDP growth rate, it will be a small number, and the
di¤erence between G and g immaterial. For example, to G = 0:040 corresponds g
0:039: Even if G = 0:10, the corresponding g is 0:0953. But if G stands for the
in‡ation rate and there is high in‡ation, the di¤erence between G and g will be
substantial. During hyperin‡ation the monthly in‡ation rate may be, say, G =
100%, but the corresponding g will be only 69%.24
a(t)
_ = r(t)a(t) + w(t) c(t); (9.66)
RT Rt
r( )d
Integrate from 0 to T > 0 to get 0
c(t)e 0 dt
Z T Rt Z T Rt Z T Rt
r( )d r( )d r( )d
= w(t)e 0 dt a(t)e
_ 0 dt + r(t)a(t)e 0 dt
0 0 0
Z Rt Rt Z T Rt !
T T
r( )d r( )d r( )d
= w(t)e 0 dt a(t)e 0 a(t)e 0 ( r(t))dt
0 0 0
Z T Rt
r( )d
+ r(t)a(t)e 0 dt
0
Z T Rt RT
r( )d r( )d
= w(t)e 0 dt (a(T )e 0 a(0));
0
where the second equality follows from integration by parts. If we let T ! 1 and
use the de…nition of h0 and the initial condition a(0) = a0 , we get (IBC) if and
only if (NPG) holds. It follows that when (NPG) is satis…ed with strict equality,
so is (IBC), and vice versa.
An alternative proof is obtained by using the general solution to a linear
inhomogeneous …rst-order di¤erential equation and then let T ! 1. Since this
is a more generally applicable approach, we will show how it works and use it
for Claim 1 below (an extended version of Proposition 1) and for the proof of
Proposition 2 in the text. Claim 1 will for example prove useful in Exercise 9.1
and in the next chapter.
CLAIM 1 Let f (t) and g(t) be given continuous functions of time, t: Consider
the di¤erential equation
x(t)
_ = g(t)x(t) + f (t); (9.67)
with x(t0 ) = xt0 ; a given initial value: Then the inequality
Rt
g(s)ds
lim x(t)e t0
0 (9.68)
t!1
is equivalent to Z 1 R
g(s)ds
f ( )e t0
d xt0 : (9.69)
t0
Moreover, if and only if (9.68) is satis…ed with strict equality, then (9.69) is
satis…ed with strict equality.
Proof. The linear di¤erential equation, (9.67), has the solution
Rt Z t Rt
g(s)ds g(s)ds
x(t) = x(t0 )e t0
+ f ( )e d . (9.70)
t0
Rt
g(s)ds
Multiplying through by e t0
yields
Rt Z t R
g(s)ds g(s)ds
x(t)e t0
= x(t0 ) + f ( )e t0
d :
t0
Since x(t0 ) = xt0 ; this is the same as (9.69). We also see that if and only if (9.68)
holds with strict equality, then (9.69) also holds with strict equality.
COROLLARY Let n be a given constant and let
Z 1 R
(r(s) n)ds
ht0 w( )e t0 d ; (9.71)
t0
a(t)
_ = (r(t) n)a(t) + w(t) c(t); where a(t0 ) = at0 , (9.72)
it holds that
Rt Z 1 R
t0 (r(s) n)ds t0 (r(s) n)ds
lim a(t)e 0, c( )e d at0 + ht0 ; (9.73)
t!1 t0
where a strict equality on the left-hand side of “,” implies a strict equality on
the right-hand side, and vice versa.
Proof. In (9.67), (9.68) and (9.69), let x(t) = a(t); g(t) = r(t) n and f (t) =
w(t) c(t). Then the conclusion follows from Claim 1.
From the …rst-order condition (9.39) in Section 9.4 we have (t0 ) = u0 (c(t0 )) > 0
so that (t0 ) in (9.74) can be ignored. Thus, (TVC) in Section 9.4 is equivalent
to the condition that (NPG) in that section is satis…ed with strict equality (let
t0 = 0 = n). This proves Proposition 2 in the text.
1
u00 (ct )c_t = rt ;
u0 (c t)
9.9 Exercises
9.1 We look at a household (or dynasty) with in…nite time horizon. The house-
hold’s labor supply is inelastic and grows at the constant rate n > 0: The house-
hold has a constant rate of time preference > n and the individual instantaneous
utility function is u(c) = c1 =(1 ); where is a positive constant. There is
no uncertainty. The household maximizes the integral of per capita utility dis-
counted at the rate n. Set up the household’s optimization problem. Show
where w(t) is the real wage per unit of labor and otherwise the same notation as
in this chapter is used. Hint: apply the corollary to Claim 1 in Appendix C and
the method of Example 1 in Section 9.5. As to h0 ; start by considering
Z 1 Rt
H0 h0 L0 = w(t)Lt e 0 (r( ) n)d dt
0
391
CHAPTER 10. THE BASIC REPRESENTATIVE AGENT
392 MODEL: RAMSEY
model. Towards the end of the chapter we consider the Ramsey framework in a
setting with an “all-knowing and all-powerful”social planner.
10.1 Preliminaries
We consider a closed economy. Time is continuous. We assume households own
the capital goods and hire them out to …rms at a market rental rate, r^. This
is just to have something concrete in mind. If instead the capital goods were
owned by the …rms using them in production and the capital investment by these
…rms were …nanced by issuing shares and bonds, the conclusions would remain
the same as long as we ignore uncertainty.
The variables in the model are considered as (piecewise) continuous and dif-
ferentiable functions of time, t: Yet, to save notation, we shall write them as wt ;
r^t ; etc. instead of w(t); r^(t), etc. In every short time interval (t, t + t); the in-
dividual …rm employs labor at the market wage wt and rents capital goods at the
rental rate r^t . The combination of labor and capital produces the homogeneous
output good. This good can be used for consumption as well as investment. So
in every short time interval there are at least three active markets, one for the
“all-purpose”output good, one for labor, and one for capital services (the rental
market for capital goods). It may be convenient to imagine that there is also a
market for loans. As all households are alike, however, the loan market will not
be active in general equilibrium.
There is perfect competition in all markets, that is, households and …rms are
price takers. Any need for means of payment money is abstracted away.
Prices are measured in units of the output good.
There are no stochastic elements in the model. We assume households under-
stand exactly how the economy works and can predict the future path of wages
and interest rates. In other words, we assume “rational expectations”. In our
non-stochastic setting this amounts to perfect foresight. The results that emerge
from the model are thereby the outcome of economic mechanisms in isolation
from expectational errors.
Uncertainty being absent, rates of return on alternative assets are in equilib-
rium the same. In spite of the not active loan market, it is usual to speak of this
common rate of return as the real interest rate of the economy. Denoting this
rate rt ; for a given rental rate, r^t ; we have
r^t Kt Kt
rt = = r^t ; (10.1)
Kt
where the right-hand side is the rate of return on holding Kt capital goods,
( 0) being a constant rate of capital depreciation. This relationship may be
dynasty.1 Births (into adult life) do not amount to emergence of new economic
agents with independent interests. Births and population growth are seen as
just an expansion of the size of the already existing families. In contrast, in the
Diamond OLG model births imply entrance of new economic decision makers
whose preferences no-one cared about in advance.
In view of (10.2), U0 can be written as
Z 1
U0 = u(ct )e ( n)t dt; (10.3)
0
This condition says that …nancial wealth far out in the future cannot have a
negative present value. That is, in the long run, debt is at most allowed to rise
1
The descrete-time Barro model of Chapter 7 articulated such an altruistic bequest motive.
In that chapter we also discussed some of the conceptual di¢ culties of the dynasty setup.
2
Since the technology exhibits constant returns to scale, in competitive equilibrium the …rms
make no (pure) pro…t to pay out to their owners.
at a rate less than the real interest rate r: The NPG condition thus precludes
permanent …nancing of the interest payments by new loans.3
The decision problem is: choose a plan (ct )1
t=0 so as to achieve a maximum
of U0 subject to non-negativity of the control variable, c; and the constraints
(10.4) and (10.5). The problem is a slight generalization of the problem studied
in Section 9.4 of the previous chapter.
To solve the problem we shall apply the Maximum Principle. This method can
be applied directly to the problem as stated above or to an equivalent problem
with constraints expressed in per capita terms. Let us follow the latter approach.
From the de…nition at At =Lt we get by di¤erentiation w.r.t. t
Lt A_ t At L_ t A_ t
a_ t = = at n:
L2t Lt
Substitution of (10.4) gives the dynamic budget identity in per capita terms:
where is the adjoint variable associated with the di¤erential equation (10.6).
2) Di¤erentiate H partially w.r.t. the control variable, c; and put the result
equal to zero:
@H
= u0 (c) = 0: (10.8)
@c
3) Di¤erentiate H partially w.r.t. the state variable, a, and put the result
equal to minus the time derivative of plus the e¤ective discount rate (appearing
in the integrand of the criterion function) multiplied by :
@H _ +(
= (r n) = n) : (10.9)
@a
4) Apply the Maximum Principle: an interior optimal path (at ; ct )1t=0 will
satisfy that there exists a continuous function = t such that for all t 0;
(10.8) and (10.9) hold along the path and the transversality condition,
( n)t
lim at t e = 0; (10.10)
t!1
is satis…ed.
The interpretation of these optimality conditions is as follows. The condition
(10.8) can be considered a M C = M B condition (in utility terms). It illustrates
together with (10.9) that the adjoint variable, ; constitutes the shadow price,
measured in current utility, of per head …nancial wealth along the optimal path.
In the di¤erential equation (10.9), n cancels out and rearranging (10.9) gives
r +_
= :
This can be interpreted as a no-arbitrage condition. The left-hand side gives the
actual rate of return, measured in utility units, on the marginal unit of saving:
r can be seen as a dividend and _ as a capital gain. The right-hand side is the
required rate of return in utility units, : The household is willing to save the
marginal unit of income only up to the point where the actual return on saving
equals the required return.
The transversality condition (10.10) says that for t ! 1; the present shadow
value of per capita …nancial wealth should go to zero. Combined with (10.8), the
condition is that
lim at u0 (ct )e ( n)t = 0 (10.11)
t!1
must hold along the optimal path. This requirement is not surprising if we
compare with the case where limt!1 at u0 (ct )e ( n)t > 0: In this case there
would be over-saving; U0 could be increased by reducing the “ultimate” at and
thereby, before eternity, consume more and save less. The opposite case, limt!1
at u0 (ct )e ( n)t < 0; will not even satisfy the NPG condition in view of Proposi-
tion 2 of the previous chapter. In fact, from that proposition we know that the
transversality condition (10.10) is equivalent to the NPG condition (10.7) being
satis…ed with strict equality, i.e.,
Rt
lim at e 0 (rs n)ds
= 0: (10.12)
t!1
Recall that the Maximum Principle gives only necessary conditions for an
optimal plan. But since the Hamiltonian is jointly concave in (a; c) for every t,
the necessary conditions are also su¢ cient, by Mangasarian’s su¢ ciency theorem.
The …rst-order conditions (10.8) and (10.9) give the Keynes-Ramsey rule:
c_t 1
= (rt ); (10.13)
ct (ct )
where (ct ) is the (absolute) elasticity of marginal utility,
ct
(ct ) 0
u00 (ct ) > 0: (10.14)
u (c t)
As we know from previous chapters, this elasticity indicates the consumer’s wish
to smooth consumption over time. The inverse of (ct ) is the elasticity of in-
tertemporal substitution in consumption. It indicates the willingness to vary
consumption over time in response to a change in the interest rate.
Note that the population growth rate, n; does not appear in the Keynes-
Ramsey rule. Going from n = 0 to n > 0 implies that rt is replaced by rt n in the
dynamic budget identity and is replaced by n in the criterion function. Hence
n cancels out in the Keynes-Ramsey rule. Yet n appears in the transversality
condition and thereby also in the level of consumption for given wealth, cf. (10.18)
below.
CRRA utility
In order that the model can accommodate Kaldor’s stylized facts, it should be
capable of generating a balanced growth path. When the population grows at
the same constant rate as the labor force, here n; by de…nition balanced growth
requires that per capita output, per capita capital, and per capita consumption
grow at constant rates. At the same time another of Kaldor’s stylized facts is
that the general rate of return in the economy tends to be constant. But (10.13)
shows that having a constant per capita consumption growth rate at the same
time as r is constant, is only possible if the elasticity of marginal utility does not
vary with c: Hence, it makes sense to assume that the right-hand-side of (10.14)
is a positive constant, . We thus assume that the instantaneous utility function
is of CRRA form:
c1
u(c) = ; > 0; (10.15)
1
here, for = 1; the right-hand side should be interpreted as ln c as explained in
Section 3.3 of Chapter 3.
So our Keynes-Ramsey rule simpli…es to
c_t 1
= (rt ): (10.16)
ct
of wealth. We see that the entire expected future evolution of wages and inter-
est rates a¤ects c0 through 0 : Moreover, 0 is less, the greater is the population
growth rate, n:5 The explanation is that the e¤ective utility discount rate, n; is
less, the greater is n. The propensity to save is greater the more mouths to feed in
the future. The initial saving level will be r0 a0 + w0 c0 = r0 a0 + w0 0 (a0 + h0 ):
gt
In case rt = r for all t and wt = w0 e ; where g < r n; we get 0 =
[( 1)r + n] = and a0 + h0 = a0 + w0 =(r n g):
In the Solow growth model the saving-income ratio is a parameter, a given
constant. The Ramsey model endogenizes the saving-income ratio. Solow’s para-
metric saving-income ratio is replaced by two “deeper” parameters, the rate of
impatience, ; and the desire for consumption smoothing, . As we shall see, the
resulting saving-income ratio will not generally be constant outside the steady
state of the dynamic system implied by the Ramsey model. But …rst we need a
description of production.
10.2.2 Firms
There is a large number of …rms. They have the same neoclassical production
function with CRS,
Yt = F (Ktd ; Tt Ldt ) (10.19)
where Yt is supply of output, Ktd is capital input, and Ldt is labor input, all
measured per time unit, at time t. The superscript d on the two inputs indicates
that these inputs are seen from the demand side. The factor Tt represents the
economy-wide level of technology as of time t and is exogenous. We assume there
is technological progress at a constant rate g ( 0) :
Tt = T0 egt ; T0 > 0: (10.20)
Thus the economy features Harrod-neutral technological progress, as is needed
for compliance with Kaldor’s stylized facts.
Necessary and su¢ cient conditions for the factor combination (Ktd ; Ldt ); where
Ktd > 0 and Ldt > 0; to maximize pro…ts under perfect competition are that
F1 (Ktd ; Tt Ldt ) = r^t ; (10.21)
F2 (Ktd ; Tt Ldt )Tt = wt : (10.22)
Factor markets
In the short term, i.e., for …xed t, the available quantities of labor, Lt = L0 ent ;
and capital, Kt ; are predetermined. The factor markets clear at all points in time,
that is,
Ktd = Kt ; and Ldt = Lt; for all t 0: (10.23)
It is the rental rate, r^t ; and the wage rate, wt ; which adjust (immediately) so that
this is achieved for every t. Aggregate output can be written
where k~t kt =Tt Kt =(Tt Lt ) is the technology-corrected capital labor ratio, also
sometimes just called the “capital intensity”. Substituting (10.23) into (10.21)
and (10.22), we …nd the equilibrium interest rate and wage rate:
@(Tt Lt f (k~t ))
rt = r^t = = f 0 (k~t ) ; (10.25)
@Kt
@(Tt Lt f (k~t )) h i
wt = Tt = f (k~t ) k~t f 0 (k~t ) Tt w(
~ k~t )Tt ; (10.26)
@(Tt Lt )
where k~t is at any point in time predetermined and where in (10.25) we have used
the no-arbitrage condition (10.1).
Capital accumulation
From now we leave out the explicit dating of the variables when not needed for
clarity. By national product accounting we have
K_ = Y C K: (10.27)
Let us check whether we get the same result from the wealth accumulation equa-
tion of the household. Because physical capital is the only asset in the economy,
aggregate …nancial wealth, A; at time t equals the total quantity of capital, K;
A_ = K_ = rK + wL cL
= ~
(f 0 (k) ~
)K + (f (k) ~ 0 (k))T
kf ~ L cL (from (10.25) and (10.26))
= ~ L
f (k)T K cL (by rearranging and use of K kT ~ L)
= F (K; T L) K C=Y C K (by C cL):
We get a concise picture of the dynamics by reducing the model to the minimum
number of coupled di¤erential equations. This minimum number is two. The key
endogenous variables are k~ K=(T L) and c~ C=(T L) c=T . Using the rule
for the growth rate of a fraction, we get
k~ K_ T_ L_ K_
= = (g + n) (from (10.2) and (10.20))
k~ K T L K
F (K; T L) C K
= (g + n) (from (10.27))
K
~
f (k) c~
= ( + g + n) (from (10.24)).
k~
6
Whatever …nancial claims on each other the households might have, they net out for the
household sector as a whole.
c~ c_ T_ 1
= = (rt ) g (from the Keynes-Ramsey rule)
c~ c T
1h 0 ~ i
= f (k) g (from (10.25)).
k~ = f (k)
~ c~ ~
( + g + n)k; k~0 > 0 given, (10.28)
h
1 0 ~ i
c~ = f (k) g c~: (10.29)
Fig. 10.1 is an aid for the construction of the phase diagram in Fig. 10.2.
The curve OEB in Fig. 10.2 represents the points where k~ = 0 and is called the
nullcline for the di¤erential equation (10.28). We see from (10.28) that
k~ = 0 for c~ = f (k)
~ ( + g + n)k~ ~
c~(k): (10.31)
f 0 (k~GR ) = g + n: (10.32)
From (10.28) we see that for points above the k~ = 0 locus we have k~ < 0, whereas
for points below the k~ = 0 locus, k~ > 0. The horizontal arrows in the …gure
~
indicate these directions of movement for k.
7
As the graph is drawn, f (0) = 0; i.e., capital is assumed essential. But none of the conclu-
sions we are going to consider depends on this.
We also need the nullcline for the di¤erential equation (10.29). We see from
(10.29) that
~ = + + g
c~ = 0 for f 0 (k) or c~ = 0: (10.33)
Let k~ > 0 satisfy the equation f 0 (k~ ) = + g: Then the vertical line k~ = k~
represents points where c~ = 0 (and so does of course the horizontal half-line
c~ = 0; k~ 0): For points to the left of the k~ = k~ line we have, according to
(10.29), c~ > 0: And for points to the right of the k~ = k~ line we have c~ < 0:
The vertical arrows in Fig. 10.2 indicate these directions of movement for c~. Four
illustrative examples of solution curves (I, II, III, and IV ) are drawn in the …gure.
Steady state
The point E has coordinates (k~ , c~ ) and represents the unique steady state.8
From (10.33) and (10.31) follows that
From (10.34) it can be seen that the real interest rate in steady state is
r = f 0 (k~ ) = + g: (10.36)
The e¤ective capital-labor ratio satisfying this equation is known as the modi…ed-
golden-rule capital intensity, k~M GR . The modi…ed golden rule is the rule saying
that for a representative agent economy to be in steady state, the capital intensity
must be such that the net marginal productivity of capital equals the required
rate of return, taking into account the pure rate of time preference, ; and the
desire for consumption smoothing, .9
We show below that the steady state is, in a speci…c sense, asymptotically sta-
ble. First we have to make sure, however, that the steady state is consistent with
8
As (10.33) shows, if c~t = 0; then c~ = 0: Therefore, mathematically, point B (if it exists) in
Fig. 10.2 is also a stationary point of the dynamic system. And if f (0) = 0; then according to
(10.29) and (10.31) also the point (0; 0) in the …gure is a stationary point. But these stationary
points have zero consumption forever and are therefore not steady states of any economic
system. From an economic point of view they are “trivial” steady states.
9
The of the Ramsey model corresponds to the intergenerational discount rate R of the
Barro dynasty model in Chapter 7. Indeed, in the discrete time Barro model we have 1 + r
= (1+R)(1+g) ; which, by taking logs on both sides and using …rst-order Taylor approximations
of ln(1 + x) around x = 0 gives r ln(1 + r ) = ln(1 + R) + ln(1 + g) R + g: Recall,
however, that in view of the considerable period length (about 25-30 years) of the Barro model,
this approximation may not be good.
ỹ
ỹ = f (k̃)
δ+g+n
δ + ρ + θg
ỹ = (δ + g + n)k̃
0 k̃
k̃M GR k̃GR ¯
k̃
c̃
c̃˙ = 0
V IV
II
˙
k̃ = 0
c̃∗ E
c̃A A
VI
III
k̃
∗
k̃0 k̃ = k̃M GR k̃GR ¯
k̃
which the maximal value of k~ on the k~ = 0 locus is greater than curve II’s maxi-
mal k~ value.12 We continue lowering c~0 until the path’s maximal k~ value is exactly
equal to k~ . The path which emerges from this, namely the path I, starting at
the point A, is special in that it converges towards the steady-state point E. No
other path starting at the stippled line, k~ = k~0 ; has this property. Paths starting
above A do not, as we just saw. Neither do paths starting below A, like path
III. Either this path never reaches the consumption level c~A in which case it can
not converge to E, of course. Or, after a while its consumption level reaches c~A ;
but at the same time it must have k~ > k~0 : From then on, as long as k~ k~ , for
every c~-value that path III has in common with path I, path III has a higher
k~ and a lower c~ than path I (use (10.28) and (10.29)). Hence, path III diverges
from point E.
Had we considered a value of k~0 > k~ , there would similarly be a unique value
of c~0 such that the path starting from (k~0 , c~0 ) would converge to E (see path IV
in Fig. 10.2).
The point E is a saddle point. By this is meant a steady-state point with
the following property: there exists exactly two paths, one from each side of k~ ,
that converge towards the steady-state point; all other paths (at least starting
in a neighborhood of the steady state) move away from the steady state and
asymptotically approach one of the two diverging paths, the stippled North-West
and South-East curves in Fig. 10.2. The two converging paths together make up
what is known as the stable branch (or stable arm); on their own they are referred
to as saddle paths (sometimes referred to in the singular as the saddle path).13
The stippled diverging paths in Fig. 10.2 together make up the unstable branch
(or unstable arm).
12
As an implication of the uniqueness theorem for di¤erential equations (see Math tools), two
solution paths in the phase plane cannot intersect.
13
An algebraic de…nition of a saddle point, in terms of eigenvalues, is given in Appendix A.
There it is also shown that if limk!0
~
~ = 0; then the saddle path on the left side of the steady
f (k)
state in Fig. 10.2 will start out in…nitely close to the origin.
along the saddle path from above. Paths like V and V I in Fig. 10.2 can be ruled
out because they violate the NPG condition and the transversality condition,
respectively. With this we have shown:
PROPOSITION 1 Assume (A1) and (A2). Let there be a given k~0 > 0: Then
the Ramsey model exhibits a unique equilibrium path, characterized by (k~t , c~t )
converging, for t ! 1, toward a unique steady state with a capital intensity k~
satisfying f 0 (k~ ) = + g: In the steady state the real interest rate is given by
the modi…ed-golden-rule formula, r = + g, the per capita consumption path
is ct = c~ T0 egt ; where c~ = f (k~ ) ( + g + n)k~ ; and the real wage path is wt
= w(~ k~ )T0 egt :
A numerical example based on one year as the time unit: = 2; g = 0:02;
n = 0:01 and = 0:01: Then, r = 0:05 > 0:03 = g + n.
So output per capita, yt Yt =Lt y~t Tt ; tends to grow at the rate of techno-
logical progress, g :
in view of k~t ! 0. This is also true for the growth rate of consumption per capita
and the real wage, since ct c~t Tt and wt = w( ~ k~t )Tt :
The intuition behind the convergence lies in the neoclassical principle of di-
minishing marginal productivity of capital. Starting from a low capital intensity
and therefore a high marginal and average productivity of capital, the resulting
high aggregate saving15 will be more than enough to maintain the capital inten-
sity which therefore increases. But when this happens, the marginal and average
productivity of capital decreases and the resulting saving, as a proportion of the
capital stock, declines until eventually it is only su¢ cient to replace worn-out ma-
chines and equip new “e¤ective”workers with enough machines to just maintain
the capital intensity. If instead we start from a high capital intensity, a similar
story can be told in reverse. In the long run the capital intensity settles down
at the steady-state level, k~ ; where the marginal saving and investment yields
a return as great as the representative household’s willingness to postpone the
marginal unit of consumption. Since the adjustment process is based on capital
accumulation, it is slow. The “speed of adjustment”, in the sense of the propor-
tionate rate of decline per year of the distance to the steady state, k~ k~ ; is
generally assessed to be in the interval (0.02, 0.10), assuming absence of distur-
bances to the system during the adjustment.
15
Saving will be high because the negative substitution and wealth e¤ects on current con-
sumption of the high interest rate dominate the income e¤ect.
The steady state of the model is globally asymptotically stable for arbitrary initial
values of the capital intensity (the phase diagram only veri…es local asymptotic
stability, but the extension to global asymptotic stability is veri…ed in Appendix
A). If k~ is hit by a shock at time 0 (say by a discrete jump in the technology level
T0 ), the economy will converge toward the same unique steady state as before. At
…rst glance this might seem peculiar considering that the steady state is a saddle
point. Such a steady state is unstable for arbitrary values of both coordinates in
the initial point (k~0 ; c~0 ). But the crux of the matter is that it is only the initial k~
that is arbitrary. The model assumes that the decision variable c0 ; and therefore
the value of c~0 c0 =T0 ; immediately adjusts to the given circumstances and
the available information about the future. That is, the model assumes that c~0
always takes the value needed for the household’s transversality condition under
perfect foresight to be satis…ed. This ensures that the economy is initially on
the saddle path, cf. the point A in Fig. 10.2. In the language of di¤erential
equations conditional asymptotic stability is present. The condition that ensures
the stability in our case is the transversality condition.
We shall follow the common terminology in macroeconomics and call a steady
state of a two-dimensional dynamic system (locally) saddle-point stable if:
4. there is a boundary condition on the system such that the diverging paths
are ruled out as solutions.
greater even without any current saving. This discourages current saving and we
end up with lower capital accumulation and lower e¤ective capital intensity in
the long run, hence higher interest rate. It is also true that the higher is g; the
higher is the rate of return needed to induce the saving required for maintaining
a steady state and resist the desire for more consumption smoothing.
The long-run interest rate is independent of the particular form of the ag-
gregate production function, f . This function matters for what e¤ective capital
intensity and what consumption level per unit of e¤ective labor are compatible
with the long-run interest rate. This kind of results are speci…c to representative
agent models. This is because only in these models will the Keynes-Ramsey rule
hold not only for the individual household, but also at the aggregate level.
Unlike the Solow growth model, the Ramsey model provides a theory of the
evolution and long-run level of the saving rate. The endogenous gross saving rate
of the economy is
where the inequality follows from our parameter restriction (A1). Indeed, (A1)
implies + g > g + n: The long-run saving rate depends positively on the
following parameters: the elasticity of production w.r.t. to capital, ; the capital
depreciation rate, ; and the population growth rate, n: The long-run saving rate
depends negatively on the rate of impatience, ; and the desire for consumption
smoothing, : The role of the rate of technological progress is ambiguous.16
A numerical example (time unit = 1 year): If n = 0:005; g = 0:015; = 0:025;
= 3; and = 0:07; then s = 0:21: With the same parameter values except
= 0:05; we get s = 0:19:
It can be shown (see Appendix D) that if, by coincidence, = 1=s ; then
0 ~
s (k) = 0; that is, the saving rate st is also outside of steady state equal to s . In
view of (10.40), the condition = 1=s is equivalent to the “knife-edge”condition
= ( + )= [ ( + g + n) g] : More generally, assuming ( + g + n) > g
(which seems likely empirically), we have that if Q 1=s (i.e., Q ); then s0 (k) ~ Q
0; respectively (and if instead ( +g +n) g; then s0 (k) ~ < 0; unconditionally):17
Data presented in Barro and Sala-i-Martin (2004, p. 15) indicate no trend for
the US saving rate, but a positive trend for several other developed countries
since 1870. One interpretation is that whereas the US has for a long time been
close to its steady state, the other countries are still in the adjustment process
toward the steady state. As an example, consider the parameter values = 0:05;
= 0:02; g = 0:02 and n = 0:01: In this case we get = 10 if = 0:33; given
< 10; these other countries should then have s0 (k) ~ < 0 which, according to the
model, is compatible with a rising saving rate over time only if these countries
are approaching their steady state from above (i.e., they should have k~0 > k~ ):
It may be argued that should also re‡ect the role of education and R&D in
production and thus be higher; with = 0:75 we get = 1:75: Then, if > 1:75;
these countries would have s0 (k)~ > 0 and thus approach their steady state from
below (i.e., k~0 < k~ ):
The constant saving rate implies proportionality between consumption and in-
come. In growth-corrected terms per capita consumption is
For the Ramsey model to yield this, the production function must be like in
(10.39) (i.e., Cobb-Douglas) with > s: And the elasticity of marginal utility, ;
must satisfy = 1=s: Finally, the rate of time preference, ; must be such that
(10.40) holds with s replaced by s; which implies = ( + g + n)=s g:
It remains to show that this satis…es the inequality, n > (1 )g, which is
necessary for existence of an equilibrium in the Ramsey model. Since =s > 1;
the chosen satis…es > +g +n g = n+(1 )g; which was to be proved.
Thus, in this case the Ramsey model generates an equilibrium path which implies
an evolution identical to that generated by the Solow model with s = 1= .18
With this foundation of the Solow model, it will always hold that s = s <
sGR , where sGR is the golden rule saving rate. Indeed, from (10.38) and (10.32),
respectively,
We assume ^ > 0 and ^ n > (1 ^)g in line with the assumption (A1) for
^
the market economy above. In case ^ = 1; the expression ct1 =(1 ^) should
be interpreted as ln ct : No market prices or other elements belonging to the spe-
ci…c market institutions of the economy enter the social planner’s problem. The
dynamic constraint (10.43) re‡ects the national product account. Because the
economy is closed, the social planner does not have the opportunity of borrowing
or lending from abroad. Hence there is no solvency requirement. Instead we just
impose the de…nitional constraint (10.44) of non-negativity of the state variable
~
k.
The problem is the continuous time analogue of the social planner’s problem in
discrete time in Chapter 8. Note, however, a minor conceptual di¤erence, namely
that in continuous time there is in the hshort run no upper bound i on the ‡ow
~ ~
variable ct ; that is, no bound like ct Tt f (kt ) ( + g + n)kt : A consumption
intensity ct which is higher than the right-hand side of this inequality will just be
re‡ected in a negative value of the ‡ow variable k~t :20
19
Possible reasons for allowing these two preference parameters to deviate from the corre-
sponding parameters in the private sector are given Section 8.1.1.
20
As usual we presume that capital can be “eaten”. That is, we consider the capital good to
be instantaneously convertible to a consumption good. Otherwise there would be at any time
an upper bound on c, namely c T f (k); ~ saying that the per capita consumption ‡ow cannot
exceed the per capita output ‡ow. The role of such constraints is discussed in Feichtinger and
Hartl (1986).
c1
^ h c i
~
H(k; c; ; t) = ~
+ f (k) ~
( + g + n)k ;
1 ^ T
where is the adjoint variable associated with the dynamic constraint (10.43).
An interior optimal path (k~t ; ct )1
t=0 will satisfy that there exists a continuous
function = (t) such that, for all t 0;
@H ^ ^
= c = 0; i.e., c = ; and (10.45)
@c T T
@H ~ _
= (f 0 (k) g n) = (^ n) (10.46)
@ k~
hold along the path and the transversality condition,
is satis…ed.21
The condition (10.45) can be seen as a M C = M B condition and illustrates
that t is the social planner’s shadow price, measured in terms of current utility,
of k~t along the optimal path.22 The di¤erential equation (10.46) tells us how this
shadow price evolves over time. The transversality condition, (10.47), together
with (10.45), entails the condition
^
lim k~t ct egt e (^ n)t
= 0;
t!1
where the unimportant factor T0 has been eliminated. Imagine the opposite were
^
true, namely that limt!1 k~t ct e[g (^ n)]t > 0. Then, intuitively U0 could be
increased by reducing the long-run value of k~t , i.e., consume more and save less:
By taking logs in (10.45) and di¤erentiating w.r.t. t, we get ^c=c
_ = _= g:
Inserting (10.46) and rearranging gives the condition
c_ 1 _ 1
= (g ~
) = (f 0 (k) ^): (10.48)
c ^ ^
21
The in…nite-horizon Maximum Principle itself does not guarantee validity of such a straight-
forward extension of a necessary transversality condition from a …nite horizon to an in…nite hori-
zon. Yet, this extension is valid for the present problem when ^ n > (1 ^)g, cf. Appendix
E.
22
Decreasing ct by one unit, increases k~t by 1=Tt units, each of which are worth t utility
units to the social planner.
This is the social planner’s Keynes-Ramsey rule. If the rate of time preference, ^;
~
is lower than the net marginal productivity of capital, f 0 (k) , the social planner
will let per capita consumption be relatively low in the beginning in order to attain
greater per capita consumption later. The lower the impatience relative to the
return to capital, the more favorable it becomes to defer consumption.
Because c~ c=T; we get from (10.48) qualitatively the same di¤erential equa-
tion for c~ as we obtained in the decentralized market economy. And the dynamic
resource constraint (10.43) is of course identical to that of the decentralized mar-
ket economy. Thus, the dynamics are in principle unaltered and the phase dia-
gram in Fig. 10.2 is still valid. The solution of the social planner implies that
the economy will move along the saddle path towards the steady state. This
trajectory, path I in the diagram, satis…es both the …rst-order conditions and
the transversality condition. However, paths such as III in the …gure do not
satisfy the transversality condition of the social planner but imply permanent
over-saving. And paths such as II in the …gure will experience a sudden end
when all the capital has been used up. Intuitively, they cannot be optimal. A
rigorous argument is given in Appendix E, based on the fact that the Hamil-
~ c~): Thence, not only is the saddle path an optimal
tonian is strictly concave in (k;
solution, it is the unique optimal solution.
Comparing with the market solution of the previous section, we have estab-
lished:
PROPOSITION 2 (equivalence theorem) Consider an economy with neoclassical
CRS technology as described above and a representative in…nitely-lived household
with preferences as in (10.3) with u(c) = c1 =(1 ): Assume (A1) and (A2).
~
Let there be a given k0 > 0: Then perfectly competitive markets bring about the
same resource allocation as that brought about by a social planner with the same
criterion function as the representative household, i.e., with ^ = and ^ = .
This is a continuous time analogue to the discrete time equivalence theorem of
Chapter 8.
The capital intensity k~ in the social planner’s solution will not converge to-
wards the golden rule level, k~GR ; but towards a level whose distance to the golden
rule level depends on how much ^ + ^g exceeds the natural growth rate, g + n:
Even if society would be able to consume more in the long term if it aimed for
the golden rule level, this would not compensate for the reduction in current con-
sumption which would be necessary to achieve it. This consumption is relatively
more valuable, the greater is the social planner’s e¤ective rate of time preference,
^ n. In line with the market economy, the social planner’s solution ends up in
a modi…ed golden rule. In the long term, net marginal productivity of capital is
determined by preference parameters and productivity growth and equals ^ + ^g
> g + n: Hereafter, given the net marginal productivity of capital, the capital
intensity and the level of the consumption path is determined by the production
function.
is ethically indefensible and arises merely from the weakness of the imagination”
(Ramsey 1928). So Ramsey has n = = 0: Given the instantaneous utility
0 00
function, u; where u > 0; u < 0; and given = 0; Ramsey’s original problem
was: choose (ct )1t=0 so as to optimize (in some sense, see below)
Z 1
W0 = u(ct )dt s.t.
0
ct 0;
_kt = f (kt ) ct kt ;
kt 0 for all t 0:
The Maximum Principle states that an interior overtaking-optimal path will sat-
isfy that there exists an adjoint variable such that for all t 0 it holds along
this path that
@H
= u0 (c) = 0; and (10.51)
@c
@H _:
= (f 0 (k) )= (10.52)
@k
Since = 0; the Keynes-Ramsey rule reduces to
c_t 1 c
= (f 0 (kt ) ); where (c) u00 (c):
ct (ct ) u0 (c)
lim kt t = 0; (10.53)
t!1
is necessary for optimality but, as we will see below, this turns out to be wrong
in this case with no discounting.
Our assumption (A2) here reduces to limk!0 f 0 (k) > > limk!1 f 0 (k) (which
requires > 0): Apart from this, the phase diagram is fully analogue to that in
Fig. 10.2, except that the steady state, E, is now at the top of the k_ = 0 curve.
This is because in steady state, f 0 (k ) = 0: This equation also de…nes kGR in
this case: It can be shown that the saddle path is again the unique solution to
the optimization problem (the method is essentially the same as in the discrete
time case of Chapter 8). The intuitive background is that failing to approach the
golden rule would imply a forgone “opportunity of in…nite gain”.
A noteworthy feature is that in this case the Ramsey model constitutes a
counterexample to the widespread presumption that an optimal plan with in…nite
horizon must satisfy a transversality condition like (10.53). Indeed, by (10.51),
0 0
t = u (ct ) ! u (c ) for t ! 1 along the overtaking-optimal path (the saddle
path). Thus, instead of (10.53), we get
lim kt t = k u0 (c ) > 0:
t!1
Note also that with zero e¤ective utility discounting, there can not be equi-
librium in the market economy version of this story. The real interest rate would
in the long run be zero and thus the human wealth of the in…nitely-lived house-
hold would be in…nite. But then the demand for consumption goods would be
unbounded and equilibrium thus be impossible.
“The problem is not just that perfect foresight into the inde…nite future
is so implausible away from steady states. The deeper problem is that in
practice if there is any practice miscalculations about the equilibrium
path may not reveal themselves for a long time. The mistaken path gives
no signal that it will be ”ultimately“ infeasible. It is natural to comfort
employed, and the clear purity of illumination with which the writer’s mind is
felt by the reader to play about its subject”.
2. The version of the Ramsey model we have considered is in accordance with
the general tenet of neoclassical preference theory: saving is motivated only by
higher consumption in the future. Extended versions assume that accumulation of
wealth is to some extend an end in itself or perhaps motivated by a desire for social
prestige and economic and political power rather than consumption. In Kurz
(1968b) an extended Ramsey model is studied where wealth is an independent
argument in the instantaneous utility function.
Also Tournemaine and Tsoukis (2008) and Long and Shimomura (2004).
3. The equivalence in the Ramsey model between the decentralized market
equilibrium and the social planner’s solution can be seen as an extension of the
…rst welfare theorem as it is known from elementary textbooks, to the case where
the market structure stretches in…nitely far out in time, and the …nite number of
economic agents (family dynasties) face an in…nite time horizon: in the absence of
externalities etc., the allocation of resources under perfect competition will lead
to a Pareto optimal allocation. The Ramsey model is indeed a special case in that
all households are identical. But the result can be shown in a far more general
setup, cf. Debreu (1954). The result, however, does not hold in overlapping
generations models where an unbounded sequence of new generations enter and
the “interests”of the new households have not been accounted for in advance.
4. Cho and Graham (1996) consider the empirical question whether countries
tend to be above or below their steady state. Based on the Penn World Table
they …nd that on average, countries with a relatively low income per adult are
above their steady state and that countries with a higher income are below.
10.8 Appendix
A. Algebraic analysis of the dynamics around the steady state
To supplement the graphical approach of Section 10.3 with an exact analysis of
the adjustment dynamics of the model, we compute the Jacobian matrix for the
system of di¤erential equations (10.28) - (10.29):
2 3
~ k~ @ k=@~
~ c 0 ~
~ c~) = 4 @ k=@
J(k; 5 = f1 (k) ( + g + n) 1
:
00 ~ 1 0 ~
@ c~=@ k~ @ c~=@~
c f (k)~ c (f (k) + g)
Since the product of the eigenvalues of the matrix equals the determinant, the
eigenvalues are real and opposite in sign.
In standard math terminology a steady-state point in a two dimensional
continuous-time dynamic system is called a saddle point if the associated eigen-
values are opposite in sign.25 For the present case we conclude that the steady
state is a saddle point. This mathematical de…nition of a saddle point is equiv-
alent to that given in the text of Section 10.3. Indeed, with two eigenvalues of
opposite sign, there exists, in a small neighborhood of the steady state, a stable
arm consisting of two saddle paths which point in opposite directions. From the
phase diagram in Fig. 10.2 we know that the stable arm has a positive slope.
At least for k~0 su¢ ciently close to k~ it is thus possible to start out on a saddle
path. Consequently, there is a (unique) value of c~0 such that (k~t ; c~t ) ! (k~ ; c~ ) for
t ! 1. Finally, the dynamic system has exactly one jump variable, c~; and one
predetermined variable, k. ~ It follows that the steady state is (locally) saddle-point
stable.
We claim that for the present model this can be strengthened to global saddle-
point stability. Indeed, for any k~0 > 0, it is possible to start out on the saddle
path. For 0 < k~0 k~ , this is obvious in that the extension of the saddle path
towards the left reaches the y-axis at a non-negative value of c~ . That is to say
that the extension of the saddle path cannot, according to the uniqueness theorem
~
for di¤erential equations, intersect the k-axis for k~ > 0 in that the positive part
~
of the k-axis is a solution of (10.28) - (10.29).26
For k~0 > k~ , our claim can be veri…ed in the following way: suppose, contrary
to our claim, that there exists a k~1 > k~ such that the saddle path does not
intersect that region of the positive quadrant where k~ k~1 . Let k~1 be chosen as
the smallest possible value with this property. The slope, d~ ~ of the saddle
c=dk;
~ ~
path will then have no upper bound when k approaches k1 from the left. Instead
c~ will approach 1 along the saddle path. But then ln c~ will also approach 1
along the saddle path for k~ ! k~1 (k~ < k~1 ): It follows that d ln c~=dk~ = (d~ ~ c,
c=dk)=~
computed along the saddle path, will have no upper bound. Nevertheless, we
25
Note the di¤erence compared to a discrete time system, cf. Appendix D of Chapter 8. In
the discrete time system we have next period’s k~ and c~ on the left-hand side of the dynamic
equations, not the increase in k~ and c~; respectively. Therefore, the criterion for a saddle point
looks di¤erent in discrete time.
26
Because the extension of the saddle path towards the left in Fig. 10.1 can not intersect the
c~-axis at a value of c~ > f (0); it follows that if f (0) = 0; the extension of the saddle path ends
up in the origin.
have
d ln c~ d ln c~=dt c~=~c 1 ~
(f 0 (k) g)
= = = :
dk~ ~
dk=dt ~
f (k) c~ ( + g + n)k~
k~
When k~ ! k~1 and c~ ! 1 , the numerator in this expression is bounded, while
the denominator will approach 1. Consequently, d ln c~=dk~ will approach zero
from above, as k~ ! k~1 . But this contradicts that d ln c~=dk~ has no upper bound,
when k~ ! k~1 . Thus, the assumption that such a k~1 exists is false and our original
hypothesis holds true.
n > (1 )g (A1)
R1 1
holds, then the utility integral, U0 = 0 c1 e ( n)t dt; is bounded, from above
as well as from below, along the steady-state path, ct = c~ Tt . The proof is as
follows. Recall that > 0 and g 0: For 6= 1;
Z 1 Z 1
1 ( n)t
(1 )U0 = ct e dt = (c0 egt )1 e ( n)t dt
0 0
Z 1
c0
= c0 e[(1 )g ( n)]t dt = , (10.54)
0 n (1 )g
which by (A1) is …nite and positive since c0 > 0. If = 1; so that u(c) = ln c; we
get Z 1
U0 = (ln c0 + gt)e ( n)t dt; (10.55)
0
which is also …nite, in view of (A1) implying n > 0 in this case (the exponential
( n)t
term, e ; declines faster than the linear term gt increases). It follows that
also any path converging to the steady state will entail bounded utility, when
(A1) holds.
On the other hand, suppose that (A1) does not hold, i.e., n (1 )g:
Then by the third equality in (10.54) and c0 > 0 follows that (1 )U0 = 1 if
6= 0: If instead = 1; (10.55) implies U0 = 1:
from the expected path (i.e., deviate from the expected “average behavior”in the
economy). We will now show this formally.
We …rst consider a path of type III. A path of this type will not be able _to reach
the horizontal axis in Fig. 10.2. It will only converge towards the point (k; ~ 0) for
t ! 1. This claim follows from the uniqueness theorem for di¤erential equations
with continuously di¤erentiable right-hand sides. The uniqueness implies that
two solution curves cannot intersect. And we see from (10.29) that the positive
part of the_x-axis is from a mathematical point of view a solution curve (and
~ 0) is a trivial steady state). This rules out another solution curve
the point (k;
hitting the x-axis. _ _
~ ~
The convergence of k towards k implies limt!1 rt = f (k) 0 ~
< g + n; where
_
the inequality follows from k~ > k~GR . So,
Rt Rt Rt _
0 ~
lim at e 0 (rs n)ds = lim k~t e 0 (rs g n)ds = lim k~t e 0 (f (ks ) g n)ds ~ 1 > 0:
= ke
t!1 t!1 t!1
(10.56)
Hence the transversality condition of the households is violated. Consequently,
the household will choose higher consumption than along this path and can do
so without violating the NPG condition.
Consider now instead a path of type II. We shall …rst show that if the economy
follows such a path, then depletion of all capital occurs in …nite time. Indeed, in
the text it was shown that any path of type II will pass the k~ = 0 locus in Fig.
10.2. Let t0 be the point in time where this occurs. If path II lies above the k~
= 0 locus for all t 0, then we set t0 = 0. For t > t0 , we have
where the inequality comes from k~t < 0 combined with the fact that k~t < k~GR
implies f 0 (k~t ) > f 0 (k~GR ) = g + n: Therefore, there exists a t1 > t0 0
such that Z t1
k~t = k~t +1 k~t dt = 0;
0
t0
as was to be shown. At time t1 ; k~ cannot fall any further and c~t immediately
drops to f (0) and stay there hereafter.
Yet, this result does not in itself explain why the individual household will
deviate from such a path. The individual household has a negligible impact on
the movement of k~t in society and correctly perceives rt and wt as essentially
independent of its own consumption behavior. Indeed, the economy-wide k~ is
not the household’s concern. What the household cares about is its own …nancial
wealth and budget constraint. In the perspective of the household nothing pre-
vents it from planning a negative …nancial wealth, a; and possibly a continuously
declining …nancial wealth, if only the NPG condition,
Rt
lim at e 0 (rs n)ds
0;
t!1
is satis…ed.
But we can show that paths of type II will violate the NPG condition. The
reasoning is as follows. The household plans to follow the Keynes-Ramsey rule.
Given an expected evolution of rt and wt corresponding to path II, this will
imply a planned gradual transition from positive …nancial wealth to debt. The
transition to positive net debt, d~t a
~t at =Tt > 0, takes place at time t1
de…ned above.
The continued growth in the debt will meanwhile be so fast that the NPG
condition is violated. To see this, note that the NPG condition implies the re-
quirement Rt
lim d~t e 0 (rs g n)ds 0; (NPG)
t!1
that is, the productivity-corrected debt, d~t , is allowed to grow in the long run
only at a rate less than the growth-corrected real interest rate. For t > t1 we get
from the accounting equation a_ t = (rt n)at + wt ct that
c~t ( + g + n)k~t
= f 0 (k~t )[s ]: (10.59)
c~t f (k~t )
c~t 1
= (f 0 (k~t ) g): (10.60)
c~t
There will not generally exist a constant, s; such that the right-hand sides of
(10.59) and (10.60), respectively, are the same for varying k~ (that is, outside
steady state). But Kurz (1968a) showed the following:
CLAIM Let ; g; n; ; and be given. If the elasticity of marginal utility is
greater than 1 and the production function is y~ = Ak~ with 2 (1= ; 1), then a
Ramsey model with = ( + g + n) g will generate a constant saving
rate s = 1= : Thereby the same resource allocation and transitional dynamics
arise as in the corresponding Solow model with s = 1= .
Proof. Let 1= < < 1 and f (k) ~ = Ak~ : Then f 0 (k)
~ = A k~ 1
: The right-hand-
side of the Solow equation, (10.59), becomes
( + g + n)k~t
A k~ 1
[s ] = sA k~ 1
( + g + n): (10.61)
Ak~
The right-hand-side of the Ramsey equation, (10.60), becomes
1 + + g
A k~ 1
:
1
= A k~ 1
( + g + n): (10.62)
gives Rt 0
~
lim k~t e 0 (f (ks ) g n)ds = 0; (10.63)
t!1
^
where we have eliminated the unimportant positive factor 0 = c0 T0 :
Hence, H is strictly concave in (k;~ c) and the saddle path is the unique optimal
solution.
It also follows that the transversality condition (10.47) is a necessary optimal-
ity condition when the parameter restriction ^ n > (1 ^)g holds. Note that
we have had to derive this conclusion in a di¤erent way than when solving the
household’s consumption/saving problem in Section 10.2. There we could appeal
to a link between the No-Ponzi-Game condition (with strict equality) and the
transversality condition to verify necessity of the transversality condition. But
that proposition does not cover the social planner’s problem where there is no
NPG condition.
As to the diverging paths in Fig. 10.2, note that paths of type II (those paths
which, as shown in Appendix C, in …nite time deplete all capital) can not be
optimal, in spite of the temporarily high consumption level. This follows from
the fact that the saddle path is the unique solution. Finally, paths of type III
in Fig. 10.2 behave as in (10.56) and thus violate the transversality condition
(10.47), as claimed in the text.
10.9 Exercises
A glimpse of theory of
the “level of interest rates”
This short note provides a brief sketch of what macroeconomics says about the general
level around which rates of return ‡uctuate. We also give a “broad”summary of di¤erent
circumstances that give rise to di¤erences in rates of return on di¤erent assets.
Di¤erent rates of return In simple neoclassical models with perfect competition and
no uncertainty, the equilibrium short-term real interest rate is at any time equal to the
net marginal productivity of capital (r = @Y =@K ): In turn the marginal productivity
of capital adjusts over time, via changes in the capital intensity, to some long-run level
(on this more below). As we saw in Chapter 14, existence of convex capital installation
costs loosens the link between r and @Y =@K. The convex adjustment costs create a
wedge between the price of investment goods and the market value of the marginal unit
of installed capital. Besides the marginal productivity of capital, the possible capital gain
in the market value of installed capital as well as the e¤ect of the marginal unit of installed
capital on future installation costs enter as co-determinants of the current rate of return
on capital.
1
Arithmetic Standard Geometric
average deviation average
----------------- Percent -----------------
Nominal values
Small Company Stocks 17,3 33,2 12,5
Large Company Stocks 12,7 20,2 10,7
Long-Term Corporate Bonds 6,1 8,6 5,8
Long-Term Government Bonds 5,7 9,4 5,3
Intermediate-Term Government Bonds 5,5 5,7 5,3
U.S. Treasury Bills 3,9 3,2 3,8
Cash 0,0 0,0 0,0
Inflation rate 3,1 4,4 3,1
Real values
Small Company Stocks 13,8 32,6 9,2
Large Company Stocks 9,4 20,4 7,4
Long-Term Corporate Bonds 3,1 9,9 2,6
Long-Term Government Bonds 2,7 10,6 2,2
Intermediate-Term Government Bonds 2,5 7,0 2,2
U.S. Treasury Bills 0,8 4,1 0,7
Cash -2,9 4,2 -3,0
Table 1: Average annual rates of return on a range of U.S. asset portfolios, 1926-2001.
Source: Stocks, Bonds, Bills, and In‡ation: Yearbook 2002, Valuation Edition. Ibbotson
Associates, Inc.
When imperfect competition in the output markets rules, prices are typically set as a
mark-up on marginal cost. This implies a wedge between the net marginal productivity
of capital and capital costs. And when uncertainty and limited opportunities for risk
diversi…cation are added to the model, a wide spectrum of expected rates of return on
di¤erent …nancial assets and expected marginal productivities of capital in di¤erent pro-
duction sectors arise, depending on the risk pro…les of the di¤erent assets and production
sectors. On top of this comes the presence of taxation which may complicate the picture
because of di¤erent tax rates on di¤erent asset returns.
Nominal and real average annual rates of return on a range of U.S. asset portfolios for
the period 1926–2001 are reported in Table 1. By a portfolio of n assets, i = 1; 2; : : : ; n
is meant a “basket”, (v1 ; v2 ; : : : ; vn ); of the n assets in value terms, that is, vi = pi xi is
the value of the investment in asset i; the price of which is denoted pi and the quantity
P
of which is denoted xi . The total investment in the basket is V = ni=1 vi : If Ri denotes
the gross rate of return on asset i; the overall gross rate of return on the portfolio is
Pn
vi Ri X
n
R= i = wi Ri ;
V i=1
2
where wi vi =V is the weight or fraction of asset i in the portfolio. De…ning Ri 1 + ri ;
where ri is the net rate of return on asset i; the net rate of return on the portfolio can be
written
X
n X
n X
n X
n
r=R 1= wi (1 + ri ) 1= wi + wi ri 1= wi ri :
i=1 i=1 i=1 i=1
The net rate of return is often just called “the rate of return”.
In Table 1 we see that the portfolio consisting of small company stocks throughout the
period 1926-2001 had an average annual real rate of return of 13.8 per cent (the arithmetic
average) or 9.2 per cent (the geometric average). This is more than the annual rate of
return of any of the other considered portfolios. Small company stocks are also seen to
be the most volatile. The standard deviation of the annual real rate of return of the
portfolio of small company stocks is almost eight times higher than that of the portfolio
of U.S. Treasury bills (government zero coupon bonds with 30 days to maturity), with
an average annual real return of only 0.8 per cent (arithmetic average) or 0.7 per cent
(geometric average) throughout the period. The displayed positive relation between high
returns and high volatility is not without exceptions, however. The portfolio of long-term
corporate bonds has performed better than the portfolio of long-term government bonds,
although they have been slightly less volatile as here measured. The data is historical and
expectations are not always met. Moreover, risk depends signi…cantly on the covariance
of asset returns within the total set of assets and speci…cally on the correlation of asset
returns with the business cycle, a feature that can not be read o¤ from Table 1. Share
prices, for instance, are very sensitive to business cycle ‡uctuations.
The need for means of payment money is a further complicating factor. That is,
besides dissimilarities in risk and expected return across di¤erent assets, also dissimilar-
ities in their degree of liquidity are important, not least in times of …nancial crisis. The
expected real rate of return on cash holding is minus the expected rate of in‡ation and
is therefore negative in an economy with in‡ation, cf. the last row in Table 1. When
agents nevertheless hold cash in their portfolios, it is because the low rate of return is
compensated by the liquidity services of money. In the Sidrauski model of Chapter 17 this
is modeled in a simple way, albeit ad hoc, by including real money holdings directly as an
argument in the utility function. Another dimension along which the presence of money
interferes with returns is through in‡ation. Real assets, like physical capital, land, houses,
etc. are better protected against ‡uctuating in‡ation than are nominally denominated
bonds (and money of course).
3
Without claiming too much we can say that investors facing such a spectrum of rates
of return choose a composition of assets so as to balance the need for liquidity, the wish
for a high expected return, and the wish for low risk. Finance theory teaches us that
adjusted for di¤erences in risk and liquidity, asset returns tend to be the same. This
raises the question: at what level? This is where macroeconomics as an empirically
oriented theory about the economy as a whole comes in.
the net marginal productivity of capital acts as a centre of gravitation for the spec-
trum of asset returns; and
movements of the rates of return are in the long run held in check by the net marginal
productivity of capital.
Though such phrases seem to convey the right ‡avour, in themselves they are not
very informative. The net marginal productivity of capital is not a given, but an endoge-
nous variable which, via changes in the capital intensity, adjusts through time to more
fundamental factors in the economy.
1. Solow’s growth model The Solow growth model leads to the fundamental di¤er-
ential equation (standard notation)
where s is an exogenous and constant aggregate saving-income ratio, 0 < s < 1: In steady
state
r = f 0 (k~ ) ; (1)
4
where k~ is the unique steady state value of the (e¤ective) capital intensity, k;
~ satisfying
In society there is a debate and a concern that changed demography and less growth
in the source of new technical ideas, i.e., the stock of educated human beings, will in the
future result in lower n and lower g; respectively, making …nancing social security more
di¢ cult: On the basis ~
h of the Solow model i 1we …nd by implicit di¤erentiation in (2) @ k =@n
= @ k~ =@g = k~ + g + n sf 0 (k~ ) ; which is negative since sf 0 (k~ ) < sf (k~ )=k~
= + g + n: Hence, by (1),
@r @r @r @ k~ k~
= = = f 00 (k~ ) > 0;
@n @g @ k~ @n + g + n sf 0 (k~ )
since f 00 (k~ ) < 0: It follows that
n # or g #) r # : (3)
A limitation of this theory is of course the exogeneity of the saving-income ratio, which
is a key co-determinant of k~ ; hence of r : The next models are examples of di¤erent ways
of integrating a theory of saving into the story about the long-run rate of return.
2. The Diamond OLG model In the Diamond OLG model, based on a life-cycle
theory of saving, we again arrive at the formula r = f 0 (k~ ) . Like in the Solow model,
the long-run rate of return thus depends on the aggregate production function and on k~ :
But now there is a logically complete theory about how k~ is determined. In the Diamond
model k~ depends in a complicated way on the lifetime utility function and the aggregate
production function. The steady state of a well-behaved Diamond model will nevertheless
have the same qualitative property as indicated in (3).
3. The Ramsey model Like the Solow and Diamond models, the Ramsey model
implies that rt = f 0 (k~t ) for all t: But unlike in the Solow and Diamond models, the net
marginal productivity of capital now converges in the long run to a speci…c value given
by the modi…ed golden rule formula. In a continuous time framework this formula says:
r = + g; (4)
where the new parameter, ; is the (absolute) elasticity of marginal utility of consumption.
Because the Ramsey model is a representative agent model, the Keynes-Ramsey rule holds
5
not only at the individual level, but also at the aggregate level. This is what gives rise to
this simple formula for r .
Here there is no role for n; only for g: On the other hand, there is an alternative
speci…cation of the Ramsey model, namely the “average utilitarianism” speci…cation. In
this version of the model, we get r = f 0 (k~ ) = + n + g; so that not only a lower
g; but also a lower n implies lower r :
Also the Sidrauski model, i.e., the monetary Ramsey model of Chapter 17, results in
the modi…ed golden rule formula.1
4. Blanchard’s OLG model A continuous time OLG model with emphasis on life-
cycle aspects is Blanchard’s model, Blanchard (1985). In that model the net marginal
productivity of capital adjusts to a value where, in addition to the production function,
technology growth, and preference parameters, also demographic parameters, like birth
rate, death rate, and retirement rate, play a role. One of the results is that when = 1;
+g <r < + g + b;
where is the retirement rate (re‡ecting how early in life the “average” person retire
from the labor market) and b is the (crude) birth rate. The population growth rate is the
di¤erence between the birth rate, b; and the (crude) mortality rate, m; so that n = b m:
The qualitative property indicated in (3) becomes conditional. It still holds if the fall in
n re‡ects a lower b; but not necessarily if it re‡ects a higher m.
5. What if technological change is embodied? The models in the list above assume
a neoclassical aggregate production function with CRS and disembodied Harrod-neutral
technological progress, that is,
This amounts to assuming that new technical knowledge advances the combined pro-
ductivity of capital and labor independently of whether the workers operate old or new
machines.
6
not participate in subsequent technological progress. Both intuition and empirics suggest
that most technological progress is of this form. Indeed, Greenwood et al. (1997) estimate
for the U.S. 1950-1990 that embodied technological change explains 60% of the growth in
output per man hour.
So a theory of the rate of return should take this into account. Fortunately, this can
be done with only minor modi…cations. We assume that the link between investment and
capital accumulation takes the form
K_ t = Qt It Kt ; (6)
Qt = Q0 e t ; > 0:
Then, even if no technological change directly appears in the production function, that
is, even if (5) is replaced by
the economy will still experience a rising standard of living.2 A given level of gross
investment will give rise to greater and greater additions to the capital stock K; measured
in e¢ ciency units. Since at time t; Qt capital goods can be produced at the same cost as
one consumption good, the price, pt ; of capital goods in terms of the consumption good
must in competitive equilibrium equal the inverse of Qt ; that is, pt = 1=Qt : In this way
embodied technological progress results in a steady decline in the relative price of capital
equipment.
This prediction is con…rmed by the data. Greenwood et al. (1997) …nd for the U.S.
that the relative price of capital equipment has been declining at an average rate of 0:03
per year in the period 1950-1990, a trend that has seemingly been forti…ed in the wake of
the computer revolution.
Along a balanced growth path the constant growth rate of K will now exceed that
of Y; and Y =K thus be falling. The output-capital ratio in value terms, Y =(pK); will be
constant, however. Embedding these features in a Ramsey-style framework, we …nd the
2
We specify F to be Cobb-Douglas, because otherwise a model with embodied technical progress in
the form (6) will not be able to generate balanced growth and comply with Kaldor’s stylized facts.
7
long-run rate of return to be3
r = + :
1
This is of the same form as (4) since growth in output per unit of labor in steady state is
exactly g = =(1 ):
Adding uncertainty and risk of bankruptcy Although absent from many simple
macroeconomic models, uncertainty and risk of bankruptcy are signi…cant features of
reality. Bankruptcy risk may lead to a con‡ict of interest between share owners and
managers. Managers may want less debt and more equity than the share owners because
bankruptcy can be very costly to managers who loose a well-paid job and a promising
carrier. So managers are unwilling to …nance all new capital investment by new debt in
spite of the associated lower capital cost (there is generally a lower rate of return on debt
than on equity). In this way the excess of the rate of return on equity over that on debt,
the equity premium, is sustained.
A rough behavioral theory of the equity premium goes as follows.4 Firm managers
prefer a payout structure with a fraction, sf ; going to equity and the remaining fraction,
1 sf ; to debt (corporate bonds). That is, out of each unit of expected operating pro…t,
managers are unwilling to commit more than 1 sf to bond owners. This is to reduce the
risk of a failing payment ability in case of a bad market outcome. And those who …nance
…rms by loans de…nitely also want debtor …rms to have some equity at stake.
As a crude adaptation of for instance the Blanchard OLG model to these features, we
interpret the model’s r as an average rate of return across …rms. Let time be discrete
and let aggregate …nancial wealth be A = pK; where p is the price of capital equipment
in terms of consumption goods. In the frameworks 1 to 4 above we have p 1; but in
framework 5 the relative price p equals 1=Q and is falling over time. Anyway, given A
at time t; the aggregate gross return or payout is (1 + r )A. Out of this, (1 + r )Asf
constitutes the gross return to the equity owners and (1 + r )A(1 sf ) the gross return
3
See Exercise 18.??
4
The following is inspired by Baker, DeLong, and Krugman (2005). These authors discuss the implied
predictions for U.S. rates of return in the future and draw implications of relevance for the debate on
social security reform.
8
to the bond owners. Let re denote the rate of return on equity and rb the rate of return
on bonds.
(1 + re )Ash = (1 + r )Asf ;
(1 + rb )A(1 sh ) = (1 + r )A(1 sf ):
Thus,
sf
1 + re = (1 + r ) >1+r ;
sh
1 sf
1 + rb = (1 + r ) <1+r :
1 sh
We may de…ne the equity premium, ; by 1 + (1 + re )=(1 + rb ): Then
sf (1 sh )
= 1 > 0:
sh (1 sf )
Of course these formulas have their limitations. The key variables sf and sh will
depend on a lot of economic circumstances and should be endogenous in an elaborate
model. Yet, the formulas may be helpful as a way of organizing one’s thoughts about
rates of return in a world with asymmetric information and risk of bankruptcy.
There is evidence that in the last decades of the twentieth century the equity premium
had become lower than in the long aftermath of the Great Depression in the 1930s.5 A
likely explanation is that sh had gone up, along with rising con…dence. The computer
and the World Wide Web have made it much easier for individuals to invests in stocks of
shares. On the other hand, the recent …nancial and economic crisis, known as the Great
Recession 2007- , and the associated rise in mistrust seems to have halted and possibly
reversed this tendency for some time (source ??).
5
Blanchard (2003, p. 333).
9
Chapter 11
The Ramsey representative agent framework has, rightly or wrongly, been a work-
horse for the study of many macroeconomic issues. Among these are public …-
nance themes and themes relating to endogenous productivity growth. In this
chapter we consider issues within these two themes. Section 11.1 deals with a
market economy with a public sector. The focus is on general equilibrium e¤ects
of government spending and taxation, including e¤ects of shifts in …scal policy,
both anticipated and unanticipated shifts. In Section 11.2 we set up and analyze
a model of technology growth based on learning by investing. The analysis leads
to a characterization of a “…rst-best policy”.
431
432 CHAPTER 11. APPLICATIONS OF THE RAMSEY MODEL
We see from (11.2) that leisure does not enter the instantaneous utility func-
tion. So per capita labor supply is exogenous. We …x its value to be one unit of
labor per time unit, as is indicated by (11.3).
De…ning k~t Kt =(Tt Lt ) kt =Tt and c~t Ct =(Tt Lt ) ct =Tt ; the dynamic
aggregate resource constraint (11.4) can be written
f (0) = 0; ~ = 1;
lim f 0 (k) ~ = 0:
lim f 0 (k)
~
k!0 ~
k!1
de…nition, k~GR :
In general equilibrium the real interest rate, rt ; equals f 0 (k~t ) : Expressed
in terms of c~; the Keynes-Ramsey rule thus becomes
1h 0 ~ i
c~t = f (kt ) g c~t : (11.8)
The phase diagram of the dynamic system (11.7) - (11.8) is shown in Fig.
11.1 where, to begin with, the k~ = 0 locus is represented by the stippled inverse
U curve. Apart from a vertical downward shift of the k~ = 0 locus, when we
have ~ > 0 instead of ~ = 0; the phase diagram is similar to that of the Ramsey
model without government. Although the per capita lump-sum tax is not visible
in the reduced form of the model consisting of (11.7), (11.8), and (11.9), it is
indirectly present because it ensures that for all t 0; the c~t and k~t appearing
in (11.7) represent exactly the consumption demand and net saving coming from
the households’intertemporal budget constraint (which depends on the lump-sum
tax, cf. (11.11). Otherwise, equilibrium would not be maintained.
We assume ~ is of “moderate size” compared to the productive capacity of
the economy so as to not rule out the existence of a steady state. Moreover, to
as derived in the previous chapter. The upward shift in public spending is accom-
panied by higher lump-sum taxes, 0t = ~ 0 Lt ; forever, implying that ht is reduced,
which in turn reduces consumption.
Had the unanticipated shift in public spending been downward, say from ~ 0 to
~ ; the e¤ect would be an upward jump in consumption but no change in k; ~ that
is, a jump E’to E in Fig. 11.1.
Many kinds of disturbances of a steady state will result in a gradual adjust-
ment process, either to a new steady state or back to the original steady state. It
is otherwise in this example where there is an immediate jump to a new steady
state.
where, if at < 0; the tax acts as a rebate. As above, xt is a per capita lump-sum
transfer. In view of a balanced budget, we have at the aggregate level
Gt + xt Lt = r rt Kt :
As Gt and r are given, the interpretation is that for all t 0; transfers adjust so
as to balance the budget. This requires that xt = r rt kt Gt =Lt = r rt kt ~ Tt :
The No-Ponzi-Game condition is now
Rt
lim at e 0 [(1 r )rs n]ds
0;
t!1
c_t 1
= [(1 r )rt ]:
ct
3
In fact, as we shall see in Section 11.1.4, the key point is not that, to …x ideas, we have
assumed the budget is balanced for every t: It is enough that the government satis…es its
intertemporal budget constraint.
+ g
f 0 (k~ ) = > + g > g + n;
1 r
where the last inequality comes from the parameter condition (A1). Because
f 00 < 0, k~ is lower than if r = 0. Consequently, in the long run consumption
is lower as well.4 The resulting resource allocation is not Pareto optimal. There
exist an alternative technically feasible resource allocation that makes everyone
in society better o¤. This is because the capital income tax implies a wedge
between the marginal rate of transformation over time in production, f 0 (k~t ) ,
and the marginal rate of transformation over time to which consumers adapt,
0 ~
(1 r )(f (kt ) ).
Until time t0 the economy has been in steady state with a tax-transfer scheme
based on some given constant tax rate, r ; on capital income. At time t0 , un-
expectedly, the government introduces a new tax-transfer scheme, involving a
higher constant tax rate, 0r , on capital income, i.e., 0 < r < 0r < 1: The path of
spending on goods and services remains unchanged, i.e., Gt = ~ Tt Lt for all t 0:
The lump-sum transfers, xt ; are raised so as to maintain a balanced budget. We
assume it is credibly announced that the new tax-transfer scheme will be adhered
to forever. So households expect the real after-tax interest rate (rate of return
0
on saving) to be (1 r )rt for all t t0 :
For t < t0 the dynamics are governed by (11.7) and (11.14) with 0 < r < 1:
The corresponding steady state, E, has k~ = k~ and c~ = c~ as indicated in the
4
In the Diamond OLG model a capital income tax, which …nances lump-sum transfers to the
old generation, has an ambiguous e¤ect on capital accumulation, depending on whether < 1
or > 1, cf. Exercise 5.?? in Chapter 5.
phase diagram in Fig. 11.2. The new tax-transfer scheme ruling after time t0
shifts the steady state point to E’with k~ = k~ 0 and c~ = c~ . The new c~ = 0 line
0
and the new saddle path are to the left of the old, i.e., k~ 0 < k~ : Until time t0
the economy is at the point E. Immediately after the shift in the tax on capital
income, equilibrium requires that the economy is on the new saddle path. So
there will be a jump from point E to point A in Fig. 11.2.
This upward jump in consumption is intuitively explained the following way.
We know that individual consumption immediately after the policy shock satis…es
Two e¤ects are present. First, both the higher transfers and the lower after-
tax rate of return after time t0 contribute to a higher ht0 ; there is thereby a
positive wealth e¤ect on current consumption through a higher ht0 . Second, the
propensity to consume, t0 ; will generally be a¤ected. If < 1; the reduction in
the after-tax rate of return will have a positive e¤ect on t0 : The positive e¤ect
on t0 when < 1 re‡ects that the positive substitution e¤ect on ct0 of a lower
after-tax rate of return dominates the negative income e¤ect. If instead > 1;
the positive substitution e¤ect on ct0 is dominated by the negative income e¤ect.
Whatever happens to t0 ; however, the phase diagram shows that in general
equilibrium there will necessarily be an upward jump in ct0 . We get this result
even if is much higher than 1. The explanation lies in the assumption that all
the extra tax revenue obtained by the rise in r is immediately transferred back
to the households lump sum, thereby strengthening the positive wealth e¤ect
0
on current consumption through the lower discount rate implied by (1 r )rz
< (1 r )rz :
In response to the rise in r ; we thus have c~t0 > f (k~t0 ) ( +g +n)k~t0 ; implying
~ which thus begins to fall. This results in lower
that saving is too low to sustain k,
real wages and higher before-tax interest rates, that is two negative feedbacks
on human wealth. Could these feedbacks not fully o¤set the initial tendency for
(after-tax) human wealth to rise? The answer is no, see Box 11.1.
As indicated by the arrows in Fig. 11.2, the economy moves along the new
saddle path towards the new steady state E’. Because k~ is lower in the new
steady state than in the old, so is c~: The evolution of the technology level, T; is
by assumption exogenous; thus, also actual per capita consumption, c c~ T; is
lower in the new steady state.
Box 11.1. A mitigating feedback can not instantaneously fully o¤set the
force that activates it.
Can the story told by Fig. 11.2 be true? Can it be true that the net e¤ect of
the higher tax on capital income is an upward jump in consumption at time
t0 as indicated in Fig. 11.2? Such a jump means that c~t0 > f (k~t0 )
( + g + n)k~t0 and the resulting reduced saving will make the future k lower
than otherwise and thereby make expected future real wages lower and
expected future before-tax interest rates higher. Both feedbacks partly
counteract the initial upward shift in human wealth due to higher transfers
and a lower e¤ective discount rate that were the direct result of the rise in
w : Could the two mentioned counteracting feedbacks fully o¤set the initial
tendency for (after-tax) human wealth, and therefore current consumption, to
rise?
The phase diagram says no. But what is the intuition? That the two feed-
backs can not fully o¤set (or even reverse) the tendency for (after-tax) human
wealth to rise at time t0 is explained by the fact that if they could, then the two
feedbacks would not be there in the …rst place. We cannot at the same
time have both a rise in the human wealth that triggers higher consumption
(and thereby lower saving and investment in the economy) and a neutrali-
zation, or a complete reversal, of this rise in the human wealth caused by
the higher consumption. The two feedbacks can only partly o¤set the initial
tendency for human wealth to rise.
Instead of all the extra tax revenue obtained being transferred back lump sum
to the households, we may alternatively assume that a major part of it is used to
…nance a rise in government consumption to the level G0t = ~ 0 Tt Lt ; where ~ 0 > ~ :5
In addition to the leftward shift of the c~ = 0 locus this will result in a downward
shift of the k~ = 0 locus. The phase diagram would look like a convex combination
of Fig. 11.1 and Fig. 11.2. Then it is possible that the jump in consumption at
time t0 becomes downward instead of upward.
Returning to the case where the extra tax revenue is fully transferred, the
next subsection splits the change in taxation policy into two events.
Until time t0 the economy has been in steady state with a tax-transfer scheme
based on some given constant tax rate, r ; on capital income. At time t0 , unex-
pectedly, the government announces that a new tax-transfer policy with 0r > r
is to be implemented at time t1 > t0 . We assume people believe in this announce-
ment and that the new policy is implemented at time t1 as announced. The shock
to the economy is now not the event of a higher tax being implemented at time
t1 ; this event is expected after time t0 : The shock occurs at time t0 in the form
of the unexpected announcement. The path of spending on goods and services
remains unchanged throughout, i.e., Gt = ~ Tt Lt for all t 0:
The phase diagram in Fig. 11.3 illustrates the evolution of the economy for
t t0 : There are two time intervals to consider. For t 2 [t1 ; 1) ; the dynamics
are governed by (11.7) and (11.14) with r replaced by 0r ; starting from whatever
value obtained by k~ at time t1 :
In the time interval [t0 ; t1 ) ; however, the “old dynamics”, with the lower tax
rate, r ; in a sense still hold. Yet the path the economy follows immediately after
time t0 is di¤erent from what it would be without the information that capital
income will be taxed heavily from time t1 , where also transfers will become higher.
Indeed, the expectation of a lower after-tax interest rate until time t1 ; combined
with higher transfers from time t1 implies higher present value of future labor and
transfer income. Already at time t0 this induces an upward jump in consumption
to the point C in Fig. 11.3 because people feel more wealthy.
Since the low r rules until time t1 ; the point C is below the point A, which
is the same as that in Fig. 11.2. How far below? The answer follows from the
fact that there cannot be an expected discontinuity of marginal utility at time t1 ;
since that would contradict the preference for consumption smoothing over time
It is understood that also ~ 0 is not larger than what allows a steady state to exist. Moreover,
5
the government budget is still balanced for all t so that any temporary surplus or shortage of
tax revenue, 0r rt Kt G0t ; is immediately transferred or collected lump-sum.
implied by u00 (c) < 0 (strict concavity of the instantaneous utility function) and
re‡ected in the Keynes-Ramsey rule. To put it di¤erently: the shift to 0r does
not occur immediately, as in (11.15), but in the future, and as long as the shift
is known to occur at a given time in the future. The shift, when it takes place,
namely at the announced time t1 , will not trigger a jump in human wealth; ht1 :6
Hence, at time t1 ; there will be no jump in consumption, ct1 .
The intuitive background for this is that a consumer will never plan a jump
in consumption. To see this, consider a consumption path in the time inter-
val (t0 ; t2 ); where t2 > t1 : Suppose there is a discontinuity in ct at time t1 . In
view of the strict concavity of the utility function, there would then be gains to
be obtained by smoothing out consumption. Recalling the optimality condition
u0 (ct1 ) = t1 ; we could also say that along an optimal path there can be no ex-
pected discontinuity in the shadow price of …nancial wealth, t1 . This is analogue
to the fact that in an asset market, arbitrage rules out the existence of a generally
expected jump in the price of the asset to occur at some future time t1 . If we
imagine the expected jump is upward, an in…nite positive rate of return could
be obtained by buying the asset immediately before the jump. This generates
excess demand of the asset before time t1 and drives its price up in advance thus
preventing an expected upward jump at time t1 . And if we on the other hand
imagine the expected jump is downward, an in…nite negative rate of return could
be avoided by selling the asset immediately before the jump. This generates ex-
6
Replace t0 in the formula for human wealth in (11.15) by some t 2 (t0 ; t1 ); and consider ht
as the sum of the integrals from t to t1 and from t1 to 1; respectively, and let then t approach
t1 from below.
cess supply of the asset before time t1 and drives its price down in advance thus
preventing an expected downward jump at t1 .
To avoid existence of an expected discontinuity in consumption, the point C
on the vertical line k~ = k~ in Fig. 11.3 must be such that, following the “old
dynamics”, it takes exactly t1 t0 time units to reach the new saddle path. This
dictates a unique position of the point C between E and A. If C were at a lower
position, the journey to the saddle path would take longer than t1 t0 : And if C
were at a higher position, the journey would not take as long as t1 t0 :
Immediately after time t0 , k~ will be decreasing (because saving is smaller than
what is required to sustain a constant k); ~ and c~ will be increasing in view of the
Keynes-Ramsey rule, since the rate of return on saving is above + g as long
as k~ < k~ and r low. Precisely at time t1 the economy reaches the new saddle
path, the high taxation of capital income begins, and the after-tax rate of return
becomes lower than + g: Hence, per-capita consumption begins to fall and the
economy gradually approaches the new steady state E’.
This analysis illustrates that when economic agents’ behavior depend on
forward-looking expectations, a credible announcement of a future change in pol-
icy has an e¤ect already before the new policy is implemented. Such e¤ects are
known as announcement e¤ects or anticipation e¤ects.
Once again we change the scenario. The economy with low capital taxation
has been in steady state up until time t0 . Then a new tax-transfer scheme is
unexpectedly introduced. At the same time it is credibly announced that the high
taxes on capital income and the corresponding transfers will cease at time t1 > t0 .
The path of spending on goods and services remains unchanged throughout, i.e.,
Gt = ~ Tt Lt for all t 0:
The phase diagram in Fig. 11.4 illustrates the evolution of the economy for
t t0 : For t t1 ; the dynamics are governed by (11.7) and (11.14), again with
the old r ; starting from whatever value obtained by k~ at time t1 :
In the time interval [t0 ; t1 ) the “new, temporary dynamics” with the high 0r
and high transfers hold sway. Yet the path that the economy takes immediately
after time t0 is di¤erent from what it would have been without the information
that the new tax-transfers scheme is only temporary. Indeed, the expectation of
a shift to a higher after-tax rate of return and cease of high transfers as of time
t1 implies lower present value of expected future labor and transfer earnings than
without this information. Hence, the upward jump in consumption at time t0 is
smaller than in Fig. 11.2. How much smaller? Again, the answer follows from
the fact that there can not be an expected discontinuity of marginal utility at time
t1 ; since that would violate the principle of smoothing of planned consumption.
Thus the point F on the vertical line k~ = k~ in Fig. 11.4 must be such that,
following the “new, temporary dynamics”, it takes exactly t1 time units to reach
the solid saddle path in Fig. 11.4 (which is in fact the same as the saddle path
before time t0 ). The implied position of the economy at time t1 is indicated by
the point G in the …gure.
Immediately after time t0 , k~ will be decreasing (because saving is smaller than
what is required to sustain a constant k) ~ and c~ will be decreasing in view of the
Keynes-Ramsey rule in a situation with an after-tax rate of return lower than
+ g. Precisely at time t1 , when the temporary tax-transfers scheme based
on 0r is abolished (as announced and expected), the economy reaches the solid
saddle path. From that time the return on saving is high both because of the
abolition of the high capital income tax and because k~ is relatively low. The
general equilibrium e¤ect of this is higher saving, and so the economy moves
along the solid saddle path back to the original steady-state point E.
There is a last case to consider, namely an anticipated temporary shift in r :
We leave that for an exercise, see Exercise 11.??
de…cit:
B_ t = rt Bt + Gt + Xt T~t : (11.16)
As we ignore uncertainty, on its debt the government has to pay the same interest
rate, rt ; as other borrowers.
Along an equilibrium path in the Ramsey model the long-run interest rate
necessarily exceeds the long-run GDP growth rate. As we saw in Chapter 6, to
remain solvent, the government must then, as a debtor, ful…l a solvency require-
ment analogous to that of the households in the Ramsey model:
Rt
rs ds
lim Bt e 0 0: (11.17)
t!1
This NPG condition says that the debt is in the long run allowed to grow at most
at a rate less than the interest rate. As in discrete time, given the accounting
relationship (11.16), the NPG condition is equivalent to the intertemporal budget
constraint Z 1 Z 1
Rt Rt
(Gt + Xt )e 0 rs ds
dt T~t e 0 rs ds dt B0 : (11.18)
0 0
This says that the present value of the credibly planned public expenditure cannot
exceed government net wealth consisting of the present value of the expected
future tax revenues minus initial government debt, i.e., assets minus liabilities.
Assuming that the government does not want to be a net creditor to the
private sector in the long run, it will not collect more taxes than is necessary to
satisfy (11.18). Hence, we replace “ ”by “=”and rearrange to obtain
Z 1 Rt
Z 1 Rt
~
Tt e 0 rs ds
dt = (Gt + Xt )e 0 rs ds dt + B0 : (11.19)
0 0
Thus, for a given path of Gt and Xt ; the stream of the expected tax revenue
must be such that its present value equals the present value of total liabilities on
the right-hand-side of (11.19). A temporary budget de…cit leads to more debt
and therefore also higher taxes in the future. A budget de…cit merely implies a
deferment of tax payments. The condition (11.19) can be reformulated as
Z 1 Rt
(T~t Gt Xt )e 0 rs ds dt = B0 ;
0
showing that if net debt is positive today, then the government has to run a
positive primary budget surplus (that is, T~t Gt Xt > 0) in a su¢ ciently long
time in the future.
We will now show that when taxes are lump sum, then Ricardian equivalence
holds in the Ramsey model with a public sector.7 That is, a temporary tax
7
It is enough that just those taxes that are varied in the thought experiment are lump-sum.
cut will have no consequences for aggregate consumption. The time pro…le of
lump-sum taxes does not matter.
Consider the intertemporal budget constraint of the representative household,
Z 1 Rt
ct Lt e 0 rs ds dt A0 + H0 = K0 + B0 + H0 ; (11.20)
0
where H0 is human wealth of the household. This says, that the present value of
the planned consumption stream can not exceed the total wealth of the household.
In the optimal plan of the household, we have strict equality in (11.20).
Let t denote the lump-sum per capita net tax: Then, T~t Xt = t Lt and
Z 1 Rt
Z 1 Rt
H0 = h0 L0 = (wt t )Lt e 0 rs ds
dt = (wt Lt + Xt T~t )e 0 rs ds dt
0 0
Z 1 Rt
= (wt Lt Gt )e 0 rs ds dt B0 ; (11.21)
0
where the last equality comes from rearranging (11.19). It follows that
Z 1 Rt
B0 + H0 = (wt Lt Gt )e 0 rs ds dt:
0
We see that the time pro…les of transfers and taxes have fallen out. What matters
for total wealth of the forward-looking household is just the spending on goods
and services, not the time pro…le of transfers and taxes. A higher initial debt
has no e¤ect on the sum, B0 + H0 ; because H0 ; which incorporates transfers
and taxes, becomes equally much lower. Total private wealth is thus una¤ected
by government debt. So is therefore also private consumption when net taxes
are lump sum. A temporary tax cut will not make people feel wealthier and
induce them to consume more. Instead they will increase their saving by the
same amount as taxes have been reduced, thereby preparing for the higher taxes
in the future.
This is the Ricardian equivalence result, which we encountered also in Barro’s
discrete time dynasty model in Chapter 7:
In a representative agent model with full employment, rational
expectations, and no credit market imperfections, if taxes are lump
sum, then, for a given evolution of public expenditure, aggregate pri-
vate consumption is independent of whether current public expen-
diture is …nanced by taxes or by issuing bonds. The latter method
merely implies a deferment of tax payments. Given the government’s
intertemporal budget constraint, (11.19), a cut in current taxes has
to be o¤set by a rise in future taxes of the same present value. Since,
with lump-sum taxation, it is only the present value of the stream of
taxes that matters, the “timing”is irrelevant.
“each new machine produced and put into use is capable of changing
the environment in which production takes place, so that learning is
taking place with continually new stimuli”(Arrow, 1962).9
The learning is assumed to bene…t essentially all …rms in the economy. There
are knowledge spillovers across …rms and these spillovers are reasonably fast rel-
ative to the time horizon relevant for growth theory. In our macroeconomic ap-
proach both F and T are in fact assumed to be exactly the same for all …rms in the
economy. That is, in this speci…cation the …rms producing consumption-goods
bene…t from the learning just as much as the …rms producing capital-goods.
The parameter indicates the elasticity of the general technology level, T ;
with respect to cumulative aggregate net investment and is named the “learning
8
For arbitrary units of measurement for labor and output the hypothesis is Tt = BKt ;
B > 0: In (11.23) measurement units are chosen such that B = 1.
9
Concerning empirical evidence of learning-by-doing and learning-by-investing, see Liter-
ature Notes. The citation of Arrow indicates that it was experience from cumulative gross
investment he had in mind as the basis for learning. Yet, to simplify, we stick to the hypothesis
in (11.23), where it is cumulative net investment that matters.
subject to K_ it = Iit Kit : Here wt and It are the real wage and gross investment,
respectively, at time t, rs is the real interest rate at time s; and 0 is the capital
depreciation rate. Rising marginal capital installation costs and other kinds of
adjustment costs are assumed minor and can be ignored. It can be shown, cf.
Chapter 14, that in this case the …rm’s problem is equivalent to maximization of
current pure pro…ts in every short time interval. So, as hitherto, we can describe
the …rm as just solving a series of static pro…t maximization problems.
We suppress the time index when not needed for clarity. At any date …rm i
maximizes current pure pro…ts, i = F (Ki ; T Li ) (r + )Ki wLi : This leads
to the …rst-order conditions for an interior solution:
Behind (11.24) is the presumption that each …rm is small relative to the economy
as a whole, so that each …rm’s investment has a negligible e¤ect on the economy-
wide technology level Tt . Since F is homogeneous of degree one, by Euler’s
theorem,10 the …rst-order partial derivatives, F1 and F2 ; are homogeneous of
degree 0. Thus, we can write (11.24) as
F1 (ki ; T ) = r + ; (11.25)
10
See Math tools.
The resulting consumption-saving plan implies that per capita consumption fol-
lows the Keynes-Ramsey rule,
c_t 1
= (rt );
ct
and the transversality condition that the NPG condition is satis…ed with strict
equality. In general equilibrium of our closed economy with no role for natural
resources and no government debt, at will equal Kt =Lt :
rt = F1 (kt ; Tt ) : (11.27)
That is, whereas in the …rm’s …rst-order condition (11.25) causality goes from rt
to kit ; in (11.27) causality goes from kt to rt : Note also that in our closed economy
with no natural resources and no government debt, at will equal kt :
where we have used (11.22), (11.26), and (11.23) and the assumption that F is
homogeneous of degree one.
rt = f 0 (k~t ) ; (11.30)
Dynamics
From the de…nition k~ K=(T L) follows
k~ K_ T_ L_ K_ K_
= = n (by (11.23))
k~ K T L K K
Y C K y~ c~ k~ C c
= (1 ) n = (1 ) n; where c~ :
K k~ TL T
Multiplying through by k~ we have
k~ = (1 ~
)(f (k) c~) [(1 ~
) + n] k: (11.31)
1 ~ ~ ~ c~:
c~ = (f 0 (k) ) (f (k) c~ k) (11.33)
k~
The two coupled di¤erential equations, (11.31) and (11.33), determine the
evolution over time of the economy.
Phase diagram Fig. 11.5 depicts the phase diagram. The k~ = 0 locus comes
from (11.31), which gives
n
k~ = 0 for c~ = f (k)
~ ( + ~
)k; (11.34)
1
~ k~
c~ = 0 for c~ = f (k) k~ ~
(f 0 (k) )
~ k~
= f (k) k~ gc ~
c(k) (from (11.32)). (11.35)
Steady state In a steady state c~ and k~ are constant so that the growth rate of
_ + n; i.e.,
C as well as K equals A=A
C_ K_ T_ K_
= = +n= + n:
C K T K
Solving gives
C_ K_ n
= = :
C K 1
C_ n n
gc = n= n= gc ; and (11.36)
C 1 1
T_ K_ n
= = = gc : (11.37)
T K 1
~ respectively, will therefore satisfy, by (11.32),
The steady-state values of r and k;
n
r = f 0 (k~ ) = + gc = + : (11.38)
1
To ensure existence of a steady state we assume that the private marginal pro-
ductivity of capital is su¢ ciently sensitive to capital per unit of e¤ective labor,
from now called the “capital intensity”:
~ > + + n ~
lim f 0 (k) > lim f 0 (k): (A1)
~
k!0 1 ~
k!1
The
R t transversality condition of the representative household is that limt!1
at e 0 (rs n)ds = 0; where a is per capita …nancial wealth. In general equilibrium
t
n
n > (1 ) ; (A2)
1
which we assume satis…ed.
As to the slope of the c~ = 0 locus we have from (11.35),
~
1 ~ f 00 (k) 1
0 ~ = f 0 (k)
c (k) ~ (k ~
+ gc ) > f 0 (k) gc ; (11.40)
since f 00 < 0: At least in a small neighborhood of the steady state we can sign
the right-hand side of this expression. Indeed,
1 1 n n n
f 0 (k~ ) gc = + g c gc = + = n (1 ) > 0;
1 1 1
(11.41)
~
by (11.36) and (A2). So, combining with (11.40), we conclude that c (k ) > 0: 0
~
By continuity, in a small neighborhood of the steady state, c0 (k) c0 (k~ ) > 0:
Therefore, close to the steady state, the c~ = 0 locus is positively sloped, as
indicated in Fig. 11.5.
Still, we have to check the following question: In a neighborhood of the steady
state, which is steeper, the c~ = 0 locus or the k~ = 0 locus? The slope of the latter
~
is f 0 (k) n=(1 ); from (11.34): At the steady state this slope is
1
f 0 (k~ ) gc 2 (0; c0 (k~ ));
in view of (11.41) and (11.40). The c~ = 0 locus is thus steeper. So, the c~ = 0
locus crosses the k~ = 0 locus from below and can only cross once.
The assumption (A1) ensures existence of a k~ > 0 satisfying (11.38). As
Fig. 11.5 is drawn, a little more is implicitly assumed namely that there exists a
k^ > 0 such that the private net marginal productivity of capital equals the the
steady-state growth rate of output, i.e.,
^ Y_ T_ L_ n n
f 0 (k) =( ) =( ) + = +n= ; (11.42)
Y T L 1 1
@Y 1 ~ + F2 ( )K L( K
= F1 ( ) + F2 ( ) K L = f 0 (k) 1
) (by (11.29))
@K
~ + (F ( ) F1 ( )K) K 1
= f 0 (k) (by Euler’s theorem)
~ + (f (k)K
= f 0 (k) ~ ~
L f 0 (k)K) K 1
(by (11.29) and (11.23))
~ ~ 0 ~
~ + (f (k)K
= f 0 (k) ~ 1
L ~
f 0 (k)) ~ + f (k) kf (k) > f 0 (k):
= f 0 (k) ~
k~
This, together with (A1) and f < 0, implies existence of a unique k~GR , and in
00
view of our additional assumption (A2), we have 0 < k~ < k^ < k~GR ; as displayed
in Fig. 11.5.
Stability The arrows in Fig. 11.5 indicate the direction of movement as de-
termined by (11.31) and (11.33). We see that the steady state is a saddle point.
The dynamic system has one pre-determined variable, k; ~ and one jump variable,
c~: The saddle path is not parallel to the jump variable axis. We claim that for
a given k~0 > 0; (i) the initial value of c~0 will be the ordinate to the point where
the vertical line k~ = k~0 crosses the saddle path; (ii) over time the economy will
move along the saddle path towards the steady state. Indeed, this time path is
consistent with all conditions of general equilibrium, including the transversality
condition (TVC). And the path is the only technically feasible path with this
property. Indeed, all the divergent paths in Fig. 11.5 can be ruled out as equi-
librium paths because they can be shown to violate the transversality condition
of the household.
In the long run c and y Y =L y~T = f (k~ )T grow at the rate n=(1 );
which is positive if and only if n > 0: This is an example of endogenous growth in
the sense that the positive long-run per capita growth rate is generated through an
internal mechanism (learning) in the model (in contrast to exogenous technology
growth as in the Ramsey model with exogenous technical progress).
counter-balancing role re‡ects the complementarity between K and L), but also
upholds sustained productivity growth.
Note that in the semi-endogenous growth case @gy =@ = n=(1 )2 > 0 for
n > 0: That is, a higher value of the learning parameter implies higher per capita
growth in the long run, when n > 0. Note also that @gy =@ = 0 = @gy =@ ; that
is, in the semi-endogenous growth case preference parameters do not matter for
long-run growth. As indicated by (11.36), the long-run growth rate is tied down
by the learning parameter, ; and the rate of population growth, n: But, like in
the simple Ramsey model, it can be shown that preference parameters matter for
the level of the growth path. This suggests that taxes and subsidies do not have
long-run growth e¤ects, but “only”level e¤ects (see Exercise 11.??).
where we have divided the two arguments of F1 (k; K) by k K=L and again
used Euler’s theorem. Note that the interest rate is constant “from the beginning”
and independent of the historically given initial value of K; K0 . The aggregate
production function is now
and is thus linear in the aggregate capital stock. In this way the general neo-
classical presumption of diminishing returns to capital has been suspended and
replaced by exactly constant returns to capital. So the Romer model belongs to a
class of models known as AK models, that is, models where in general equilibrium
the interest rate and the output-capital ratio are necessarily constant over time
whatever the initial conditions.
The method for analyzing an AK model is di¤erent from the one used for a
diminishing returns model as above.
Dynamics
The Keynes-Ramsey rule now takes the form
c_ 1 1
= (r ) = (F1 (1; L) ) ; (11.44)
c
which is also constant “from the beginning”. To ensure positive growth, we
assume
F1 (1; L) > : (A1’)
And to ensure bounded intertemporal utility (and existence of equilibrium), it is
assumed that
> (1 ) and therefore < + = r: (A2’)
Solving the linear di¤erential equation (11.44) gives
ct = c0 e t ; (11.45)
First, note that the dynamic resource constraint for the economy is
K_ = Y cL K = F (1; L)K cL K;
k_ = [F (1; L) ]k c0 e t : (11.46)
Figure 11.6: Illustration of the fact that for L given, F (1; L) > F1 (1; L).
in the Arrow case and growth peters out, since n = 0: With slightly above 1, it
can be shown that growth becomes explosive (in…nite output in …nite time).12
The Romer case, = 1; is thus a knife-edge case in a double sense. First,
it imposes a particular value for a parameter which apriori can take any value
within an interval. Second, the imposed value leads to theoretically non-robust
results; values in a hair’s breadth distance result in qualitatively di¤erent behavior
of the dynamic system. Still, whether the Romer case - or, more generally, a
fully-endogenous growth case - can be used as an empirical approximation to
its semi-endogenous “counterpart” for a su¢ ciently long time horizon to be of
interest, is a debated question within growth analysis.
It is noteworthy that the causal structure in the long run in the diminishing
returns case is di¤erent than in the AK-case of Romer. In the diminishing returns
case the steady-state growth rate is determined …rst, as gc in (11.36), and then r
is determined through the Keynes-Ramsey rule; …nally, Y =K is determined by the
technology, given r : In contrast, the Romer case has Y =K and r directly given
as F (1; L) and r; respectively. In turn, r determines the (constant) equilibrium
growth rate through the Keynes-Ramsey rule.
In the AK case, that is, the fully endogenous growth case, we have @ =@ < 0 and
@ =@ < 0: Thus, preference parameters matter for the long-run growth rate and
not “only”for the level of the growth path. This suggests that taxes and subsidies
can have long-run growth e¤ects. In any case, in this model there is a motivation
for government intervention due to the positive externality of private investment.
This motivation is present whether < 1 or = 1: Here we concentrate on the
latter case, which is the simpler one. We …rst …nd the social planner’s solution.
The social planner The social planner faces the aggregate production function
Yt = F (1; L)Kt or, in per capita terms, yt = F (1; L)kt : The social planner’s
problem is to choose (ct )1
=0 to maximize
Z 1
c1t t
U0 = e dt s.t.
0 1
ct 0;
_kt = F (1; L)kt ct kt ; k0 > 0 given, (11.50)
kt 0 for all t > 0: (11.51)
12
See Solow (1997).
In view of (11.53), in an interior optimal solution the time path of the adjoint
variable is
[(F (1;L) ]t
t = 0e ;
that is, k grows at the same constant rate as c “from the beginning”: Since y
Y =L = F (1; L)k; the same is true for y: Hence, our candidate for the so-
cial planner’s solution is from start in balanced growth (there is no transitional
dynamics).
The next step is to check whether our candidate solution satis…es a set of
su¢ cient conditions for an optimal solution. Here we can use Mangasarian’s
theorem. Applied to a continuous-time optimization problem like this, with one
control variable and one state variable, the theorem says that the following con-
ditions are su¢ cient:
(a) Concavity: For all t 0 the Hamiltonian is jointly concave in the control
and state variables, here c and k.
(b) Non-negativity: There is for all t 0 a non-negativity constraint on the
state variable; in addition, the co-state variable, ; is non-negative for all
t 0 along the optimal path.
(c) TVC: The candidate solution satis…es the transversality condition
limt!1 kt t e t = 0; where t e t is the discounted co-state variable.
In the present case we see that the Hamiltonian is a sum of concave func-
tions and therefore is itself concave in (k; c): Further, from (11.51) we see that
condition (b) is satis…ed. Finally, our candidate solution is constructed so as to
satisfy condition (c). The conclusion is that our candidate solution is an optimal
solution. We call it an SP allocation.
(1 s)(r + )
per unit of capital per time unit; (ii) …nancing this subsidy by a constant con-
sumption tax rate :
Let us …rst …nd the size of s needed to establish the SP allocation. Firm i
now chooses Ki such that
@Yi
jK …xed = F1 (Ki ; KLi ) = (1 s)(r + ):
@Ki
F1 (k; K) = (1 s)(r + );
where k K=L; which is pre-determined from the supply side. Thus, the equi-
librium interest rate must satisfy
F1 (k; K) F1 (1; L)
r= = ; (11.59)
1 s 1 s
again using Euler’s theorem.
It follows that s should be chosen such that the “right” r arises. What is
the “right” r? It is that net rate of return which is implied by the production
technology at the aggregate level, namely @Y =@K = F (1; L) : If we can
obtain r = F (1; L) ; then there is no wedge between the intertemporal rate of
transformation faced by the consumer and that implied by the technology. The
required s thus satis…es
F1 (1; L)
r= = F (1; L) ;
1 s
so that
F1 (1; L) F (1; L) F1 (1; L) F2 (1; L)L
s=1 = = :
F (1; L) F (1; L) F (1; L)
It remains to …nd the required consumption tax rate : The tax revenue will
be cL; and the required tax revenue is
where we have used that the proportionality in (11.58) between c and k holds for
all t 0: Substituting (11.55) into (11.60), the solution for can be written
The required tax rate on consumption is thus a constant. It therefore does not
distort the consumption/saving decision on the margin, cf. Appendix B.
It follows that the allocation obtained by this subsidy-tax policy is the SP
allocation. A policy, here the policy (s; ); which in a decentralized system in-
duces the SP allocation, is called a …rst-best policy. In a situation where for some
reason it is impossible to obtain an SP allocation in a decentralized way (because
of adverse selection and moral hazard problems, say), a government’s optimiza-
tion problem would involve additional constraints to those given by technology
and initial resources. A decentralized implementation of the solution to such a
problem is called a second-best policy.
Another way in which (11.23) deviates from Arrow’s original ideas is by assum-
ing that technical progress is disembodied rather than embodied, a distinction we
touched upon in Chapter 2. Moreover, we have assumed a neoclassical technology
whereas Arrow assumed …xed technical coe¢ cients.
11.5 Appendix
A. The golden-rule capital intensity in Arrow’s growth model
In our discussion of Arrow’s learning-by-investing model in Section 11.2.2 (where
0 < < 1); we claimed that the golden-rule capital intensity, k~GR ; will be that ef-
fective capital-labor ratio at which the social net marginal productivity of capital
equals the steady-state growth rate of output. In this respect the Arrow model
with endogenous technical progress is similar to the standard neoclassical growth
model with exogenous technical progress. This claim corresponds to a very gen-
eral theorem, valid also for models with many capital goods and non-existence of
an aggregate production function. This theorem says that the highest sustainable
path for consumption per unit of labor in the economy will be that path which
results from those techniques which pro…t maximizing …rms choose under perfect
competition when the real interest rate equals the steady-state growth rate of
GNP (see Gale and Rockwell, 1975).
To prove our claim, note that in steady state, (11.35) holds whereby consump-
tion per unit of labor (here the same as per capita consumption as L = labor
force = population) can be written
~ n
ct c~t Tt = f (k) ( + )k~ Kt
1
~ n n n
= f (k) ( + )k~ K0 e 1 t
(by gK = )
1 1
~ n 1 n Kt K1
= f (k) ( + )k~ ~ 0) 1 e 1
(kL t
(from k~ = = 0 )
1 Kt Lt L0
~ n n n
= f (k) ( + )k~ k~ 1 L0 1 e 1 t ~ 0 1 e1
'(k)L t
;
1
We …nd
~ = ~ n n
'0 (k) f 0 (k) ( + ) k~ 1 ~
+ f (k) ( + )k~ k~ 1 1
1 1 1
" ! #
n ~
f (k) n
~
= f 0 (k) ( + )+ ( + ) k~ 1
1 k~ 1 1
" #
~
f (k) n k~ 1
= (1 ~
)f 0 (k) (1 ) n+ ( + )
k~ 1 1
" #
~
f (k) n k~ 1 ~1
= (1 ~
)f 0 (k) + ~ k
(k) ; (11.61)
k~ 1 1 1
de…ning (k) ~ in the obvious way. The …rst-order condition for the problem,
~ = 0; is equivalent to (k)
'0 (k) ~ = 0: After ordering this gives
~ ~ 0 (k)
~
~ + f (k)
f 0 (k)
kf
=
n
: (11.62)
k~ 1
We see that
~ R 0 for
'0 (k) ~ R 0;
(k)
respectively. Moreover,
~
f (k) ~ 0 (k)
kf ~
0 ~ = (1
(k) ~
)f 00 (k) < 0;
k~2
~ k~ > f 0 (k):
in view of f 00 < 0 and f (k)= ~ So a k~ > 0 satisfying (k) ~ = 0 is the
unique maximizer of '(k):~ By (A1) and (A3) in Section 11.2.2 such a k~ exists
and is thereby the same as the k~GR we were looking for.
The left-hand side of (11.62) equals the social marginal productivity of capital
and the right-hand side equals the steady-state growth rate of output. At k~ = k~GR
it therefore holds that !
@Y Y_
= :
@K Y
Remark about the absence of a golden rule in the Romer case. In the Romer case
the golden rule is not a well-de…ned concept for the following reason. Along any
balanced growth path we have from (11.50),
k_ t ct c0
gk = F (1; L) = F (1; L) ;
kt kt k0
where xt is the per capita lump-sum transfer, exogenous to the household, and
Lt is the size of the representative household.
Assuming CRRA utility with parameter > 0, the instantaneous per capita
utility can be written
c1t (1 + t)
1 1
ct
u(ct ) = = :
1 1
In our standard notation the household’s intertemporal optimization problem is
then to choose (ct )1
t=0 so as to maximize
Z 1 1 1
(1 + t) ct ( n)t
U0 = e dt s.t.
0 1
ct 0;
a_ t = (rt n)at + wt + xt ct ; a0 given,
R1
(rs n)ds
lim at e 0 0:
t!1
From now, we let the timing of the variables be implicit unless needed for
clarity. The current-value Hamiltonian is
1 1
(1 + ) c
H= + [(r n)a + w + x c] ;
1
where is the co-state variable associated with …nancial per capita wealth, a: An
interior optimal solution will satisfy the …rst-order conditions
@H 1 1
= (1 + ) c = 0; so that (1 + ) c = ; (FOC1)
@c
@H _ +(
= (r n) = n) ; (FOC2)
@a
and a transversality condition which amounts to
R1
(rs n)ds
lim at e 0 = 0: (TVC)
t!1
_ c _
( 1) = = r:
1+ c
11.6 Exercises
The models considered so far (the OLG models as well as the representative agent
models) have ignored capital adjustment costs. In the closed-economy version
of the models aggregate investment is merely a re‡ection of aggregate saving
and appears in a “passive” way as just the residual of national income after
households have chosen their consumption. We can describe what is going on by
telling a story in which …rms just rent capital goods owned by the households
and households save by purchasing additional capital goods. In these models
only households solve intertemporal decision problems. Firms merely demand
labor and capital services with a view to maximizing current pro…ts. This may
be a legitimate abstraction in some contexts within long-run analysis. In short-
and medium-run analysis, however, the dynamics of …xed capital investment is
important. So a more realistic approach is desirable.
In the real world the capital goods used by a production …rm are usually
owned by the …rm itself rather than rented for single periods on rental markets.
This is because inside the speci…c plant in which these capital goods are an
integrated part, they are generally worth much more than outside. So in practice
…rms acquire and install …xed capital equipment to maximize discounted expected
earnings in the future.
Tobin’s q-theory of investment (after the American Nobel laureate James To-
bin, 1918-2002) is an attempt to model these features. In this theory,
(a) …rms make the investment decisions and install the purchased capital goods
in their own businesses;
(b) there are certain adjustment costs associated with this investment: in ad-
dition to the direct cost of buying new capital goods there are costs of
573
CHAPTER 14. FIXED CAPITAL INVESTMENT AND
574 TOBIN’S Q
where Y~ ; K; and L are “potential output” (to be explained), capital input, and
labor input per time unit, respectively, while F is a concave neoclassical produc-
tion function. So we allow decreasing as well as constant returns to scale (or a
combination of locally CRS and locally DRS), whereas increasing returns to scale
is ruled out. Until further notice technological change is ignored for simplicity.
Time is continuous. The dating of the variables will not be explicit unless needed
for clarity. The increase per time unit in the …rm’s capital stock is given by
K_ = I K; > 0; (14.1)
where I is gross …xed capital investment per time unit and is the rate of wearing
down of capital (physical capital depreciation). To …x ideas, we presume the
realistic case with positive capital depreciation, but most of the results go through
even for = 0:
Let J denote the …rm’s capital installation costs (measured in units of output)
per time unit. The installation costs imply that a part of the potential output, Y~ ;
is “used up”in transforming investment goods into installed capital; only Y~ J
is “true output”available for sale.
Assuming the price of investment goods is one (the same as that of output
goods), then total investment costs per time unit are I +J; i.e., the direct purchase
costs, 1 I; plus the indirect cost associated with installation etc., J: The q-theory
of investment assumes that the capital installation cost, J; is a strictly convex
function of gross investment and is either independent of or a decreasing function
of the current capital stock. Thus,
J = G(I; K);
for all K and all (I; K); respectively. For …xed K = K the graph is as shown
in Fig. 14.1. Also negative gross investment, i.e., sell o¤ of capital equipment,
involves costs (for dismantling, reorganization etc.). Therefore GI < 0 for I < 0.
The important assumption is that GII > 0 (strict convexity in I); implying that
the marginal installation cost is increasing in the level of gross investment. If the
…rm wants to accomplish a given installation project in only half the time, then
the installation costs are more than doubled (the risk of mistakes is larger, the
problems with reorganizing work routines are larger etc.).
The strictly convex graph in Fig. 14.1 illustrates the essence of the matter.
Assume the current capital stock in the …rm is K and that the …rm wants to
increase it by a given amount K: If the …rm chooses the investment level I >
0 per time unit in the time interval [t; t + t), then, in view of (14.1), K
(I K) t: So it takes t K=(I K) units of time to accomplish
the desired increase K. If, however, the …rm slows down the adjustment and
invests only half of I per time unit, then it takes approximately twice as long
time to accomplish K. Total costs of the two alternative courses of action are
approximately G(I; K) t and G( 12 I; K)2 t; respectively (ignoring discounting
and assuming the initial increase in capital is small in relation to K). By drawing
a few straight line segments in Fig. 14.1 the reader will be convinced that the
last-mentioned cost is smaller than the …rst-mentioned due to strict convexity of
installation costs (see Exercise 14.1). Haste is waste.
On the other hand, there are of course limits to how slow the adjustment
to the desired capital stock should be. Slower adjustment means postponement
of the potential bene…ts of a higher capital stock. So the …rm faces a trade-o¤
between fast adjustment to the desired capital stock and low adjustment costs.
where wt is the wage per unit of labor at time t. As mentioned, the installation
cost G(It ; Kt ) implies that a part of production, F (Kt ; Lt ); is used up in trans-
forming investment goods into installed capital; only the di¤erence F (Kt ; Lt )
G(It ; Kt ) is available for sale.
We ignore uncertainty and assume the …rm is a price taker. The interest rate
is rt , which we assume to be positive, at least in the long run. The decision
problem, as seen from time 0, is to choose a plan (Lt ; It )1 t=0 so as to maximize the
…rm’s market value, i.e., the present value of the future stream of expected cash
‡ows: Z 1 Rt
max1 V0 = Rt e 0 rs ds dt s.t. (14.3) and (14.4)
(Lt ;It )t=0 0
of the corresponding state variable, K; we mean the right-hand time derivative, i.e., K_ t0 =
limt!t+ (Kt Kt0 )=(t t0 ): Mathematically, these conventions are inconsequential, but they
0
help the intuition.
costs are present, current decisions depend on the expected future circumstances.
The …rm makes a plan for the whole future so as to maximize the value of the …rm,
which is what matters for the owners. This is the general neoclassical hypothesis
about …rms’behavior. As shown in Appendix A, when strictly convex installation
costs or similar dependencies across time are absent, then value maximization is
equivalent to solving a sequence of static pro…t maximization problems, and we
are back in the previous chapters’description.
To solve the problem (14.4) (14.7), where Rt is given by (14.3), we apply
the Maximum Principle. The problem has two control variables, L and I, and
one state variable, K. We set up the current-value Hamiltonian:
FL (K; L) = w; (14.9)
1 + GI (I; K) = q: (14.10)
Next, we partially di¤erentiate H w.r.t. the state variable and set the result equal
to rq q,_ where r is the discount rate in (14.4):
@H
= FK (K; L) GK (I; K) q = rq q:
_ (14.11)
@K
Then, the Maximum Principle says that for an interior optimal path (Kt ; Lt ; It )
there exists an adjoint variable q; which is a continuous function of t; written qt ;
such that for all t 0 the conditions (14.9), (14.10), and (14.11) hold and the
transversality condition Rt
lim Kt qt e 0 rs ds = 0 (14.12)
t!1
is satis…ed.
The optimality condition (14.9) is the usual employment condition equalizing
the marginal product of labor to the real wage. In the present context with
strictly convex capital installation costs, this condition attains a distinct role as
labor will in the short run be the only variable input. This is because the strictly
convex capital installation costs imply that the …rm’s installed capital in the
short run is a quasi-…xed production factor. So, e¤ectively there are diminishing
returns (equivalent with rising marginal costs) in the short run even though the
production function might have CRS.
The left-hand side of (14.10) gives the cost of acquiring one extra unit of
installed capital at time t (the sum of the cost of buying the marginal investment
good and the cost of its installation). That is, the left-hand side is the marginal
cost, MC, of increasing the capital stock in the …rm. Since (14.10) is a necessary
condition for optimality, the right-hand side of (14.10) must be the marginal
bene…t, MB, of increasing the capital stock. Hence, qt represents the value to
the optimizing …rm of having one more unit of (installed) capital at time t: To
put it di¤erently: the adjoint variable qt can be interpreted as the shadow price
(measured in current output units) of capital along the optimal path.2
As to the interpretation of the di¤erential equation (14.11), a condition for
optimality must be that the …rm acquires capital up to the point where the
“marginal productivity of capital”, FK GK ; equals “capital costs”, rt qt + ( qt
q_t ); the …rst term in this expression represents interest costs and the second
economic depreciation. In (14.11) the “marginal productivity of capital”appears
as FK GK ; because we should take into account the potential reduction, GK ; of
installation costs in the next instant brought about by the marginal unit of already
installed capital. The shadow price qt appears as the “overall”price at which the
…rm can buy and sell the marginal unit of installed capital. In fact, in view of qt =
1 + GI (Kt ; Lt ) along the optimal path (from (14.10)), qt measures, approximately,
both the “overall” cost increase associated with increasing investment by one
unit and the “overall” cost saving associated with decreasing investment by one
unit. In the …rst case the …rm not only has to pay one extra unit of account
in the investment goods market but must also bear an installation cost equal to
GI (Kt ; Lt ); thereby in total investing qt units of account: And in the second case
the …rm recovers qt by saving both on installation costs and purchases in the
investment goods market. Continuing along this line of thought, by reordering in
(14.11) we get the “no-arbitrage”condition
FK GK q + q_
= r; (14.13)
q
saying that along the optimal path the rate of return on the marginal unit of
installed capital must equal the interest rate.
The transversality condition (14.12) says that the present value of the capital
stock “left over” at in…nity must be zero. That is, the capital stock should not
in the long run grow too fast, given the evolution of its discounted shadow price.
In addition to necessity of (14.12) it can be shown3 that the discounted shadow
2
Recall that a shadow price, measured in some unit of account, of a good, from the point of
view of the buyer, is the maximum number of units of account that he or she is willing to o¤er
for one extra unit of the good.
3
See Appendix B.
price itself in the far future must along an optimal path be asymptotically nil,
i.e.,
Rt
rs ds
lim qt e 0 = 0: (14.14)
t!1
If along the optimal path, Kt grows without bound, then not only must (14.14)
hold but, in view of (14.12), the discounted shadow price must in the long run
approach zero faster than Kt grows. Intuitively, otherwise the …rm would be
“over-accumulating”. The …rm would gain by reducing the capital stock “left
over” for eternity (which is like“money left on the table”), since reducing the
ultimate investment and installation costs would raise the present value of the
…rm’s expected cash ‡ow.
In connection with (14.10) we claimed that qt can be interpreted as the shadow
price (measured in current output units) of capital along the optimal path. A
con…rmation of this interpretation isR obtained by solving the di¤erential equation
t
(14.11). Indeed, multiplying by e 0 (rs + )ds on both sides of (14.11), we get by
integration and application of (14.14),4
Z 1 R
(rs + )ds
qt = [FK (K ; L ) GK (I ; K )] e t d : (14.15)
t
The right-hand side of (14.15) is the present value, as seen from time t; of expected
future increases of the …rm’s cash-‡ow that would result if one extra unit of
capital were installed at time t; indeed, FK (K ; L ) is the direct contribution
to output of one extra unit of capital, while GK (I ; K ) 0 represents the
potential reduction of installation costs in the next instant brought about by the
marginal unit of installed capital. However, future increases of cash-‡ow should
be discounted at a rate equal to the interest rate plus the capital depreciation
rate; from one extra unit of capital at time t there are only e ( t) units left at
time :
To concretize our interpretation of qt as representing the value to the opti-
mizing …rm at time t of having one extra unit of installed capital, let us make
a thought experiment. Assume that a extra units of installed capital at time t
drops down from the sky. At time > t there are a e ( t) units of these still
in operation so that the stock of installed capital is
K0 = K + a e ( t)
; (14.16)
where K denotes the stock of installed capital as it would have been without
this “injection”. Now, in (14.3) replace t by and consider the optimizing …rm’s
4
For details, see Appendix A.
@R ( t)
= [FK (K ; L ) GK (I ; K )] e : (14.17)
@a ja=0
when the …rm moves along the optimal path. The second equality sign comes
from (14.17) and the third is implied by (14.15). So the value of the adjoint
variable, q, at time t equals the contribution to the …rm’s maximized value of a
…ctional marginal “injection” of installed capital at time t: This is just another
way of saying that qt represents the bene…t to the …rm of the marginal unit of
installed capital along the optimal path.
This story facilitates the understanding that the control variables at any point
in time should be chosen so that the Hamiltonian function is maximized. Thereby
one maximizes the properly weighted sum of the current direct contribution to the
criterion function and the indirect contribution, which is the bene…t (as measured
approximately by qt Kt ) of having a higher capital stock in the future.
As we know, the Maximum Principle gives only necessary conditions for an
optimal path, not su¢ cient conditions. We use the principle as a tool for …nding
candidates for a solution. Having found in this way a candidate, one way to pro-
ceed is to check whether Mangasarian’s su¢ cient conditions are satis…ed. Given
the transversality condition (14.12) and the non-negativity of the state variable,
K; the only additional condition to check is whether the Hamiltonian function
is jointly concave in the endogenous variables (here K, L; and I): If it is jointly
concave in these variables, then the candidate is an optimal solution. Owing
to concavity of F (K; L), inspection of (14.8) reveals that the Hamiltonian func-
tion is jointly concave in (K; L; I) if G(I, K) is jointly concave in (I; K): This
condition is equivalent to G(I; K) being jointly convex in (I; K); an assumption
allowed within the con…nes of (14.2); for example, G(I; K) = ( 12 ) I 2 =K as well as
the simpler G(I; K) = ( 12 ) I 2 (where in both cases > 0) will do. Thus, assum-
ing joint convexity of G(I; K); the …rst-order conditions and the transversality
condition are not only necessary, but also su¢ cient for an optimal solution.
GI (It ; Kt ) = qt 1: (14.19)
Combining this with the assumption (14.2) on the installation cost function, we
see that
It T 0 for qt T 1; respectively, (14.20)
cf. Fig. 14.2.5 In view of GII 6= 0; (14.19) implicitly de…nes optimal investment,
It , as a function of the shadow price, qt ; and the state variable, Kt :
It = M(qt ; Kt ); (14.21)
Figure 14.2: Marginal installation costs as a function of the gross investment level, I,
for a given amount, K, of installed capital. The optimal gross investment, It , when
q = qt is indicated.
Recall that GK (It ; Kt ) indicates how much lower the installation costs are as
a result of the marginal unit of installed capital. In the special case (14.22) we
have from Lemma 1
I I I
GK (I; K) = g( ) g0( ) = g(m(q)) (q 1)m(q);
K K K
14.3 Applications
Capital installation costs in a closed economy
Allowing for convex capital installation costs in the economy has far-reaching
implications for the causal structure of a model of a closed economy. Investment
decisions attain an active role in the economy and forward-looking expectations
become important for these decisions. Expected future market conditions and an-
nounced future changes in corporate taxes and depreciation allowance will a¤ect
…rms’investment already today.
The essence of the matter is that current and expected future interest rates
have to adjust for aggregate saving to equal aggregate investment, that is, for the
output and asset markets to clear. Given full employment (Lt = Lt ); the output
market clears when
for the determination of It : This is the …rst time in this book where clearing in the
output market is assigned an active role. In the earlier models investment was just
a passive re‡ection of household saving. Desired investment was automatically
equal to the residual of national income left over after consumption decisions had
taken place. Nothing had to adjust to clear the output market, neither interest
rates nor output. In contrast, in the present framework adjustments in interest
rates and/or the output level are needed for the continuous clearing in the output
market and these adjustments are decisive for the macroeconomic dynamics.
In actual economies there may of course exist “secondary markets” for used
capital goods and markets for renting capital goods owned by others. In view of
installation costs and similar, however, shifting capital goods from one plant to
another is generally costly. Therefore the turnover in that kind of markets tends
to be limited and there is little underpinning for the earlier models’supposition
that the current interest rate should be tied down by a requirement that such
markets clear.
In for instance Abel and Blanchard (1983) a Ramsey-style model integrating
the q-theory of investment is presented. The authors study the two-dimensional
general equilibrium dynamics resulting from the adjustment of current and ex-
pected future (short-term) interest rates needed for the output market to clear.
Adjustments of the whole structure of interest rates (the yield curve) take place
and constitute the equilibrating mechanism in the output and asset markets.
By having output market equilibrium playing this role in the model, a …rst
step is taken towards medium- and short-run macroeconomic theory. We take
further steps in later chapters, by allowing imperfect competition and nominal
price rigidities to enter the picture. Then the demand side gets an active role
both in the determination of q (and thereby investment) and in the determination
of aggregate output and employment. This is what Keynesian theory (old and
new) deals with.
In the remainder of this chapter we will still assume perfect competition in all
markets including the labor market. In this sense we will stay within the neoclas-
sical framework (supply-dominated models) where, by instantaneous adjustment
of the real wage, labor demand continuously matches labor supply. The next
two subsections present examples of how Tobin’s q-theory of investment can be
integrated into the neoclassical framework. To avoid the more complex dynamics
arising in a closed economy, we shift the focus to a small open economy. This
allows concentrating on a dynamic system with an exogenous interest rate.
instantaneously when the interest rate in the world …nancial market changes.
In the standard neoclassical growth model for a small open economy, without
convex capital installation costs, a rise in the interest rate leads immediately to
a complete adjustment of the capital stock so as to equalize the net marginal
productivity of capital to the new higher interest rate. Moreover, in that model
expected future changes in the interest rate or in corporate taxes and deprecia-
tion allowances do not trigger an investment response until these changes actually
happen. In contrast, when convex installation costs are present, expected future
changes tend to in‡uence …rms’investment already today.
We assume:
In this setting the SOE faces an exogenous interest rate, r; given from the
world …nancial market. We assume r is a positive constant. The aggregate pro-
duction function, F (K; L); is neoclassical and concave as in the previous sections.
With L > 0 denoting the constant labor supply, continuous clearing in the labor
market under perfect competition gives Lt = L for all t 0 and
At any time t; Kt is predetermined in the sense that due to the convex installation
costs, changes in K take time. Thus (14.29) determines the market real wage wt :
To pin down the evolution of the economy, we now derive two coupled di¤er-
ential equations in K and q. Inserting (14.24) into (14.6) gives
As r and L are exogenous, the capital stock, K; and its shadow price, q; are
the only endogenous variables in the di¤erential equations (14.30) and (14.31).
Figure 14.4: Phase diagram for investment dynamics in a small open economy (a case
where > 0).
Fig. 14.4 shows the phase diagram for these two coupled di¤erential equations.
Let q be de…ned as the value of q satisfying the equation m(q) = : Since m0 > 0,
q is unique. Suppressing for convenience the explicit time subscripts, we then
have
K_ = 0 for m(q) = ; i.e., for q = q :
As > 0; we have q > 1: This is so because also mere reinvestment to o¤set
capital depreciation requires an incentive, namely that the marginal value to
the …rm of replacing worn-out capital is larger than the purchase price of the
investment good (since the installation cost must also be compensated). From
(14.30) is seen that
0 = (r + )q FK (K; L) + g( ) (q 1) ; (14.34)
q_ 7 0 for points to the left and to the right, respectively, of the q_ = 0 locus,
since FKK (Kt ; L) < 0: The vertical arrows in Fig. 14.4 show these directions of
movement.
Altogether the phase diagram shows that the steady state E is a saddle point,
and since there is one predetermined variable, K; and one jump variable, q; and
the saddle path is not parallel to the jump variable axis, the steady state is
saddle-point stable. At time 0 the economy will be at the point B in Fig. 14.4
where the vertical line K = K0 crosses the saddle path. Then the economy
will move along the saddle path towards the steady state. This solution satis…es
the transversality condition (14.32) and is the unique solution to the model (for
details, see Appendix F).
Figure 14.5: Phase portrait of an unanticipated rise in r (the case > 0).
The basic assumptions are the same as in the previous section except that now
labor supply, Lt , grows at the constant rate n 0; while the technology level, T;
grows at the constant rate 0 (both rates exogenous and constant) and the
production function is neoclassical with CRS. We assume that the world market
real interest rate, r; is a constant and satis…es r > + n: Still assuming full
employment, we have Lt = Lt = L0 ent :
must hold.
The di¤erential equations (14.35) and (14.36) constitute our new dynamic
system. Fig. 14.6 shows the phase diagram, which is qualitatively similar to that
in Fig. 14.4. We have
0 = (r + )q ~ + g(m(q))
f 0 (k) (q 1)m(q):
0 = (r + )q ~ + g( +
f 0 (k) + n) (q 1)( + + n):
10
In our perfect foresight model we in fact have to assume r > +n for the …rm’s maximization
problem to be well-de…ned. If instead r + n; the market value of the representative …rm
would be in…nite, and maximization would loose its meaning.
further loosens the relationship between q a and investment and may help explain
the observed positive correlation between investment and corporate pro…ts.
We might also question that capital installation costs really have the hy-
pothesized strictly convex form. It is one thing that there are costs associated
with installation, reorganizing and retraining etc., when new capital equipment
is procured. But should we expect these costs to be strictly convex in the vol-
ume of investment? To think about this, let us for a moment ignore the role
of the existing capital stock. Hence, we write total installation costs J = G(I)
with G(0) = 0. It does not seem problematic to assume G0 (I) > 0 for I > 0.
The question concerns the assumption G00 (I) > 0. According to this assumption
the average installation cost G(I)=I must be increasing in I:11 But against this
speaks the fact that capital installation may involve indivisibilities, …xed costs,
acquisition of new information etc. All these features tend to imply decreasing
average costs. In any case, at least at the microeconomic level one should ex-
pect unevenness in the capital adjustment process rather than the above smooth
adjustment.
Because of the mixed empirical success of the convex installation cost hypoth-
esis other theoretical approaches that can account for sluggish and sometimes
non-smooth and lumpy capital adjustment have been considered: uncertainty,
investment irreversibility, indivisibility, or …nancial problems due to bankruptcy
costs (Nickell 1978, Zeira 1987, Dixit and Pindyck 1994, Caballero 1999, Adda and
Cooper 2003). These approaches notwithstanding, it turns out that the q-theory
of investment has recently been somewhat rehabilitated from both a theoretical
and an empirical point of view. At the theoretical level Wang and Wen (2010)
show that …nancial frictions in the form of collateralized borrowing at the …rm
level can give rise to strictly convex adjustment costs at the aggregate level yet
at the same time generate lumpiness in plant-level investment. For large …rms,
unlikely to be much a¤ected by …nancial frictions, Eberly et al. (2008) …nd that
the theory does a good job in explaining investment behavior.
In any case, the q-theory of investment is in di¤erent versions widely used
in short- and medium-run macroeconomics because of its simplicity and the ap-
pealing link it establishes between asset markets and …rms’investment. And the
q-theory has also had an important role in studies of the housing market and the
role of housing prices for household wealth and consumption, a theme to which
we return in the next chapter.
11
Indeed, for I 6= 0 we have d[G(I)=I]=dI = [IG0 (I) G(I)]=I 2 > 0; when G is strictly convex
(G00 > 0) and G(0) = 0:
14.6 Appendix
A. When value maximization is - and is not - equivalent with continuous
static pro…t maximization
For the idealized case where tax distortions, asymmetric information, and prob-
lems with enforceability of …nancial contracts are absent, the Modigliani-Miller
theorem (Modigliani and Miller, 1958) says that the …nancial structure of the …rm
is both indeterminate and irrelevant for production outcomes. Considering the
…rm described in Section 14.1, the implied separation of the …nancing decision
from the production and investment decision can be exposed in the following way.
B_ t = Xt (F (Kt ; Lt ) G(It ; Kt ) wt Lt It rt Bt ) :
dividends:
Z 1 Rt
max 1 V~0 = Xt e 0 rs ds
dt s.t.
(Lt ;It ;Xt )t=0 0
Lt 0; It free,
K_ t = It Kt ; K0 > 0 given, Kt 0 for all t;
B_ t = Xt (F (Kt ; Lt ) G(It ; Kt ) wt Lt It rt Bt ) ;
where B0 is given, (14.38)
Rt
rs ds
lim Bt e 0 0: (NPG)
t!1
Let this problem, where the …nancing aspects are ignored, be called Problem
II. When considering the relationship between Problem I and Problem II, the
following mathematical fact is useful.
LEMMA A1 Consider a continuous function a(t) and a di¤erentiable function
f (t). Then
Z t1 Rt R t1
a(s)ds
(f 0 (t) a(t)f (t))e t0 dt = f (t1 )e t0 a(s)ds f (t0 ):
t0
Hence,
Z t1 Rt
a(s)ds
(f 0 (t) a(t)f (t))e t0
dt
t0
R t1
a(s)ds
= f (t1 )e t0
f (t0 ):
CLAIM 1 If (Kt ; Bt ; Lt ; It ; Xt )1 1
t=0 is a solution to Problem I, then (Kt ; Lt ; It )t=0
is a solution to Problem II.
Proof. By (14.38) and the de…nition of Rt , Xt = Rt + B_ t rt Bt so that
Z 1 Rt
Z 1 Rt
V~0 = Xt e 0 rs ds
dt = V0 + (B_ t rt Bt )e 0 rs ds
dt: (14.39)
0 0
where the weak inequality is due to (NPG). Substituting this into (14.39), we
see that maximum
RT
of net worth V~0 is obtained by maximizing V0 and ensuring
r ds
limT !1 BT e 0 s
= 0; in which case net worth equals ((maximized V0 ) B0 );
where B0 is given. So a plan that maximizes net worth of the …rm must also
maximize V0 in Problem II.
Consequently it does not matter for the …rm’s production and investment
behavior whether the …rm’s investment is …nanced by issuing new debt or by
issuing shares of stock. Moreover, if we assume investors do not care about
whether they receive the …rm’s earnings in the form of dividends or valuation
gains on the shares, the …rm’s dividend policy is also irrelevant. Hence, from now
on we can concentrate on the investment-production problem, Problem II above.
The case with no capital installation costs Suppose the …rm has no capital
installation costs. Then the cash ‡ow reduces to Rt = F (Kt ; Lt ) wt Lt It :
CLAIM 2 When there are no capital installation costs, Problem II can be reduced
to a series of static pro…t maximization problems.
Proof. Current (pure) pro…t is de…ned as
Hence, Z Z
1 Rt 1 Rt
V0 = te 0 rs ds
dt + (rt Kt K_ t )e 0 rs ds
dt: (14.41)
0 0
The …rst integral on the right-hand side of this expression is independent of the
second. Indeed, the …rm can maximize the …rst integral by renting capital and
labor, Kt and Lt ; at the going factor prices, rt + and wt ; respectively, such that
t = (Kt ; Lt ) is maximized at each t. The factor costs are accounted for in the
de…nition of t :
The second integral on the right-hand side of (14.41) is the present value of
net revenue from renting capital out to others. In Lemma A1, let f (t) = Kt ;
a(t) = rt ; t0 = 0; t1 = T and consider T ! 1: Then
Z T Rt RT
lim (rt Kt K_ t )e 0 rs ds
dt = K0 lim KT e 0 rs ds
= K0 ; (14.42)
T !1 0 T !1
where the last equality comes from the fact that maximization of V0 requires
maximization of the left-hand side of (14.42)
RT
which in turn, since K0 is given,
rs ds
requires minimization of limT !1 KT e 0 . The latter expression is always
non-negative and can be made zero by choosing any time path for Kt such that
limT !1 KT = 0: (We may alternatively put it this way: it never pays Rthe …rm to
T
accumulate costly capital so fast in the long run that limT !1 KT e 0 rs ds > 0;
that is, to maintain accumulation of capital at a rate equal Rto or higher
Rt than the
1
interest rate:) Substituting (14.42) into (14.41), we get V0 = 0 t e 0 rs ds dt+K0 :
The conclusion is that, given K0 ,12 V0 is maximized if and only if Kt and Lt
are at each t chosen such that t = (Kt ; Lt ) is maximized.
The case with strictly convex capital installation costs Now we rein-
troduce the capital installation cost function G(It ; Kt ); satisfying in particular
the condition GII (I; K) > 0 for all (I; K): Then, as shown in the text, the …rm
adjusts to a change in its environment, say a downward shift in r; by a gradual
adjustment of K; in this case upward, rather than attempting an instantaneous
maximization of (Kt ; Lt ): The latter would entail an instantaneous upward jump
in Kt of size Kt = a > 0, requiring It t = a for t = 0: This would require
It = 1; which implies G(It ; Kt ) = 1; which may interpreted either as such a
jump being impossible or at least so costly that no …rm will pursue it.
12
Note that in the absence of capital installation costs, the historically given K0 is no more
“given” than the …rm may instantly let it jump to a lower or higher level. In the …rst case the
…rm would immediately sell a bunch of its machines and in the latter case it would immediately
buy a bunch of machines. Indeed, without convex capital installation costs nothing rules out
jumps in the capital stock. But such jumps just re‡ect an immediate jump, in the opposite
direction, in another asset item in the balance sheet and leave the maximized net worth of the
…rm unchanged.
where the …rst equality follows from the transversality condition (14.14), which
we repeat here:
Rt
lim qt e 0 rs ds = 0: (*)
t!1
RT
Indeed, since 0; limT !1 (e 0 rs ds e T ) = 0; when (*) holds. Initial time
is arbitrary, and so we may replace 0 and t in (14.44) by t and ; respectively.
The conclusion is that (14.15) holds along an interior optimal path, given the
transversality condition (*). A proof of necessity of the transversality condition
(*) is given in Appendix B.13
B. Transversality conditions
Rt
In view of (14.44), a quali…ed conjecture is that the condition limt!1 qt e 0 (rs + )ds
= 0 is necessary for optimality. This is indeed true, since this condition follows
from the stronger transversality condition (*) in Appendix A, the necessity of
which along an optimal path we will now prove.
Rt
rs ds
Rearranging (14.11) and multiplying through by the integrating factor e 0 ;
we have Rt Rt
(rt qt q_t )e 0 rs ds = (FKt GKt qt ) e 0 rs ds :
In Lemma A1, let f (t) = qt ; a(t) = rt ; t0 = 0; t1 = T . Then
Z T Rt RT
Z T Rt
rs ds rs ds rs ds
(rt qt q_t )e 0 dt = q0 qT e 0 = (FKt GKt qt ) e 0 dt:
0 0
RT
If, contrary to (*), limT !1 qT e 0 rs ds > 0 along the optimal path, then (14.45)
shows that the …rm is over-investing. By reducing initial investment by one unit,
the …rm would save approximately 1 + GI (I0 ; K0 ) = q0 ; by (14.10), which would
be more than the present value of the stream of potential net gains coming from
this marginal unit of installed capital (the …rst term on the right-hand side of
(14.45)). RT
Suppose instead that limT !1 qT e 0 rs ds < 0: Then, by a symmetric argu-
ment, the …rm has under-invested initially.
t
GI (It ; Kt ) = 1;
pIt
Now, g 00 (x) must be positive for the theory to work. But the assumptions (0) =
0; 0 > 0; and 00 0; imposed in p. 153 and again in p. 154 in Barro and
Sala-i-Martin (2004), are not su¢ cient for this (since x < 0 is possible). Since
in macroeconomics x < 0 is seldom, this is only a minor point, of course. Yet,
from a formal point of view the g( ) formulation may seem preferable to the ( )
formulation.
It is sometimes convenient to let the capital installation cost G(I, K) appear,
not as a reduction in output, but as a reduction in capital formation so that
This approach is used in Hayashi (1982) and Heijdra and Ploeg (2002, p. 573 ¤.).
For example, Heijdra and Ploeg write the rate of capital accumulation as K=K _
= '(I=K) ; where the “capital installation function”'(I=K) can be interpreted
as '(I=K) [I G(I; K)] =K = I=K g(I=K); the latter equality comes from
assuming G is homogeneous of degree 1. In one-sector models, as we usually
consider in this text, this changes nothing of importance. In more general models
this installation function approach may have some analytical advantages; what
gives the best …t empirically is an open question. In our housing market model
in the next chapter we apply a speci…cation analogue to (14.46), interpreting K_
as the number of new houses per time unit.
Finally, some analysts assume that installation costs are a strictly convex
function of net investment, I K; not gross investment, I: This agrees well with
intuition if mere replacement investment occurs in a smooth way not involving
R = F (K ; L ) FL L G(I ; K ) I
= A(K ; L ) + FK K + B(I ; K ) GI I GK K I ;
where we have used …rst FL = w and then the de…nitions of A and B above.
Consequently, when moving along the optimal path,
Z 1 R
Vt = V (Kt ; t) = (A(K ; L ) + B(I ; K )) e t rs ds d (14.50)
t
Z 1 R
+ [(FK GK )K (1 + GI )I ]e t rs ds d
Z 1t R
= (A(K ; L ) + B(I ; K ))e t rs ds d + qt Kt ;
t
(FKz GKz )Kz (1 + GIz )Iz = [(rz + )qz q_z ]Kz (1 + GIz )Iz
R
From K_ t = (xt )Kt follows K = Kt e t (xs )ds
. Substituting this into (14.52)
yields Z 1 R
V (Kt ; t) = Kt [f 0 (k ) g(x ) x ]e t (rs xs + )ds
d :
t
In view of (14.24), with t replaced by ; the optimal investment ratio x depends,
for all ; only on q ; not on K ; hence not on Kt : Therefore,
Z 1 R
@V =@Kt = [f 0 (k ) g(x ) x ]e t (rs xs + )ds d = Vt =Kt :
t
14.7 Exercises
14.1 (induced sluggish capital adjustment). Consider a …rm with capital instal-
lation costs J = G(I; K); satisfying
Uncertainty, expectations,
and asset price bubbles
This lecture note provides a framework for addressing themes where expectations
in uncertain situations are important elements. Our previous models have not taken
seriously the problem of uncertainty. Where agent’s expectations about future variables
were involved and these expectations were assumed to be model-consistent (“rational”),
we only considered a special case: perfect foresight. Shocks were treated in a peculiar
(almost self-contradictory) way: they might occur, but only as a complete surprise, a
once-for-all event. Agents’ expectations and actions never incorporated that new shocks
could arrive.
We will now allow recurrent shocks to take place. The environment in which the
economic agents act will be considered inherently uncertain. How can this be modeled
and how can we solve the resultant models? Since it is easier to model uncertainty
in discrete rather than continuous time, we examine uncertainty and expectations in a
discrete time framework.
Our emphasis will be on the hypothesis that when facing uncertainty a dominating
fraction of the economic agents form “rational expectations” in the sense of making prob-
abilistic forecasts which coincide with the forecast calculated on the basis of the “relevant
economic model”. But we begin with simple mechanistic expectation formation hypothe-
ses that have been used to describe day-to-day expectations of people who do not at all
think about the probabilistic properties of the economic environment.
One simple supposition is that expectations change gradually to correct past expectation
errors. Let denote the general price level in period and ≡ ( − −1 )−1
the corresponding inflation rate. Further, let −1 denote the “subjective expectation”,
formed in period − 1 of i.e., the inflation rate from period − 1 to period We may
1
think of the “subjective expectation” as the expected value in a vaguely defined subjective
conditional probability distribution.
The hypothesis of adaptive expectations (the AE hypothesis) says that the expectation
is revised in proportion to the past expectation error,
where the parameter is called the adjustment speed. If = 1 the formula reduces to
This limiting case is known as static expectations or myopic expectations; the subjective
expectation is that the inflation rate will remain the same or at least that it is not more
likely to go up than down.
This says that the expected value concerning this period (period ) is a weighted average
of the actual value for the last period and the expected value for the last period. By
backward substitution we find
The formula (4) can be generalized to the general backward-looking expectations for-
mula,
X
∞ X
∞
−1 = −1− where = 1 (5)
=1 =1
2
If the weights in (5) satisfy = (1 − )−1 = 1 2. . . we get the AE formula (4).
If the weights are
1 = 1 + 2 = − = 0 for = 3 4 . . . ,
we get
−1 = (1 + ) −1 − −2 = −1 + ( −1 − −2 ) (6)
This is called the hypothesis of extrapolative expectations and says:
There are cases where for instance myopic expectations are “rational” (in a sense to
be defined below). Exercise 1 provides an example. But in many cases purely backward-
looking formulas are too rigid, too mechanistic. They will often lead to systematic expec-
tation errors to one side or the other. It seems implausible that people should not then
respond to their experience and revise their expectations formula. And when expectations
are about things that really matter for people, they are likely to listen to professional fore-
casters who build their forecasting on statistical or econometric models. Such models are
based on a formal probabilistic framework, take the interaction between different variables
into account, and incorporate new information about future possible events.
2.1 Preliminaries
We first recapitulate a few concepts from statistics. A sequence { } of random variables
indexed by time is called a stochastic process. A stochastic process { } is called white
noise if for all has zero expected value, constant variance, and zero covariance across
time.1 A stochastic process { } is called a first-order autoregressive process, abbreviated
AR(1), if = 0 + 1 −1 + where 0 and 1 are constants, and { } is white noise;
if | 1 | 1 then { } is called a stationary RA(1) process. A stochastic process { } is
called a random walk if = −1 + where { } is white noise.
1
The expression white noise derives from electrotechnics. In electrotechnical systems signals will often
be subject to noise. If this noise is arbitrary and has no dominating frequence, it looks like white light.
The various colours correspond to a certain wave length, but white light is light which has all frequences
(no dominating frequence).
3
Before defining the term rational expectation, it is useful to clarify a distinction be-
tween two ways in which expectations, whatever their nature, may enter a macroeconomic
model.
where is some endogenous variable (not necessarily ) and are given constant
coefficients, and is an exogenous random variable which follows some specified stochas-
tic process. In line with the notation from Section 1, −1 is the subjective expectation
formed in period −1 of the value of the variable in period The economic agents are in
simple models assumed to have the same expectations. Or, at least there is a dominating
expectation, −1 in the society. What the equation (7) claims is that the endogenous
variable, , depends, in the specified linear way, on the “generally held” expectation of
, formed in the previous period. It is natural to think of the outcome as being the
aggregate result of agents’ decisions and market mechanisms, the decisions being made at
discrete points in time −2 −1 immediately after the uncertainty concerning
the period in question is resolved.
The second equation specifies how the subjective expectation is formed. To fix ideas,
let us assume myopic expectations,
−1 = −1 (8)
as in (2) above. A solution to the model is a stochastic process for such that (7) holds,
given the expectation formation (8) and the stochastic process which follows.
EXAMPLE 1 (imported raw materials and the domestic price level) Let the endogenous
variable in (7) represent the domestic price level (the consumer price index) and let
be the price level of imported raw materials. Suppose the price level is determined
through a markup on unit costs,
1
= ( + )(1 + ) 0 (*)
1+
where is the nominal wage level in period = 0 1 2 , and and are positive tech-
nical coefficients representing the assumed constant labor and raw meterials requirements,
4
respectively, per unit of output; is a constant markup. Assume further that workers in
period − 1 negotiate next period’s wage level, so as to achieve, in expected value, a
certain target real wage which we normalize to 1, i.e.,
= 1
−1
= −1 + 0 = (1 + ) 1 0 = (1 + ) (9)
= −1 + (̄ + ) = 0 1 2
Without shocks, and starting from an arbitrary −1 0 the time path of the price
level would be = (−1 − ∗ )+1 + ∗ where ∗ = ̄(1 − ) Shocks to the price of
imported raw materials result in transitory deviations from ∗ But as the shocks are only
temporary and || 1 the domestic price level gradually returns towards the constant
level ∗ The intervening changes in wage demands in response to the changes in the price
level changes prolong the time it takes to return to ∗ in the absence of new shocks. ¤
Equation (7) can also be interpreted as a vector equation (such that and −1 are
-vectors, is an × matrix, an × matrix, and an -vector). The crucial
feature is that the endogenous variables dated only depend on previous expectations of
date- values of these variables and on the exogenous variables.
Models with past expectations of current endogenous variables will serve as our point
of reference when introducing the concept of rational expectations below.
= +1 + = 0 1 2 (11)
Here +1 is the subjective expectation, formed in period of the value of in period
+ 1. Example: the equity price today depends on what the equity price is expected to be
5
tomorrow. Or more generally: the current expectation of a future value of an endogenous
variable influences the current value of this variable. We name this the case of forward-
looking expectations. (In “everyday language” also −1 in model type 1 can be said to
be a forward-looking variable as seen from period − 1. But the dividing line between
the two model types, (7) and (11), is whether current expectations of future values of the
endogenous variables do or do not influence the current values of these.)
The complete model with forward-looking expectations will include an additional equa-
tion, specifying how the subjective expectation, +1 is formed. We might again impose
myopic expectations, +1 = A solution to the model is a stochastic process for
satisfying (11), given the stochastic process followed by and given the specified ex-
pectation formation and perhaps some additional restrictions in the form of boundary
conditions or similar. The case of forward-looking expectations is important in connec-
tion with many topics in macroeconomics, including the evolution of asset prices, and
issues of asset price bubbles. This case will be dealt with in sections 3 and 4 below.
In passing we note that in both model type 1 and model type 2, it is the mean (in the
subjective probability distribution) of the random variable(s) that enters. This is typical
of simple macroeconomic models which often ignore other measures such as the median,
mode, or higher-order moments. The latter, say the variance of , may be included in
more advanced models where for instance behavior towards risk is important.
The concepts of a rational expectation and model-consistent expectation are closely related,
but not the same. We start with the latter.
Let there be given a stochastic model represented by (7) combined with some given
expectation formation (8), say. We put ourselves in the position of the investigator or
model builder and ask what the model-consistent expectation of the endogenous variable
is as seen from period − 1. It is the mathematical conditional expectation that can
be calculated on the basis of the model and available relevant data revealed up to and
including period − 1. Let us denote this expectation
where is the expectation operator and −1 denotes the information available at time
− 1. We think of period − 1 as the half-open time interval [ − 1 ) and imagine that
the uncertainty concerning the exogenous random variable −1 is resolved at time − 1
6
So −1 includes knowledge of −1 and thereby, via the model, also of −1 2
The information −1 may comprise knowledge of the realized values of and up
until and including period − 1 Instead of (12) we could, for instance, write
Here information (some of which may be redundant) goes back to a given initial period,
say period 0, in which case equals Alternatively, perhaps information goes back to
“ancient times”, possibly represented by = ∞ Anyway, as time proceeds, in general
more and more realizations of the exogenous and endogenous variables become known
and in this sense the information −1 expands with rising . The information −1 may
also be interpreted as “partial lack of uncertainty”, so that an “increasing amount of
information” and “reduced uncertainty” are seen as two sides of the same thing. The
“reduced uncertainty” lies in the fact that the space of possible time paths {( )}+
−
as of time shrinks as time proceeds ( denotes the time horizon as seen from time ).3
Indeed, this space shrinks precisely because more and more realizations of the variables
take place (more information appears) and thereby rule out an increasing subset of paths
that were earlier possible.
Inserting the investigator’s estimated values of the coefficients and the investigator’s
forecast of is obtained.
Unsatisfied with mechanistic formulas like those of Section 1, the American economist
John F. Muth (1961) introduced a radically different approach, the hypothesis of rational
expectations. Muth stated the hypothesis the following way:
I should like to suggest that expectations, since they are informed predictions
of future events, are essentially the same as the predictions of the relevant
2
We refer to −1 as the “available information” rather than the “information set” which is an alterna-
tive term used in the literature. The latter term is tricky, however, and has different meanings in different
branches of economics, hence we are hesitant to use it. The subtleties are accounted for in Appendix B,
dealing with mathematical conditional expectations in general.
3
By “possible” is meant “ex ante feasible according to a given model”.
7
economic theory. At the risk of confusing this purely descriptive hypothesis
with a pronouncement as to what firms ought to do, we call such expectations
’rational’ (Muth 1961).
Muth applied this hypothesis to simple microeconomic problems. The hypothesis was
subsequently extended and applied to general equilibrium theory and macroeconomics by
what since the early 1970s became known as the New Classical Macroeconomics school.
Nobel laureate Robert E. Lucas from the University of Chicago lead the way by a series of
papers starting with Lucas (1972) and Lucas (1973). Assuming rational expectations in a
model instead of, for instance, adaptive expectations may radically change the dynamics
and impact of economic policy.
Assuming the economic agents have rational expectations (RE) is to assume that their
subjective expectation equals the model-consistent expectation, that is, the mathematical
conditional expectation that can be calculated on the basis of the model and available
relevant information about the exogenous stochastic variables. In connection with the
model ingredient (7), assuming the agents have rational expectations thus means that
−1 = ( |−1 ) (13)
i.e., agents’ subjective conditional expectation coincides with the “objective” or “true”
conditional expectation, given the model (7).
Together, the equations (7) and (13) constitute a simple rational expectations model
(henceforth an RE model). We may write the model in compact form as
To solve the model means to find the stochastic process followed by given the sto-
chastic process followed by the exogenous variable For a linear RE model with past
expectations of current endogenous variables, the solution procedure is the following.
1. By substitution, reduce the RE model (or the relevant part of the model) into a
form like (14) expressing the endogenous variable in period in terms of its past
8
expectation and the exogenous variable(s). (The case with multiple endogenous
variables is treated similarly.)
2. Take the conditional expectation on both sides of the equation and solve for the
conditional expectation of the endogenous variable.
In practice there is often a fourth step, namely to express other endogenous variables
in the model in terms of those found in step 3. Let us see how the procedure works by
way of the following example.
With myopic expectations, combined with −1 = ∗ say, the positive shock to import
prices at = 0 will imply 0 = ∗ + (̄ + 0 ) = ∗ + 0 0 +1 = ( ∗ + ) + ̄
= ∗ + 0 + = ∗ + for = 1 2 After 0 there is a systematic positive
forecast error. This is because the mechanical expectation does not consider how the
economy really functions. ¤
9
Returning to the general form (14), without specifying the process { } the second
step gives
( |−1 )
( |−1 ) = (16)
1−
when 6= 14 Then, in the third step we get
10
We shall stick to the term “rational expectation” because it is standard. The term
can easily be misunderstood, however. Usually, in economists’ terminology “rational”
refers to behavior based on optimization subject to the constraints faced by the agent.
So one might think that the RE hypothesis stipulates that economic agents try to get the
most out of a situation with limited information, contemplating the benefits and costs
of gathering more information and using adequate statistical estimation methods. But
this is a misunderstanding. The RE hypothesis presumes that the true model is already
known to the agents. The “rationality” refers to taking this assumed knowledge fully into
account.
Let the forecast of some variable one period ahead be denoted −1 . Suppose the
forecast is determined by some given function, , of realizations of and up to and
including period − 1 that is, −1 = (−1 −2 −1 −2 ) Such a function is
known as a forecast function. It might for instance be one of the mechanistic forecasting
principles in Section 1. At the other extreme the forecast function might, at least theo-
retically, coincide with the a model-consistent conditional expectation. In the latter case
it is a model-consistent forecast function and we can write
The forecast error is the difference between the actually occurring future value, of
a variable and the forecasted value. So, for a given forecast, −1 the forecast error is
≡ − −1 and is itself a stochastic variable.
If the forecast function in (18) complies with the true data-generating process (a big
“if”), then the implied forecasts would have several ideal properties:
(b) the forecast error would be uncorrelated with any of the variable in the information
−1 and therefore also with its own past values; and
11
tation is nil, i.e., ( ) = 0; this is because (( |−1 )) = (0) = 0 at the same time as
(( |−1 )) = ( ) by the law of iterated expectations from statistics saying that the
unconditional expectation of the conditional expectation of a stochastic variable is given
by the unconditional expectation of , cf. Appendix B. Considering the specific model
(7), the model-consistent-forecast error is = − ( |−1 ) = ( − ( |−1 )) by
(16) and (17). An ex post error ( 6= 0) thus emerges if and only if the realization of the
exogenous variable deviates from its conditional expectation as seen from the previous
period.
As to property (b), for = 1 2 let − be some variable value belonging to the
information − . Then, property (b) is the claim that the (unconditional) covariance
between and − is zero, i.e., Cov( − ) = 0 for = 1 2 . This follows from the
orthogonality property of model-consistent expectations (see Appendix C). In particular,
with − = − we get Cov( − ) = 0 i.e., the forecast errors exhibit lack of serial
correlation. If the covariance were not zero, it would be possible to improve the forecast
by incorporating the correlation into the forecast. In other words, under the assumption of
rational expectations economic agents have no more to learn from past forecast errors. As
remarked above, the RE hypothesis precisely refers to a fictional situation where learning
has been completed and underlying mechanisms do not change.
Assuming −1 −1 −2 are jointly normally distributed, then the solution to
the problem of minimizing is to set (·) equal to the conditional expectation ( |−1 )
based on the data-generating model as in (18).5 This is what property (c) refers to.
12
It is worth emphasizing that the “true” conditional expectation can not usually be
known − neither to the economic agents nor to the investigator. At best there can be a
reasonable estimate, probably somewhat different across the agents because of differences
in information and conceptions of how the economic system functions. A deeper model of
expectations would give an account of the mechanisms through which agents learn about
the economic environment. An important ingredient here would be how agents contem-
plate the costs and potential gains associated with further information search needed
to reduce systematic expectation errors where possible. This contemplation is intricate
because information search often means entering unknown territory. Moreover, for a sig-
nificant subset of the agents the costs may be prohibitive. A further complicating factor
involved in learning is that when the agents have obtained some knowledge about the
statistical properties of the economic variables, the resulting behavior of the agents may
change these statistical properties. The rational expectations hypothesis sets these prob-
lems aside. It is simply assumed that the structure of the economy remains unchanged
and that the learning process has been completed.
The notion of perfect foresight corresponds to the limiting case where the variance of
the exogenous variable(s) is zero so that with probability one, = ( |−1 ) for all
. Then we have a non-stochastic model where rational expectations imply that agents’
ex post forecast error with respect to is zero.6 To put it differently: rational expec-
tations in a non-stochastic model is equivalent to perfect foresight. Note, however, that
perfect foresight necessitates the exogenous variable to be known in advance. Real-
world situations are usually not like that. If we want our model to take this into account,
the model ought to be formulated in an explicit stochastic framework. And assumptions
should be stated about how the economic agents respond to the uncertainty. The ra-
tional expectations assumption is a one approach to the problem and has been much
applied in macroeconomics in recent decades, perhaps due to lack of compelling tractable
alternatives.
6
Here we disregard zero probability events.
13
3 Models with rational forward-looking expectations
That is, the expected value of a stochastic variable, + conditional on the information
, will be denoted +
A stochastic difference equation of the form (19) is called a linear expectation difference
equation of first order with constant coefficient .7 A solution is a specified stochastic
process { } which satisfies (19), given the stochastic process followed by . In the
economic applications usually no initial value, 0 , is given. On the contrary, the interpre-
tation is that depends, for all on expectations about the future.8 So is considered
a jump variable that can immediately shift its value in response to the emergence of new
information about the future ’s. For example, a share price may immediately jump to a
new value when the accounts of the firm become publicly known (often even before, due
to sudden rumors).
Due to the lack of an initial condition for there can easily be infinitely many
processes for satisfying our expectation difference equation. We have an infinite forward-
looking “regress”, where a variable’s value today depends on its expected value tomorrow,
this value depending on the expected value the day after tomorrow and so on. Then usu-
ally there are infinitely many expected sequences which can be self-fulfilling in the sense
that if only the agents expect a particular sequence, then the aggregate outcome of their
behavior will be that the sequence is realized. It “bites its own tail” so to speak. Yet, when
7
To keep things simple, we let the coefficients and be constants, but a generalization to time-
dependent coefficients is straightforward.
8
The reason we say “depends on” is that it would be inaccurate to say that is determined (in a
one-way-sense) by expectations about the future. Rather there is mutual dependence. In view of being
an element in the information the expectation of +1 in (19) may depend on just as much as
depends on the expectation of +1 .
14
an equation like (19) is part of a larger model, there will often (but not always) be con-
ditions that allow us to select one of the many solutions to (19) as the only economically
relevant one. For example, an economy-wide transversality condition or another general
equilibrium condition may rule out divergent solutions and leave a unique convergent
solution as the final solution.
We assume 6= 0 since otherwise (19) itself is already the unique solution. It turns
out that the set of solutions to (19) takes a different form depending on whether || 1
or || 1:
The case || 1 In general, there is a unique fundamental solution and infinitely many
explosive bubble solutions.
The case || 1 In general, there is no fundamental solution but infinitely many non-
explosive solutions. (The case || = 1 resembles this.)
In the case || 1 the expected future has modest influence on the present. Here we
will concentrate on this case, since it is the case most frequently appearing in macroeco-
nomic models with rational expectations.
Various solution methods are available. Repeated forward substitution is the most easily
understood method.
Repeated forward substitution consists of the following steps. We first shift (19) one
period ahead:
+1 = +1 +2 + +1
where the second equality sign is due to the law of iterated expectations, which says that
15
see Box 1. Inserting (20) into (19) then gives
The procedure is repeated by forwarding (19) two periods ahead; then taking the condi-
tional expectation and inserting into (22), we get
We continue in this way and the general form (for = 0 1 2 ) becomes
The method of repeated forward substitution is based on the law of iterated expecta-
tions which says that ( +1 +2 ) = +2 as in (21). The logic is the fol-
lowing. Events in period + 1 are stochastic as seen from period and so +1 +2
(the expectation conditional on these events) is a stochastic variable. Then the law
of iterated expectations says that the conditional expectation of this stochastic variable
as seen from period is the same as the conditional expectation of +2 itself as seen
from period So, given that expectations are rational, then an earlier expectation of
a later expectation of is just the earlier expectation of . Put differently: my best
forecast today of how I am going to forecast tomorrow a share price the day after
tomorrow, will be the same as my best forecast today of the share price the day after
tomorrow. If beforehand we have good reasons to expect that we will revise our
expectations upward, say, when next period’s additional information arrives, the
original expectation would be biased, hence not rational.9
X
lim + exists, (24)
→∞
=1
9
A formal account of conditional expectations and the law of iterated expectations is given in Appendix
B.
16
then
X
∞ X
∞
= + = + + ≡ ∗ = 0 1 2 (25)
=0 =1
Proof Assume (24). Then the formula (25) is meaningful. In view of (23), it satisfies
(19) if and only if lim→∞ +1 ++1 = 0 Hence, it is enough to show that the process
(25) satisfies this latter condition.
P∞
In (25), replace by + + 1 to get ++1 = =0 ++1 ++1+ Using the law
of iterated expectations, this yields
X
∞
++1 = ++1+ so that
=0
X
∞ X
∞
+1 +1
++1 = ++1+ = +
=0 =+1
P∞
It remains to show that lim→∞ =+1 + = 0 From the identity
X
∞ X
X
∞
+ = + + +
=1 =1 =+1
follows
X
∞ X
∞ X
+ = + − +
=+1 =1 =1
The solution (25) is called the fundamental solution of (19), often marked by an
asterisk ∗ . The fundamental solution is (for 6= 0) defined only when the condition (24)
holds. In general this condition requires that || 1 In addition, (24) requires that the
absolute value of the expectation of the exogenous variable does not increase “too fast”.
More precisely, the requirement is that | + |, when → ∞, has a growth factor less
than ||−1 As an example, let 0 1 and 0, and suppose that + 0 for
= 0 1 2 and that 1 + is an upper bound for the growth factor of + Then
17
Multiplying by , we get + ≤ (1 + ) By summing from = 1 to
X
X
+ ≤ [(1 + )]
=1 =1
Letting → ∞ we get
X X
(1 + )
lim
+ ≤ lim [(1 + )] = ∞
→∞
=1
→∞
=1
1 − (1 + )
if 1 + −1 using the sum rule for an infinite geometric series.
As noted in the proof of Proposition 1, the fundamental solution, (25), has the property
that
lim + = 0 (26)
→∞
That is, the expected value of is not “explosive”: its absolute value has a growth factor
less than ||−1 . Given || 1 the fundamental solution is the only solution of (19) with
this property. Indeed, it is seen from (23) that whenever (26) holds, (25) must also hold.
In Example 1 below, is interpreted as the market price of a share and as dividends.
Then the fundamental solution gives the share price as the present value of the expected
future flow of dividends.
18
Suppose investors have rational expectations and care only about expected return.
Then the no-arbitrage condition reads
+ +1 −
= 0 (27)
This can be written
1 1
= +1 + (28)
1+ 1+
which is of the same form as (19) with = = 1(1 + ) ∈ (0 1). Assuming dividends do
not grow “too fast”, we find the fundamental solution, denoted ∗ as
1 X X
∞ ∞
1 1 1
∗ = +
+ =
+1 +
(29)
1+ 1 + =1 (1 + ) =0
(1 + )
The fundamental solution is simply the present value of expected future dividends.
If the dividend process is +1 = + +1 where +1 is white noise, then the dividend
process is known as a random walk and + = for = 1 2 Thus ∗ = , by
the sum rule for an infinite geometric series. In this case the fundamental value is thus
itself a random walk. More generally, the dividend process could be a martingale, that is,
a sequence of stochastic variables with the property that the expected value next period
exists and equals the current actual value, i.e., +1 = ; but in a martingale, +1
≡ +1 − need not be white noise; it is enough that +1 = 011 Given the constant
required return we still have ∗ = So the fundamental value itself is in this case a
martingale. ¤
In finance theory the present value of the expected future flow of dividends on an
equity share is referred to as the fundamental value of the share. It is by analogy with
this that the general designation fundamental solution has been introduced for solutions
of form (25). We could also think of as the market price of a house rented out and
as the rent. Or could be the market price of an oil well and the revenue (net of
extraction costs) from the extracted oil in period
Other than the fundamental solution, the expectation difference equation (19) has infi-
nitely many bubble solutions. In view of || 1, these are characterized by violating the
condition (26). That is, they are solutions whose expected value explodes over time.
11
A random walk is thus a special case of a martingale.
19
It is convenient to first consider the homogenous expectation equation associated with
(19). This is defined as the equation emerging when setting = 0 in (19):
and is thus a solution to (30). The “disturbance” +1 represents “new information” which
may be related to movements in “fundamentals”, +1 But it does not have to. In fact,
+1 may be related to conditions that per se have no economic relevance whatsoever.
For ease of notation, from now on we just write even if we think of the whole process
{ } rather than the value taken by in the specific period The meaning should be clear
from the context. A solution to (30) is referred to as a homogenous solution associated
with (19). Let be a given homogenous solution and let be an arbitrary constant.
Then = is also a homogenous solution (try it out for yourself). Conversely, any
homogenous solution associated with (19) can be written in the form (31). To see this,
let be a given homogenous solution, that is, = +1 . Let +1 = +1 − +1 .
Then
+1 = +1 + +1 = −1 + +1
For convenience we here repeat our original expectation difference equation (19):
PROPOSITION 2 Consider the expectation difference equation (*). Let ̃ be a particular
solution to the expectation difference equation (19), where 6= 0 Then:
= ̃ + (33)
(ii) every solution to (*) can be written in the form (33) with being an appropriately
chosen homogenous solution associated with (*).
20
Proof. Let some particular solution ̃ be given. (i) Consider = ̃ + where satisfies
(31). Since ̃ satisfies (*), we have = ̃+1 + + . Consequently, by (30),
saying that (33) satisfies (*). (ii) Let be an arbitrary solution to (*). Define = − ̃ .
Then we have
where the second equality follows from the fact that both and ̃ are solutions to (*).
This shows that is a solution to the homogenous equation (30) associated with (*).
Since = ̃ + , the proposition is hereby proved. ¤
Proposition 2 holds for any 6= 0 In case the fundamental solution (25) exists and
|| 1, it is convenient to choose this solution as the particular solution in (33). Thus,
referring to the right-hand side of (25) as ∗ , we can use the particular form,
= ∗ + (34)
When the component is different from zero, the solution (34) is called a bubble
solution and is called the bubble component. In the typical economic interpretation
the bubble component shows up only because it is expected to show up next period, cf.
(32). The name bubble springs from the fact that the expected value conditional on the
information available in period explodes over time when || 1. To see this, as an
example, let 0 1 Then, from (30), by repeated forward substitution we get
It follows that + = − , and from this follows that the bubble, for going to infinity,
is unbounded in expected value:
½
∞, if 0
lim + = (35)
→∞ −∞ if 0
Indeed, the absolute value of + will for rising grow geometrically towards infinity
with a growth factor equal to 1 1
Let us consider a special case of (*19) that allows a simple graphical illustration of
both the fundamental solution and some bubble solutions.
21
y
II
cx I
1 a
III
Suppose the stochastic process (the “fundamentals”) takes the form = ̄ + where
̄ is a constant and is white noise. Then
It may be instructive to consider the case where all stochastic features are eliminated.
So we assume ≡ ≡ 0. Then we have a model with perfect foresight; the solution (37)
simplifies to
̄
+ 0 −
= (38)
1−
where we have used repeated backward substitution in (31). By setting = 0 we see that
̄
0 = 1−
+ 0 Inserting this into (38) gives
̄ ̄
= + (0 − )− (39)
1− 1−
22
̄(1 − ) and trajectory III, with 0 ̄(1 −) are bubble solutions. Since we have
imposed no boundary condition apriori, one 0 is as good as any other. The interpretation
is that there are infinitely many trajectories with the property that if only the economic
agents expect the economy will follow that particular trajectory, the aggregate outcome of
their behavior will be that this trajectory is realized. This is the potential indeterminacy
arising when is not a predetermined variable. However, as alluded to above, in a
complete economic model there will often be restrictions on the endogenous variable(s)
not visible in the basic expectation difference equation(s), here (36). It may be that
the economic meaning of precludes negative values (a share certificate would be an
example). In that case no-one can rationally expect a path such as III in Fig. 1. Or
perhaps, for some reason, there is an upper bound on (think of the full-employment
ceiling for output in a situation where the “natural” growth factor for output is smaller
than −1 ). Then no one can rationally expect a trajectory like II in the figure.
To sum up: in order for a solution of a first-order linear expectation difference equation
with constant coefficient , where || 1 to differ from the fundamental solution, the
solution must have the form (34) where has the form described in (31). This provides
a clue as to what asset price bubbles might look like.
A stylized fact of stock markets is that stock price indices are quite volatile on a month-to-
month, year-to-year, and especially decade-to-decade scale, cf. Fig. 2. There are different
views about how these swings should be understood. According to the Efficient Market
Hypothesis the swings just reflect unpredictable changes in the “fundamentals”, that is,
changes in the present value of rationally expected future dividends. This is for instance
the view of Nobel laureate Eugene Fama (1970, 2003) from University of Chicago.
In contrast, Nobel laureate Robert Shiller (1981, 2003, 2005) from Yale University,
and others, have pointed to the phenomenon of “excess volatility”. The view is that asset
prices tend to fluctuate more than can be rationalized by shifts in information about
fundamentals (present values of dividends). Although in no way a verification, graphs
like those in Fig. 2 and Fig. 3 are suggestive. Fig. 2 shows the monthly real Standard
and Poors (S&P) composite stock prices and real S&P composite earnings for the period
1871-2008. The unusually large increase in real stock prices since the mid-90’s, which
ended with the collapse in 2000, is known as the “dot-com bubble”. Fig. 3 shows, on a
monthly basis, the ratio of real S&P stock prices to an average of the previous ten years’
23
2000 500
1800 450
1200 300
1000 250
800 200
600 150
400 100
200 50
0 0
1860 1880 1900 1920 1940 1960 1980 2000 2020
Figure 2: Monthly real S&P composite stock prices from January 1871 to January 2008 (left)
and monthly real S&P composite earnings from January 1871 to September 2007 (right). Source:
http://www.econ.yale.edu/~shiller/data.htm.
real S&P earnings along with the long-term real interest rate. It is seen that this ratio
reached an all-time high in 2000, by many observers considered as “the year the dot-com
bubble burst”.
Shiller’s interpretation of the large stock market swings is that they are due to fads,
herding, and shifts in fashions and “animal spirits” (the latter being a notion from
Keynes).
A third possible source of large stock market swings was pointed out by Blanchard
(1979) and Blanchard and Watson (1982). They argued that bubble phenomena need
not be due to irrational behavior and absence of rational expectations. This lead to the
theory of rational bubbles − the idea that excess volatility can be explained as speculative
bubbles arising from self-fulfilling rational expectations.
24
50 20
45 18
40 16
30 12
25 10
20 8
15 6
10 4
5 2
0 0
1860 1880 1900 1920 1940 1960 1980 2000 2020
Price earnings ratio Year Long-term real interes t rate
Figure 3: S&P price-earnings ratio and long-term real interest rates from January 1881
to January 2008. The earnings are calculated as a moving average over the preceding
ten years. The long-term real interest rate is the 10-year Treasury rate from 1953 and
government bond yields from Sidney Homer, “A History of Interest Rates” from before
1953. Source: http://www.econ.yale.edu/~shiller/data.htm.
25
the market price, of the asset from its fundamental value, ∗ :
= − ∗ (40)
An asset price bubble that emerges in a setting where the no-arbitrage condition (27)
holds under rational expectations, is called a rational bubble. It emerges only because
there is an economy-wide self-fulfilling expectation that it will appreciate at a rate high
enough to warrant the overcharge involved. In the definition in (40) and in the discussion
below we ignore that at a less abstract level it is a systematic deviation, rather than just
a temporary noise deviation, of from ∗ which qualifies for an asset price bubble.
EXAMPLE 2 (an ever-expanding rational bubble) Consider again an equity share for
which the no-arbitrage condition is
+ +1 −
= 0 (41)
As in Example 1, the implied expectation difference equation is = +1 + with
= = 1(1 + ) ∈ (0 1) Let the price of the share at time be = ∗ + where ∗ is the
fundamental value and 0 a bubble component following the deterministic process,
+1 = (1+) 0 0 so that = 0 (1+) This is called a deterministic rational bubble.
Agents may be ready to pay a price over and above the fundamental value (whether or
not they know the “true” fundamental value) if they expect they can sell at a sufficiently
higher price later; trading with such motivation is called speculative behavior. If generally
held and lasting for some time, this expectation may be self-fulfilling. Note that (41)
implies that the asset price ultimately grows at the rate . Indeed, let = 0 (1 + )
(if ≤ the asset price would be infinite). By the rule of the sum of an infinite
geometrice series, we then have ∗ = ( −) showing that the fundamental value grows
at the rate Consequently, = (∗ + ) = ∗ + 1 → 1 as It follows that
the asset price in the long run grows at the same rate as the bubble, the rate
We are not acquainted with ever-expanding incidents of that caliber in real world
situations, however. A deterministic rational bubble is implausible. ¤
In some contexts it may not matter whether or not we think of the “rational” market
participants as knowing the probability distribution of the “fundamentals”, hence knowing
∗ (by “fundamentals” is meant any information relating to the future dividend or service
capacity of an asset: a firm’s technology, resources, market conditions etc.). All the same,
it seems common to imply such a high level of information in the term “rational bubbles”.
Unless otherwise indicated, we shall let this implication be understood.
26
While a deterministic rational bubble was found implausible, let us now consider an
example of a stochastic rational bubble which sooner or later bursts.
This bubble satisfies the criterion for a rational bubble. Indeed, (42) implies
1+
+1 = ( )+1 + 0 · (1 − +1 ) = (1 + )
+1
This is of the form (31) with −1 = 1 + and the bubble is therefore a stochastic
rational bubble. The stochastic component is +1 = +1 − +1 = +1 − (1 + )
and has conditional expectation equal to zero. Although +1 must have zero conditional
expectation, it need not be white noise (it can for instance have varying variance). ¤
As this example illustrates, a stochastic rational bubble does not have the implausible
ever-expanding form of a deterministic rational bubble. Yet, under certain conditions
even stochastic rational bubbles can be ruled out or at least be judged implausible. The
next section reviews some arguments.
We concentrate on assets whose services are valued independently of the price.13 Let
be the market price and ∗ the fundamental value of the asset as of time . Even if the
13
This is in contrast to assets that serve as means of payment.
27
asset yields services rather than dividends, we think of ∗ as in principle the same for all
agents. This is because a user who, in a given period, values the service flow of the asset
relatively low can hire it out to the one who values it highest (the one with the highest
willingness to pay). Until further notice we assume ∗ known to the market participants.
(a) Assets which can be freely disposed of (“free disposal”) Can a rational asset
price bubble be negative? The answer is no. The logic can be illustrated on the basis
of Example 2 above. For simplicity, let the dividend be the same constant 0 for all
= 0 1 2 . Then, from the formula (39) we have
where 0 and ∗ = Suppose there is a negative bubble in period 0, i.e., 0 −∗ 0
In period 1, since 1 + 1 the bubble is greater in absolute value. The downward
movement of continues and sooner or later is negative. The intuition is that the
low 0 in period 0 implies a high dividend-price ratio. Hence a negative capital gain
(+1 − 0) is needed for the no-arbitrage condition (41) to hold. Thereby 1 0
and so on.
But in a market with self-interested rational agents, an object which can be freely
disposed of can never have a negative price. A negative price means that the “seller”
has to pay to dispose of the object. Nobody will do that if the object can just be
thrown away. An asset which can be freely disposed of (share certificates for instance)
can therefore never have a negative price. We conclude that a negative rational bubble
can not be consistent with rational expectations. Similarly, with a stochastic dividend,
a negative rational bubble would imply that in expected value the share price becomes
negative at some point in time, cf. (35). Again, rational expectations rule this out.
Hence, if we imagine that for a short moment ∗ , then everyone will want to buy
the asset and hold it forever, which by own use or by hiring out will imply a discounted
value equal to ∗ There is thus excess demand until has risen to ∗
When a negative rational bubble can be ruled out, then, if at the first date of trading
of the asset there were no positive bubble, neither can a positive bubble arise later. Let
28
us make this precise:
PROPOSITION 3 Assume free disposal of a given asset. Then, if a rational bubble in the
asset price is present today, it must be positive and must have been present also yesterday
and so on back to the first date of trading the asset. And if a rational bubble bursts, it
will not restart later.
Proof As argued above, in view of free disposal, a negative rational bubble in the asset
price can be ruled out. It follows that = − ∗ ≥ 0 for = 0 1 2 where = 0 is
the first date of trading the asset. That is, any rational bubble in the asset price must be
a positive bubble. We now show by contradiction that if, for an arbitrary = 1 2 it
holds that 0 then −1 0. Let 0 Then, if −1 = 0 we have −1 = −1
= 0 (from (31) with replaced by − 1), implying, since 0 is not possible, that = 0
with probability one as seen from period − 1 Ignoring zero probability events, this rules
out 0 and we have arrived at a contradiction. Thus −1 0 Replacing by − 1
and so on backward in time, we end up with 0 0. This reasoning also implies that if
a bubble bursts in period , it can not restart in period + 1 nor, by extension, in any
subsequent period. ¤
This proposition (due to Diba and Grossman, 1988) claims that a rational bubble in
an asset price must have been there since trading of the asset began. Yet such a conclusion
is not without ambiguities. If new information about radically new technology comes up
at some point in time, is a share in the firm then the same asset as before? In a legal
sense the firm is the same, but is the asset also the same? Even if an earlier bubble has
crashed, cannot a new rational bubble arise later in case of an utterly new situation?
These ambiguities reflect the difficulty involved in the concepts of rational expectations
and rational bubbles when we are dealing with uncertainties about future developments of
the economy. The market’s evaluation of many assets of macroeconomic importance, not
the least shares in firms, depends on vague beliefs about future preferences, technologies,
and societal circumstances. The fundamental value can not be determined in any objective
way. There is no well-defined probability distribution over the potential future outcomes.
Fundamental uncertainty, also called Knightian uncertainty,14 is present.
(b) Bonds with finite maturity The finite maturity ensures that the value of the bond
is given at some finite future date. Therefore, if there were a positive bubble in the market
14
After the Chicago of University economist Frank Knight who in his book, Risk, Uncertainty, and
Profit (1921), coined the important distinction between measurable risk and unmeasurable uncertainty.
29
price of the bond, no rational investor would buy just before that date. Anticipating this,
no one would buy the date before, and so on. Consequently, nobody will buy in the first
place. By this backward-induction argument follows that a positive bubble cannot get
started. And since there also is “free disposal”, all rational bubbles can be precluded.
From now on we take as given that negative rational bubbles are ruled out. So, the
discussion is about whether positive rational asset price bubbles may exist or not.
(c) Assets whose supply is elastic Real capital goods (including buildings) can be
reproduced and have clearly defined costs of reproduction. This precludes rational bubbles
on this kind of assets, since a potential buyer can avoid the overcharge by producing
instead. Notice, however, that building sites with a specific amenity value and apartments
in attractive quarters of a city are not easily reproducible. Therefore, rational bubbles on
such assets are more difficult to rule out.
Here are a few intuitive remarks about bubbles on shares of stock in an established
firm. An argument against a rational bubble might be that if there were a bubble, the
firm would tend to exploit it by issuing more shares. But thereby market participants
mistrust is raised and may pull market evaluation back to the fundamental value. On
the other hand, the firm might anticipate this adverse response from the market. So the
firm chooses instead to “fool” the market by steady financing behavior, calmly enjoying
its solid equity and continuing as if no bubble were present. It is therefore not obvious
that this kind of argument can rule out rational bubbles on shares of stock.
(d) Assets for which there exists a “backstop-technology” For some articles of
trade there exists substitutes in elastic supply which will be demanded if the price of
the article becomes sufficiently high. Such a substitute is called a “backstop-technology”.
For example oil and other fossil fuels will, when their prices become sufficiently high,
be subject to intense competition from substitutes (renewable energy sources). This
precludes an unbounded bubble process in the price of oil.
On account of the arguments (c) and (d), it seems more difficult to rule out rational
bubbles when it comes to assets which are not reproducible or substitutable, let alone
assets whose fundamentals are difficult to ascertain. For some assets the fundamentals
are not easily ascertained. Examples are paintings of past great artists, rare stamps,
diamonds, gold etc. Also new firms that introduce completely novel products and tech-
nologies are potential candidates. Think of the proliferation of radio broadcasting in the
30
1920s before the wall Street crash in 1929 and the internet in the 1990s before the dotcom
bubble burst in 2000.
What these situations allow for may not be termed rational bubbles, if by definition
this concept requires a well-defined fundamental. Then we may think of a broader class
of real-world bubbly phenomena driven by self-reinforcing expectations.
The above considerations are of a partial equilibrium nature. On top of this, general
equilibrium arguments can be put forward to limit the possibility of rational bubbles. We
may briefly give a flavour of two such general equilibrium arguments. We still consider
assets whose services are valued independently of the price and which, as in (a) above,
can be freely disposed of. A house, a machine, or a share in a firm yields a service in
consumption or production or in the form of a dividend stream. Since such an asset has
an intrinsic value, ∗ equal to the present value of the flow of services, one might believe
that positive rational bubbles on such assets can be ruled out in general equilibrium.
As we shall see, this is indeed true for an economy with a finite number of “neoclassical”
households (to be defined below), but not necessarily in an overlapping generations model.
Yet even there, rational bubbles can under certain conditions be ruled out.
Under free disposal in point (a) we saw that ∗ can not be an equilibrium. We
now consider the case of a positive bubble, i.e., ∗ All owners of the bubble asset
who are users will in this case prefer to sell and then rent; this would imply excess supply
and could thus not be an equilibrium. Hence, we turn to households that are not users,
but speculators. Assuming “short selling” is legal, speculators may pursue “short selling”,
that is, they first rent the asset (for a contracted interval of time) and immediately sell
it at . This results in excess supply and so the asset price falls towards ∗ . Within the
contracted interval of time the speculators buy the asset back and return it to the original
owners in accordance with the loan accord. So ∗ can not be an equilibrium.
Even ruling out “short selling” (which is sometimes outright forbidden), we can ex-
clude positive bubbles in the present setup with a finite number of households. To assume
31
that owners who are not users would want to hold the bubble asset forever as a permanent
investment will contradict that these owners are “neoclassical”. Indeed, their transver-
sality condition would be violated because the value of their wealth would grow at a rate
asymptotically equal to the rate of interest. This would allow them to increase their
consumption now without decreasing it later and without violating their No-Ponzi-Game
condition.
We have to instead imagine that the “neoclassical” households who own the bubble
asset, hold it against future sale. This could on the face of it seem rational enough
if there were some probability that not only would the bubble continue to exist, but
it would also grow so that the return would be at least as high as that yielded on an
alternative investment. Owners holding the asset in the expectation of a capital gain, will
thus plan to sell at some later point in time. Let be the point in time where household
wishes to sell and let
= max{1 2 }
Then nobody will plan to hold the asset after The household speculator, having
= will thus not have anyone to sell to (other than people who will only pay ∗ )
Anticipating this, no-one would buy or hold the asset the period before, and so on. So
no-one will want to buy or hold the asset in the first place.
The same line of reasoning does not, however, go through in an overlapping generations
model where new households − that is, new traders − enter the economy every period.
(f) An economy with interest rate above the output growth rate In an overlap-
ping generations (OLG) model with an infinite sequence of new decision makers, rational
bubbles are under certain conditions theoretically possible. The argument is that with
→ ∞ as defined above is not bounded. Although this unboundedness is a necessary
condition for rational bubbles, it is not sufficient, however.
To see why, let us return to the arbitrage examples 1, 2, and 3 where we have −1 =
1 + so that a hypothetical rational bubble has the form +1 = (1 + ) ++1 where
+1 = 0 So in expected value the hypothetical bubble is growing at a rate equal to
the interest rate, If at the same time is higher than the long-run output growth rate,
the value of the expanding bubble asset would sooner or later be larger than GDP and
aggregate saving would not suffice to back its continued growth. Agents with rational
32
expectations anticipate this and so the bubble never gets started.
This point is valid when the interest rate in the OLG economy is higher than the
growth rate of the economy − which is normally considered the realistic case. Yet, the
opposite case is possible and in that situation it is less easy to rule out rational asset
price bubbles. This is also the case in situations with imperfect credit markets. It turns
out that the presence of segmented financial markets or externalities that create a wedge
between private and social returns on productive investment may increase the scope for
rational bubbles (Blanchard, 2008).
4.5 Conclusion
The empirical evidence concerning asset price bubbles in general and rational asset price
bubbles in particular seems inconclusive. It is very difficult to statistically distinguish
between bubbles and mis-specified fundamentals. Rational bubbles can also have more
complicated forms than the bursting bubble in Example 3 above. For example Evans
(1991) and Hall et al. (1999) study “regime-switching” rational bubbles.
Whatever the possible limits to the plausibility of rational bubbles in asset prices, it is
useful to be aware of their logical structure and the variety of forms they can take as logical
possibilities. Rational bubbles may serve as a benchmark for a variety of “behavioral asset
price bubbles”, i.e., bubbles arising through particular psychological mechanisms. This
would take us to behavioral finance theory. The reader is referred to, e.g., Shiller (2003).
For surveys on the theory of rational bubbles and econometric bubble tests, see Salge
(1997) and Gürkaynak (2008). For discussions of famous historical bubble episodes, see
the symposium in Journal of Economic Perspectives 4, No. 2, 1990, and Shiller (2005).
5 Appendix
In many macroeconomic models with rational expectations the equations are specified as
log-linear, that is, as being linear in the logarithms of the variables. If and are
the original positive stochastic variables, defining = ln , = ln and = ln , a
log-linear relationship between and is a relation of the form
= + + (43)
33
where and are constants. The motivation for assuming log-linearity can be:
(a) Linearity is convenient because of the simple rule for the expected value of a sum:
( + + ) = + () + (), where is the expectation operator. Indeed,
for a non-linear function, ( ) we generally have (( )) 6= (() ()).
(b) Linearity in logs may often seem a more realistic assumption than linearity in any-
thing else.
Proof Let the positive variables , and be related by = (, ), where is a
continuous function with continuous partial derivatives. Taking the differential on both
sides of ln = ln ( ) we get
1 1
ln = + (44)
( ) ( )
= + = + = ln + ln
where and are the partial elasticities of w.r.t. and , respectively. Thus,
defining = ln , = ln and = ln , gives
= + (45)
Now, let us instead start by assuming the log-linear relationship (43). Then,
= = (46)
But (43), together with the definitions of , and implies that
34
from which follows that
1
= so that ≡ =
and
1
= so that ≡ =
That is, the partial elasticities are constant. ¤
So, when the variables are in logs, then the coefficients in the linear expressions are
the elasticities. Note, however, that the interest rate is normally an exception. It is often
regarded as more realistic to let the interest rate itself and not its logarithm enter linearly.
Then the associated coefficient indicates the semi-elasticity with respect to the interest
rate.
The mathematical conditional expectation is a weighted sum of the possible values of the
stochastic variable with weights equal to the corresponding conditional probabilities.
Let and be two discrete stochastic variables with joint probability function ( )
and marginal probability functions () and () respectively. If the conditional probabil-
ity function for given = 0 is denoted ( |0 ) we have ( |0 ) = ( 0 )(0 ) as-
suming (0 ) 0 The conditional expectation of given = 0 denoted ( | = 0 )
is then
X ( 0 )
( | = 0 ) = (47)
(0 )
where the summation is over all the possible values of
When we here let the conditional expectation ( |) play the role of () and sum over
all for which () 0 we get
à !
X X X ( )
(( |)) = ( |)() = () (by (47))
()
à !
X X X
= ( ) = () = ( )
35
This result is a manifestation of the law of iterated expectations: the unconditional
expectation of the conditional expectation of is given by the unconditional expectation
of
Now consider the case where and are continuous stochastic variables with joint
probability density function ( ) and marginal density functions () and () respec-
tively. If the conditional density function for given = 0 is denoted ( |0 ) we have
( |0 ) = ( 0 )(0 ) assuming (0 ) 0 The conditional expectation of given
= 0 is Z ∞
( 0 )
( | = 0 ) = (48)
−∞ (0 )
where we have assumed that the range of is (−∞ ∞) Again, we may view the condi-
tional expectation itself as a stochastic variable and write it as ( |) Generally, for a
function of the continuous stochastic variable say () the expected value is
Z
(()) = ()()
where stands for the range of When we let the conditional expectation ( |) play
the role of () we get
Z Z µZ ∞ ¶
( )
(( |)) = ( |)() = () (by (48))
−∞ ()
Z ∞ µZ ¶ Z ∞
= ( ) = () = ( ) (49)
−∞ −∞
This shows us the law of iterated expectations in action for continuous stochastic
variables: the unconditional expectation of the conditional expectation of is given by
the unconditional expectation of
EXAMPLE Let the two stochastic variables, and follow a two-dimensional normal
distribution. Then, from mathematical statistics we know that the conditional expectation
of given satisfies
Cov( )
( |) = ( ) + ( − ())
Var()
Taking expectations on both sides gives
Cov( )
(( |)) = ( ) + (() − ()) = ( ) ¤
Var()
We may also express the law of iterated expectations in terms of subsets of the original
outcome space for a stochastic variable. Let the event A be a subset of the outcome space
36
for and let B be a subset of A. Then the law of iterated expectations takes the form
In the text of this and the subsequent chapters we consider a dynamic context where
expectations are conditional on dated information − ( = 1 2 ). By a, so far, “informal
analogy” with (49) we then write the law of iterated expectations this way:
That is, the expectation today of the expectation tomorrow, when more may be known,
of a variable the day after tomorrow is the same as the expectation today of the variable
the day after tomorrow. Intuitively: you ask a stockbroker in which direction she expects
to revise her expectations upon the arrival of more information. If the broker answers
“upward”, say, then another broker is recommended.
The notation used in the transition from (50) to (52) might seem problematic, though.
That is why we talk of “informal analogy”. The sets A and B are subsets of the outcome
space and B ⊆ A In contrast, the “information” or “information content” represented by
our symbol will, for the uninitiated, inevitably be understood in a meaning not fitting
the inclusion +1 ⊆ . Intuitively “information” dictates the opposite inclusion, namely
as a set which expands over time − more and more “information” (like “knowledge” or
“available data”) is revealed as time proceeds.
37
more and more realizations occur, that is, more and more of the ex ante random states
( ) become historical data, ( ) Hence, as time proceeds, the subset Ω shrinks
in the sense that Ω+1 ⊆ Ω . The increasing amount of information and the “reduced
uncertainty” can thus be seen as two sides of the same thing. Interpreting this way,
i.e., as “partial lack of uncertainty”, the expression (52) means the same thing as
As in the text of Section 24.2.2, let denote the model-consistent forecast error −
( |−1 ) Then, if −1 represents information contained in −1 ,
( |−1 ) = ( − ( |−1 ) |−1 ) = ( |−1 ) − (( |−1 ) |−1 )
= ( |−1 ) − ( |−1 ) = 0 (53)
where we have used that (( |−1 ) |−1 ) = ( |−1 ) by the law of iterated expec-
tations. With −1 = −1 we have, as a special case,
As to property (b) in Section 24.2.2, for = 1 2 let − be an arbitrary variable
value belonging to the information − . Then, ( − |− ) = − ( |− ) = 0 by
(53) with −1 = − (since − is contained in −1 ). Thus, by the principle (51),
by (54) and (55). In particular, with − = − we get Cov( − ) = 0 This proves that
model-consistent forecast errors exhibit lack of serial correlation.
38
6 Exercises
2. Consider a simple Keynesian model of a closed economy with constant wages and
prices (behind the scene), abundant capacity, and output determined by demand:
= = + ¯ + (1)
= + −1 0 0 1 (2)
= (1 − )̄ + −1 + ̄ 0 0 1 (3)
Suppose expectations are “static” in the sense that expected income in period equals
actual income in the previous period.
a) Solve for .
39
Suppose instead that expectations are rational.
d) Solve for
e) Find the income multiplier with respect to a change in −1 and respectively.
3. Consider arbitrage between equity shares and a riskless asset paying the constant
rate of return 0. Let denote the price at the beginning of period of a share that
at the end of period yields the dividend . As seen from period there is uncertainty
about + and + for = 1 2. . . . Suppose agents have rational expectations and care
only about expected return (risk neutrality).
b) Find the fundamental solution for and let it be denoted ∗ . Hint: given
P
= +1 + the fundamental solution is = + ∞
=1 +
Suppose someone claims that the share price follows the process
= ∗ +
c) What is an asset price bubble and what is a rational asset price bubble?
d) Can the described process be a rational asset price bubble? Hint: a bubble
component associated with the inhomogenous equation = +1 + is a
solution, different from zero, to the homogeneous equation, = +1 .
40
Chapter 16
Money in macroeconomics
Money buys goods and goods buy money; but goods do not buy goods.
Robert W. Clower (1967).
Up to now we have put monetary issues aside. The implicit assumption has
been that the exchange of goods and services in the market economy can be
carried out without friction as mere intra- or intertemporal barter. This is, of
course, not realistic. At best it can provide an acceptable approximation to reality
only for a limited set of macroeconomic issues. We now turn to models in which
there is a demand for money. We thus turn to monetary theory, that is, the study
of causes and consequences of the fact that a large part of the exchange of goods
and services in the real world is mediated through the use of money.
645
646 CHAPTER 16. MONEY IN MACROECONOMICS
Figure 16.1: No direct exchange possible. A medium of exchange, here good 2, solves
the problem (details in text).
(because the value exists through “…at”, a ruler’s declaration). In view of the
absence of intrinsic value, maintaining the exchange value of …at money over
time, that is, avoiding high or ‡uctuating in‡ation, is one of the central tasks of
monetary policy.
2. It is a store of value.
3. It serves as a unit of account in which prices are quoted and books kept
(the numeraire).
On can argue, however, that the last function is on a di¤erent footing com-
pared to the two others. Thus, we should make a distinction between the func-
tions that money necessarily performs, according to our de…nition above, and the
functions that money usually performs. Property 1 and 2 certainly belong to the
essential characteristics of money. By its role as a device for making transactions
money helps an economy to avoid the need for a double coincidence of wants.
In order to perform this role, money must be a store of value, i.e., a device that
transfers and maintains value over time. The reason that people are willing to
exchange their goods for pieces of paper is exactly that these can later be used
to purchase other goods. As a store of value, however, money is dominated by
other stores of value such as bonds and shares that pay a higher rate of return.
When nevertheless there is a demand for money, it is due to the liquidity of this
store of value, that is, its service as a generally accepted medium of exchange.
Property 3, however, is not an indispensable function of money as we have
de…ned it. Though the money unit is usually used as the unit of account in which
prices are quoted, this function of money is conceptually distinct from the other
two functions and has sometimes been distinct in practice. During times of high
in‡ation, foreign currency has been used as a unit of account, whereas the local
money continued to be used as the medium of exchange. During the German
hyperin‡ation of 1922-23 US dollars were the unit of account used in parts of the
economy, whereas the mark was the medium of exchange; and during the Russian
hyperin‡ation in the middle of the 1990s again US dollars were often the unit of
account, but the rouble was still the medium of exchange.
This is not to say that it is of little importance that money usually serves
as numeraire. Indeed, this function of money plays an important role for the
M0 ; i.e., the monetary base, alternatively called base money, central bank
money, or high-powered money. The monetary base is de…ned as fully liquid
claims on the central bank held by the private sector, that is, currency (coins
and notes) in circulation plus demand deposits held by the commercial
banks in the central bank.2 This monetary aggregate is under the direct
control of the central bank and is changed by open-market operations, that
is, by the central bank trading bonds, usually short-term government bonds,
with the private sector. But clearly the monetary base is an imperfect
measure of the liquidity in the private sector.
As we move down the list, the liquidity of the added assets decreases, while
their interest yield increases.4 Currency earns zero interest. When in macroeco-
nomic texts the term “money supply” is used, traditionally M1 or M2 has been
meant; there is, however, a rising tendency to focus on M3 . Along with currency,
the demand deposits in the commercial banks are normally fully liquid, at least
as long as they are guaranteed by a governmental deposit insurance (although
normally only up to a certain maximum per account). The interest earned on
these demand deposits is usually low (at least for “small”depositors) and in fact
often ignored in simple theoretical models.
A related and theoretically important simple classi…cation of money types is
the following:
2. Inside money = money that is not net wealth of the private sector.
1 1 1
M1 = cd 1
= = M0 = mmM0 :
cd+1
+ rd cd+1 cm + rd(1 cm) 1 (1 rd)(1 cm)
(16.3)
The way the central bank controls the monetary base is through open-market
operations, that is, by buying or selling bonds (typically short-term government
bonds) in the amount needed to sustain a desired level of the monetary base. In
the next stage the aim could be to obtain a desired level of M1 or a desired level
of the short-term interest rate or, in an open economy, a desired exchange rate
vis-a-vis other currencies.
5
Blanchard (2003).
The demand for money, by which we mean the quantity of money held by the
non-bank public, should be seen as part of a broader portfolio decision by
which economic agents allocate their …nancial wealth to di¤erent existing
assets, including money, and liabilities. The portfolio decision involves a
balanced consideration of after-tax expected return, risk, and liquidity.
earns no interest at all or at least less interest than other assets. Therefore money
holding involves a trade-o¤ between the need for liquidity and the wish for interest
yield.
The incorporation of a somewhat micro-founded money demand in macro-
models is often based on one or another kind of short-cut:
The shopping-costs approach. Here the liquidity services of money are mod-
elled as reducing shopping time or other kinds of non-pecuniary or pecu-
niary shopping costs. The shopping time needed to purchase a given level
of consumption, ct ; is decreasing in real money holdings and increasing in
ct :
is needed. The short-term nominal interest rate, i, enters because it is the op-
portunity cost of holding cash instead of interest-bearing short-term securities,
for instance government bonds that mature in one year or less. 8 The latter
constitute a close substitute to money because they have a high degree of liquid-
ity. They are standardized and extensively traded in centralized auction markets
and under “normal circumstances” relatively safe. Because of the short term to
maturity, their market value is less volatile than longer-term securities.
Let the money supply in focus be M1 and let P be the general price level in
the economy (say the GDP de‡ator). Then money market equilibrium is present
if
M1 = P L(Y; i); (16.4)
that is, the available amount of money equals nominal money demand. Note that
supply and demand are in terms of stocks (amounts at a given point in time),
not ‡ows. One of the issues in monetary theory is to account for how this stock
equilibrium is brought about at any instant. Which of the variables M1 ; P; Y;
and i is the equilibrating variable? Presuming that the central bank controls M1 ;
classical (pre-Keynesian) monetary theory has P as the equilibrating variable
while in Keynes’monetary theory it is primarily i which has this role.9 Popular
speci…cations of the function L include L(Y; i) = Y i and L(Y; i) = Y e i ;
where and are positive constants.
One may alternatively think of the “money market”in a more narrow sense,
however. We may translate (16.4) into a description of demand and supply for
base money:
P
M0 = L(Y; i); (16.5)
mm
where mm is the money multiplier. The right-hand side of this equation re‡ects
that the demand for M1 via the actions of commercial banks is translated into a
demand for base money.10 If the public needs more cash, the demand for bank
loans rises and when granted, banks’ reserves are reduced. When in the next
round the deposits in the banks increase, then generally also the banks’reserves
8
To simplify, we assume that none of the components in the monetary aggregate considered
earns interest. In practice demand deposits in the central bank and commercial banks may
earn a small nominal interest.
9
If the economy has ended up in a “liquidity trap”with i at its lower bound, 0, an increase in
M1 will not generate further reductions in i. Agents would prefer holding cash at zero interest
rather than short-term bonds at negative interest. That is, the “=”in the equilibrium condition
(16.4) should be replaced by “ ” or, equivalently, L(Y; i) should at i = 0 be interpreted as a
“set-valued function”. The implications of this are taken up later in this book.
10
Although the money multiplier tends to depend positively on i as well as other interest
rates, this aspect is unimportant for the discussion below and is ignored in the notation in
(16.5).
have to increase. To maintain the required reserve-deposit ratio, banks which for
a few days have too little liquidity, borrow from other banks or other institutions
which have too much.
This narrowly de…ned money market is closely related to what is by the practi-
tioners and in the …nancial market statistics called the “money market”, namely
the trade in short-term debt-instruments that are close substitutes to holding
central bank money (think of commercial paper and government bonds with ma-
turity of less than one year). The agents trading in this market not only include
the central bank and the commercial banks but also the mortgage credit institu-
tions, life insurance companies, and other …nancial institutions. What is in the
theoretical models called the “short-term nominal interest rate”can normally be
identi…ed with what is in the …nancial market statistics called the money market
rate or the interbank rate. This is the interest rate (usually measured as a per
year rate) at which the commercial banks provide unsecured loans (“signature
loans”) to each other, often on a day-to-day basis.
Open market operations The commercial banks may under certain condi-
tions borrow (on a secured basis) from the central bank at a rate usually called
the discount rate. This central bank lending rate will be somewhat above the
central bank deposit rate, that is, the interest rate, possibly nil, earned by the
commercial banks on their deposits in the central bank. The interval between the
discount rate and the deposit rate constitutes the interest rate corridor, within
which, under “normal circumstances”, the money market rate, i, ‡uctuates. The
central bank deposit rate acts as a ‡oor for the money market rate and the cen-
tral bank lending rate as a ceiling. Sometimes, however, the money market rate
exceeds the central bank lending rate. This may happen in a …nancial crisis
where the potential lenders are hesitant because of the risk that the borrowing
bank goes bankrupt and because there are constraints on how much and when, a
commercial bank in need of cash can borrow from the central bank.
If the money market rate, i; tends to deviate from what the central bank
aims at (the “target rate”, also called the “policy rate”), the central bank will
typically through open-market operations provide liquidity to the money market
or withhold liquidity from it. The mechanism is as follows. Consider a one-period
government bond with a secured payo¤ equal to 1 euro at the end of the period
and no payo¤s during the period (known as a zero-coupon bond or discount
bond). To …x ideas, let the period length be one month. In the …nancial market
language the maturity date is then one month after the issue date. Let v be the
market price (in euros) of the bond at the beginning of the month. The implicit
monthly interest rate, x; is then the solution to the equation v = (1 + x) 1 ; i.e.,
1
x=v 1:
Translated into an annual interest rate, with monthly compounding, this amounts
to i = (1 + x)12 1 = v 12 1 per year. With v = 0:9975; we get i = 0:03049
per year.11
Suppose the central bank …nds that i is too high and buys a bunch of these
bonds. Then less of them are available for the private sector, which on the other
hand now has a larger money stock at its disposal. According to the Keynesian
monetary theory (which is by now quite commonly accepted), under normal cir-
cumstances the general price level for goods and services is sticky in the short
run. It will be the bond price, v; which responds. In the present case it moves
up, thus lowering i; until the available stocks of bonds and money are willingly
held. In practice this adjustment of v; and hence i; to a new equilibrium level
takes place rapidly.
In recent decades the short-term interest rate has been the main monetary
policy tool when trying to stimulate or dampen the general level of economic
activity and control in‡ation. Under normal circumstances the open market
operations give the central bank a narrow control over the short-term interest rate.
Central banks typically announce their target level for the short-term interest rate
and then adjust the monetary base such that the actual money market rate ends
up close to the announced interest rate. This is what the European Central
Bank (the ECB) does when it announces its target for EONIA (euro overnight
index average) and what the U.S. central bank, the Federal Reserve, does when
it announces its target for the federal funds rate. In spite of its name, the latter
is not an interest rate charged by the U.S. central bank but a weighted average
of the interest rates commercial banks in the U.S. charge each other, usually
overnight.
In the narrowly de…ned “money market”close substitutes to money are traded.
From a logical point of view a more appropriate name for this market would be
the “short-term bond market” or the “near-money market”. This would entail
using the term “market” in its general meaning as a “place” where a certain
type of goods or assets are traded for money. Moreover, speaking of a “short-
term bond market” would be in line with the standard name for market(s) for
…nancial assets with maturity of more than one year, namely market(s) for longer-
term bonds and equity; by practitioners these markets are also called the capital
markets. Anyway, in this book we shall use the term “money market”in its broad
theoretical meaning as an abstract market place where the aggregate demand
for money “meets” the aggregate supply of money. As to what kind of money,
“narrow”or “broad”, further speci…cation is always to be added.
The open-market operations by the central bank a¤ect directly or indirectly
11 i=12 1
With continuous compounding we have v = e so that i = 12 ln v = 0:03004 when v
= 0:9975:
all the equilibrating prices in the …nancial markets as well as expectations about
the future path of these prices. This in‡uence derives from the direct control
over the monetary base, M0 : The central bank has no direct control, however,
over the money supply in the broader sense of M1 ; M2 ; or M3 : These broader
monetary aggregates are also a¤ected by the behavior of the commercial banks
and the non-bank public. The money supply in this broad sense can at most be
an intermediate target for monetary policy, that is, a target that can be reached
in some average-sense in the medium run.
1. How is the level and the growth rate of the money supply (in the M0 sense,
say) linked to:
3. How can monetary policy be designed to stabilize the economy and “smooth”
business cycle ‡uctuations?
16.7 Exercises
The idea that prices of most goods and services are sticky in the short run rests on
the empirical observation that in the short run firms in the manufacturing and service
industries typically let output do the adjustment to changes in demand while keeping
prices unchanged. In industrialized societies firms are able to do that because under
“normal circumstances” there is “abundant production capacity” available in the economy.
Three of the most salient short-run features that arise from macroeconomic time series
1
These number-of-years declarations should not be understood as more than a rough indication. Their
appropriateness will depend on the specific historical circumstances and on the problem at hand.
1
analysis of industrialized market economies are the following (cf. Blanchard and Fischer,
1989, Christiano et al., 1999):
2) Even large movements in quantities are often associated with little or no movement
in relative prices − real price insensitivity. The real wage, for instance, exhibits
such insensitivity in the short run.
These stylized facts pertain to final goods and services. It is not the case that all
nominal prices in the economy are in the short run insensitive vis-a-vis demand changes.
One must distinguish between production of most final goods and services on the one
hand and production of primary foodstuff and raw materials on the other. This leads to
the associated distinction between “cost-determined” and “demand- determined” prices.
Final goods and services are typically differentiated goods (imperfect substitutes).
Their production takes place under conditions of imperfect competition. As a result of
existing reserves of production capacity, generally speaking, the production is elastic w.r.t.
demand. An upward shift in demand tends to be met by a rise in production rather than
price. The price changes which do occur are mostly a response to general changes in costs
of production. Hence the name “cost-determined” prices.
For primary foodstuff and many raw materials the situation is different. To increase
the supply of most agricultural products requires considerable time. This is also true
(though not to the same extent) with respect to mining of raw materials as well as
extraction and transport of crude oil. When production is inelastic w.r.t. demand in
the short run, an increase in demand results in a diminution of stocks and a rise in
price. Hence the name “demand-determined prices”. The price rise may be enhanced by
a speculative element: temporary hoarding in the expectation of further price increases.
The price of oil and coffee − two of the most traded commodities in the world market
− fluctuate a lot. Through the channel of costs the changes in these demand-determined
prices spill over to the prices of final goods. Housing is also an area where, apart from
regulation, demand-determined prices is the rule in the short run.
2
In industrialized economies manufacturing and services are the main sectors, and the
general price level is typically regarded as cost-determined rather than demand deter-
mined. Two further aspects are important. First, many wages and prices are set in
nominal terms by price setting agents like craft unions and firms operating in imperfectly
competitive output markets. Second, these wages and prices are in general deliberately
kept unchanged for some time even if changes in the environment of the agent occurs; this
aspect, possibly due to pecuniary or non-pecuniary costs of changing prices, is known as
nominal price stickiness. Both aspects have vast consequences for the functioning of the
economy as a whole compared with a regime of perfect competition and flexible prices.
The simple model presented below is close to what Paul Krugman named the World’s
Smallest Macroeconomic Model.2 The model is crude but nevertheless useful in at least
three ways:
• the model displays the logic behind the Keynesian refutation of Say’s law.
3
The production function has CRS,
= 0 (1)
The price of the consumption good in terms of money, i.e., the nominal price, is The
wage rate in terms of money, the nominal wage, is We assume that the representative
firm maximizes profit, taking these current prices as given. The nominal profit, possibly
nil, is
Π = − (2)
There is free exit from the production sector in the sense that the representative firm can
decide to produce nothing. Hence, an equilibrium with positive production requires that
profits are non-negative.
The representative household lives only one period, but leaves a bequest for the next
generation. The household supplies labor inelastically in the amount ̄ and receives the
profit obtained by the firm, if any. The household demands the consumption good in the
amount in the current period (since we want to allow cases of non-market clearing,
we distinguish between consumption demand, and realized consumption, . Current
income not consumed is saved for the future. As the output good cannot be stored, the
only non-human asset available in the economy is fiat money, which is thus the only asset
on hand for saving. There is no private banking sector in the economy. So “money”
means the “currency in circulation” (the monetary base) and is on net an asset in the
private sector as a whole. Until further notice the money stock is constant.
̂
= ln + ln 0 1 (3)
where ̂ is the amount of money transferred to “the future”, and is the expected
future price level. The utility discount factor (equal to (1 + )−1 if is the utility
discount rate) reflects “patience”.
Consider the household’s choice problem. Facing and and expecting that the
4
future price level will be the household chooses and ̂ to maximize s.t.
+ ̂ = + + Π ≡ ≤ ̄ (4)
Here, 0 is the stock of money held at the beginning of the current period and is
predetermined. The actual employment is denoted and equals the minimum of the
amount of employment offered by the firm and the labor supply ̄ (the principle of
voluntary trade). The sum of initial financial wealth, and nominal income, + Π
constitutes the budget, 3 Nominal financial wealth at the beginning of the next period
is ̂ = + + Π − i.e., the sum of initial financial wealth and planned saving
where the latter equals + Π − The benefit obtained by transferring ̂ depends
on the expected purchasing power of ̂ hence it is ̂ that enters the utility function.
Presumably the household expects some labor and profit income also in the future and
seemingly ownership rights to the firms’ profit are non-negotiable. How the decision
making is related to such matters is not specified in this minimalist way of representing
that there is a future.
5
In our simple model output demand is the same as the consumption demand So
clearing in the output market, in the sense of equality between demand and actual output,
requires = So, if this clearing condition holds, substituting into (6) gives the
relationship
= (7)
This is only a relationship between and not a solution for any of them since both
are endogenous variables so far. Moreover, the relationship is conditional on clearing in
the output market.
We have assumed that agents take prices as given when making their demand and
supply decisions. But we have said nothing about whether nominal prices are flexible or
rigid as seen from the perspective of the system as a whole.
What Keynes called “classical economics” is nowadays also often called “Walrasian macro-
economics” (sometime just “pre-Keynesian macroeconomics”). In this theoretical tradi-
tion both wages and prices are assumed fully flexible and all markets perfectly competitive.
Firms’ ex ante output supply conditional on a hypothetical wage-price pair ( ) and
the corresponding labor demand will be denoted and , respectively. As we know
from microeconomics, the pair ( ) need not be unique, it can easily be a “set-valued
function” of ( ) Moreover, with constant returns to scale in the production function,
the range of this function may for certain pairs ( ) include (∞ ∞).
The distinguishing feature of the Walrasian approach is that wages and prices are
assumed fully flexible. Both and are thought to adjust immediately so as to clear
the labor market and the output market like in a centralized auction market. Clearing
in the labor market requires that and are adjusted so that actual employment,
equals labor supply, which is here inelastic at the given level ̄ So
= = ̄ = (8)
where the last equality indicates that this employment level is willingly demanded by the
firms.
6
profit requires that the real wage equals labor productivity:
= (9)
With clearing in the labor market, output must equal full-employment output,
= ̄ ≡ = (10)
where the superscript “ ” stands for “full employment”, and where the last equality
indicates that this level of output is willingly supplied by the firms. For this level of
output to match the demand, coming from the households, the price level must be
= ≡ (11)
in view of (7) with = This price level is the classical equilibrium price, hence the
superscript “”. Substituting into (9) gives the classical equilibrium wage
= ≡ (12)
For general equilibrium we also need that the desired money holding at the end of the
period equals the available money stock. By Walras’ law this equality follows automat-
ically from the household’s Walrasian budget constraint and clearing in the output and
labor markets. To see this, note that the Walrasian budget constraint is a special case of
the budget constraint (4), namely the case
+ ̂ = + + Π (13)
where Π is the notional profit associated with the hypothetical production plan ( )
i.e.,
Π ≡ − (14)
The Walrasian budget constraint thus imposes replacement of the term for actual employ-
ment, with the households’ desired labor supply, (= ̄) It also imposes replacement
of the term for actual profit, Π with the hypothetical profit Π (“” for “classical”) cal-
culated on the basis of the firms’ aggregate production plan ( ).
Now, let the Walrasian auctioneer announce an arbitrary price vector ( 1) with
0 0 and 1 being the price of the numeraire, money. Then the values of excess
demands add up to
( − ) + ( − ) + ̂ −
= − + + ̂ − − (by rearranging)
= − + Π (by (13))
= − + Π ≡ 0 (from definition of Π in (14))
7
This exemplifies Walras’ law, saying that with Walrasian budget constraints the aggregate
value of excess demands is identically zero. Walras’ law reflects that when households
satisfy their Walrasian budget constraint, then as an arithmetic necessity the economy
as a whole has to satisfy an aggregate budget constraint for the period in question. It
follows that the equilibrium condition ̂ = is ensured as soon as there is clearing in
the output and labor markets. And more generally: if there are markets and − 1 of
these clear, so does the ’th market.
The intuitive mechanism behind this equilibrium is the following adjustment process.
Imagine that in an ultra-short sub-period − 6= 0 In case − 0 ( 0)
there will be excess supply (demand) in the labor market. This drives down (up). Only
when = and full employment obtains, can the system be at rest. Next imagine
that − 6= 0 In case − 0 ( 0) there is excess supply (demand) in the output
market. This drives down (up). Again, only when = and = (whereby
= ), so that the output market clears under full employment, will the system be at
rest.
This adjustment process is fictional, however, because outside equilibrium the Wal-
rasian supplies and demands, which supposedly drive the adjustment, are artificial con-
structs. Being functions only of initial resources and price signals, the Walrasian supplies
and demands are mutually inconsistent outside equilibrium and can therefore not tell
what quantities will be traded during an adjustment process. The story needs a consider-
able refinement unless one is willing to let the mythical “Walrasian auctioneer” enter the
scene and bring about adjustment toward the equilibrium prices without allowing trade
until these prices are found.
Anyway, assuming that Walrasian equilibrium has been attained, by comparative stat-
ics based on (11) and (12) we see that in the classical regime: (a) and are proportional
to ; and (b) output is at the unchanged full-employment level whatever the level of .
This is the neutrality of money result of classical macroeconomics.
4
To underline its one-period nature, it may be called a Walrasian short-run or a Walrasian temporary
equilibrium.
8
The neutrality result also holds in a quasi-dynamic context where we consider an actual
change in the money stock occurring in historical time. Suppose the government/central
bank at the beginning of the period brings about lump-sum transfers to the households
in the total amount ∆ 0 As there is no taxation, this implies a budget deficit which
is thus fully financed by money issue.5 So (134) is replaced by
+ ̂ = + ∆ + ̄ + Π (15)
Likewise, nominal prices are set in advance by firms facing downward-sloping demand
curves. It is understood that there is a large spectrum of differentiated products, and
and are composites of these. This heterogeneity ought of course be visible in the model
− and it will become so in Section 19.3. But at this point the model takes an easy way
out and ignores the involved aggregation issue.
Let in the current period be given at the level ̄ Because firms have market power,
the profit-maximizing price involves a mark-up on marginal cost, ̄ = ̄ (which
is also the average cost). We assume that the price setting occurs under circumstances
where the chosen mark-up becomes a constant 0, so that
̄
= (1 + ) ≡ ̄ (16)
5
Within the model this is in fact the only way to increase the money stock. As money is the only asset
in the economy, a change in the money stock can not be brought about through open market operations
where the central bank buys or sells another financial asset.
9
While ̄ is considered exogenous (not determined within the model), ̄ is endogenously
determined by the given ̄ and There are barriers to entry in the short run.
Because of the fixed wage and price, the distinction between ex ante (also called
planned or intended) demands and supplies and the ex post carried out purchases and
sales are now even more important than before. This is because the different markets may
now also ex post feature excess demand or excess supply (to be defined more precisely
below). According to the principle that no agent can be forced to trade more than desired,
the actual amount traded in a market must equal the minimum of demand and supply.
So in the output market and the labor market the actual quantities traded will be
respectively, where the superscripts “” and “” are now used for demand and supply in
a new meaning to be defined below. This principle, that the short side of the market
determines the traded quantity, is known as the short-side rule. The other side of the
market is said to be quantity rationed or just rationed if there is discrepancy between
and . In view of the produced good being non-storable, intended inventory investment
is ruled out. Hence, the firms try to avoid producing more than can be sold. In (17) we
have thus identified the traded quantity with the produced quantity,
But what exactly do we mean by “demand” and “supply” in this context where market
clearing is not guaranteed? We mean what is appropriately called the effective demand
and the effective supply (“effective” in the meaning of “operative” in the market, though
possibly frustrated in view of the short-side rule). To make these concepts clear, we need
first to define an agent’s effective budget constraint:
It is the last part, “and quantity signals from the markets”, which is not included in
the concept of a Walrasian budget constraint. The perceived quantity signals are in the
present context the actual employment constraint faced by the household and the profit
expected to be received from the firms and determined by their actual production.6 So
the household’s effective budget constraint is given by (4). In contrast, the Walrasian
6
We assume the perceived quantity signals are deterministic.
10
budget constraint is not conditional on quantity signals from the markets but only on the
“endowment” ( ̄) and the perceived price signals and profit.
DEFINITION 2 An agent’s effective demand in a given market is the amount the agent
bids for in the market, conditional on the perceived price and quantity signals that con-
strains its bidding. By “bids for” is meant that the agent is both able to buy that amount
and wishes to buy that amount, given the effective budget constraint. Summing over all
potential buyers, we get the aggregate effective demand in the market.
DEFINITION 3 An agent’s effective supply in a given market is the amount the agent
offers for sale in the market, conditional on perceived price and quantity signals that
constrains its offering. By “offers for sale” is meant that the agent is both able to bring
that amount to the market and wishes to sell that amount, given the set of opportunities
available. Summing over all potential sellers, we get the aggregate effective supply in the
market.
= ̄
= ≡ ̄ (20)
Indeed, in the aggregate the firms are not able to bring more to the market than full-
employment output , And every individual firm is not able to bring to the market
than what can be produced by “its share” of the labor force. On the other hand, because
of the constant marginal costs, every unit sold at the preset price adds to profit. The
firms are therefore happy to satisfy any output demand forthcoming − which is in practice
testified by a lot of sales promotion.
Firms’ aggregate effective demand for labor is constrained by the perceived output
demand, because the firm would loose by employing more labor. Thus,
= (21)
11
By the short-side rule (17), combined with (20), follows that actual aggregate output
(equal to the quantity traded) is
= min( ) 5
So the following three mutually exclusive cases exhaust the possibilities regarding aggre-
gate output:
Since in this regime, we may say there is “excess supply” in the output market or,
with a perhaps better term, there is a “buyers’ market” situation (sale less than desired).
The reservation regarding the term “excess supply” is due to the fact that we should
not forget that − 0 is a completely voluntary state of affairs on the part of the
price-setting firms.
From (1) and the short-side rule now follows that actual employment will be
= = = ̄ = (24)
̄
Also the labor market is thus characterized by “excess supply” or a “buyers’ market”
situation. Profits are Π = ̄ − ̄ = (1 − ̄ (̄ ))̄ = (1 − (1 + )−1 ) −1 0
where we have used, first, = , then the price setting rule (16), and finally (22).
This solution for ( ) is known as a Keynesian equilibrium for the current period.
It is named an equilibrium because the system is “at rest” in the following sense: (a)
12
agents do the best they can given the constraints (which include the preset prices and
the quantities offered by the other side of the market); and (b) the chosen actions are
mutually compatible (purchases and sales match). The term equilibrium is here not used
in the Walrasian sense of market clearing through instantaneous price adjustment but in
the sense of a Nash equilibrium conditional on perceived price and quantity signals. To
underline its temporary character, the equilibrium may be called a Keynesian short-run
(or temporary) equilibrium. The flavor of the equilibrium is Keynesian in the sense that
there is unemployment and at the same time it is aggregate demand in the output market,
not the real wage, which is the binding constraint on the employment level. A higher
propensity to consume (lower discount factor ) results in higher aggregate demand,
and thereby a higher equilibrium output, . In contrast, a lower real wage due to either
a higher mark-up, or a lower marginal (= average) labor productivity, does not
result in a higher . On the contrary, becomes lower, and the causal chain behind
this goes via a higher ̄ cf. (16) and (22). In fact, the given real wage, ̄ ̄ = (1+)
is consistent with unemployment as well as full employment, see below. It is the sticky
nominal price at an excessive level, caused by a sticky nominal wage at an “excessive”
level, that makes unemployment prevail through a too low aggregate demand, A lower
nominal wage would imply a lower ̄ and thereby, for a given stimulate and thus
raise
In brief, the Keynesian regime leads to an equilibrium where output as well as em-
ployment are demand-determined.
The “Keynesian cross” and effective demand The situation is illustrated by the
“Keynesian cross” in the ( ) plane shown in Fig. 19.1, where = = (1 +
)−1 ( + ̄ ) We see the vicious circle: Output is below the full-employment level
because of low consumption demand; and consumption demand is low because of the low
employment. The economy is in a unemployment trap. Even though at we have Π 0
and there are constant returns to scale, the individual firm has no incentive to increase
production because the firm already produces as much as it rightly perceives it can sell
at its preferred price. We also see that here money is not neutral. For a given = ̄
and thereby a given = ̄ a higher results in higher output and higher employment.
13
d
Y
M
Y
Yd P
1
M
P
1
45
O Y
Yk Yf
Figure 1: The Keynesian regime (̄ (1 + ); and given, ̄ fixed).
The fundamental difference between the Walrasian and the present framework is that
the latter allows trade outside Walrasian equilibrium. In that situation the households’
consumption demand depends not on how much labor the households would prefer to sell
at the going wage, but on how much they are able to sell, that is, on a quantity signal
received from the labor market. Indeed, it is the actual employment, that enters the
operative budget constraint, (4), not the desired employment as in classical or Walrasian
theory.
This regime represents the “opposite” case of the Keynesian regime and arises if and only
if the opposite of (23) holds, namely
̄ (1 + )
14
Yd M
Y
M P
Y
d
Y f
P 1
1
Cf
M
P
1
45
O f Y
Y
Figure 2: The repressed inflation regime (̄ (1 + ); and given, ̄ fixed).
The new element here in that firms perceive a demand level in excess of As the
real-wage level does not deter profitable production, firms would thus prefer to employ
people up to the point where output demand is satisfied. But in view of the short side
rule for the labor market, actual employment will be
= = ̄ =
So there is excess demand in both the output market and the labor market. Presum-
ably, these excess demands generate pressure for wage and price increases. By assumption,
these potential wage and price increases do not materialize until possibly the next period.
So we have a repressed-inflation equilibrium ( ) = ( ̄) although possibly short-
lived.
Fig. 19.2 illustrates the repressed-inflation regime. In the language of the microeco-
nomic theory of quantity rationing, consumers are quantity rationed in the goods market,
as realized consumption = = = consumption demand. Firms are quantity
rationed in the labor market, as . This is the background for the parlance that in
the repressed inflation regime, output and employment are not demand-determined but
supply-determined. Both the output market and the labor market are sellers’ markets
(purchase less than desired). Presumably, the repressed inflation regime will not last long
unless there are wage and price controls imposed by the government, as for instance may
be the case for an economy in a war situation.7
7
As another example of repressed inflation (simultaneous excess demand for consumption goods and
15
2.3.3 The border case between the two regimes: = =
This case arises if and only if ̄ = (1 + ) which is in turn equivalent to ̄ =
(1 + )̄ = ≡ ≡ ( ). No market has quantity rationing and we may
speak of both the output market and the labor market as balanced markets.
There are two differences compared with the classical equilibrium, however. The first
is that due to market power, there is a wedge between the real wage and the marginal
productivity of labor. In the present context, though, where labor supply is inelastic,
this does not imply inefficiency but only a higher profit/wage-income ratio than under
perfect competition (where the profit/wage-income ratio is zero). The second difference
compared with the classical equilibrium is that due to price stickiness, the impact of
shifts in exogenous variables will be different. For instance a lower will here result
in unemployment, while in the classical model it will just lower and and not affect
employment.
2.3.4 In terms of effective demands and supplies Walras’ law does not hold
As we saw above, with Walrasian budget constraints, the aggregate value of excess de-
mands in the given period is zero for any given price vector, ( 1) with 0 and
0 In contrast, with effective budget constraints, effective demands and supplies,
and the short-side rule, this is no longer so. To see this, consider a pair ( ) where
and 6= ≡ ( ) Such a pair leads to either the Keynesian regime or
the repressed-inflation regime. The pair may, but need not, equal one of the pairs (̄ ̄ )
considered above in Fig. 19.1 or 19.2 (we say “need not”, because the particular -markup
relationship between and is not needed). We have, first, that in both the Keynesian
and the repressed-inflation regime, effective output supply equals full-employment output,
= (26)
The intuition is that in view of , the representative firm wishes to satisfy any
output demand forthcoming but it is only able to do so up to the point of where the
availability of workers becomes a binding constraint.
Second, the aggregate value of excess effective demands is, for the considered price
labor) we may refer to Eastern Europe before the dissolution of the Soviet Union in 1991. In response to
severe and long-lasting rationing in the consumption goods markets, households tended to decrease their
labor supply (Kornai, 1979). This example illustrates that if labor supply is elastic, the effective labor
supply may be less than the Walrasian labor supply due to spillovers from the output market.
16
vector ( 1) equal to
( − ) + ( − ) + ̂ −
= ( − ̄) + + ̂ − −
= ( − ̄) + + Π − (by (4))
= ( − ̄) + − (by (2))
½
0 if ( ) and
= ( − ̄) + ( − ) (27)
0 if ( ) and
The aggregate value of excess effective demands is thus not identically zero. As expected, it
is negative in a Keynesian equilibrium and positive in a repressed-inflation equilibrium.8
The reason that Walras’ law does not apply to effective demands and supplies is that
outside Walrasian equilibrium some of these demands and supplies are not realized in the
final transactions.
This takes us to Keynes’ refutation of Say’s law and thereby what Keynes and others
have regarded as the core of his theory.
The classical principle “supply creates its own demand” (or “income is automatically
spent on products”) is named Say’s law after the French economist and business man
Jean-Baptiste Say (1767-1832). In line with other classical economists like David Ricardo
and John Stuart Mill, Say maintained that although mismatch between demand and
production can occur, it can only occur in the form of excess production in some industries
at the same time as there is excess demand in other industries.9 General overproduction
is impossible. Or, by a classical catchphrase:
Every offer to sell a good implies a demand for some other good.
By “good” is here meant a produced good rather than just any traded article, including
for instance money. Otherwise Say’s law would be a platitude (a simple implication of
the definition of trade). So, interpreting “good” to mean a produced good, let us evaluate
8
At the same time, (27) together with the general equations = ̄ and = shows that we have
̂ = in a Keynesian equilibrium (where = ) and ̂ in a repressed-inflation equilibrium
(where = ).
9
There were two dissidents at this point, Thomas Malthus and Karl Marx, two classical economists
that were otherwise not much aggreeing.
17
Say’s law from the point of view of the result (27). We first subtract ( − )
= ( − ̄) on both sides of (27), then insert (26) and rearrange to get
( − ) + ̂ − = 0 (28)
for any 0 Consider the case In this situation every unit produced and
sold is profitable. So any in the interval 0 ≤ is profitable from the supply side
angle. Assume further that = ̄ ≡ ( ) This is the case shown in Fig. 19.1.
The figure illustrates that aggregate demand is rising with aggregate production. So far
so well for Say’s law. We also see that if aggregate production is in the interval 0
then (= ) This amounts to excess demand for goods and in effect, by
(28), excess supply of money. Still, Say’s law is not contradicted. But if instead aggregate
production is in the interval ≤ then (= ) ; now there is general
overproduction. Supply no longer creates its own demand. There is a general shortfall of
demand. By (28), the other side of the coin is that when then ̂ which
means excess demand for money. People try to hoard money rather than spend on goods.
Both the Great Depression in the 1930s and the Great Recession 2008- can be seen in this
light.10
The refutation of Say’s law does not depend on the market power and constant markup
aspects we have adhered to above. All that is needed for the argument is that the agents
are price takers within the period. In addition, the refutation does not hinge on money
being the asset available for transferring purchasing power from one period to the next.
We may imagine an economy where represents land available in limited supply. As
land is also a non-produced store of value, the above analysis goes through − with one
exception, though. The exception is that ∆ in (15) can no longer be interpreted as a
policy choice. Instead, a positive ∆ could be due to discovery of new land.
We conclude that general overproduction is possible and Say’s law thereby refuted.
It might be objected that our “aggregate reply” to Say’s law is not to the point since
Say had a disaggregate structure with many industries in mind. Considering an explicit
disaggregate production sector makes no essential difference, however, as a simple example
will now show.
10
Paul Krugman stated it this way:
“When everyone is trying to accumulate cash at the same time, which is what happened
worldwide after the collapse of Lehman Brothers, the result is an end to demand [for output],
which produces a severe recession” (Krugman, 2009).
18
Many industries Suppose there is still one labor market, but industries with pro-
duction function = where and are output and employment in industry
respectively, = 1 2 Let the preferences of the representative household be given
by
X ̂
= ln + ln 0 = 1 2 0 1
In analogy with (4), the budget constraint is
X X X X
+ ̂ = ≡ + + Π = +
Π = −
19
where ̄ = max [1 ] . This is a situation where people try to save (hoard
money) rather than spend all income on produced goods. It is an example of general
overproduction, thus falsifying Say’s law.
In the special case where all = the situation for each single industry can be
illustrated by a diagram as that in Fig. ??. Just replace and in Fig.
?? by ≡ ( ) and respectively.
Could the evaluation of Say’s law be more favorable if we allow for the existence of
interest-bearing assets? The answer is no, as we shall see in Chapter ??.
We now return to the aggregate setup. Apart from the border case of balanced markets,
we have considered two kinds of “fix-price equilibria”, repressed inflation and Keynesian
equilibrium. Most macroeconomists consider nominal wages and prices to be less sticky
upwards than downwards. So a repressed inflation regime is typically regarded as having
little durability (unless there are wage and price controls imposed by a government). It is
otherwise with the Keynesian equilibrium. A way of thinking about this is the following.
There are basically two kinds of reaction to this situation. One is that wages and
prices are maintained throughout all the sub-periods, while production is scaled down
to the Keynesian equilibrium = ( 0 ̄ ). Another is that wages and prices adjust
downward so as to soon reestablish full-employment equilibrium. Let us take each case
at a time.
20
Wage and price stay fixed: Sheer quantity adjustment For simplicity we have
assumed that the produced goods are perishable. So unsold goods represent a complete
loss. If firms fully understand the functioning of the economy and have model-consistent
expectations, they will adjust production per time unit down to the level as fast as
possible. Suppose instead that firms have naive adaptive expectations of the form
−1 = −1 = 0 1 2
This means that the “subjective” expectation, formed in sub-period − 1 of demand next
sub-period is that it will equal the demand in sub-period − 1 Let the time-lag between
the decision to produce and the observation of the demand correspond to the length of
the subperiods. It is profitable to satisfy demand, hence actual output in sub-period
will be
̄ −1
= −1 = −1 = 0 +
1+ 1 + 0
in analogy with (19). This is a linear first-order difference equation in , with constant
coefficients. The solution is (see Math Tools)
µ ¶
∗01
= (0 − ) + ∗0 ∗0 = = (31)
1 + 0 0
̄
Suppose 0 = 09 say. Then actual production, converges fast towards the steady-state
value . When = the system is at rest. Fig. 19.x illustrates. Although there is
excess supply in the labor market and therefore some downward pressure on wages, the
Keynesian presumption is that the workers’s side in the labor market generally withstand
the pressure.11
The process (31) also applies “in the opposite direction”. Suppose, starting from the
Keynesian equilibrium = ( 0 ̄ ) a reduction in the patience parameter 0 occurs,
such that ( 0 ̄ ) increases, but still satisfies ( 0 ̄ ) Then the initial condition
in (31) is 0 ∗0 and the greater propensity to consume leads to an upward quantity
adjustment.
11
Possible explanations of downward wage stickiness are discussed in Chapter ??.
21
Downward wage and price adjustment Several of Keynes’ contemporaries, among
them A. C. Pigou, maintained that the Keynesian state of affairs with =
could only be very temporary. Pigou’s argument was that a fall in the price level would
take place and lead to higher purchasing power of The implied stimulation of ag-
gregate demand would bring the economy back to full employment. This hypothetically
equilibrating mechanism is known as the “real balance effect” or the “Pigou effect” (after
Pigou, 1943).
Does the argument go through? To answer this, we imagine that the time interval
between different rounds of wage and price setting is as short as our sub-periods. We
imagine the time interval between households’ decision making to be equally short. Given
the fixed markup , an initial fall in the preset ̄ is needed to trigger a fall in the preset
̄ The new classical equilibrium price and wage levels will be
0 = and 0 = 0
0
Both will thus be lower than the original ones − by the same factor as the patience
parameter has risen, i.e., the factor 0 In line with “classical” thinking, assume that
soon after the rise in the propensity to save, the incipient unemployment prompts wage
setters to reduce ̄ and thereby price setters to reduce ̄ Let both ̄ and ̄ after a few
rounds be reduced by the factor 0 Denoting the resulting wage and price ̄ 0 and ̄ 0
respectively, we then have
0 ̄ 0 0
̄ 0 = ̄ 0 = (1 + ) = ≡ 0 ≡ 0
1+
Seemingly, this restores aggregate demand at the full-employment level = ( 0 ̄ 0 )
= .
2. A downward wage-price spiral, i.e., deflation, increases the implicit real interest
rate, ( − +1 )+1 thus tending to dampen aggregate demand rather than the
opposite.
22
(a) the monetary base is in reality only a small fraction of financial wealth, and so
the real balance effect can not be powerful unless the fall in the price level is
drastic;
(b) many firms and households have nominal debt, the real value of which would
rise dramatically, thereby leading to bankruptcies and a worsening of the con-
fidence crisis, thus counteracting a return to full employment.
One should be aware that there are two distinct kinds of “price flexibility”. It can
be “imperfect” or “perfect” (also called “full”). The first kind relates to a gradual price
process, for instance generated by a wage-price spiral as at item 2 above. The latter kind
relates to instantaneous and complete price adjustment as with a Walrasian auctioneer,
cf. Section 2. It is the first kind that may be destabilizing rather than the opposite.
(a) The wage and price setting should be explicitly modelled and in this connection
there should be an explanation of the wage and price stickiness.
(b) It should be made clear how to come from the existence of many differentiated
goods and markets with imperfect competition to aggregate output and income which in
turn constitute the environment conditioning individual agents’ actions.
(c) To incorporate better the role of asset markets, including the primary role of money
as a medium of exchange rather than a store of value, at least one alternative asset should
enter, an interest-bearing asset.
(d) The model should be truly dynamic with forward-looking endogenous expectations
and gradual wage and price changes depending on the market conditions, in particular
the employment situation.
The classical theory of perfectly flexible wages and prices and neutrality of money seems
contradicted by overwhelming empirical evidence. At the theoretical level the theory
23
ignores that the dominant market form is not perfect competition. Wages and prices
are usually set by agents with market power. And there may be costs associated with
changing prices and wages. Here we consider such costs.
The literature has modelled price adjustment costs in two different ways. Menu costs
refer to the case where there are fixed costs of changing price. Another case considered
in the literature is the case of strictly convex adjustment costs, where the marginal price
adjustment cost is increasing in the size of the price change.
The most obvious examples of menu costs are of course costs associated with
But the term menu costs should be interpreted in a broader sense, including pecuniary
as well non-pecuniary costs of:
3. information-gathering,
8. renegotiating contracts.
Menu costs induce firms to change prices less often than if no such costs were present.
And some of the points mentioned in the list above, in particular point 7 and 8, may be
relevant also in the different labor markets.
The menu cost theory is one of the microfoundations provided by modern Keynesian
economics for the presumption that nominal prices and wages are sticky in the short run.
The main theoretical insight of the menu cost theory is the following. There are menu
costs associated with changing prices. Even small menu costs can be enough to prevent
firms from changing their price. This is because the opportunity cost of not changing
price is only of second order, i.e., “small”; this is a reflection of the envelope theorem (see
24
Appendix). But owing to imperfect competition (price MC), the effect on aggregate
output, employment, and welfare of not changing prices is of first order, i.e., “large”.
The menu cost theory provides the more popular explanation of nominal price rigidity.
Another explanation rests on the presumption of strictly convex price adjustment costs.
In this theory the price change cost for firm is assumed to be = ( − −1 )2
0 Under this assumption the firm is induced to avoid large price changes, which
means that it tends to make frequent, but small price adjustments. This theory is related
to the customer market theory. Customers search less frequently than they purchase. A
large upward price change may be provocative to customers and lead them to do search in
the market, thereby perhaps becoming aware of attractive offers from other stores. The
implied “kinked” demand curve can explain that firms are reluctant to suddenly increase
their price.
To incorporate the key role of financial markets for the performance of the macroeconomy,
at least one extra asset should enter in a short-run model, an interest-bearing asset. This
gives rise to the IS-LM model that should be familiar from Blanchard, Macroeconomics.
An extended IS-LM model is presented in the recent editions of the mentioned text
by Blanchard (alone) and in Blanchard et al., Macroeconomics: A European Perspective,
2010, Chapter 20. The advantage of the extended version is that the commercial banking
sector is introduced more explicitly so that the model incorporates both a centralized
bond market and decentralized markets for bank loans.
25
simply by assuming flexible prices and perfect competition. In a realistic model with
imperfect competition and price stickiness in both output and labor markets the natural
rate of unemployment is likely to be higher than in an economy with perfect competition.
And hump-shaped deviations from trend GDP, that is, business cycles, are likely to arise
when the economy is hit by large shocks, for instance a financial crisis.
The third macro course, Macroeconomics 3, deals with short and medium run theory
and emphasizes issues related to monetary policy.
6 Appendix
0 () = ( ())
where denotes the partial derivative of (·) w.r.t. the first argument.
Proof 0 () =
( ()) +
( ()) 0 () =
( ()), since
( ()) = 0 by
definition of (). ¤
That is, when calculating the total derivative of a function w.r.t. a parameter and
evaluating this derivative at an interior maximum w.r.t. a control variable, the envelope
theorem allows us to ignore the terms that arise from the chain rule. This is also the case
if we calculate the total derivative at an interior minimum.12
12
For extensions and more rigorous framing of the envelope theorem, see for example Sydsaeter et al.
(2006).
26