Difference Equations For Economists PDF
Difference Equations For Economists PDF
Klaus Neusser
1
Klaus
c Neusser
i
Preface
There are, of course, excellent books on deterministic difference equations:
for example, Elaydi (2005), Agarwal (2000), or Galor (2007). Colonius and
Kliemann (2014) give a presentation from the perspective of dynamical sys-
tems. These books do, however, not go into the specific problems faced in
economics.
The books makes use of linear algebra. Very good introduction to this
topic is presented by the books of Strang (2003) and Meyer (2000).
Bern,
September 2010 Klaus Neusser
ii
Contents
1 Introduction 1
1.1 Notation and Preliminaries . . . . . . . . . . . . . . . . . . . . 1
iii
iv CONTENTS
v
vi LIST OF FIGURES
vii
viii LIST OF DEFINITIONS
List of Theorems
ix
Chapter 1
Introduction
1
2 CHAPTER 1. INTRODUCTION
The aim of the analysis is to assess the existence and uniqueness of a solution
to a given difference equation; and, in the case of many solutions, to charac-
terize the set of all solutions. In the normal linear case of dimension n = 1,
for example, the set of solutions turns out to be a vector space of dimension
p.
One way to pin down a particular solution is to require that the solution
must satisfy some boundary conditions. In this case, we speak of a boundary
value problem. The simplest way to specify boundary conditions is to require
that the solution X(t) must be equal to p prescribed values x1 , . . . , xp , called
initial conditions, at given time indices t1 , . . . , tp :
Difference equations, like (1.2), transform one sequence (X) = (Xt ) into
another one. The difference equation therefore defines a function or better
an operator on the set of all sequences, denoted by RZ , into itself. Defining
addition and scalar multiplication in the obvious way, RZ forms a vector or
linear space over the real numbers. For many economic applications, it makes
sense to concentrate on the set of bounded sequences, i.e. on sequences (Xt )
for which there exists a real number M such that kXt k ≤ M for all t ∈ Z.
The set of bounded sequences is usually denoted by `∞ . It can be endowed
with a norm k.k∞ in the following way:
k(X)k∞ = sup{kXt k, t ∈ Z}
Φ(L)Xt = (I − Φ1 L − Φ2 L2 − . . . − Φp Lp )Xt
= Xt − Φ1 Xt−1 − Φ2 Xt−2 − . . . − Φp Xt−p
Xt = φXt−1 + Zt , φ 6= 0. (2.2)
5
6 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
equation (2.3)
X1 = φx0
X2 = φX1 = φ2 x0
...
Xt = φXt−1 = φt x0 .
This suggests to take
Xt = φt c (2.4)
as the general solution of the first order linear homogenous difference equa-
tion (2.3). Actually, equation (2.4) provides a whole family of solutions
indexed by the parameter c ∈ R. To each value of c, there corresponds a
trajectory (Xt ) = (φt c). In order to highlight this dependency, we may write
the solutions as Xt (c). Note that the trajectories of two different solutions
Xt (c1 ) and Xt (c2 ), c1 6= c2 , cannot cross.
The parameter c can be pinned down by using a single boundary condition.
A simple form of such a boundary condition requires, for example, that Xt
takes a particular value xt0 in some period t0 . Thus, we require that Xt0 = xt0
in period t0 . In this case we speak of an initial value problem. The value of
c can then be retrieved by solving the equation xt0 = φt0 c for c. This leads
x
to the solution: c = φtt00 . In many instances we are given the value at t0 = 0
so that c = x0 .
Suppose that we are given two solutions of the homogenous equation,
(1) (2)
(Xt ) and (Xt ). Then it is easy to verify that any linear combination of
(1) (2)
the two solutions, a1 (Xt ) + a2 (Xt ), a1 , a2 ∈ R, is also a solution. This
implies that the set of all solutions to the homogenous equation forms a
linear space. In order to find out the dimension of this linear space and its
algebraic structure, it is necessary to introduce the following three important
definitions.
Definition 2.1 (Linear Dependence, Linear Independence). The r sequences
(X (1) ), (X (2) ), . . . , (X (r) ) with r ≥ 2 are said to be linearly dependent for
t ≥ t0 if there exist constants a1 , a2 , . . . , ar ∈ R, not all zero, such that
(1) (2) (r)
a1 Xt + a2 Xt + · · · + ar Xt =0 ∀t ≥ t0 .
This definition is equivalent to saying that there exists a nontrivial linear
combination of the solutions which is zero. If the solutions are not linearly
dependent, they are said to be linearly independent.
Definition 2.2 (Fundamental Set of Solutions). A set of r linearly inde-
pendent solutions of the homogenous equation is called a fundamental set of
solutions.
2.1. FIRST ORDER DIFFERENCE EQUATION 7
or equivalently
!
(1) (2)
Xt Xt a1
(1) (2) 6= 0
φXt φXt a2
Since this must hold for any a1 , a2 , not both equal to zero, the determinant
of the Casarotian matrix (see Definition 2.3)
!
(1) (2)
Xt Xt
C(t) = (1) (2)
φXt φXt
(1) (2) (1) (2)
must be nonzero. However, det C(t) = φXt Xt − φXt Xt = 0. This
is a contradiction to the initial assumption. Thus, there can only be one
independent solution.
8 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
3. The Superposition Principle (see Theorem 2.2) then delivers the gen-
eral solution of the nonhomogeneous equation as the sum of (X (g) ) and
(X (p) ). However, this solution still depends through (X (g) ) on some
1
The superposition principle means that the net response of Xt caused by two or
more stimuli is the sum of the responses which would have been caused by each stimulus
individually. In the first order case one stimulus comes from the general solution to the
homogeneous equation, the other from the particular solution to the nonhomogeneous
equation.
2.1. FIRST ORDER DIFFERENCE EQUATION 9
Before continuing with the theoretical analysis consider the following ba-
sic example.
Amortization of a Loan
One of the simplest settings in economics where a difference equation arises
naturally, is compound interest calculation. Take, for example, the evolution
of debt. Denote by Dt the debt outstanding at the beginning of period t,
then the debt in the subsequent period t + 1, Dt+1 , is obtained by the simple
accounting rule:
where rDt is the interest accruing at the end of period t. Here we are using for
simplicity a constant interest rate r. The debt contract is serviced by paying
the amount Zt at the end of period t. This payment typically includes a
payment for the interest and a repayment of the principal. Equation (2.6)
constitutes a linear nonhomogeneous first order difference equation with φ =
1 + r.
Given the initial debt at the beginning of period 0, D0 , the amount of
debt outstanding in subsequent periods can be computed recursively using
the accounting rule (2.6):
D1 = (1 + r)D0 − Z0
D2 = (1 + r)D1 − Z1 = (1 + r)2 D0 − (1 + r)Z0 − Z1
...
Dt+1 = (1 + r)t+1 D0 − Zt − (1 + r)Zt−1 − · · · − (1 + r)t Z0
Xt
t+1
= (1 + r) D0 − (1 + r)i Zt−i
i=0
t+1
Note
Pt how Dt+1i is determined as the sum of two parts: (1 + r) D0 and
− i=0 (1 + r) Zt−i . The first expression thereby corresponds to the general
solution of the homogeneous equation and the second one to a particular
10 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
Note that the payment Z required to pay back the debt diminishes with
T. If T approaches infinity Z equals rD0 . In this case the payment is just
equal to interest accruing in each period so that there is no repayment of the
principal. In this case the debt is never paid back and equals the initial debt
D0 in each period. If the payment Z exceeds rD0 , the debt is repaid in a
finite amount of time.
Suppose that instead of requiring that the debt must be zero at some
point in time (including infinity), we impose the condition that the present
discounted value of the debt must be non-positive as T goes to infinity:
DT +1
lim ≤ 0. (2.7)
T →∞ (1 + r)T +1
economics). Given the difference equation for the evolution of debt, the NPG
condition with constant payment per period is equivalent to:
DT +1 Z Z
lim T +1
= lim D0 − 1 − (1 + r)−T −1 = D0 − ≤ 0
T →∞ (1 + r) T →∞ r r
which implies that Z ≥ rD0 . Thus, the NPG condition holds if the constant
repayments Z are at least as great as the interest.
This provides an example with two steady states. The steady states are
determined by the equation: X ∗ = µX ∗ (1 − X ∗ ). This equation has two
12 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
0.9
0.8 steady
state 450−line
0.7
0.6
Xt+1
0.5
0.4
f(x)
0.3
0.2
0.1 steady
state
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
X
t
Figure 2.1: Cobweb diagram with steady states of the logistic function: y =
2.5x(1 − x) and x0 = 0.1
Xt − X ∗ = φ(Xt−1 − X ∗ ).
5
In economics, equations are often log-linearized leading to log-deviations from steady
states. In which case, Xt − X ∗ can be interpreted as percentage deviations from steady
states.
2.2. STEADY STATE AND STABILITY 13
The mean value theorem then implies that there exists ξ, X0 < ξ < X ∗ , such
that
|f (X0 ) − f (X ∗ )| = |f 0 (ξ)| |X0 − X ∗ | .
Hence we have
|X1 − X ∗ | ≤ M |X0 − X ∗ | .
|Xt − X ∗ | ≤ M t |X0 − X ∗ | .
For any ε > 0, let δε = min{γ, ε} then |X0 − X ∗ | < δε implies |Xt − X ∗ | < ε
for all t ≥ 0. X ∗ is therefore a stable equilibrium point. In addition, X ∗
is attractive because limt→∞ |Xt − X ∗ | = 0. Thus, X ∗ is asymptotically
stable.
Xt = φXt−1 + Zt
Xt = φ(φXt−2 + Zt−1 ) + Zt = φ2 Xt−2 + φZt−1 + Zt
...
Xt = φt X0 + φt−1 Z1 + φt−2 Z2 + · · · + φZt−1 + Zt
t−1
X
= φt X0 + φj Zt−j
j=0
Taking the absolute value of the difference between Xt and the second term
of the right hand side of the equation leads to:
t−1
X
Xt − φj Zt−j = φt X0 = |φt | |X0 |
j=0
which is just the steady state solution described in equation (2.10) of section
2.2.
The requirement that Zt remains bounded can, for example, be violated
if Zt itself satisfies the homogenous difference equation Zt = ψZt−1 which
implies that Zt = ψ t c for some c 6= 0. Inserting this into equation (2.13)
then leads to ∞ ∞ j
(b)
X
j t−j t
X φ
Xt = φ ψ c=ψ c.
j=0 j=0
ψ
The infinite sum converges only if |φ / ψ| < 1. This shows that besides
the stability condition |φ| < 1, some additional requirements with respect
(b)
to the sequence of the exogenous variable are necessary to render Xt in
equation (2.13) a meaningful particular solution. Usually, we assume that
(Zt ) is bounded, i.e. that (Zt ) ∈ `∞ .
Consider next the case |φ| > 1. In this situation the above iteration is
(b)
no longer successful because Xt in equation (2.13) is not well–defined even
when Zt is constant.7 A way out of this problem is to consider the iteration
forward in time instead of backward in time:
...
Xh
−h −1
= φ Xt+h − φ φ−j+1 Zt+j for h ≥ 1.
j=1
Taking the absolute value of the difference between Xt and the second term
on the right hand side of the equation leads to:
X h
Xt + φ−1 φ−j+1 Zt+j = φ−h Xt+h = |φ−h | |Xt+h |.
j=1
As the economy is expected to live forever, there is no end period and the
forward iteration can be carried out indefinitely into the future. Because
|φ| > 1, the right hand side of the equation converges to zero as h → ∞,
provided that Xt+h remains bounded. This suggests the following particular
solution: ∞
(f )
X
−1
Xt = −φ φ−j+1 Zt+j , |φ| > 1, (2.14)
j=1
7
Note however that, if Zt satisfies itself a homogeneous difference equation of the form
Zt = ψZt−1 , the criterion for convergence is, as in the example above, |φ / ψ| < 1.
18 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
where the superscript (f ) indicates that the solution was obtained by iter-
ating the difference equation forward in time. For this to be a meaningful
choice, the infinite sum must be well-defined. This will be guaranteed if, for
example, Zt remains bounded, i.e. if (Z) ∈ `∞ .
In the case |φ| = 1 neither the backward nor the forward iteration strategy
leads to a sensible solution even when Zt is constant and equal to Z 6=
0. Either an equilibrium point does not exist as in the case φ = 1 or the
equilibrium point exists as is the case for φ = −1, but Xt oscillates forever
between X0 and −X0 + Z so that the equilibrium point is unstable. Most of
the time, we restrict ourself to the case of hyperbolic situations and exclude
the case |φ| = 1.
To summarize, assuming that (Zt ) is bounded, the first order linear dif-
ference equation (2.2) led us to consider the following two representations of
the general solutions:
∞
X
t (b) (b)
X t = φ cb + Xt , whereby Xt = φj Zt−j
j=0
∞
X
(f ) (f ) −1
t
X t = φ cf + Xt , whereby Xt = −φ φ−j+1 Zt+j
j=1
(b)
Note that these equations imply that cb = X0 − X0 , respectively that cf =
(f )
X0 − X0 . Depending on the value of φ, we can distinguish the following
three cases:
|φ| < 1: the backward solution is asymptotically stable in the sense that Xt
(b) (b)
approaches Xt as t → ∞. Any deviation of Xt from Xt vanishes
over time, irrespective of the value chosen for cb . The forward solution,
(f )
usually, makes no sense because Xt does not remain bounded even if
the forcing variable Zt is constant over time.
|φ| > 1: both solutions have an explosive behavior due to the term φt . Even
(b) (f )
small deviations from either Xt or Xt will grow without bounds.
There is, however, one and only one solution which remains bounded.
It is given by cf = 0 which implies that Xt always equals its equilibrium
(f )
value Xt .
|φ| = 1: neither the backward nor the forward solution converge for constant
Zt 6= 0.
Which solution is appropriate depends on the nature of the economic prob-
lem at hand. In particular, the choice of the boundary condition requires
2.4. EXAMPLES OF FIRST ORDER EQUATIONS 19
where φ = − βγ and Zt = − uβt . Due to the negative value of φ, the price oscil-
lates: high prices tend to be followed by low prices which are again followed
by high prices. These price oscillations translate into corresponding quantity
8
The logarithm of the price level is taken to ensure a positive price level.
20 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
1.4
45°−line 1
1.2
p1
0.9
45°−line
1 0.8
P1
0.7
0.8 steady
0.6
Pt+1
steady state
Pt+1
state 0.5
0.6
−(γ/β)p + 1 0.4
t
0.4
0.3
P2
0.2 −(γ/β)pt + 1
0.2
p2 0.1
0 0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 0 0.2 0.4 0.6 0.8 1
P0 P1 p0
Pt Pt p1
1
45°−line
0.9
0.8
p1 0.7
0.6 steady
state
Pt+1
0.5
0.4 −(γ/β)p + 1
t
p2 0.3
0.2
0.1
0
0 0.2 0.4 0.6 p 0.8 1
p0
Pt 1
the difference equation around the steady state. We will exemplify this tech-
nique by studying the famous Solow (see Solow (1956)) growth model. A
simple version of this model describes a closed economy with no technical
progress. Output in period t, denoted by Yt , is produced with two essen-
tial production factors: capital, Kt , and labor, Lt . Production possibilities
of this economy in period t are described by a neoclassical production func-
tion Yt = F (Kt , Lt ). This production function is defined on the nonnegative
orthant of R2 and is characterized by the following properties:
• F is twice continuously differentiable;
∂F (K,L) ∂F (K,L)
• strictly positive marginal products, i.e. ∂K
> 0 and ∂L
> 0;
∂ 2 F (K,L) ∂ 2 F (K,L)
• diminishing marginal products, i.e. ∂K 2
< 0 and ∂L2
< 0;
• F has constant returns-to-scale, i.e. F (λK, λL) = λF (K, L) for all
λ > 0;
• F satisfies the Inada conditions:
∂F (K, L) ∂F (K, L)
lim = 0, lim =0
K→∞ ∂K L→∞ ∂L
∂F (K, L) ∂F (K, L)
lim = ∞, lim =∞
K→0 ∂K L→0 ∂L
The Inada conditions are usually not listed among the properties of a neoclas-
sical production function, however, they turn out to be necessary to guar-
antee a strictly positive steady state. The classic example for a produc-
tion function with these properties is the Cobb-Douglas production function:
F (K, L) = AK (1−α) Lα , A > 0, 0 < α < 1. The above properties have two
important implications summarized by the following lemmata.
Lemma 2.1 (Essential Inputs). Let F be a neoclassical production func-
tion as described above then both inputs are essential, i.e. F (K, 0) = 0 and
F (0, L) = 0.
Proof. Suppose that Y → ∞ as K → ∞ then L’Hôpital’s rule together with
the Inada conditions imply
Y ∂Y /∂K ∂Y
lim = lim = lim = 0.
K→∞ K K→∞ 1 K→∞ ∂K
If on the other hand, Y remains finite when K → ∞, we immediately also
get
Y
lim = 0.
K→∞ K
22 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
The constant returns to scale assumption then implies that, for L > 0 fixed,
Y
lim = lim F (1, L/K) = F (1, 0) = 0.
K→∞ K K→∞
where we have used the result that capital is essential (see the previous
Lemma), i.e. that f (0) = 0. The last equality is a consequence of the Inada
conditions. The proof for limK→∞ F (K, L) = ∞, L > 0 fixed, is analogous.
Starting in period 0 with some positive capital K0 > 0, the system con-
sisting of the two difference equations (2.19) and (2.20) completely describes
the evolution of the economy over time. A first inspection of the two equa-
tions immediately reveals that both labor and capital tend to infinity. Indeed,
as µ > 0 labor grows without bound implying according to Lemma 2.2 that
output also grows without bound. This is not very revealing if one is looking
for steady states and is interested in a stability analysis. In such a situation
it is often advisable to look at the ratio of the two variables, in our case at
K/L. This has two main advantages. First, the dimension of the system
is reduced to one and, more importantly, the singularity at infinity is, at
least in the linear case, eliminated.9 Second, these ratios often have a clear
economic meaning making the economic interpretation of the results more
comprehensible.
We apply this device to the Solow model as described by equations (2.19)
and (2.20). Thus, dividing equation (2.19) by Lt+1 and making use of the
constant returns to scale assumption results in the fundamental equation of
the Solow model:
Kt+1 1−δ s
kt+1 = = kt + f (kt ) = g(kt ) (2.21)
Lt+1 1+µ 1+µ
where kt = K Lt
t
is known as the capital intensity and f (kt ) = F Kt
Lt
, 1 .
The economy starts in period zero with an initial capital intensity k0 > 0.
The nonlinear first order difference equation (2.21) together with the initial
condition uniquely determines the evolution of the capital intensity over time,
and consequently of all other variables in the model. Note that the concavity
of F is inherited by f and thus by g so that we have g 0 > 0 and g 00 < 0.
Moreover, limk→0 g(k) = 0 and limk→∞ g(k) = ∞.
Proposition 2.1. Given the assumptions of the Solow model, the funda-
mental Solow equation (2.21) has two steady states k ∗ = 0 and k ∗ > 0.
Proof. The steady states must satisfy the nonlinear equation:
k ∗ = g(k ∗ ).
0.35
steady state
0.3
k*
0.25
capital intensity
g(k)
0.15
45−degree line
0.1
0.05
0
0 0.05 0.1 0.15 0.2 0.25 k* 0.3
capital intensity
We can therefore study the local behavior of the nonlinear difference equa-
tion (2.21) around the steady state k ∗ > 0 by investigating the properties of
2.4. EXAMPLES OF FIRST ORDER EQUATIONS 25
Proof. Note that g 0 (k) > 0 for all k > 0. Concavity of g implies that g(k) −
k ∗ ≤ g 0 (k ∗ )(k − k ∗ ) for all k > 0. Take k < k ∗ , then g(k) > k. Thus,
g(k) − k ∗ < g 0 (k ∗ )(g(k) − k ∗ ) < 0 so that g 0 (k ∗ ) < 1.
Starting in period zero with an initial capital intensity k0 > 0, the solution
to this initial value problem is:
kt = k ∗ + φt (k0 − k ∗ )
dt + pet+1 − pt
r= ⇔ pet+1 = (1 + r)pt − dt (2.25)
pt
where pet+1 denotes the price expected to prevail in the next period. Assuming
that expectations of the investors are rational which is equivalent to assum-
ing perfect foresight in the context of no uncertainty, the above arbitrage
equation turns into a simple first order difference equation:
where Xt = (1 + r)−1 ∞
(f ) P −j
j=0 (1 + r) dt+j . Note that the forward solution is
only well-defined if the infinite sum converges. A sufficient condition for this
to happen is the existence of a finite index j0 such that |dt+j /(1 + r)j | < M j ,
for j > j0 and some M < 1. This is guaranteed, in particular, by a constant
dividend stream dt+j = d, for all j = 0, 1, 2, . . .
The term (1 + r)t cf is usually called the bubble term because its behavior
(f )
is unrelated to the dividend stream; whereas the term Xt is referred to as
the fundamentals because it is supposed to reflect the “intrinsic value” of the
share.
Remember that we want to figure out the price of a share. Take period
(f )
0 to be the current period and suppose that cf = p0 − X0 > 0. This
means that the current stock price is higher than what can be justified by
the future dividend stream. According to the arbitrage equation (2.25) this
high price (compared to the dividend stream) can only be justified by an
appropriate capital gain, i.e. an appropriate expected price increase in the
next period. This makes the price in the next period even more different
from the fundamentals which must be justified by an even greater capital
gain in the following period, and so on. In the end, the bubble term takes
over and the share price becomes almost unrelated to the dividend stream.
This situation is, however, not sustainable in the long run.10 Therefore, the
(f )
only reasonable current share price p0 is X0 which implies that cf = 0. This
effectively eliminates the bubble term and is actually the only nonexplosive
solution. Thus, we have a unique (determinate) rational equilibrium solution.
This solution is
∞
X
(f ) −1
pt = Xt = (1 + r) (1 + r)−j dt+j (2.28)
j=0
Thus, the price of a share always equals the present discounted value of the
corresponding dividend stream. Such a solution is reasonable in a situation
10 (f )
A similar argument applies to the case cf = p0 − X0 < 0.
2.4. EXAMPLES OF FIRST ORDER EQUATIONS 27
∂pt
= (1 + r)−h−1 h = 0, 1, . . .
∂dt+h
Thus, the effect diminishes the further the change takes place in the fu-
ture. Consider now a permanent change in dividends, i.e. a change where all
dividends increase by some constant amount 4d. The corresponding price
change 4pt equals:
∞
−1
X 4d
4pt = (1 + r) (1 + r)−j 4d =
j=0
r
t+1, pet+1 −pt , where the superscript e stands for expectation. This relation is
negative because households and firms want to hold less money if they expect
the real value of money to deteriorate in the next period due to high inflation
rates. Thus, α < 0. In this model, the central bank perfectly controls the
money stock and sets it independently of the development of the price level.
The model treats the logarithm of the supply of the money stock, mst , where
the superscript s stands for supply, as exogenous. The money stock injected
in the economy is completely absorbed by the economy so that in each point
in time the supply of money equals the demand of money. Combining the
first two equations, i.e. replacing mdt by mt in the first equation, leads to a
portfolio equilibrium condition. As we will see, the behavior of the model
depends crucially on the way in which expectations are formed. Following the
original contribution by Cagan, we postulated that expectations are formed
adaptively, i.e. agents form their expectations by extrapolating past inflation.
The third equation postulates a very simple adaptive expectation formation
scheme: inflation expected to prevail in the next period is just proportional
to the current inflation. Thereby the proportionality factor γ is assumed
to be positive, meaning that expected inflation increases if current inflation
increases. Combining all three equations of the model and solving for pt , we
arrive at the following linear nonhomogeneous first order difference equation:
αγ 1
pt = pt−1 + mt = φpt−1 + Zt (2.29)
1 + αγ 1 + αγ
αγ 1
where φ = 1+αγ and Zt = 1+αγ mt .
From our previous discussion we know that the general solution of this
difference equation is given as the sum of the general solution to the ho-
(p)
mogenous equation and a particular solution, pt , to the nonhomogeneous
equation:
(p)
pt = φt c + pt
One particular solution can be found by recursively inserting into equation
(2.29):
p1 = φp0 + Z1
p2 = φp1 + Z2 = φ2 p0 + φZ1 + Z2
...
pt = φt p0 + φt−1 Z1 + φt−2 Z2 + · · · + φZt−1 + Zt
t−1
X
= φt p0 + φi Zt−i
i=0
of p0 whereas the second one is a weighted sum of past logged money stocks.
In economics there is no natural starting period so that one may iterate the
above equation further, thereby going back into infinite remote past:
∞
X
pt = lim φi pt−i + φi Zt−i
i→∞
i=0
From a mathematical point of view this expression only makes sense if the
limit of the infinite sum exists. Thus, additional assumptions are required.
Suppose that logged money remained constant, i.e. mt = m < ∞ for all
t, then the logic of the model suggests that the logged priceP leveli should
remain finite as well. In mathematical terms this means that ∞ i→∞ φ should
converge. This is, however, a geometric sum so that convergence is achieved
if and only if
αγ
|φ| = <1 (2.30)
1 + αγ
Assuming that this stability condition holds, the general solution of the dif-
ference equation (2.29) implied by the Cagan model is:
∞
X
t
pt = φ c + φi Zt−i (2.31)
i=0
where the constant c can be computed from an initial value condition. Such
an initial condition arises naturally because the formation of adaptive expec-
tations requires the knowledge of the price from the previous period which
can then serve as an initial condition.
The stability condition therefore has important consequences. First, irre-
spective of the value of c, the first term of the solution (the general solution
to the homogenous equation), φt c, becomes less and less important as time
unfolds. Thus, for a large enough t, the logged price level willPbe dominated
by the particular solution to the nonhomogeneous equation, ∞ i
i=0 φ Zt−i . In
this infinite sum, the more recent values of the money stock are more impor-
tant for the determination of the price level. The importance of past money
stocks diminishes as one goes further back into the past. Third, suppose
that money stock is increased by a constant percentage point, ∆m, in every
period, then the effect on the logged price level, ∆pt is given by
∞
X
i 1 1 1
∆pt = φ ∆m = ∆m = ∆m.
i=0
1 + αγ 1 − φ 1 + αγ
Thus, the price level moves up by the same percentage point. Such a once-
and-for-all change is termed a permanent change. In contrast a transitory
30 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
1.5
0.5
−0.5
−1
−1.5
−2
0 2 4 6 8 10 12 14 16 18 20
h
Figure 2.4: Impulse response function of the Cagan model with adaptive
expectations taking α = −0.5 and γ = 0.9
change is a change which occurs only once. The effect of a transitory change
of mt by ∆m in period t on the logged price level in period t + h for some
h ≥ 0 is given by
h
h 1 αγ 1
∆pt+h = φ ∆m = ∆m
1 + αγ 1 + αγ 1 + αγ
...
Xh−1
−h −1
= φ pt+h − φ φ−i Zt+i for h > 0
i=0
The logged price level in period t, pt , now depends on some expected logged
price level in the future, pt+h , and on the development of logged money
expected to be realized in the future. Because the economy is expected to
live forever, this forward iteration is carried on into the infinite future to
yield:
∞
X
−h −1
pt = lim φ pt+h − φ φ−i Zt+i
h→∞
i=0
As 0 < φ−1 < 1, the limit and the infinite sum are well defined, provided
that the logged money stock remains bounded. Under the assumption that
the logged money stock is expected to remain bounded, the economic logic
of the model suggests that the logged price level should remain bounded as
well. This suggests the following particular solution to the nonhomogeneous
equation:
X∞
(p)
pt = −φ−1 φ−i Zt+i
i=0
∞
X
(p) −1
t
pt = φ c + pt t
=φ c−φ φ−i Zt+i (2.34)
i=0
Due to the term φt c, the logged price level grows exponentially without bound
although the logged money stock may be expected to remain bounded, unless
(p)
c = 0. Thus, setting c = 0 or equivalently p0 = p0 guarantees a nonexplosive
rational expectations equilibrium.
32 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
Thus, the superposition principle stated in Theorem 2.2 also holds in the
general case: the general solution to the nonhomogeneous equations can
be represented as the sum of the general solution to the homogeneous and
a particular solution to the nonhomogeneous equation. Thus, we begin the
analysis of the general case by an investigation of the homogeneous equation.
2.5. DIFFERENCE EQUATIONS OF ORDER P 33
1 − φ1 z − φ2 z 2 − · · · − φp z p = 0 (2.36)
distinct roots
In this case we have the following theorem.
Theorem 2.5 (Fundamental Set for equation of order p). If all the roots of
the characteristic equation are distinct, the set {λt1 , . . . , λtp } forms a funda-
mental set of solutions.
Proof. It suffices to show that det C(t) 6= 0 where C(t) is the Casarotian
matrix of {λt1 , . . . , λtp }.
λt1 λt2 ... λtp
λt+1
1 λt+1
2 ... λt+1
p
det C(t) = det
.. .. .. ..
.
. . .
λt+p−1
1 λ2t+p−1 . . . λt+p−1
p
1 1 ... 1
λ1 λ2 . . . λ p
t t t
= λ1 λ2 . . . λp det .. .. ..
..
. . . .
p−1 p−1 p−1
λ1 λ2 . . . λp
34 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
This second
Q matrix is called the Vandermonde matrix whose determinant
equals 1≤i<j≤p (λj − λi ) which is different from zero because the roots are
distinct. Thus, det C(t) 6= 0, because the roots are also different from zero.
The above Theorem thus implies that the general solution to the homo-
(g)
geneous equation Xt is given by
(g)
Xt = c1 λt1 + c2 λt2 + · · · + cp λtp . (2.37)
Using the same technique as in the proof of Theorem 2.1, it is easy to demon-
strate that the set of solutions forms a linear space of dimension p.
multiple roots
When the roots of the characteristic equation are not distinct, the situa-
tion becomes more complicated. Denote the r distinct roots by z1 , · · · , zr ,
r < p, and their corresponding multiplicities by m1 , · · · , mr . Writing the
homogeneous difference equation in terms of the lag operator leads to
(1 − φ1 L − · · · − φp Lp ) Xt
= (1 − λ1 L)m1 (1 − λ2 L)m2 · · · (1 − λr L)mr Xt = 0 (2.38)
(1 − λi L)mi ψt = 0 (2.39)
(1 − L)k ts = 0, 0≤s<k
Lemma 2.4. The set Gi = {λti , tλti , t2 λti , · · · , tmi −1 λti } represents a funda-
mental set of solutions to the equation (2.39).
Proof. Take s, 1 ≤ s ≤ mi − 1, then
(1 − λi L)mi (ts λti ) = λti (1 − L)mi (ts ) = 0
because (1−L)mi ts = 0 according to Lemma 2.3.12 Therefore ts λti is a solution
to (2.39). The set Gi is linearly independent because the set {1, t, t2 , · · · , tmi −1 }
is linearly independent.
It is then easily seen that G = ri=1 Gi is a fundamental set of solutions
S
to the equation (2.38). Thus, the general solution can be written as
r
X
ci0 + ci1 t + ci2 t2 + · · · + ci,mi −1 tmi −1 λti .
Xt = (2.40)
i=1
Higher order equations will add no new qualitative features. Assuming that
1 − φ1 − φ2 6= 0, the unique fixed point of this homogenous equation is 0. The
corresponding characteristic equation is given by the quadratic equation:
1 − φ1 z − φ2 z 2 = 0.
Or in terms of λ = z1 :
p
φ1 ± φ21 + 4φ2
λ1,2 = . (2.42)
2
To understand the qualitative behavior of Xt , we distinguish three cases:
complex roots: The two roots appear as complex conjugate pairs and may
be written as λ1 = α+ıβ and λ2 = α−ıβ with β 6= 0. In terms of polar
coordinates the two roots may alternatively
p be written as λ1 = reıθ ,
respectively λ2 = re−ıθ , where r = α2 + β 2 and θ = tan−1 αβ . The
= 2ρrt cos(θt + ω)
38 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
3 3 3
2 2 2
1 1 1
Xt
Xt
Xt
0 0 0
−1 −1 −1
−2 −2 −2
−3 −3 −3
0 5 10 0 5 10 0 5 10
case 1: λ1 = 1.1 case 2: λ1 = 1 case 3: λ1 = 0.8
3 3 3
2 2 2
1 1 1
Xt
Xt
Xt
0 0 0
−1 −1 −1
−2 −2 −2
−3 −3 −3
0 5 10 0 5 10 0 5 10
case 4: λ1 = −0.8 case 5: λ1 = −1 case 6: λ1 = −1.1
1. r > 1: both roots are outside the unit circle (i.e. the circle of
radius one and centered in the point (0, 0)). Xt oscillates, but
with ever increasing amplitude. The fixed point zero is unstable.
2. r = 1: both roots are on the unit circle. Xt oscillates, but with
constant amplitude.
3. r < 1: both roots are inside the unit circle. The solution oscillates,
but with monotonically decreasing amplitude and converges to
zero as t → ∞. The fixed point zero is asymptotically stable.
20
0
Xt
−20
−40
0 1 2 3 4 5 6 7 8 9 10
case 1: roots outside unit circle (λ = 1+i, λ = 1−i)
1 2
1
t
0
X
−1
0 1 2 3 4 5 6 7 8 9 10
case 2: roots on unit circle (λ1 = (sqrt(2)/2)(1+i), λ2 = (sqrt(2)/2)(1−i))
1
t
0
X
−1
0 1 2 3 4 5 6 7 8 9 10
case 3: roots in unit circle (λ = 0.5(1+i), λ = 0.5(1−i))
1 2
Z
X ∗ = φ1 X ∗ + φ2 X ∗ + Z ⇒ X∗ =
1 − φ1 − φ2
(g)
Thus, Xt converges to its equilibrium if and only if Xt converges to zero as
(g)
t → ∞. Moreover, the solution oscillates around X ∗ if and only if Xt oscil-
lates around zero. Based on the theorem just above, the following theorems
hold.
Theorem 2.7 (Limiting Behavior of Second Order Equation (original pa-
rameters)). Assuming 1 − φ1 − φ2 6= 0, the following statements hold.
(i) All solutions of the nonhomogeneous equation (2.43) oscillate around
the equilibrium point X ∗ if and only if the characteristic equation has
no positive real characteristic root.
(ii) All solutions to the nonhomogeneous equation (2.43) converge to X ∗
(i.e. X ∗ is asymptotically stable) if and only if max{|λ1 |, |λ2 |} < 1.
Theorem 2.8 (Stability Conditions of Second Order Equation (original pa-
rameters)). The equilibrium point X ∗ is asymptotically stable (i.e. all solu-
tions converge to X ∗ ) if and only if the following three conditions are satisfied:
(i) 1 − φ1 − φ2 > 0
(ii) 1 + φ1 − φ2 > 0
(iii) 1 + φ2 > 0
Proof. Assume that X ∗ is an asymptotically stable equilibrium point. Ac-
cording to the previous Theorem (2.6), this means that both λ1 and λ2 must
be smaller than one in absolute value. According to equation (2.42) this
implies that
φ + pφ2 + 4φ φ − pφ2 + 4φ
1 1 2 1 1 2
|λ1 | = < 1 and |λ2 | = <1.
2 2
or, equivalently,
q
−2 − φ1 < φ21 + 4φ2 < 2 − φ1
q
−2 − φ1 < − φ21 + 4φ2 < 2 − φ1
2.5. DIFFERENCE EQUATIONS OF ORDER P 41
Squaring the second inequality in the first line implies: φ21 + 4φ2 <
4 − 4φ1 + φ21 which leads to condition (i). Similarly, squaring the first
inequality in the second line yields: 4 + 4φ1 + φ21 > φ21 + 4φ2 which
results in condition (ii). The assumption |λ1 | < 1 and |λ2 | < 1 imply
that |λ1 λ2 | = | − φ2 | < 1 which gives condition (iii).
complex roots: φ21 + 4φ2 < 0: This implies that 0 < φ21 < −4φ2 . There-
fore
4(1 − φ1 − φ2 ) > 4 − 4φ1 + φ21 = (2 − φ1 )2 > 0
which is equivalent to condition (i). Similarly,
4(1 + φ1 − φ2 ) > 4 + 4φ1 + φ21 = (2 + φ1 )2 > 0
which is equivalent to condition (ii). In order to obtain condition (iii),
note that the two complex conjugate roots are given by
φ1 ı φ1 ı
q q
2
λ1 = + φ1 + 4φ2 and λ2 = − φ21 + 4φ2 .
2 2 2 2
Because |λ1 | < 1 and |λ2 | < 1 by assumption, we have that |λ1 λ2 | =
| − φ2 | < 1 which is condition (iii).
Assume now that the three conditions are satisfied. They immediately imply
that −2 < φ1 < 2 and that −1 < φ2 < 1. If the roots are real then
p p
−2 + φ21 + 4φ2 φ1 + φ21 + 4φ2
−1 < < λ1 =
2 p 2
φ1 + φ21 + 4 − 4φ1
<
q 2
φ1 + (2 − φ1 )2
=
2
φ1 − φ1 + 2
= =1
2
Similarly,
p p
2− φ21 + 4φ2 φ1 − φ21 + 4φ2
1> > λ2 =
2 p 2
2
φ1 − φ1 + 4 + 4φ1
>
q 2
φ1 − (φ1 + 2)2
=
2
φ1 − φ1 − 2
= = −1
2
42 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
2
explosive explosive
oscillations growth
1
asymptotically
stable
φ2
0
-1
explosive oscillations
-2
parabola: φ 21 + 4φ 2 = 0
-3
-3 -2 -1 0 1 2 3
φ1
If the roots are complex, λ1 and λ2 are complex conjugate numbers. Their
φ2 −(φ2 +4φ )
squared modulus then equals λ1 λ2 = 1 41 2 = −φ2 . As −φ2 < 1, the
modulus of both λ1 and λ2 is smaller than one.
The three conditions listed above determine a triangle in the φ1 -φ2 -plane
with vertices (−2, −1), (0, 1) and (2, −1). Points inside the triangle imply
an asymptotically stable behavior whereas points outside the triangle lead
to an unstable behavior. The parabola φ21 + 4φ2 = 0 determines the region
of complex roots. Values of φ1 and φ2 above the parabola lead to real roots
whereas values below the parabola lead to complex roots. The situation is
represented in Figure 2.7.
2.5.4 Examples
Multiplier Accelerator model
A classic economic example of a second order difference equation is the
multiplier-accelerator model originally proposed by Samuelson (1939). It
was designed to demonstrate how the interaction of the multiplier and the
accelerator can generate business cycles. The model is one of a closed econ-
omy and consists of a consumption function, an investment function which
2.5. DIFFERENCE EQUATIONS OF ORDER P 43
In the general case where government expenditures are not constant, but
vary over time, we apply the method of undetermined coefficients to find
(p)
a particular solution, Yt , to equation (2.44). This method conjectures a
certain type of solution and then tries to pin down a solution by inserting it
into the difference equation. In the particular case at hand, the roots of the
characteristic function are all outside the unit circle. Thus, we conjecture a
particular solution of the form:
∞
X
(p)
Yt =c+ ψi Gt−i
i=0
The coefficients d1 and d2 can then be determined from the initial conditions:
ψ0 = 1 = d1 + d2
ψ1 = β + γ = d1 λ1 + d2 λ2
1.4
1.2
0.8
0.6
real roots:
β = 4/5, γ = 1/5
0.4 multiple roots:
β = 3/4, γ = 1/4
0.2
−0.2
complex roots:
−0.4
β = 2/3, γ = 2/3
−0.6
0 2 4 6 8 10 12 14 16 18 20
period
1 − φz + z 2 = 0
This equation implies that the two roots, z1 and z2 , are given by
p
φ ± φ2 − 4
z1,2 =
2
First note that because φ > 2 the roots are real, distinct, and positive.
Second they come in reciprocal pairs as z1 z2 = 1. Thus, one root is smaller
than one whereas the other is necessarily greater than one. Thus, we have
one stable and one explosive root. The solution to the homogenous equation
can therefore be written as
pt = c1 λt + c2 λ−t
reflected in the inventories carried over last period. Thus, we conjecture that
the solution will have both a forward and a backward looking component.
Thus, we seek for a particular solution of the following form:
∞
X
pt = ψj ut−j
j=−∞
···
ut+2 : ψ−1 = φψ−2 − ψ−3
ut+1 : ψ0 = φψ−1 − ψ−2
1
ut : ψ1 = φψ0 − ψ−1 +
α
ut−1 ψ2 = φψ1 − ψ0
ut−2 : ψ3 = φψ2 − ψ1
···
This shows that the ψj ’s follow homogenous second order difference equa-
tions:
ψj = d1 λj + d2 λ−j
ψ−j = e1 λj + e2 λ−j
2.5. DIFFERENCE EQUATIONS OF ORDER P 49
If we impose again the requirement that the price must be finite if the supply
shock has always been constant and is expected to remain constant in the
future, we have to set c1 = 0 and c2 = 0 to get the solution:
∞
α−1 X
pt = λ|j| ut−j (2.47)
λ − λ−1 j=−∞
This implies that {pt } is a bounded sequence, i.e. that (pt ) ∈ `∞ . In this
case the price pt is just a function of all past shocks and all expected future
shocks.
Another way to represent this solution is to express pt as
X∞
pt = λpt−1 − α−1 λ λj ut+j .
j=0
In this expression the double infinite sum is replaced by a single one. This is
due to the fact that the past evolution of supply is now summarized by pt−1
which is supposed to be known in period t. The effect of discounted expected
future supply is just as before.
In order to gain a better understanding of the dynamics, we will analyze
the following numerical example. In this example α = 20 9
and the parameters
β and γ are such that φ = 2.05. This implies that β + γ = 19 . The roots are
then given by λ = 0.8 and λ−1 = 1.25. The bounded solution is then given
by
X∞
pt = − 0.8|j| ut−j
j=−∞
50 CHAPTER 2. LINEAR DIFFERENCE EQUATIONS
Suppose that the supply shock has been constant forever and is expected
to remain constant at u. The above formula then implies that the logged
price level pt equals −9u and that It = 0. Suppose that an unexpected and
transitory positive supply shock of value 1 hits the market in period 0. Then
according to the first panel in Figure 2.9 the price immediately falls by 1.
At the same time inventories rise because prices are expected to move up in
the future due to the transitory nature of the shock. Here we have a typical
price movement: the price falls, but is expected to increase. After the shock
the market adjusts gradually as prices rise to their old level and by running
down inventories.
Consider now a different gedankenexperiment. Suppose that the shock
is not unexpected, but expected to hit the market only in period 5. In
this case, we see a more interesting evolution of prices and inventories. In
period zero when the positive supply shock for period 5 is announced, market
participants expect the price to fall in the future. They therefore want to
get rid of their inventories by trying to selling them already now.13 As a
result, the price and the inventories start to fall already before the supply
shock actually takes place. In period 5 when the supply shock finally hits the
market, market participants expect the price to move up again in the future
which leads to a buildup of inventories. Note that this buildup is done when
the price is low. From period 5 on, the market adjusts like in the previous
case because the supply shock is again assumed to be transitory in nature.
Taylor model
In this example we analyze a simple deterministic version of Taylor’s stag-
gered wage contract model which also has a backward and forward component
(see Taylor (1980) and Ashenfelter and Card (1982)).14 In this model, half
of the wages have to be contracted in each period for two periods. Thus, in
each period half of the wages are renegotiated taking the wages of the other
group as given. Assuming that the two groups are of equal size, wages are
set according to the following rule:
Thus, wage setting in period t takes into account the wages of contracts still
in force, wt−1 , and the expected wage contract in the next period, wt+1 . As
the two groups are of equal size and power, we weight them equally by 0.5.
In addition wages depend on the state of the economy over the length of the
13
In our example they actually go short as I0 < 0.
14
The model could equally well be applied to analyze staggered price setting behavior.
2.5. DIFFERENCE EQUATIONS OF ORDER P 51
−1
0 2 4 6 8 10 12 14 16 18 20
period
expected positive transitory supply shock in period 5
0.5
logged price level / inventory
inventories
0
−0.5
logged price level
−1
0 2 4 6 8 10 12 14 16 18 20
period
The symmetric nature of the polynomial coefficients implies that the roots
appear in pairs such that one root is the inverse of the other.15 This means
that one root, say λ1 , is smaller than one whereas the other one is greater
than one, i.e. λ2 = 1/λ1 . To see this note first that the discriminant is equal
to 4 = −hγ > 0. Thus, the roots are real and second that λ1 λ2 = 1. If we
denote λ1 by λ then λ2 = 1/λ and we have φ = λ + λ−1 .
Applying the superposition principle, the solution becomes
(p)
wt = c1 λt + c2 λ−t + wt (2.52)
(p)
where the coefficients c1 and c2 and a particular solution wt have yet to be
determined. In order to eliminate explosive solutions, we set c2 = 0. The
other constant can then be determined by noting that (wt ) is a predetermined
variable such that the wage negotiations in period one take wages from the
(p)
other group negotiated in period zero as given. Thus, c1 = w0 − w0 . To
find the particular solution, set
∞
X
(p)
wt = ψj Zt−j
j=−∞
and insert this solution into the difference equation (2.52) and perform a
comparison of coefficients as in the previous exercise. This leads again to
two homogeneous difference equations for the coefficients (ψj ) and (ψ−j ),
j ≥ 1 with solutions
ψj = d1 λj + d2 λ−j
ψ−j = e1 λj + e2 λ−j
where the coefficients d1 , d2 , e1 and e2 have still to be determined. The elim-
ination of explosive coefficient sequences leads to d2 = e2 = 0. Furthermore,
both solutions must give the same ψ0 so that d1 = e1 . Denote this value by
d, then comparing the coefficients for Zt and noting that φ = λ + λ−1 leads
to:
ψ1 = φψ0 − ψ−1 + 1 ⇐⇒ dλ = φd − dλ + 1.
Therefore
1
d= < 0.
λ − λ−1
The effect of a shock to aggregate demand in period j ≥ 0 is then
∂wt+j 2h
= ψ−j + ψ−j−1 = − d(1 + λ)λj , j = 0, 1, 2, . . .
∂vt 1 + hγ
15
This conclusion extends to contracts longer than two periods (see Ashenfelter and
Card, 1982).
Chapter 3
3.1 Introduction
This chapter treats systems of linear difference equations. For each variable
X1t , · · · , Xnt , n ≥ 1, we are given a linear nonhomogeneous difference equa-
tion of order p where each variable can, in principle, depend on all other
variables with a lag. Writing each difference equation separately, the system
is given by
(1) (1) (1)
X1t = φ11 X1,t−1 + φ12 X2,t−1 + · · · + φ1n Xn,t−1
(p) (p) (p)
+ · · · + φ11 X1,t−p + φ12 X2,t−p + · · · + φ1n Xn,t−p + Z1t
(1) (1) (1)
X2t = φ21 X1,t−1 + φ22 X2,t−1 + · · · + φ2n Xn,t−1
(p) (p) (p)
+ · · · + φ21 X1,t−p + φ22 X2,t−p + · · · + φ2n Xn,t−p + Z2t
···
(1) (1)
Xnt = φn1 X1,t−1 + φn2 X2,t−1 + · · · + φ(1)
nn Xn,t−1
(p) (p)
+ · · · + φn1 X1,t−p + φn2 X2,t−p + · · · + φ(p)
nn Xn,t−p + Znt
Using matrix notation this equation system can be written more com-
pactly as
53
54 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
(k)
1, 2, . . . , p, denote the matrices Φk = φi,j for k = 1, 2, . . . , p. The
i,j=1,2,...,n
solution of this difference equation is based again on the same principles as
in the univariate case (see page 8). Before doing so we show how to reduce
this p-th order system to a first order system.
Any system of order p can be rewritten as a system of order 1. In order to
see this, define a new variable Yt as the stacked vectors Xt , Xt−1 , · · · , Xt−p+1 .
This new variable then satisfies the following first order system:
Xt Xt−1 Zt
Xt−1 Φ1 Φ2 Φ3 . . . Φp−1 Φp
Xt−2 0
Xt−2 In 0 0 . . . 0 0
Xt−3 0
Yt = .. = 0 In 0 . . . 0 0 .. + ..
. .. .. .. . . .. .. . .
. . . . . .
Xt−p+2 Xt−p+1 0
0 0 0 . . . In 0
Xt−p+1 Xt−p 0
= ΦYt−1 + Zt (3.2)
0
where Zt is redefined to be Zt0 0 0 . . . 0 0 . In denotes the identity
matrix of dimension n. The matrix Φ is an np × np matrix called the com-
panion matrix of (3.1).1 Thus, multiplying out the equation system (3.2) one
can see that the first equation gives again the original equation (3.1) whereas
the remaining p − 1 equations are just identities. The study of a p-th order
system can therefore always be reduced to a first order system.
The Casarotian matrix is closely related to the issue whether or not the
sequences are independent.
Lemma 3.1. If det C(t) of n sequences (X (i) ), 1 ≤ i ≤ n, is different from
zero for at least one t0 ≥ 0, then (X (i) ), 1 ≤ i ≤ n, are linearly independent
for t ≥ 0.
3.2. FIRST ORDER SYSTEM OF DIFFERENCE EQUATIONS 57
Proof. Suppose there exists a t0 such det C(t0 ) = 0. This implies that there
(i)
exists a nonzero vector c such that C(t0 )c = ni=1 ci Xt = 0. Because the
P
(i) (i)
Xt are solutions so is the linear combination Yt = ni=1 ci Xt . For this
P
solution Yt0 = 0 thus Yt = 0 for all t because the uniqueness of the solution.
As the solutions are, however, linearly independent c must be equal to 0
which stands in contradiction to c 6= 0.
We can combine the two Lemmas to obtain the following theorem.
Definition 3.3. Any n×n matrix U(t) which is nonsingular for all t ≥ 0 and
which satisfies the homogenous matrix system (3.8) is called a fundamental
matrix. If in addition the matrix satisfies U(0) = In then it is called a
principal fundamental matrix.
Note that if V(t) is any fundamental matrix then U(t) = V(t)V −1 (0)
is a principal fundamental matrix. Note also if a V(t) is a fundamental
matrix then V(t)C is also a fundamental matrix where C is any nonsingular
matrix. This implies that there are infinitely many fundamental matrices for
a given homogenous matrix system. There is, however, only one principal
fundamental matrix because the matrix difference equation (3.8) uniquely
determines all subsequent matrices once an initial matrix is given. In the
case of a principal fundamental matrix this initial matrix is the identity
matrix. In this monograph we will not pursue the concept of the fundamental
matrix further because it does not payoff in the context of constant coefficient
systems.3 Only note that U(t) = Φt where U(t) is a principal fundamental
matrix. Thus, any solution to the homogenous system (3.5) has the form:
Xt = U(t)c = Φt c (3.9)
Distinct Eigenvalues
If all the eigenvalues, λ1 , · · · , λn of Φ are distinct, then Φ is diagonalizable,
i.e. similar to a diagonal matrix. Thus, there exists a nonsingular matrix
Q such that Q−1 ΦQ = Λ where Λ = diag(λ1 , · · · , λn ). The columns of Q
consist of the eigenvectors of Φ. With this similarity transformation in mind
it is easy to compute Φt :
Thus, the general solution to the homogenous system (3.5) can be written
as n
X
Xt = ci qi λti , t≥0 (3.10)
i=1
Y1,t+1 = λ1 Y1,t
···
Yn,t+1 = λn Yn,t
Repeated Eigenvalues
where
0 1 0 ··· 0 0
0
0 1 ··· 0 0
N = ... .. .. ..
.. .
. . ..
0 0 0 · · · 0 1
0 0 0 ··· 0 0
Jordan segment J t (λi ) is just J t (λi ) = diag J1t (λi ) , · · · , Jtti (λi ) . The t-th
62 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
Jj (λi )t = (λi I + N )t
t t−1 t t−2 2 t
t
= λi I + λ N+ λ N + ··· + λt−k+1 N k−1
1 i 2 i k−1 i
t t t−1 t t−2 t
t−k+1
λi 1 λi λ
2 i
· · · k−1 i
λ
t t−1 t
0
λi t
1
λi · · · k−2 λit−k+2
.. .
.. . .. . .. .
..
= . (3.12)
t
t−1
1
λi
t
0 0 0 ··· λi
where N is the nilpotent matrix of size corresponding to the Jordan block.
Xt = ΦXt−1 + Zt (3.13)
Xt = ΦXt−1 + Zt
Xt = Φ(ΦXt−2 + Zt−1 ) + Zt = Φ2 Xt−2 + ΦZt−1 + Zt
...
Xt = Φt X0 + Φt−1 Z1 + Φt−2 Z2 + · · · + ΦZt−1 + Zt
t−1
X
t
= Φ X0 + Φj Zt−j
j=0
3.3. STABILITY THEORY 63
X ∗ = f (X ∗ , t) for all t ≥ 0
Lemma 3.3. The zero solution of the homogeneous system (3.5) is stable if
and only if there exists M > 0 such that
t
Φ
≤ M for all t ≥ 0. (3.20)
Proof. Suppose that the inequality (3.20) is satisfied then kXt k ≤ M kX0 k.
Thus, for ε > 0, let δ = Mε . Then kX0 k < δ implies kXt k < ε so that the
zero point is stable. Conversely, suppose that the zero point is stable. Then
for all kX0 k < δ
Φ
= sup
Φt ξ
= 1 sup
Φt X0
≤ ε = M
t
kξk≤1 δ kX0 k≤δ δ
where the first inequality is just the definition of the matrix norm correspond-
ing to the norm in Rn . The second equality is a consequence of kX0 k ≤ δ.
The inequality above follows from the assumption that the zero point is a
stable equilibrium.
The condition given in equation (3.20) is equivalent to the condition that all
solutions are bounded.
Theorem 3.4. For the homogeneous system (3.5) the following statements
are true:
(i) The zero solution is stable if and only if ρ(Φ) ≤ 1 and the eigenvalues
on the unit circle are semisimple.
(ii) The zero solution is asymptotically stable if and only if ρ(Φ) < 1. In
this case, the solution is even exponentially stable.
66 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
Proof. According to previous lemma 3.3 we have to prove that kΦt k ≤ M for
some M > 0. Using the Jordan canonical form Φ = QJQ−1 this amounts to
kΦt k = kQJ t Q−1 k ≤ M . But this is equivalent to the existence of a M̃ > 0
M̃
such that kJ t k ≤ M̃ . M may then be taken as M = kQkkQ −1 k . Now the power
Jj (λi )t = (λi I + N )t
t t−1 t t−2 2 t
t
= λi I + λ N+ λ N + ··· + λt−k+1 N k−1
1 i 2 i k−1 i
t t t−1 t t−2 t
t−k+1
λi 1 λi λ
2 i
· · · k−1 λit−k+2
t t t−1 t
0
λi 1
λi · · · k−2 λi
.. .
.. . .. .. ..
= . .
. .
t
t−1
1
λ i
0 0 0 ··· λti
The elements in this matrix become unbounded if |λi | > 1. They also become
unbounded if |λi | = 1 and Jj (λi ) is not a 1 × 1 matrix. If, however, for all
eigenvalues with |λi | = 1 the largest Jordan blocks are 1 × 1, then the Jordan
segment corresponding to λi with |λi | = 1, J (λi ), is just a diagonal matrix
with ones in the diagonal and is therefore obviously bounded. The Jordan
segments, J (λi ), corresponding to eigenvalues |λi | < 1, converge to zero, i.e.
limt→∞ J (λi )t = 0 because tk λti → 0 as t → ∞ by l’Hôpital’s rule.
Theorem 3.4 showed that the stability properties crucially depend on
the location of the eigenvalues relative to the unit circle. If there are no
eigenvalues on the unit circle, small perturbations of Φ will not affect the
location of the eigenvalues relative to the unit circle because they depend
continuously on the entries of Φ. Thus, small perturbations preserve the
stability properties. This motivates to bring out matrices with no eigenvalues
on the unit circle in an own definition.
Definition 3.6 (Hyperbolic Matrix). A hyperbolic matrix is a matrix with
no eigenvalues on the unit circle. If A is a hyperbolic matrix, then the
corresponding linear homogeneous difference equation Xt+1 = AXt is also
called hyperbolic.
Consider a hyperbolic matrix Φ with spectrum σ = {λ1 , . . . , λs }, 1 ≤
s ≤ n.9 Then, partition the spectrum into the stable, respectively unstable
9
The spectrum of a matrix is given by the set of its distinct eigenvalues.
3.3. STABILITY THEORY 67
Proof. The proof follows Elaydi (2005, theorem 4.14). Let X(t, x0 ) be a
solution with x0 ∈ W s . The definition of the (generalized) eigenvector corre-
sponding to some eigenvalue λ ∈ σ implies that ΦEλ = Eλ where Eλ denotes
the space generated by the (generalized) eigenvectors corresponding to λ.
Hence ΦW s = W s and s s
PrX(t, x0 ) ∈ W for all t ≥ 0. Given x0 ∈ W , we can
represent x0 as x0 = j=1 αj ej where r is the number of eigenvalues, distinct
or not, strictly smaller than one and ej denotes the (generalized) eigenvec-
tors corresponding to those eigenvalues. Let J = Q−1 ΦQ be the Jordan form
of Φ. Next, partition the Jordan matrix into blocks of stable, respectively
unstable eigenvalues accordingly:
Js 0
J=
0 Ju
The last equality follows from the fact that the Q−1 ej ’s are of the form
(ξ1j , . . . , ξrj , 0, . . . , 0)0 which is a consequence of the ej ’s being (generalized)
eigenvectors. This implies that X(t, x0 ) → 0 as t → ∞ because Jst → 0 as
t → ∞.
The proof of (ii) is analogous.
Remark 3.3. Theorem 3.5 may be refined by relaxing the assumption of
an hyperbolic matrix and can be sharpened with respect to its conclusions
(Colonius and Kliemann, 2014, theorem 1.5.8).
Linearization
As in the case of one dimensional difference equations (see Theorem 2.4), we
can analyze the stability properties of nonlinear systems via a linear approx-
imation around the steady state. Consider for this purpose the nonlinear
homogeneous system Xt+1 = F (Xt ) with F : Rn −→ Rn and fixed point
X ∗ . Suppose that F is continuously differentiable in an open neighborhood
of X ∗ , then the linearized difference equation is given by
Xt+1 − X ∗ = A(Xt − X ∗ )
where A is the derivative of F evaluated at X ∗ , i.e. A = DF (X ∗ ), the matrix
of partial derivatives. If A is a hyperbolic matrix, then we say that X ∗ is a
hyperbolic fixed point.
The Stable Manifold Theorem 3.5 can be extended, at least locally, to
nonlinear difference equations. This is the subject of the Hartman–Grobman
theorem (see Elaydi (2005) and Robinson (1999, section 5.6)).
Theorem 3.6 (Hartman–Grobman Theorem). If F is C r –diffeomorphism10
with hyperbolic fixed point X ∗ , then there exist neighborhoods V of X ∗ and
W of 0 and a homeomorphism H : W −→ V such that F (H(X)) = H(AX).
Proof. Robinson (See 1999, sections 5.6 and 5.7)
Remark 3.4. The Theorem says that F is topologically equivalent in a
neighborhood of the fixed point X ∗ to the linear map induced by the deriva-
tive evaluated at the fixed point. This fact can be represented graphically as
follows
A
W −−−→ W
Hy yH
F
V −−−→ V
10
This means that the function is r-times continuously differentiable and that it is a
homeomorphism (i.e. one-to-one and onto, continuous with continuous inverse).
3.4. TWO-DIMENSIONAL SYSTEMS 69
Definition 3.7 (Saddle Point). The zero solution of the hyperbolic linear
difference equation Xt+1 = AXt is called a saddle point if there exist at least
two eigenvalues of A, λu and λs , such that |λu | > 1 and |λs | < 1.
Squaring the second inequality in the first line and simplifying gives: tr(Φ) <
1 + det Φ. Squaring the first inequality in the second line gives −1 − det Φ <
tr(Φ). Combining both results gives the first part of the stability condition
(3.21). The second part follows from the observation that det Φ = λ1 λ2 and
the assumption that |λ1,2 | < 1. If the roots are complex, they are conjugate
complex, so that the second part of the stability (3.21) results from det Φ =
70 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
λ1 λ2 = |λ1 | |λ2 | < 1. The first part follows from tr2 (Φ) − 4 det Φ < 0 which
is equivalent to 0 < tr2 (Φ) < 4 det Φ. This can be used to show that
4 (1 + det Φ − tr(Φ)) > 4 + tr2 (Φ) − 4tr(Φ) = (2 − tr(Φ))2 > 0.
which is the required inequality.
Conversely, if the stability condition (3.21) is satisfied and if the roots are
real, we have
p p
−2 + tr2 (Φ) − 4 det Φ tr(Φ) + tr2 (Φ) − 4 det Φ
−1 < < λ1 =
2 p 2
2
tr(Φ) + tr (Φ) + 4 − 4tr(Φ)
<
q 2
tr(Φ) + (2 − tr(Φ))2
= < 1.
2
Similarly, for λ2 . If the roots are complex, they are conjugate complex and we
2 2
have |λ1 |2 = |λ2 |2 = λ1 λ2 = tr (Φ)−tr 4(Φ)+4 det Φ = det Φ < 1. This completes
the proof.
X2 t
schedule: X1 t+1
, - X1 t = 0
region I region II
schedule:
steady X2 t+1
, - X2 t = 0
state
X1 t
region III
region IV
The dynamics of the system in each of the four regions can be figured
out from the signs of the coefficients as follows. Suppose we start at a point
on the X1,t+1 − X1,t = (φ11 − 1)X1,t + φ12 X2,t = 0 schedule then we know
that the first variable does not change. Now increase X2,t a little bit. This
moves us into region I or IV. If φ12 is positive, this implies that X1,t+1 −
X1,t > 0 so that the first variable increases. Thus, we know that above the
X1,t+1 − X1,t = 0 line, X1,t increases and that below this line X1,t decreases.
We can indicate this result in figure 3.1 by arrows from left to right in regions
I and IV and arrows from right to left in regions II and III. If φ12 is negative,
we obtain, of course, the opposite result. Similarly, consider the schedule
X2,t+1 − X2,t = φ21 X1,t + (φ22 − 1)X2,t = 0 where the second variable does
not change. Consider now an increase in X1,t . This moves us into region III
or IV. If the coefficient φ21 is positive, this implies that X2,t+1 − X2,t > 0 so
that the second variable must increase. As before we can infer that below the
X2,t+1 − X2,t = 0 line X2,t increases whereas above this line X2,t decreases.
We can again indicate this behavior by arrows: upward arrows in regions
III and IV and downward arrows in regions I and II. If the sign of φ21 is
negative, the opposite result is obtained. This type of analysis gives us a
phase diagram as in figure 3.1. This diagram shows us the dynamics of the
system starting in every possible point. In this example, the arrows indicate
that wherever we are, we will move closer to the steady state. But this
corresponds exactly to the definition of an asymptotically stable equilibrium
point (see definition 3.18). Thus, we conclude from this diagram that the
steady state is an asymptotically stable equilibrium point.
Of course the situation depicted in figure 3.1 is not the only possible
one. In order to analyze all possible situations which can arise in a two-
dimensional system, we make a variable transformation using the Jordan
canonical form of Φ. If Φ has the Jordan canonical form Φ = QJQ−1 then
we make the variable transformation Yt = Q−1 Xt . This results in a new first
order homogenous difference equation system:
Note that the steady state is not affected by this variable transformation. It
is still the point (0, 0)0 . Let us treat these three cases separately.
0.8
0.6
0.4
0.2
Y2,t
−0.2
steady
state
−0.4
−0.6
−0.8
−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Y1,t
1000
500
Y2,t
steady
state
−500
−1000
−6 −4 −2 0 2 4 6
Y1,t
0.8
0.6
0.4
0.2
2,t
0
Y
−0.2
steady
−0.4 state
−0.6
−0.8
−1
−6 −4 −2 0 2 4 6
Y
1,t
state. Thus, given an initial value y20 for Y2,t , the requirement that the
solution must be bounded uniquely determines an initial condition for
Y1,t too, which in this reduced setting is just Y1,0 = y10 = 0. Thus,
the solution is given by Y1,t = 0 and Y2,t = λt2 y2,0 for some initial value
y2,0 . Note that the saddle path, in contrast to the other paths, is a
straight line through the origin. This property is carried over when
the system is transformed back to its original variables. In fact, the
solution becomes X1,t = q12 λt2 y2,0 and X2,t = q22 λt2 y2,0 where (q12 , q22 )0
is the eigenvector corresponding to λ2 . Thus, the ratio of X1,t and X2,t
equals q12 /q22 constant.12 As saddle point equilibria are very promi-
nent in economics, we investigate this case in depth in Section 3.5. In
particular, we will go beyond the two-dimensional systems and analyze
the role of initial values.
When there are multiple eigenvalues with two independent eigenvec-
tors, Φ can again be reduced by a similarity transformation to a di-
agonal matrix. The trajectories are then straight lines leading to the
origin if the eigenvalue is smaller than one as in figure 3.5, and straight
lines leading away from the origin if the eigenvalue is larger than one
in absolute terms.
12
If q22 = 0, we take the ratio X2,t /X1,t which again defines a straight line through the
origin.
76 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
0.8
0.6
0.4
0.2
2,t
0
Y
−0.2
steady
−0.4 state
−0.6
−0.8
−1
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Y1,t
Figure 3.5: Repeated Roots with Asymptotically Stable Steady State (λ1 =
λ2 = 0.8)
When one eigenvalue equals one whereas the second eigenvalue is smaller
than one in absolute value, we arrive at a degenerate situation. Whereas
Y1,t remains at its starting value y10 , Y2,t converges to zero so that the
system converges to (y10 , 0)0 as is exemplified by figure 3.6. Only when
y10 = 0 will there be a convergence to the steady state (0, 0). However,
(0, 0) is not the only steady state because the definition of an eigen-
value implies that A − I is singular. Thus, there exists X ∗ 6= 0 such
that X ∗ = ΦX ∗ .
In case that the first eigenvalue equals minus one, there is no conver-
gence as Y1,t will oscillate between y10 and −y10 .
1.5
0.5
Y2,t
steady
state
−0.5
−1
−1.5
−1.5 −1 −0.5 0 0.5 1 1.5
Y1,t
0.5
steady
state
Y2,t
schedule:
Y1,t+1−Y1,t = 0
−0.5
−1
schedule:
Y1,t+1−Y1,t = 0
1
0.8
0.6
0.4
0.2
0
schedule:
Y2,t+1−Y2,t = 0
−0.2
−0.4
−0.6
−0.8
−1
Figure 3.8: Complex eigenvalues with Stable Steady State (λ1,2 = 0.7 ± 0.2ı)
namics in the case of eigenvalues inside the unit circle. One can clearly
discern the oscillatory behavior and the convergence to the steady state.
Figure 3.9 displays a situation with an unstable steady state where all
trajectories move away from the steady state. Finally figure 3.10 dis-
plays a degenerate case where the eigenvalues are on the unit circle. In
such a situation the system moves around its steady state in a circle.
The starting values are (0.25, 0.25), (0.5, 0.5), (1, 1), and (1.5, 1.5).
10
−5
schedule: schedule:
Y2,t+1−Y2,t = 0 Y1,t+1−Y1,t = 0
−10
−10 −5 0 5 10
Figure 3.9: Complex eigenvalues with Unstable Steady State (λ1,2 = 1±0.5ı)
2.5
1.5
0.5
−0.5
−1
−1.5
−2
−2.5
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5
Figure 3.10: Complex eigenvalues on the unit circle (λ1,2 = cos (π/4) ±
ı sin (π/4))
80 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
pin down a unique solution. Thus, one has to resort to additional restrictions.
These restrictions come from the assumption that the solution must remain
bounded. In the well-behaved scenario, this will give just enough additional
initial values to determine a unique solution and the model is said to be
determinate (see also Section 1.1). Geometrically, this solution has the form
of a saddle path and the steady state is a saddle point.
A concise treatment of such models in the context of stochastic difference
equations was first given by Blanchard and Kahn (1980). Klein (2000) and
Sims (2001) provide further insights and solution approaches (see Chapter 5
for further details). As the stochastic setting delivers similar conclusions
with respect to uniqueness, we adopt the Blanchard-Kahn setup to the de-
terministic case by replacing rational expectations by perfect foresight. This
framework delivers first order linear nonautonomous difference equations:
x0 = RX0 (3.27)
or in matrix form
X10
...
x10 1 ... 0 0 ... 0
X
x0 = ... = ... . . . ... ... . . . ... k0 = (Ik , 0k×n−k )X0
Xk+1,0
xk0 0 ... 1 0 ... 0 ...
Xn0
(p)
equation Xt :
(g) (p)
Xt = X t + Xt
(p)
= QΛt Q−1 c + Xt
(p)
where the n-vector c is yet to be determined. Taking t = 0, X0 = c + X0 .
Given a particular solution, the initial values given by equation (3.27) thus
determine c only up to n − k degrees of freedom so that we are lacking n − k
additional boundary conditions.
In order to solve the boundary value problem, we partition the Jordan
form of Φ according to the moduli of the eigenvalues. Let Φ = Q−1 JQ
where J is the Jordan form of Φ and where the columns of Q consist of the
corresponding eigenvectors (generalized eigenvectors). We assume that the Φ
is hyperbolic, i.e. has no eigenvalues on the unit circle. Define the number of
possibly multiple eigenvalues strictly smaller than one by n1 and the number
of eigenvalues strictly larger than one by n2 so that n = n1 + n2 . Assume
that the eigenvalues are ordered in terms of their moduli, then define the
matrices Λ1 and Λ2 as follows:
J1 0 0 0
Λ1 = and Λ2 =
0 0 0 J2
where J1 consists of the Jordan segments corresponding to the eigenvalues
smaller than one and J2 to the segments corresponding to the eigenvalues
greater than one. We focus on the case where the zero solution is a saddle
point, meaning that there exists at least two eigenvalues λ1 and λ2 such that
|λ1 | < 1 and |λ2 | > 1 (see Definition 3.7).14 With this notation, we propose
the following particular solution:
∞
X ∞
X
(p)
Xt = QΛj1 Q−1 Zt−j − QΛ−j −1
2 Q Zt+j (3.28)
j=0 j=1
The reader is invited to verify that this is indeed a solution to equation (3.26).
The solution proposed in equation (3.28) has the property that “variables”
corresponding to eigenvalues smaller than one are iterated backwards whereas
those corresponding to eigenvalues larger than one are iterated forwards. This
(p)
ensures that {Xt } remains bounded whenever {Zt } is. The general solution
therefore is of the form
∞
X X∞
t −1 t −1
Xt = QΛ1 Q c + QΛ2 Q c + j −1
QΛ1 Q Zt−j − QΛ−j −1
2 Q Zt+j . (3.29)
j=0 j=1
14
If n1 or n2 equal zeros then the terms corresponding to the matrices Λ1 and Λ2 are
omitted.
82 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
The final step consists in finding the constant c. There are two types of
restrictions: the first one are the initial values given by equation (3.27); the
second one are the requirement that we are only interested in non-exploding
solutions. Whereas the first type delivers k restrictions because rank(R) =
k, the second type delivers n2 restrictions. Thus, c must be determined
according to the following equation system:
(p)
initial values: R c = x0 − RX0
no explosive solutions: Q(2) c = Q(21) c1 + Q(22) c2 = 0 (3.30)
(11)
−1 Q Q(12) (21) ..
where Q = (2)
, Q = Q .Q (22)
and c = (c01 , c02 )0 and
Q(21) Q(22)
where the partitioning of Q−1 and c conforms to the partitioning of the
eigenvalues. Note that Q(2) is a n2 × n matrix. Depending on whether the
number of independent restrictions is greater, smaller or equal to n, several
situations arise.
Theorem 3.8 (Blanchard-Kahn). Let Φ ∈ GL(n) be hyperbolic then the
nonhomogeneous difference equation (3.26) with initial values given by (3.27)
has a unique nonexplosive solution if and only if
R
rank = n. (3.31)
Q(2)
The solution is given by equation (3.29) with c uniquely determined from (3.30).
A necessary condition is that k = rank(R) = n1 : the number of eigenval-
ues smaller than one, n1 , must be equal to the number of independent initial
conditions, k = rank(R).
In many instances the values of the first k variables at given time, say
time zero, are given. This is the case when there are exactly k predetermined
variables in the system. The initial values then fix the first k values of c:
(p) (p) (p)
c1 = x10 − X10 where x10 and X10 denote the first k values of x0 and X0 .
Corollary 3.1. If R = (Ik , 0k×(n−k) ) with k = rank(R) = n1 , the necessary
and sufficient condition (3.31) reduces to Q(22) is invertible. Given that c1 is
fixed by the initial conditions, c2 = −(Q(22) )−1 Q(21) c1 .
If k = rank(R) > n1 then there are too many restrictions and there is no
nonexplosive solution. We may, however, soften the boundedness condition
and accept explosive solutions in this situation.
If k = rank(R) < n1 there are not enough initial conditions so that it
is not possible to pin down c uniquely. We thus have an infinite amount
3.5. BOUNDARY VALUE PROBLEM 83
15
Sunspot equilibria explore the idea that extraneous beliefs about the state of nature
influence economic activity. The disturbing feature of sunspot equilibria is that economic
activity may change across states although nothing fundamental has changed.
84 CHAPTER 3. SYSTEMS OF DIFFERENCE EQUATIONS
Chapter 4
Examples of Linear
Deterministic Systems of
Difference Equations
where the parameters δ, σ, α, φ, and λ are all positive. The variables are
all expressed in logarithms. The IS-equation represents the dependence of
aggregate demand ytd on the relative price of foreign to home goods and on
the real interest rate. A devaluation of the exchange rate2 , an increase in the
1
Rogoff (Rogoff (2002)) provides an appraisal of this influential paper.
2
The exchange rate et is quoted as the price of a unit of foreign currency in terms of
the domestic currency. An increase in et therefore corresponds to a devaluation of the
home currency.
85
86 CHAPTER 4. EXAMPLES: LINEAR SYSTEMS
foreign price level, p∗ , or a decrease in the domestic price level pt all leads to an
increase in aggregate demand. On the other hand, an increase in the domestic
nominal interest rate, rt , or a decrease in the expected inflation rate, pt+1 −pt ,
lead to a reduction in aggregate demand. A crucial feature of the Dornbusch
model is that prices are sticky and adjust only slowly. In particular, the price
adjustment pt+1 − pt is proportional to the deviation of aggregate demand
from potential output y. If aggregate demand is higher than potential output,
prices increase whereas, if aggregate demand is below potential output, prices
decrease. In particular, the price level is treated as predetermined variable
whose value is fixed in the current period. The LM-equation represents
the equilibrium on the money market. The demand for real balances, m −
pt , depends positively on potential output3 and negatively on the domestic
nominal interest rate. The model is closed by assuming that the uncovered
interest parity holds where r∗ denotes the foreign nominal interest rate. In
contrast to the price level, the exchange rate is not predetermined. It can
immediately adjust within the current period to any shock that may occur.
For simplicity, the exogenous variables y, m, r∗ , and p∗ are assumed to remain
constant.
This system can be reduced to a two-dimensional system in the exchange
rate and the price level:
1
et+1 − et = (φy + pt − m) − r∗ (4.1)
λ
α h σ i
pt+1 − pt = δ (et + p∗ − pt ) − y − (φy − m + pt ) (4.2)
1 − ασ λ
The first equation was obtained by combining the LM-equation with the
UIP. The second equation was obtained by inserting the IS-equation into
the price adjustment equation, replacing the nominal interest rate using the
LM-equation and then solving for pt+1 − pt .
The steady state of this system is obtained by setting et = ess and pt = pss
for all t and solving for this two variables:
steady state:
1
ss
1 ss
et+1 − e λ
σ et − e
= αδ α(δ + λ )
pt+1 − pss 1− pt − pss
1 − ασ 1 − ασ
ss
e −e
= Φ t (4.5)
pt − pss
• trΦ < 2 implies that the sum of the eigenvalues is less than 2;
88 CHAPTER 4. EXAMPLES: LINEAR SYSTEMS
• P(−1) > 0 finally implies that one eigenvalue, say µ1 , is larger than 1
whereas the second eigenvalue, µ2 , lies between −1 and 1.
Because the eigenvalues are distinct, we can diagonalize Φ as Φ = QΛQ−1
where Λ is a diagonal matrix with µ1 and µ2 on its diagonal. The column of
the matrix Q consist of the eigenvectors of Φ. Multiplying the system (4.5)
by Q−1 , we obtain the transformed system:
ss
ss
êt+1 −1 et+1 − e −1 −1 et − e
= Q = Q ΦQQ
p̂t+1 pt+1 − pss pt − pss
µ1 0 êt
= (4.6)
0 µ2 p̂t
êt = c1 µt1
p̂t = c2 µt2
and φ12 > 0, q12 and q22 must be of opposite sign. The same conclusion is
reached using the second equation. This reasoning also shows that q22 6= 0.
Suppose that q22 = 0 then q12 must also be zero because (φ11 − µ2 ) > 0.
This, however, contradicts the assumption that (q12 , q22 )0 is an eigenvector.
Note that although the eigenvector is not uniquely determined, its direction
and thus the slope of the saddle path is.
The Dornbusch model is most easily analyzed in terms of a phase diagram
representing the price level and the exchange rate as in figure 4.1. The graph
consists of two schedules: pt+1 − pt = 0 and et+1 − et = 0. Their intersection
determines the steady state denoted by S. These two schedules correspond
to the equations (4.4) and (4.3). The et+1 − et = 0 schedule does not depend
on the exchange rate and is therefore horizontal intersecting the price axis
at pss . Above this schedule the exchange rate depreciates whereas below
this schedule the exchange appreciates, according to equation (4.5). This
is indicated by arrows pointing to the right, respectively to the left. The
pt+1 − pt = 0 schedule is upward sloping. To its left, prices are decreasing
whereas to its right prices are increasing, according to equation (4.5). The
two schedules divide the e-p-quadrant into four regions: I, II, III, and IV. In
each region the movement of e and p is indicated by arrows. In the Dornbusch
model the price level is sticky and considered to be a predetermined variable.
Suppose that in period 0 its level is given by p0 . The exchange rate in this
period is not given, but endogenous and has to be determined by the model.
Suppose that the exchange rate in period 0 is at a level corresponding to
point A. This point is to the left of the pt+1 − pt = 0 schedule and above
the et+1 − et = 0 schedule and therefore in region I. This implies that the
price level has to fall and the exchange rate to increase. The path of e and
p will continue in this direction until they hit the et+1 − et = 0 schedule.
At this time the system enters region IV and the direction is changed: both
the price level and the exchange rate decrease. They will so forever. We are
therefore on an unstable path. Consider now an exchange rate in period 0
corresponding to point B. Like A, this point is also in region I so that the
exchange rate increases and the price level decreases. However, in contrast to
the previous case, the path starting in B will hit the pt+1 −pt = 0 schedule and
move into region II. In this region, both the price level and the exchange rate
increase forever. Again this cannot be a stable path. Thus, there must be an
exchange rate smaller than the one corresponding to point B, but higher than
the one corresponding to point A, which sets the system on a path leading
to the steady steady. This is exactly the exchange rate which corresponds
to the saddle path given by equation (4.9). In this way the exchange rate in
period 0 is pinned down uniquely by the requirement that the path of (et , pt )0
converges.
90 CHAPTER 4. EXAMPLES: LINEAR SYSTEMS
schedule:
p pt+1 - pt = 0
I
saddle path
A B
p0
II
S
ss schedule:
p et+1 - et = 0
IV
III
saddle path
e
ss e
p schedule:
pt+1 - pt = 0
saddle path
Snew
ss
pnew
schedule:
et+1 - et = 0
ss
pold
saddle path
ss ss
eold enew e0 e
The constant β is called the subjective discount rate. The period utility func-
tion U : R+ → R is continuously differentiable, increasing, strictly concave,
and, in order to avoid corner solutions, fulfills limc→0 U 0 (c) = ∞.5
The rest of the specification is exactly the same as for the Solow model
(see section 2.4.2): Output is produced according to a neoclassical aggregate
production satisfying the Inada conditions. Recognizing that investment in
period t, It equals It = Kt+1 − (1 − δ)Kt the national accounting identity
becomes:
Ct + It = Ct + Kt+1 − (1 − δ)Kt = F (Kt , Lt )
respectively,
The first order condition for the optimum is given by the Euler-equation,
sometimes also called the Keynes-Ramsey rule:
U 0 (ct ) = βh0 (kt+1 )U 0 (ct+1 ) (4.12)
Thus, the Euler-equation equates the marginal rate of transformation, 1/h0 (kt+1 ),
to the marginal rate of substitution, βU 0 (ct+1 )/U 0 (ct ).
The equation system consisting of the transition equation (4.11) and the
Euler-equation (4.12) constitutes a nonlinear difference equation system. The
analysis of this system proceeds in the usual manner. First, we compute
the steady state(s). Then we linearize the system around the steady state.
This gives a linear homogeneous difference equation in terms of deviations
from steady state. We find the solution of this difference equation using
the superposition principle. Finally, we select, if possible, one solution using
initial conditions and boundedness arguments.
saddle path
c Dc=0
c
* E
Dk=0
c0
0 k0 k
*
k
**
k
max
k
Figure 4.3: Phase diagram of the optimal growth model
4.2. OPTIMAL GROWTH MODEL 95
From this equation we can deduce that ct+1 ≥ ct when kt+1 < k ∗ and vice
versa. Thus, to left of the (∆c = 0)-schedule consumption rises whereas to
the right consumption falls. Similarly, the transition equation (4.11) implies
that kt+1 ≥ kt when ct is lower than the corresponding c∗ implied by the
(∆k = 0)-schedule. Thus, the two schedules divide the nonnegative orthant
of the (k, c)-plane in four regions. The dynamics in these four regions is
indicated in figure 4.3 by orthogonal arrows. In the region to the left of
the (∆c = 0)-schedule and above the (∆k = 0)-schedule, i.e. the north-
west region, consumption would increase whereas the capital intensity would
decrease. This dynamics would continue until the c-axis is hit. When this
happens, the economy has no capital left and therefore produces nothing but
consumes a positive amount. Such a situation is clearly infeasible. Paths with
this property have therefore to be excluded. Starting at k0 , there is, however,
one path, the saddle path, where the forces which lead to explosive paths,
respectively infeasible paths, just offset each other and lead the economy to
the steady state. This is the red line in figure 4.3.
The algebraic analysis requires the linearization of the nonlinear equation
system (4.11) and (4.12). This leads to:
kt+1 − k ∗
−1
kt − k ∗
1+µ 0 β −1
=
βU 0 (c∗ )h00 (k ∗ ) U 00 (c∗ ) ct+1 − c∗ 0 U 00 (c∗ ) ct − c∗
where we used the fact that βh0 (k ∗ ) = 1. Given that µ > 0 and U 00 (c∗ ) < 0,
the matrix on the left hand side is invertible. This then leads to the following
linear first order homogenous system:
kt+1 − k ∗ β −1 kt − k ∗
1 −1
= −1 −1
ct+1 − c∗ 1 + µ RA (c∗ )h00 (k ∗ ) −βRA (c∗ )h00 (k ∗ ) + (1 + µ) ct − c∗
kt − k ∗
=Φ (4.16)
ct − c∗
96 CHAPTER 4. EXAMPLES: LINEAR SYSTEMS
where RA (c∗ ) equals −U 00 (c∗ )/U 0 (c∗ ) > 0, the absolute risk aversion coeffi-
cient evaluated at the steady state.6 As discussed in section 3.3 and 3.4, the
dynamics of the system is determined by the eigenvalues of Φ. Denote the
characteristic polynomial of Φ by P(λ) and the corresponding eigenvalues by
λ1 and λ2 , then we have
with
−1 ∗ 00 ∗
trΦ = λ1 + λ2 = 1 + [β(1 + µ)]−1 − βRA (c )h (k )/(1 + µ) > 2
1
det Φ = λ1 λ2 = >1
β(1 + µ)
4 = (trΦ)2 − 4 det Φ = [1 − (β(1 + µ))−1 ]2
−1 ∗ 00 ∗ −1 ∗ 00 ∗
βRA (c )h (k ) βRA (c )h (k ) 2
+ −2− >0
1+µ 1+µ β(1 + µ)
−1 ∗ 00 ∗
βRA (c )h (k )
P(1) = 1 − trΦ + det Φ = <0
1+µ
where 4 denotes the discriminant of the quadratic equation and where we
used the fact that h00 < 0. The above inequalities have the following impli-
cations for the two eigenvalues:
• P(0) = det Φ > 1 finally implies that one eigenvalue, say λ1 , is larger
than 1 whereas the second eigenvalue, λ2 , lies between 0 and 1.
Next, we show that the saddle path is upward sloping, i.e. that qq12 22
> 0 as
shown by the red line in Figure 4.3. This can be verified by manipulating the
defining equations for the eigenvector corresponding to the second eigenvalue
λ2 . They are given by (φ11 −λ2 )q12 +φ12 q22 = 0 and φ21 q12 +(φ22 −λ2 )q22 = 0.
Because (φ11 −λ2 ) > 0 and φ12 < 0, q12 and q22 are of the same sign. The same
conclusion is reached using the second equation. This argument also shows
that q12 6= 0. Suppose that q12 = 0 then q22 must also be zero because φ12 =
−1
1+µ
< 0. This, however, contradicts the assumption that (q12 , q22 )0 is an
eigenvector. Note that although the eigenvector is not uniquely determined,
its direction and thus the slope of saddle path is.
Taxation of Capital
Suppose that the government levies a proportional tax on the gross return
to capital. For simplicity, we assume that the revenues from the tax are just
wasted. Therefore only the Euler equation (4.12) is affected. The new Euler
equation then becomes
where τ is the tax rate with 0 < τ < 1. The transition equation for capital
is not altered. The new ∆c = 0 schedule then is
new Dc=0
c schedule saddle path corresponding
to new steady state
c0
*
c old Eold Dk=0
*
c
Enew
0
*
k new k0 k
max
k
Figure 4.4: Phase diagram of the optimal growth model with distortionary
taxation of capital
This implies that an increase in the tax rate τ will lower the steady state
capital stock. The tax is thus distortionary.
∆k = 0 : c∗ = f (k ∗ ) − (δ + µ)k ∗ − g.
unstable path
* E0 old: Dk=0
c0
c1 g
c0
* new: Dk=0
c1
E1
0 k1 k
*
k
Figure 4.5: Phase diagram of the optimal growth model with government
expenditures
4.3. THE NEW KEYNESIAN MODEL 101
where yt , πt , and it denote income, inflation and the nominal interest rate, all
measured as deviations from the steady state. ut is an exogenous cost-push
shock. Furthermore, we assume that σ > 0, κ > 0, and 0 < β < 1. In
addition, we take an aggressive central bank, i.e. φ > 1.
This system can be solved for (yt+1 , πt+1 )0 by inserting the Taylor-rule
and Phillips-curve into the IS-equation:
πt+1 1 1 −κ πt −ut /β
Xt+1 = = +
yt+1 β (φβ − 1)/σ β + κ/σ yt ut /(σβ)
= ΦXt + Zt+1 (4.22)
with
1 κ
trΦ = λ1 + λ2 = 1 + + >2
β σβ
1 κφ
det Φ = λ1 λ2 = + >1
β σβ
2
2 1 κ κ 2
4 = (trΦ) − 4 det Φ = 1 − + + 2 + − 4φ
β σβ σβ β
κ
P(1) = (1 − λ1 )(1 − λ2 ) = (φ − 1) > 0 if φ > 1
σβ
First assume that φ is high such that 4 < 0. In this case we have two complex
conjugate roots. Because det Φ > 1, they are located outside the unit circle.8
Alternatively assume that φ is small enough such that 4 > 0. In this case
both eigenvalues are real. Using the assumption φ > 1, P(1) > 0. Thus,
both roots are either greater or smaller than one. They cannot be smaller
than one because trΦ > 2. Thus, in both cases we reach the conclusion
that the eigenvalues are outside the unit circle. As both variables are non-
predetermined, the boundedness condition (3.30), Qc = 0 which is equivalent
to c = 0, then determines the unique solution:
∞ −j
X λ1 0 −1 ut−1+j /β
Xt = Q Q
j=1
0 λ−j2 −ut−1+j /(σβ)
e = λ1 + λ2 = 1 + 1 + κ > 2
trΦ
β σβ
1
det Φe = λ1 λ2 = > 1
β
2
2 1 κ κ 2
4 = (trΦ) − 4 det Φ = 1 −
e + +2+ >0
β σβ σβ β
κ
P(1) = (1 − λ1 )(1 − λ2 ) = − < 0.
σβ
The discriminant is now unambiguously positive so that both eigenvalues are
real. Moreover, P(1) < 0 so that one eigenvalue is smaller than one and the
other bigger than one. Thus, the boundedness condition does not determine
a unique solution. Instead there is a continuum of solutions indexed by c1
and we are faced with the case of indeterminacy. The implications of this
indeterminacy for monetary policy and possible remedies are discussed in
Galı́ (2011).
8
Another way to reach this conclusion is by observing that the real part of the roots is
trΦ
2 > 1.
Chapter 5
Φ1 Et Xt+1 = Φ0 Xt + Zt , t = 0, 1, 2 . . . , (5.1)
where {Xt } and {Zt } are real-valued n-dimensional stochastic processes de-
fined on some probability space (Ω, F, P). The random variables Xt and
Zt are measurable with respect to the σ-algebra Ft = σ{(Xs , Zs ) : s ≤ t}.
This makes the sequence {Ft } a filtration adapted to {Xt } and {Zt }. In eco-
nomics Ft is also called the information set. Et then denotes the conditional
expectation with respect to Ft . As expectations are based on current and
past Xt ’s and Zt ’s only and not on extraneous variables, we have eliminated
the possibility of sunspot solutions.
Expectational difference equations of the type (5.1) arise typically in the
context of rational expectations models. Starting with the seminal paper by
Blanchard and Kahn (1980), an extensive literature developed which analyzes
the existence and nature of its solutions. The most influential papers, at least
for the present exposition, are Gourieroux et al. (1982), Klein (2000), Sims
(2001), among others. There is no loss of generality involved by confining
the analysis to first order equations as higher order equations can be reduced
to first order ones by inflating the dimension of the process (see Binder and
Peseran, 1994).
In the following, Φ1 is not necessarily invertible. Thus, we allow for the
possibility that some equations do not involve expectational terms. The
103
104 CHAPTER 5. STOCHASTIC DIFFERENCE EQUATION
stochastic theory developed below is therefore more general than its deter-
ministic counterpart.
To make the problem tangible, we consider only a certain class of so-
lutions. In particular, we require that the exogenous input process {Zt } is
bounded in Lp , p > 1.1 This means that supt kZt kp < ∞. This assumption
implicitly restricts the solution processes to be bounded as well.
Remark 5.1. The class of stationary processes is the prime example of such
processes as the expected value and the variance remain constant. In many
applications, {Zt } is specified as an ARMA-process.2
Throughout the analysis we follow King and Watson (1998) and assume
that the linear matrix pencil Φ1 z + Φ0 is regular:3
As was already pointed out by Blanchard and Kahn (1980), the notion of
a predetermined variable or process is key for understanding the nature of the
solution.4 Following Klein (p.1412, 2000), we adopt the following definition.
(i) The process of expectational errors {ηt } with ηt+1 = Kt+1 − Et Kt+1 is
an exogenous martingale difference process;
(ii) K0 ∈ F0 is exogenously given.
This definition is more general than the one given in Blanchard and Kahn
(1980) who require that Kt+1 = Et Kt+1 .
Finally, note that the superposition principle still applies in this context.
(1) (2)
Suppose that there exists two solutions {Xt } and {Xt } then {Xt } =
(1) (2)
{Xt − Xt } satisfies the homogeneous stochastic expectational difference
equation
Φ1 Et Xt+1 = Φ0 Xt .
Thus, the general solution of equation (5.1) is
(g) (p)
X t = Xt + Xt
(g)
where Xt denotes the general solution to the homogeneous equation and
(p)
Xt a particular solution to the nonhomogeneous equation.
|φ| < 1: In this case, the representation (5.4) implies that the solution fol-
lows an autoregressive process of order one. This representation admits
a causal representation with respect to the
P expectational errors. This
representation is given by Xt = φt X0 + t−1 j=0 φj
η t−j and is bounded.
However, if {Xt } is not predetermined, there is no starting value X0
and we are faced with a situation of indeterminacy because any mar-
tingale difference sequence defined with respect to Ft would satisfy the
difference equation.
one obtains:
As we are looking for solutions which remain bounded, this suggests to take
∞
X
−1
Xt = −φ φ−j Et Zt+j (5.6)
j=0
This solution, however, makes only sense when {Xt } is predetermined so that
X0 is given.
α−1 mt
Et pt+1 = pt + , α < 0, (5.7)
α α
108 CHAPTER 5. STOCHASTIC DIFFERENCE EQUATION
where {mt } is now a stochastic process and where the expected price level is
replaced by the conditional expectation of the logged price level.
Given that α < 0, the coefficient of pt , φ = (α − 1)/α, is positive and
strictly greater than one. Therefore the only bounded (stationary) solution
(g)
to the homogeneous equation is {Xt } = 0 and a particular solution can be
found by forward iteration. Thus the solution is:
∞ j
1 X α
pt = Et mt+j . (5.8)
1 − α j=0 α − 1
This shows that the relation between the price level and money supply de-
pends on the conduct of monetary policy, i.e. it depends on the autoregressive
coefficient a. Thus whenever the monetary authority changes its rule, it af-
fects the relation between pt and mt . This cross-equation restriction is viewed
by Hansen and Sargent (1980) to be the hallmark of rational expectations.
It also illustrates that a simple regression of pt on mt can not be considered a
structural equation, i.e. cannot uncover the true structural coefficients (α in
our case), and is therefore subject to the so-called Lucas-critique (see Lucas
(1976)).
A similar conclusion is reached if money supply follows a moving average
process of order one instead of an autoregressive process of order one:
εt = mt − θmt−1 + θ2 mt−2 − . . .
and
Φ1 invertible
The invertibility of Φ1 implies that we can rewrite equation (5.1) as
Et Xt+1 = ΦXt + Z̃t , t = 0, 1, 2 . . .
where Φ = Φ−1 −1
1 Φ0 and Z̃t = Φ1 Zt . Let us further assume that Φ is diagonal-
izable with Φ = QΛQ−1 , Λ diagonal. As in the discussion of the deterministic
case in Section 3.5, we partition Λ as
Λ1 0
Λ=
0 Λ2
110 CHAPTER 5. STOCHASTIC DIFFERENCE EQUATION
such that the eigenvalues in Λ1 are strictly inside the unit circle whereas those
in Λ2 are strictly outside the unit circle. We disregard the case of eigenvalues
on the unit circle.
We make the following assumption with respect to the dimension of Λ1
and Λ2 .
Following the logic of the discussion in Section 5.2, the unique bounded
(2)
solution for {Yt } is
∞
X ∞
X
(2)
Λ−j
(2)
Yt = −Λ−1
2
−1
2 Et Z̄t+j = −Λ2 Λ−j
2 Et Q(21) (1)
Z̃ t+j + Q(22) (2)
Z̃ t+j
j=0 j=0
X∞
= −Λ−1 Λ−j
2 2 Q(21) Q(22) Et Z̃t+j . (5.9)
j=0
5.3. THE MULTIVARIATE CASE 111
Turn next to the first part of the decoupled equation. Note that the
(1)
predetermined variable Xt satisfies the identity:
(1) (1) (1) (1) (2) (2)
Xt+1 − Et Xt+1 = Q11 Yt+1 − Et Yt+1 + Q12 Yt+1 − Et Yt+1 = ηt+1 .
(1)
Inserting this equation into the equation for Yt+1 , we get
(1) (1) (1) (1)
Yt+1 = Et Yt+1 + Yt+1 − Et Yt+1
(1) (2) (2)
= Λ1 Yt + Z̄t (1) + Q−1
11 η t+1 − Q −1
11 Q 12 Yt+1 − Et Y t+1
(1)
= Λ1 Yt + Z̄t (1) + Q−1 −1
11 ηt+1 − Q11 Q12 εt+1 (5.10)
(2) (2)
where εt+1 = Yt+1 − Et Yt+1 is an exogenous martingale difference process.
Thus, equation (5.10) is a first order autoregressive scheme with starting
value given by
(1) (k) (2)
Y0 = Q−1 11 X 0 − Q 12 Y 0 .
Equations (5.10) and (5.9) determine the solution for Yt . This step in the
derivation is only valid if Q11 is invertible. Otherwise, we could not determine
(1) (1)
the initial values of Yt from those of Xt and there would be a lack of initial
values for Yt .6 Hence, Assumption 5.3 is not sufficient for the uniqueness of
the solution. In addition, we need the following assumption.
Assumption 5.4. Q11 is nonsingular.
Finally, the solution for Yt can be turned back into a solution for Xt by
multiplying Yt by Q.
We can get further insights into the nature of the solution by assuming
that {Z̃t } is a causal autoregressive process of order one:
Z̃t+1 = AZ̃t + ut+1 , ut ∼ WN(0, σ 2 ) and kAk < 1
where {ut } is exogenous. This specification implies that Et Z̃t+j = Aj Z̃t ,
j = 1, 2, . . . Inserting this into equation (5.9) we find that
∞
X
(2)
−Λ−1 Λ−j Q(21) Q(22) Aj Z̃t = M Z̃t
Yt = 2 2
j=0
−1
P∞ −j (21) (22)
j (1)
where M = −Λ2 j=0 Λ 2 Q Q A . The solution to Yt then can
be written as
(1) (1)
Yt+1 = Λ1 Yt + Z̄t (1) + Q−1 −1
11 ηt+1 − Q11 Q12 M ut+1 .
6
See Klein (2000, section 5.3.1) and King and Watson (2002) for details and examples.
112 CHAPTER 5. STOCHASTIC DIFFERENCE EQUATION
Remark 5.3. The above derivation remains valid even if the matrix Φ is not
diagonalizable. In this case, we will have to work with the Jordan canonical
form instead (see Section 3.2.2).
Remark 5.4. The derivation excluded the possibility of roots on the unit
circle.
Φ1 singular
In many practical applications, Φ1 is not invertible so that the procedure just
outlined is not immediately applicable. This, for example, is the case when
a particular equation contains no expectations at all which translates into a
corresponding row of zeros in Φ1 . One way to deal with this problem is to
take a generalized inverse of Φ1 and proceed as explained above.
The most appropriate type of generalized inverse in the context of dif-
ference equations is the Drazin-inverse (see Campbell and Meyer, 1979, for
a comprehensive exposition). This generalized inverse can be obtained for
any n × n matrix A in the following manner. Denote by IndA the smallest
nonnegative integer k such that rankAk = rankAk+1 . This number is called
the index of A. Then the following Theorem holds (see Theorem 7.2.1 in
Campbell and Meyer, 1979).
With this Theorem in mind, we can now decouple the system in two
parts. The first part will be similar to the case when Φ1 is invertible. The
second one will correspond to the singular part and will in some sense solve
5.3. THE MULTIVARIATE CASE 113
out the expectations. Multiply for this purpose equation (5.1) from the left
by (zΦ1 + Φ0 )−1 where assumption 5.2 guarantees that the inverse exists for
some number z. Thus, we get
Applying the law of iterated expectations and multiplying the equation from
the left by N k−1 gives
e (2) = (In2 − zN )2 N k−2 Et X
0 = N k Et X e (2)
t+k t+k−2
3 k−3
= (In2 − zN ) N Et X e (2)
t+k−3
= ...
et(2)
= (In2 − zN )k Et X
et(2) .
= (In2 − zN )k X
Complex Numbers
These two operations will turn C into a field where (0, 0) and (1, 0) play
the role of 0 and 1.1 The real numbers R are embedded into C because we
identify any a ∈ R with (a, 0) ∈ C.
The number ı = (0, 1) is of special interest. It solves the equation x2 +1 =
0, i.e. ı2 = −1. The other solution being −ı = (0, −1). Thus any complex
number (a, b) may be written as (a, b) = a + ıb where a, b are arbitrary real
numbers.2
1
Substraction and division can be defined accordingly:
2
A more detailed introduction of complex numbers can be found in Rudin (1976) or
any other mathematics textbook.
115
116 APPENDIX A. COMPLEX NUMBERS
z=a+ib
b
i
1
r
imaginary part
−1 θ a
0
1
−1
−i
−b
unit circle: z=a−ib
a2 + b2 = 1
−2
−2 −1 0 1 2
real part
z = a + ıb Cartesian coordinates
ıθ
= re = r(cos θ + ı sin θ) polar coordinates.
eıθ + e−ıθ a
cos θ = = ,
2 r
eıθ − e−ıθ b
sin θ = = .
2ı r
Further implications are de Moivre’s formula and the Pythagoras’ theorem
(see Figure A.1):
n
de Moivre’s formula reıθ = rn eınθ = rn (cos nθ + ı sin nθ)
Pythagoras’ theorem 1 = eıθ e−ıθ = (cos θ + ı sin θ)(cos θ − ı sin θ)
= cos2 θ + sin2 θ
3
The notation with “−φj z j ” instead of “φj z j ” was chosen to conform to the notation
of AR-models.
118 APPENDIX A. COMPLEX NUMBERS
Appendix B
Matrix Norm
Thus the induced matrix norm is the maximum amount a vector on the unit
sphere can be stretched. The matrix norm induced by the Euclidian vector
norm is given by: p
kAk = max kAxk = ρ(A0 A)
kxk=1
where ρ(A0 A) is the spectral radius of A0 A, i.e. ρ(A0 A) = max{|λ| : λ is an eigenvalue of A0 A}.
Another convenient matrix norm is the Frobenius norm, sometimes also
called the Hilbert-Schmidt or the Schur norm. It is defined as follows:
n
X n
X
kAk2 = |aij |2 = tr(A0 A) = λi
i,j i
where λi are the eigenvalues of A0 A. Thus the Frobenius norm stakes the
columns of A into a long n2 -dimensional vector and takes its Euclidian norm.
The matrix norm has the following properties:
119
120 APPENDIX B. MATRIX NORM
Bibliography
121
122 BIBLIOGRAPHY
H. Moore. Economic Cycles: Their Law and Cause. MacMillan, New York,
1914.