GMM Estimation and Inference in Dynamic Panel Data
GMM Estimation and Inference in Dynamic Panel Data
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
http://about.jstor.org/terms
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted
digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about
JSTOR, please contact [email protected].
Cambridge University Press is collaborating with JSTOR to digitize, preserve and extend access to
Econometric Theory
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
Econometric Theory, 25, 2009, 1348-1391. Printed in the United States of America.
doi: 10. 1017/S026646660809053 1
Hugo Kruiniger
Queen Mary, University of London
1. INTRODUCTION
I thank Steve Bond and Frank Windmeijer for kindly making one of their computer programs available to me.
I also thank Liúdas Giraitis, three anonymous referees, a co-editor, and the editor for very helpful comments. This
research was funded by the ESRC under grant R000239 139. Address correspondence to Hugo Kruiniger, Department
of Economics, Queen Mary, University of London, Mile End Road, London El 4NS, United Kingdom; e-mail:
[email protected].
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1349
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1350 HUGO KRUINIGER
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1351
are based on System estimators that use a restricted estimator of the weight ma-
trix that is optimal under the null have correct size. This paper therefore offers
a theoretical justification for using LM tests in the context of panel AR models.
Finally the paper proposes two LM-type panel unit root tests and studies their
properties.
The paper is organized as follows. In Section 2 we review GMM estimation of
the panel AR(1) model. In Section 3 we derive local asymptotic approximations to
the finite-sample distributions of the AB, Arbov, and System estimators under var-
ious assumptions about the initial conditions, i.e., under various asymptotic plans
for S and N. Section 4 proposes two LM-type panel unit root tests and inves-
tigates their properties both analytically and through Monte Carlo experiments.
Section 5 concludes. Proofs are given in the Appendix.
A few words on notation. We use the symbol - > to signify convergence in dis-
tribution, the symbol plim to signify convergence in probability, and the symbol
q-%' to signify convergence in quadratic mean. To state multi-index asymptotic
results we make use of the following notation (for definitions of the underlying
concepts, see also Phillips and Moon, 1999): 'ims,N-^oo,seq N~] S/Li */,* = xt is
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1352 HUGO KRUINIGER
Var(v/,i -///) =
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1353
Note that the FS model does not rule out that mean-stationarity, i.e., assumption
(8), holds.
Arellano and Bond (1991) proposed a GMM estimator that exploits the following
m = (T - l)(T - 2)/2 linear moment conditions:
where rriAB,sAP) = yiA&yu ~P&yi,t-') and Ay,-,r = y¡,t -yi,t-'- These mo-
ment conditions are implied by assumptions (4)-(6). Note that they do not identify
the unit root because E(yZ)i A;y/,,_i) = E( j/^e/^-i) = 0 when p = 1.
Arellano and Bover (1995) noted that if mean-stationarity, i.e., assumption (8),
holds also, one can add T - 2 linear moment conditions to those in (10):
where m Ar bov,s, tip) = (yi,t - W,/-i)Ay/fJ. A GMM estimator that only exploits
the latter m moment conditions will be referred to as an Arbov estimator.
Lety/ = [y/,i ... yi%t' and let Z/ =diag(j/,...,y/r"2) bea (T - 2) x m block-
diagonal matrix. Then we can write the set of AB moment conditions in (10)
as E(ZjAi),-) = 0 where Av¡ = v¡ - o/,-i = [Ao/,3 ... Avij]'. Under our as-
sumptions, E(Au¿ Av¡)/a2 = //, where // = //r-2 is a (7 -2) band-diagonal
matrix with 2's on the main diagonal, - l's on the first sub- and superdiagonal,
and zeros elsewhere. It follows that the AB GMM estimator that uses Wm,ab' =
(N~l X/Li Z¡HZi)~l as weight matrix is an optimal one-step GMM estimator.
This estimator is denoted as pab'> In this paper {W^} denotes an arbitrary se-
quence of PD weight matrices with plimyv^oo^v = W, where W is PD. An AB
estimator based on Wn is simply denoted as pab-
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1354 HUGO KRUINIGER
There exist no feasible optimal one-step weight matrices for the Arbov
and System estimators, except when ajj = 0. Let Ayj = [Aj/,2 ... A >>/,,], let
Z' = y¡, 1 It -2, where It -2 is an (T - 2) identity matrix, let Z77 = diag( Ayf, . . . ,
Ayf~l) be a (T - 2) x m block-diagonal matrix, and let Zf = diag(Z/, Z/7) be
a 2(T - 2) x (r - 2 + ra) block-diagonal matrix. When a2 = 0, optimal one-step
weight matrices for the Arbov and System estimators are given by W^^rboui =
(AT^Z^'Z/7)-1 and Wn,Sysi = (N~l 1^, Z?'AZfy' respectively,
where A is given by
'H C
[C It-i'
where C = Ct-2 is a (T - 2) x (T - 2) matrix with ones on the main diagonal,
- l's on the first subdiagonal, and zeros elsewhere. Note that C = E(uz Av¡)'/a2.
The one-step GMM estimators based on W^^Arbov] and Wn^sysì will be re-
ferred to as the Arbov 1 estimator and the SYS1 estimator, respectively. Finally,
let WN9sYSib = (N-1 Iti Zi'ZfTX-
The "optimal" two-step System (S YS2) estimator is based on the weight matrix
Wn,sys2(Pi) = (N-] Iti Zi''A»'i S/]'[a5 v¡]Zf)-' with Ait = ¿¿¡(pi) =
Ay, - pi A y¡y-' , Si = y¡ - piy¡9-', and, unless stated otherwise, p¡ = p' , where
^1 is an initial V^-consistent estimator for p, i.e., 'ÍÑ(p' - p) - » /C # 0. We
define the "optimal" two-step AB (AB2) and Arbov (Arbov2) estimators and
their weight matrices analogously. The W^^YSk matrices (k = 1,2) can be parti-
tioned as
'WN,SYSk,'' WnjYSIcM
Wu,SYSk =
|_WyV,SrS*,21 WN,SYSk,22_
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1355
Here we investigate the asymptotic properties of various AB, Arbov, and System
estimators when p is close to unity under two different asymptotic schemes:
5, Af - » oo sequentially with S - > oo first (the CS model) and S, N -> oo
simultaneously with S/Nd - » c* > 0, where ¿/ > 0. The latter scheme includes
N -> oo, and 5 is fixed (the FS model) as special case for d = 0. Note that the
model assumes that T is fixed.
We will see that the estimators suffer from some kind of weak instruments
problem when p is close to unity, with the exact nature of the problem depend-
ing on the distributional properties of the initial conditions. Thus p is not only
the parameter of interest but also determines the "strength" of the instruments.
Conventional fixed parameter asymptotics yields a poor approximation to the
finite-sample distribution of the estimators when p is close to unity because of
the fact that the weak instruments problem vanishes when Af increases. To ensure
that the weak instruments problem is captured by the asymptotics, we let p tend
to unity as a function of Af when AT -> oo in such a way that asymptotically p
is local-to-nonidentified. That is, our asymptotics will make use of parameter se-
quences such as p = 1 - X N~g where X > 0 and '<g<'. The rate Af£ at which
p tends to unity depends on the distributional properties of the initial conditions.
Note that the parameter sequences p = 1 - 1 N~8 are introduced to obtain better
asymptotic approximations to the finite-sample distribution of the estimators for
p when the instruments are weak and not because of interest in local-to-unit-
root models. Thus the motivation for considering these parameter sequences is
different from that in the time series literature. Moon and Phillips (2000) have
considered consistent panel data estimators for the localizing parameter y < 0 in
p - exp(y / T) assuming that T grows large.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1356 HUGO KRUINIGER
In the analysis the ratios Var(yift - ///)/cr2 play an important role. The panel
AR(1) model implies the following expression for Var( j/,i - /¿,-) when 'p' < 1:
It is easily verified for any 5 and p > 0 that d[Var(;y/,i - jUi)]/dp > 0
and limpai Var(y/fi - Mi) = Var(yits -//;) + 0 + S)ff2. Moreover, lims_>oo
Var( vfji - /ii) = <t2/(1 - p2). It follows that when 5 is large and p tends to one,
the ratio Var(y/i - ¡ii)lo2 becomes large. Indeed, lim^ilims^ooVariy/,! -
m) = 00. However, when 5 is fixed lim^i Var( v/j - ju¡) < 00.
When the CS model holds, lim^i Var(y/,r)/cr2 = lim^i [Var( y¡tt - jui)/a2 +
rf/a2] = limpTi[l/(l - p2)] + a2/(T2 = 00 for t = 1,...,7' Furthermore,
E[(A^_i)2] = 2<72/0 +p) and Ei-^Av^-i) = (1 - /Op'-^EKy/,, -
Mi + Mi)(yi,s - /*/)]'= <720 - P)P1~2~' 7(1 - P2) when s < t - 2. The latter
equalities imply that 'imp<t'E(dmAB,sj/dp) = limp>piE(- v/^Av/^-i) = tr2/2
for s <t - 2 and lim^i E(dmArbou,s,t/dp) = limati E(- y,-f/_i Ay,-,,) = -cr2/2
when 5 < / - 1 .
Consider now the simple first-stage regression A yiit-i = tí v/)iS +cy; (5 < í - 2),
which corresponds to a typical AB moment condition. Then we find that
limp^iplim^^oo^r = lim^iplim^^ooŒi^)"1!/^^/,/-! =0. Further-
more, if p = 1 - XN~' the first-stage F-statistic FAB = n2 N X£¿ y2s) / '£/
(Ay/f/-i -îy/tJ)2 = Op(l) and *- = [EÍ^W^Íy/,, Ay/,/-i) = O^"1). The
expression Fab - 1 provides an estimate of the so-called concentration param-
eter, which is a measure of the strength of the instrument(s); see, e.g., Stock,
Wright, and Yogo (2002) for details. The fact that FAb = Op(l) suggests
that there is a potential weak instruments problem. Note that if one would
choose the parameter sequence p = 1 - ÀN~Ì/2, then n = O(N~l/2) but
Fab = Op(N1/2). Thus when the CS version of the model applies and p is suf-
ficiently close to unity, the AB GMM estimator suffers from some sort of weak
instruments problem, albeit not from one of the Staiger and Stock (1997) type
where Fab = Op(') when n = O(N~1/2). In this case the weak instruments
problem arises because lim^iEiv2^ - > 00, whereas lim^iEiy/^Ay/^-i) =
-<72/2 t¿ 0. Considering the multiple first-stage regressions Ay;,,_i = X^L2
Xkyi,k + (Oi for t = 3, . . . , T leads to the same conclusion because T is fixed.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1357
Next consider the simple first-stage regression y¿yí-' = 7cAyiiS + a>i (s <
t - 1), which corresponds to a typical Arbov moment condition. Clearly, lim^i
plimw->oo7? = 1/2 7^0. Nonetheless, if p = 1 - kN~' the first-stage F-statistic
FArbov = ^2^I/(Aj/,,)2/I/(^/,/-i -*&yi,s)2 = Op(l), even though n =
0(1). The reason for this finding is that when 5 - > oo, E(yft_{) oc 1/(1 - p) and
hence plim;v->ooN~2Z/(}7,í-i -nàyi,s)2 = cr2/(2k) if p = 1 -kN~K However,
lim^i E( v/,í_i Av/,5) = <72/2 ^ 0.
The preceding analysis suggests that when the CS model holds and p ap-
proaches one the finite-sample distributions of the AB, Arbov, and System esti-
mators are affected by exploding higher order moments and that one should not
use the local-to-zero asymptotics of Staiger and Stock (1997) to obtain approxi-
mations to these distributions. Under covariance stationarity not only Variy^i) a
1/(1 - p) but also Var(y/,iAv/,2) oc 1/(1-/9), Varíy^Ay^) oc 1/(1 -p),
Var(y/jA£/,3) oc 1/(1 - p) and Cov(y/,i Ae/,3, y/,iAy/,2) oc 1/(1 - p); see
Lemma A.2 in the Appendix. In this case one can obtain local (-to-nonidenti-
fication) approximations to the distributions of the estimators by choosing a
parameter sequence such that the variances of the sample covariances of the
instruments and the differenced regressors become 0(1), i.e., by choosing
p = '-ÀN~l. Under this sequence Var(yzj) = O(N). A large value of Var(y;5i)
may or may not be an accurate description of the data.3
We have the following local (-to-nonidentification) asymptotic results for the
AB and Arbov estimators.
THEOREM 1. Let the CS model hold, T = 3,andp = '- AN~l with X > 0.
Then, as N -» oo,
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1358 HUGO KRUINIGER
a =b = 2.25, the density at the second mode is only about 0.2% of the density
at the primary mode. A recent discussion of the possible bimodality of the (limit)
density of instrumental variables (IV) estimators can be found in the papers by
Forchini (2006), Hillier (2006), and Phillips (2006).
The local asymptotic results of Theorem 1 have been obtained under the
assumption of covariance stationarity. However, if we take k - 0 in the local
asymptotic distribution of the AB estimator, we obtain its distribution for p = 1 ,
despite the fact that we need to condition on the initial conditions in this case.4
Let p = 1 and T = 3. Then it can easily be shown that as (S, N - > oo)^,5
and
LEMMA 1 . Let the CS model hold, T >3,andp = '- XN~l with I > 0. Let
N -> oo. Then
(0 (D N-^^Zl'AviÂXsu
N-WZ^zP'AviÂXsiand
N-l%tLiZ¡>'Ayi9-iA Xawith
X5 = (X'5l r52y - tf(0, ¿55), Xe = (X'6l X'62y^ W(¿6, 166), and
E(X5X¿) = ¿56 t¿ 0, where ¡L^y ¿55, I56, and ¿66 ore given in the
proof;
(2) whenT = 3, X5 = X5Ì =a2X'/VJand X6 = X6] =a2X2/'/X where
X' and X2 are defined in Theorem 1.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1359
THEOREM 2. Leí í/*¿ C5 model hold, T >3,andp = '- ÀN~{ with X > 0.
Létf N -> 00. r/z^w
PAB-pl X'6ìWnXSÌ/X'6ìWuX6ì.
Parts (ii) of Lemma 1 and Theorem 2 imply that because X% is Gaussian and
hence Xj _L Xg, i.e., X7 and Xg are independent, the correct local asymptotic stan-
dard errors of Arbov estimators are given by (E(X¿ WÉ77 WXS/(X^WXS)2))0-5.
Thus when the data are CS the conventional asymptotic standard errors of Arbov
estimators are inconsistent under p = '-AN~l. Moreover, because the truly local
asymptotically optimal weight matrix is proportional to I^1 , the second result in
Lemma l(iv) implies that the conventional estimator of the optimal weight matrix
for the Arbov estimator is also inconsistent under these local asymptotics.6 How-
ever, the Arbov estimator that uses WM,Arbov2(p') as weight matrix is still consis-
tent, although when T > 3 its local asymptotic distribution is different from that of
the truly optimal Arbov estimator. Finally, if W 1 (Xf7 X's)' and p - 1 - kN~l 9
the local asymptotic distribution of the Arbov estimator is symmetric and, pro-
vided T > 3, the Arbov estimator is unbiased under these local asymptotics. In
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1360 HUGO KRUINIGER
contrast, when the data are CS and p = 1 - kN~' E(X$Xf6) / 0; i.e., the cross-
products of the instruments and the regressors in the AB estimator remain corre-
lated with the cross-products of the instruments and the dependent variables when
N - > oo, thereby causing the bias of this estimator.
Note that the set of m + T - 2 System moment conditions in E(Zf '[Av¡ v'Í) = 0
can be rewritten as E(Zf'[Aü¡ v¡]') = 0, where Zf = diag(Z^, Z[) is a
2(T - 2) x (m + T - 2) matrix. We have the following results for the System
estimator.
THEOREM 3. Let the CS model hold, let T > 3, and let p = 1 - ÁN~l with
A>0: Let N -> oo. Let Wsys* = diag(Or_2, Im), where Ot-2 is a (T -2) x
(T - 2) null matrix. Finally, let D = E[diag(e?, . . . , ej~x)fi A^J/a2. Then
^(psYS2-p)^X'nWSYS2Xi0/X'uWsYS2Xn
with Xio = (^5! X'-j)' and Xn = (0 X^)r, w^re ¿77, Í51, X7> flnrf X8 «^
defined in Lemma 1.
(iv) assuming that psYS exploits E(¿f'[Av¡ v¡]') = 0 in lieu of
E(Zf'[AD>;]') = 0,
PSYS-p^(X'6lWuX5ì+X'SÌW3lX5ì)/X'nWXì2
with X'2 = (X'6l 0 X'siy, where X51, À61, û«J Xgi «r^ defined in Lemma 1.
Part (iv) of Theorem 3 implies that when the data are CS, a System estimator
that uses a weight matrix estimator that has a PD probability limit is inconsis-
tent under p = 1 - ÀN~l. Moreover, the conventional estimator for the optimal
weight matrix for the System estimator, i.e., Wm,sys2ÌP'), and the conventional
asymptotic standard errors of System estimators are also inconsistent under
such asymptotics. Nevertheless, the System estimators that use Wn^ysi and
Wm,sys2ÌP'), respectively, as weight matrix are still consistent under such
asymptotics although the local asymptotic distribution of the two-step System
estimator is asymmetric because E(X5'XfSÌ) ^ 0. The truly optimal System es-
timator has the same local asymptotic distribution as the truly optimal Arbov
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1361
estimator, which is symmetric. Thus the local asymptotic distribution of the truly
optimal System estimator is not affected by the AB moment conditions.
The preceding results suggest that Wald tests based on the usual two-step Sys-
tem estimator will have incorrect size when the data are CS and p is close to one
as a consequence of its asymmetric distribution. On the other hand, LM tests that
are based on Arbov or System estimators that use a restricted estimator of the
weight matrix that is optimal under the null, e.g., Wu^rboviip) and W^^YSiip)^
respectively, have a standard normal asymptotic distribution under the null both
when first-order fixed-parameter asymptotics and when local asymptotics are em-
ployed because of the fact that X-j _L X%? Therefore, such LM tests will have
rejection probabilities very close to their nominal size.
THEOREM4. LetT = 3 andlet p = 1-XN~* with X > 0 andO < g < '.Let
0 < d < g and Agd = 1/2(1 - d) - (g - d). Furthermore, let N -> oo, S -> oo
simultaneously with S/N -> c > 0, S/N^ -> c+ > 0, and S/Nd -> c* > 0. Let
í(A,c*) = 2A(S+l) ifd = O, q(X,c*) = 2Xc* ifO <d <g, and q(X,c*) =
q(X,c*) if d = g. Finally, let crMy = E(y/,i///) and o] = E( y?j). Then
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1362 HUGO KRUINIGER
Note that Agd = 1/2(1 +d) -g and that Ag£/ < 1/2. When the values of d and
g are such that Agd = 0, p is weakly identified by the AB moment condition(s).
When Agd > 0, p is nearly weakly identified, and when A¿¿ < 0, p is nearly
nonidentified by the AB moment condition(s).
When 0 < d < 1 and Ag£/ = 0, there are two extreme cases: if d = 0, then one
obtains the local-to-nonidentification asymptotic distribution of pab for g = 1/2.
This case corresponds to local-to-zero asymptotics; cf. Staiger and Stock (1997).
On the other hand, if d = 1 , then one obtains the local-to-nonidentification asymp-
totic distribution of pab for g = 1. When Agd = 0 and the value of d increases
from zero to one, both the signal, E(y¡t' Ay/2) = O{Nd~g), and the noise of the
AB moment function, (Var(y/i Ay/,2))1^2 = O(N^^d), become stronger.
Some of the diagonal path local (-to-nonidentification) asymptotic results of
Theorem 4 are very similar to the sequential local asymptotic results of Theorem
1, which were obtained for g = 1. If d = 1 and Agd = 0 (so that g = 1), then
q(À, c*) = q(À, c) and the sequential local (-to-nonidentification) asymptotic dis-
tributions of pab and pArbov as (5, Af - » 00)^ could be obtained from Theorem 4
by letting c - > 00 so that q(À, c*) = q(À, c) = q(À, c+) = 1 . Furthermore, when
d > 1 diagonal path local (-to-nonidentification) asymptotic distributions of pab
and PArbov are still obtained for g = 1, and they can also be shown to be equal
to the sequential local (-to-nonidentification) asymptotic distributions of pab and
PArbov as (5, Af - > cx))5^ for g = 1. On the other hand, the first-order large Af
fixed S asymptotic distribution of pArbov for p = 1 can be obtained from Theo-
rem 4 by assuming that c+ = 0, i.e., d < g < 1, so that q(Á, c+) = q(À, c) = 0.
Finally, note that ôq(Á, c*)/(dc*) > 0.
If Ag¿ = 0 and d = 0, the limit density of pab given in Theorem 4 is bi-
modal if and only if Mcr^y - Gy'/{ay(r) > 2, whereas if Ag¿ = 0 and d > 0,
the limit density of pab is bimodal if and only if y/Xq(À9c*)/2 > 2. In both
cases the second mode, if it exists, is not of practical importance. When Agd <
0, the limit density of pab is unimodal. The limit density of PArbov is always
unimodal.
To see that the preceding diagonal path local asymptotic results can yield rea-
sonable approximations, consider the following example: let p = 0.95, Af = 100,
and S = 10. Choose d = g = 1. Then k = 5, c = 1/10, and q(k,c)N/(2k) =
(1 -exp(-l)) x 10 = 6.32, whereas (1 -p2(1+5>)/(l - p2) = 6.93.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1363
LEMMA 2. Let T > 3, 0 < d < 1, and g = 1/2(1 +d). Furthermore, let p =
1 - i Af~~£ vv/i/z A > 0. Let N - > oo, 5 - » oo simultaneously with S/N - » c > 0 and
S/#¿ _> c* > o. Lei ^(A, c*) = 2/1(5 + 1) i/d = 0, q(l,c*) = 2Àc* ifO < d < 1,
andq(l9cm)=q(X,c*) ifd=l. Finally, let oj = o2(G2q(k,c*)/(2X)+ '[d =
0]als). Then
N-]^=lZPfAviAX52,and
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1364 HUGO KRUINIGER
THEOREM 5. Let T > 3, 0 < d < 1, and g = 1/2(1 + d). Furthermore, let
p = 1 -AN'8 with X > 0. Let N -> oo, S -> oo simultaneously with S/N -> c > 0
and S/Nd -> c* > 0. 77i£?/z
PAB-pl X'hWXu/X'hWXh
wiíA Xio = (X'5X X'7y and Xn = (0 X¿)7, wA^ £77, X51, Xlt and X8 are
defined in Lemma 2.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1365
Nl/2V-dHpsYS-p)^ X[6WXÌ5/X'6WXÌ6
with Xl5 = (Xf5{ '{d = 0}X'7Y and Xl6 = ('{d = l}X'6l X'S)', where X5i,
X(,', Xj, and X% are defined in Lemma 2.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1366 HUGO KRUINIGER
by the behavior of the sum that involves the initial conditions, which dominates
the sum(s) that only involve(s) the sample period innovations; see the proof of
Theorem 7 or note 5. Also when d = 1, the asymptotic distribution of pArbov is
affected by the behavior of the initial conditions. Phillips and Magdalinos (2008)
observe similar "tail wagging the dog" phenomena in the asymptotic behavior of
the LS estimator for a simple unit root AR model with a so-called infinite past
initialization and the related i-ratio. Note that although the initial conditions also
play a dominating role in the derivation of the asymptotic distribution of fi ab when
5, N -> oo with S/Nd -> c* > 0 and d > 1 or (S, N -> oo)seq, the distribution
obtained is the same as the one found for any d with 0 < d < 1.
Observe that ''mx'nq(k,c*) = 0 and limante (¿,<0/2ii) = lim^oiU-
exp(-2Ac))/(2A)) = linu¿o(2cexp(-2Ac)/2) = c. From these observations
and the results in Theorems 4 and 7 we conclude that the diagonal path local
asymptotic distributions of the Arbov and System estimators are left-continuous
atA=0(/? = l) provided that 0 < d < 1. A similar continuity result holds for
the local-to-nonidentification asymptotic distribution of the AB estimator for any
d>0.
The limit densities of fi ab and fi Arbov in Theorem 7 are both unimodal.
The following lemma and theorem extend the results in Theorem 7 to T > 3
and the System estimator.
LEMMA 3. Let T > 3 and let p = 1. Furthermore, let N -> oo, S -> oo
simultaneously with S/Nd -> c* > 0 and S/Nd -> c > 0 where d > 0 and d =
max(ú?, 1). Leto*r =o2(c*a2 + '{d = 0}(als-'-a2)). Finally, letfi'b bean initial
N (X/2W -consistent estimator for p, i.e., Nil/2)ã(pib - p) 4 Kb #0. Then
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1367
070diag(^-|/2/m-(r-2),^(-l/2)¿/r-2)I,/L,(¿//'[A^(pi)";'(pi)]'
diag(^-1/2/m_(r-2),^-1/2/r-2)I,'Ll(Z//'tAt>;(p,ft)^(^iè)]'[At;;(yolfc)
V¡(pib)]Z;')diag(N-l'2Im-{T-2),N-V2lT-2) 4 ^^Ãrbovlb'
diasWl-WWlT-i.N-W^T-iy^-WlT-í)
xIìLl&f' [Aoí(pi) o!(fii)]'[Av'i(fit) Ô!(fiO]Zf)
xdiag(yv(-l/2)('+rf)/r-2, N-Wlm-(T-v, N^'^h-i) -4 a4^52>
diag(^(-'/2)O^)/r_2> A^-'/2/m_(r_2), Ar'/2/7--2)
x X,N=l (Zf [ Av'iipu,) v¡(p]b)]' [ A»í(pit) v¡(fiib)]Zf)
PAB-P^X'4WXÌ3/X'HWXH
V7J(pSYS2ÍPl)-p) -» ■^iiW'5K52-^10/An^SyS2-ïll,
w/fÄ Xio = (*5, Ì52 1W < l)^7i)'. Xiofr = (X^ X^)', am/ in = (0 *£)'.
f/vj (1) if d < ì, plimN->oo(psYS-p) = 0.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1368 HUGO KRUINIGER
N(i/2)(d-d)(ßsYs_p) 4 x'6WXì5/X'6WXì6
with Xl5 = (X'5Ì '{d = 0}X'7y and X16 = ('{d> 1}X'6{ X's)f.
When p = 1 and T > 3, the diagonal path asymptotic distribution of the pab'
estimator is equal to both its first-order large N fixed S asymptotic distribution and
its sequential as (5, N -> oo)s^ asymptotic distribution for all d > 0. However,
in general, these equalities no longer hold simultaneously for any d > 0 when the
AB estimator uses an arbitrary sequence of PD weight matrices. Then one needs
to distinguish between two cases: the diagonal path asymptotic distribution of
the AB estimator is equal to its first-order large N fixed S asymptotic distribution
when d = 0, and equal to its sequential as (5, N -» oo)^ asymptotic distribution
when d > 0. Finally, when p = 1 and T > 3, the Arbov estimator that uses an
arbitrary sequence of PD weight matrices has a normal asymptotic distribution
that is equal to its first-order large N fixed S asymptotic distribution for d < 1 ,
whereas it has a nonnormal asymptotic distribution for d > 1, just as in the case
p = 1 and T = 3; cf. the discussion following Theorem 7. Moreover, for d > 1,
the diagonal path asymptotic distribution of the Arbov estimator is equal to its
sequential as (S, N -> oo)5^ asymptotic distribution.
4.1. LM Tests
In this section we propose two LM-type panel unit root (UR) test statistics that are
based on an Arbov estimator and a System estimator that use a weight matrix that
is optimal under the null, and on restricted conventional estimators of their first-
order fixed-parameter asymptotic standard errors. The critical values for these
LM tests can be taken from the standard normal distribution irrespective of the
assumptions made regarding the initial observations.
Let pArbov2,R and psysi^r be GMM estimators that use WNiArbou2(l) and
WNisYS2(l) as weight matrix, respectively. Let SE(pArbov2,R) = {N~lI,?=i
(^-iZ/Ot^^wil)]!^!^/7^,-!)}"172 and SE(psys2,r) = {AT1!^
[(Ay;., ^_1)Zf][^,5y52(l)]ir=i[zf/(A^-i^-i)/]}"1/2- Then we have
the following results.
(i) If p = 1 and if N -> oo, S -> oo simultaneously with S/Nd -> c* > Ofor
some d > 0, then
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1369
and
(ii) If p = '- XN~i/2 with X > 0 and */ S -> oo, tf -» oo sequentially, then
d f X ,- '
(pArbov2,R - l)/SE(pArboo2,R) "> N I--V/», ,- 1 1
and
(Hi) Ifp = l-XN~1/2 with X > 0 am/ // tf - > oo, S - > oo simultaneously with
S/Nl/2 -*c>0and S/Nd -> c* > 0 w/rcrc d > 0, /Ae/i
In the last decade various other panel unit root tests have been proposed. For
instance, Breitung and Meyer (1994) proposed a test statistic that is based on an
ordinary least squares estimator for p in a model for deviations from the initial
observations. Harris and Tzavalis (1999) discussed an LM-type panel unit root
test that is based on the bias-corrected least squares dummy variables (LSDV)
estimator for p. Finally, Kruiniger (2008b) discussed a Wald-type panel unit root
test that is based on the first difference maximum likelihood estimator (FDMLE)
for p in the covariance stationary panel AR(l)/unit root model.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1370 HUGO KRUINIGER
1 . In most cases considered the GMM-based tests have correct size. However
nonnormality of the errors affects the size of the tests.
2. The power of the test based on the System estimator is greater than or equal
to the power of the test based on the Arbov estimator. However, in many
cases the power of both tests is roughly the same and equal to the power of
the FDMLE.
3. When the variance of the y¿j - pa is larger than the value implied by co-
variance stationarity, the power of the test based on the System estimator
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1371
A. <72= 1 and<72 =0
6 100 ARBOV2-LMa 0.035 0.030 0.155 0.355 0.846
ARBOV2-LM 0.058 0.056 0.228 0.470 0.906
SYS2-LM0 0.029 0.030 0.143 0.370 0.881
SYS2-LM 0.050 0.054 0.215 0.477 0.930
B.<72 = 1 and<r2 = l
10 100 ARBOV2-LM* 0.030 0.029 0.261 0.663 0.987
ARBOV2-LM 0.059 0.054 0.370 0.756 0.993
SYS2-LM* 0.024 0.028 0.209 0.592 0.981
SYS2-LM 0.049 0.047 0.306 0.700 0.989
6 100 ARBOV2-LM* 0.035 0.030 0.145 0.339 0.844
ARBOV2-LM 0.058 0.056 0.217 0.459 0.901
SYS2-LM* 0.029 0.030 0.139 0.341 0.856
SYS2-LM 0.050 0.054 0.209 0.453 0.916
FDML-W 0.056 0.056 0.200 0.466 0.925
FEML-W-SA 0.049 0.049 0.118 0.162 0.319
LSDV-LM-SA 0.050 0.050 0.153 0.327 0.812
6 500 ARBOV2-LM* 0.029 0.029 0.446 0.939 1.000
ARBOV2-LM 0.055 0.053 0.553 0.968 1.000
SYS2-LM* 0.027 0.028 0.446 0.958 1.000
SYS2-LM 0.050 0.052 0.567 0.979 1.000
is greater than the power of the tests based on the Arbov estimator and the
FDMLE.
4. The power of the GMM-based tests decreases with an increase of the value
of 4
5. In the cases considered the GMM-based tests have greater power than either
the test that is based on the LSDV estimator or the test that is based on the
FEMLE.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1372 HUGO KRUINIGER
5. CONCLUDING REMARKS
In this paper we considered GMM-based estimation and inference for the panel
AR(1) model when the data are persistent and the time dimension of the panel is
fixed. We derived local asymptotic approximations to the finite-sample distribu-
tions of the AB, Arbov, and System estimators, respectively, under a variety of
distributional assumptions about the initial observations. Among other things we
found that the nature of the weak instruments problem of the AB estimator de-
pends on the distributional properties of the initial observations. Moreover, when
p = 1 - À/N and when either the data are co variance stationary or both S and Af
grow large with S/Nd -> c* > 0 and d > 1, then both the Arbov and the two-
step "optimal" System estimator have nonnormal local asymptotic distributions,
and the estimators of the optimal weight matrices for the Arbov estimator and the
System estimator and their asymptotic standard errors are no longer consistent.
We also argued that in these cases one should use LM tests and not Wald tests.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1373
Two LM-type panel unit root tests that we proposed were found to have good size
and power properties.
NOTES
2. Note that E(e, , y¡tS ) = 0 for all t and all s < t and E(eitte^s ) = 0 for all t ^ s implies E(eijt rji ) =
0 for all t > -S + 2 but not E(siy-S+' li) = °-
3. In Section 3.2 we discuss a range of alternative local asymptotic distributions based on S/Nd - >
c* > 0 and p = 1 - XN~g with 0 < d < g < 1 so that Var( v/i) = O(Nd); cf. Lemma A.4 in the
Appendix. The value of d determines the order of magnitude of Var(^/j) in terms of N. One can
expect that if 5 or Var()>; j) is small relative to N, then a local asymptotics based on S/Nd - » c*
with d close to zero gives a better approximation to the distributions of the AB and Arbov estimators
compared to local asymptotics based on the CS model, which implies that Var(y/i) is large relative
to TV.
REFERENCES
Ahn, S.C. & P. Schmidt (1995) Efficient estimation of models for dynamic panel data. Journal of
Econometrics 68, 5-28.
Ahn, S.C. & P. Schmidt (1997) Efficient estimation of dynamic panel data models: Alternative as-
sumptions and simplified estimation. Journal of Econometrics 76, 309-321.
Arellano, M. & S. Bond (1991) Some tests of specification for panel data: Monte Carlo evidence and
an application to employment equations. Review of Economic Studies 58, 277-297.
Arellano, M. & O. Bover (1995) Another look at the instrumental variable estimation of error-
components models. Journal of Econometrics 68, 29-51.
Blundell, R.W. & S. Bond (1998) Initial conditions and moment restrictions in dynamic panel data
models. Journal of Econometrics 87, 1 15-143.
Bond, S.R., C. Nauges, & F. Windmeijer (2005) Unit Roots: Identification and Testing in Micro Panels.
CEMMAP Working paper 07/05, Institute for Fiscal Studies, London.
Breitung, J. & W. Meyer (1994) Testing for unit roots using panel data: Are wages on different bar-
gaining levels cointegrated? Applied Economics 26, 353-361.
Fieller, E.C. (1932) The distribution of an index in a normal bivariate population. Biometrika 24,
428-440.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1374 HUGO KRUINIGER
Forchini, G. (2006) On the bimodality of the exact distribution of the TSLS estimator. Econometric
Theory 22, 932-946.
Hahn, J., J. Hausman, & G. Kuersteiner (2001) Bias Corrected Instrumental Variables Estimation for
Dynamic Panel Models with Fixed Effects. Working paper, M.I.T.
Han, C. & P.C.B. Phillips (2006) GMM with many moment conditions. Econometrica 14, 147-192.
Han, C. & P.C.B. Phillips (2007) GMM Estimation for Dynamic Panels with Fixed Effects and Strong
Instruments at Unity. Cowles Foundation Discussion paper 1 599, Yale University.
Harris, R.D.F. & E. Tzavalis (1999) Inference for unit roots in dynamic panels where the time dimen-
sion is fixed. Journal of Econometrics 91, 201-226.
Hillier, G.H. (2006) Yet more on the exact properties of IV estimators. Econometric Theory 22, 913-
931.
Hsiao, C, M.H. Pesaran, & A.K. Tahmiscioglu (2002) Maximum likelihood estimation of fixed effects
dynamic panel models covering short time periods. Journal of Econometrics 109, 107-150.
Kruiniger, H. (2003) On the Estimation of Panel Regression Models with Fixed Effects. Revised ver-
sion of Working paper 450, Queen Mary, University of London.
Kruiniger, H. (2008a) GMM Estimation and Inference in Dynamic Panel Data Models with Persistent
Data. Revised version of Working paper 428, Queen Mary, University of London. Available at
http://webspace.qmul.ac.uk/hkruiniger/.
Kruiniger, H. (2008b) Maximum likelihood estimation and inference methods for the covariance sta-
tionary panel AR(l)/unit root model. Journal of Econometrics 144, 447-464.
Madsen, E. (2003) Using GMM when Testing for a Unit Root in Panels Where the Time-Dimension
is Fixed. CAM Working paper 2003- 1 1 , University of Copenhagen.
Marsaglia, G. (1965) Ratios of normal variables and ratios of sums of uniform variables. Journal of
the American Statistical Association 60, 193-204.
Moon, H.R. & P.C.B. Phillips (2000) Estimation of autoregressive roots near unity using panel data.
Econometric Theory 16, 927-997.
Phillips, P.C.B. (2006) A remark on bimodality and weak instrumentation in structural equation esti-
mation. Econometric Theory 22, 947-960.
Phillips, P.C.B. & C. Han (2008) Gaussian inference in AR(1) time series with or without a unit root.
Econometric Theory 24, 631-650.
Phillips, P.C.B. & T. Magdalinos (2008) Unit Root and Cointegrating Limit Theory when Initialization
is in the Infinite Past. Cowles Foundation Discussion paper 1655, Yale University.
Phillips, P.C.B. & H.R. Moon (1999) Linear regression limit theory for nonstationary panel data.
Econometrica 67, 1 057- 1 1 1 1 .
Staiger, D. & J.H. Stock (1997) Instrumental variables regression with weak instruments. Economet-
rica 65, 557-586.
Stock, J.H, J.H. Wright, & M. Yogo (2002) A survey of weak instruments and weak identification in
generalized method of moments. Journal of Business & Economic Statistics 20, 518-529.
Woglom, G. (2001) More results on the exact small sample properties of the instrumental variable
estimator. Econometrica 69, 1381-1389.
LEMMA A. 1 . Let [y¡ , } be a stationary process and let u¡tl - y¡t - fi¡. Then
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1375
('+p)20+p2) , ^ 1+p
cr,A E[(Aw, ,) v4, ]=
EK,_l(A«,v)3]
EK,_l(A«,v)3] ,. ,.,3l
,3l =
(l-p)[K('-p2) + 6p2] 4 3 4
Proof of Lemma A.I. Note that u¡j = pult-' +e,-if and Am/?í = (p - ')uit-' +
£/,. Moreover {wzi} is a stationary process. Then verification of the first two claims is
straightforward. The other claims are proved as follows:
E(<,)=E(pV,í-l+6pM/-le2,r+eu)=/E(</) + ^Í+^4
n/ 4 ^ k<74 6gy (i-p2)*+6/>2 4
«b("'V> n/ ^ - ! _p4 + (1 .^d .^4) - (, _p2)2(i +p2)ff '
a2 a2
E(>v,i Ay/,2) = ---- , E(y,-,2Ay/,2) = y- ,
v / a , 2a2l*2 + ei('-p2)]
v Var(,;,,A£,3)= / a , (i+p)0_p) .
2(T2[íJ2 + ít2(1-/9)2]
Var[fe,3 H- (1 -/>)///) Ay/,2] =
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1376 HUGO KRUINIGER
v , a * pHk('-p2) + 6p2) 4 , oA
2(7^2(1 -/>)
Cov[(e/,3 + (l -/J^iOAy/^.y/^Ay/^] =
Proof of Lemma A.2. Noting that >>;}/ = utj + //,- , application of Lemma A. 1 yields
-?
<7
2 *2
E( y/,2Ay/,2)=E[y/jAy/f2 + ( Allori 2 = yt-.
2^4 2 2 2^2[a2+^(l-/>2)]
2c72[(72 + ^2(l-p)2]
r ^2 2I
Variai A^2) = E (m/,i Am/,2 + /ìiAiìi,2 + - - )
L ^ J
, 2a4 2<72<r2
= E[KlA""2) , ÕW+TT7 2a4 2<72<r2
, ^4 1
(i+p)2J
_ (K(l-p2) + 6p2)g4 g4
(l+p)2(l+p2) +(1+/>)(1-P)
(T2[2g2(l+p)-ff2]
+ (l+/>)2
+ (T^)2+2"'.'(A"'.2)3
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1377
lo1 lo2 ,
- 77- (w/,1 A11/.2) - - - (Am/,2) , ]
+ Cov(/i/Ae/f3,/i/A«/f2)
= E(M?1Aei,3AM|i2) + <T^E(Ae/>3Aii|f2)
= -E(«?tlCi2f2)-^E(c22)
¿72[<72+<72(l-p2)]
(l-p)(l+/>) ' *"
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1378 HUGO KRUINIGER
LEMMA A.3. Lei {^r} be a stationary process, let wz?i = yit - ///, let /, p, oazú? g
be positive integers, and let Kq = E(ef t)/aq < oo. Then assuming that the sum of the
powers ofsik, ui,s> ui,t> and ^ui,t I<J less tnan or equal to 8, we have
(i) 'impV('-p)PE(ulPt)<oo,
(ii) limpT1(l-y9^|E(^+1)| <oo,
(Hi) 'imp<tiE[(AuiJ)P] = Kp(jP,
(iii) limn, E[(Aii/if )^] = limpîl E[((p - 1 )«/,/_! +^,i)/?] = E[(e/,f )^] = /c^îj/7,
(iv) Hmpîl(l-p)*|E[i«^
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1379
Proof of Lemma 1. Part (i): The proof of the first two results is similar to the proof of
the first part of Theorem 1 :
For/? = 1 -XN~X 'imN->ooE(yiìiAyis-ì) = 'imN->OoPt-3E(yiAAyiy2) = ~'o2
and limyv->oo AT^Varí^i Aü/ff) = lim^^oo N~lE[(y¡9' A^,)2] = lim/v-^oo^"1
xE[(j/JA^3)2]) = lim^^oo2^2N-lE(>;?1) = lim^^oo2a2^-1E(W?1) = ^^.
Note that Var(y/fl Ayiyt-') = E[(uiA AiilV_i +^/Aii/ff_i + y^p'~3)2], M,-,/ =
puij-'+£itt, and A«/if = (p-l)w/ir_i +£/,/• Then it is easily seen that for/? = l-AN~l
limn-too N-^ariyijAyij-i) = 'imN^ooN~ìE[(uiìiAuift^i)2] = lim^^oo^"1
E{[(p - l)w?! +K/,ie/f,-i]2} = limyv^ooyV-^tÍM/,!^-!)2] = a4/(2A); that if
s <t- 1 and/? = l-AA^~1limyv->oo^~1Cov(y/?1A^jJ,y/iA^i_i) = lim^^oo^"1
Eíii^Aii/^Aii^-i) = lim^^oo^'^ÍM^Aiii^Aii/^+i) = . lim^^oo^"^^?!
[(/? - 1)m5_i + es][(p - 1)(pmj-i + ej)]} = 0; lim^^oo^'^ovi^i Au/,/,
yÌ9'Ayu-l) = YimN^ooN-ÌEiyf^AeijAuij-O = -tr4/(2A); and finally that
'imN^Oo^~ìC0'(yiiiAvij9yiAAyij) = 'imN^0ON-ìE(ylìA£iìiAuij) = (74/(2A).
Thus for/? = 1-/1^- * limyv^oo^-1E(Z//At;/Ay;z/) = lim^^ooi727V-1E(>'?1)
xf/ = aV(2A)//JimN^oo^-1E(Z//Ay/í_1A>;;_lZ/) = lim^^ooíT2^-1E(^1)/
= í74/(2^)/, and lim;v->oo W-'EÍZ/'A^-! Aü¡Z¡) = -MmN^oo^N^Eiyf^C =
-g4/(2à)C, and hence
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1380 HUGO KRUINIGER
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1381
Part (iv): Because limw_>ooVar([A;yz?r]2) < oo Vf, it is easily seen that for p =
1 - AN'1, N~l ZfL^zV'Zf1) q^' lim^oo E(Z/7/z/7) = a2 1 and hence plim^^
[AT-1Z^1(Z///Z//)] = ^2/.
To establish the second claim of part (iv), note that v¡ = y; - pìy/,-1 =«/+(/?- p')
xy/,-i and Avi = Ay¡ -p' AyZì_i = Ao/ + (/>-pi)Ayif_i.
Lemma A.3 implies that lim;v->ocE(Aw/,/)8 < oo, lim^-^Oo^~1E[M?J(AM/>f)6] <
cxD^im^^ooA^^ElM^ÍAM^)4] < oo, limA^ooE^Aw;,,)4] < oo, lim^oo |E[n/fJ
x (Am/^)3] < oo, and ìim^^oQ N~^E[uj s(Auiìt)2] < oo Vs,r. It follows from the
first result, our model assumptions, and the Cauchy-Schwarz inequality that lim^^oo
Var([uiìSAyiìt]2) < oo Vs,t and hence N'^fL^Zf1'^ v¡][Av¡ oJ]Z/7) q™'
'imN^oo K^EiZf^Viü'iZf^K11' = o*KuKIlf. Similarly it follows that AT2
x SÍLiíZpyí.-i^.!^) ^ lim^ooiV-^ÍZ^-i^.^/«) = ^4/(2A)/,
Ar2L*Li (Z/>' AWfli A^_, z/> 0 q5' 0, and ^-2^=1 (z^/'-i ^l-x*?') '$' 0.
Lemma A.3 and our model assumptions also imply that lim;v->oo ^[vi,kui,s(^ui,t)2] <
oo and limyv^oo N~lE[vf ku2 s(Auij)4] < oo Vk,s,t. It follows from these results, our
model assumptions, and the Cauchy-Schwarz inequality that N~3/2 X-^j (Z^'d/^ _{ z/O
Í5- O, tf"3/2 It^=1 (Zpo¿ A>if _! Z/>) ^- 0, AT3/2 1^=1 (Zpy/,-1 AdJ z/>) q4' 0, and
N-3/2S^=1 (z/> > A^_! Ay;Zzö) ^- 0.
We conclude from the preceding results that AT"1 I^j ¿/7/[AÏ- SJ]'[Ao- ^]Z/7 4
a*KllKw + o*/(2k)K2diag{Om-{T-1)9 ¡T-2). ■
Proof of Theorem 2. The proof relies on results in (the proof of) Lemma 1 .
Part (i): Recall that Z' = KABzfB' where ZfB = [Z/ ZfP] and rank(KAB) = m.
Therefore pab' is equal to a GMM estimator that exploits E(Z^'Aü;) = 0 and uses the
weight matrix (W1 X/^i zfB'HZfB)~l. Consider the local-to-nonidentification limit-
ing behavior of this weight matrix. Because limw^oo N~2Var(j2j) < oo, lim^^oo ^-1
Variai Ayij) < oo, and limyv-^oo VarilA^^]2) < oo, it is easily seen that for p = 1 -
XN-' o2N-2^{Z''HZ{)q™' ¿55,11 and^A^X^Z/'/ZZ^
owcva2N-lJäfLi(zPfHZP)q^' t55a2- Therefore p'imN^ooc72N-2^fLi(Z¡fHZIi)
= £55f „, plim^ooa2^"1 IJL^ZP'HZP) = ¿55,22, and plim^ooa^-3/2!^
(ZJ'HZP) = ¿55,12 = 0. Furthermore ¿55 is PD. It follows from the preceding results
and (the proof of) Lemma 1 that pAB' - p -* X'6 ¿^ X5/X¿ ¿^ X6. Assuming that pAB
exploits E(Zf Bf Avi ) = 0, we also obtain pAB~P^ *61 wl 1 ^5 1 /¿¿I W" *61 •
Finally we show that pAB' is asymptotically biased downward when T > 3:
Note that ¿As! = p + {I^I^j^jZ^ZÍZO-^íy.!,,.!]}-1!^1!^!^^,
x(Z;Z0-1Z;Dí].BecauseI^21[^1'í_1Z,(Z;Z0-1z;yí] = (I^1A);;j_
{Zf'HZf^^Zf'Avi ^à'^[yf_u_xZt{ZftZt)-xZfty.u^] = <£*LX
A^ _j Zf 5)[I^! (ZfB'HZ?B)rl ZfLi ZfBfAyi^h it follows from part (i) of the proof
of Lemma 1 that both the numerator and denominator of pAB' - p converge in distribution.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1382 HUGO KRUINIGER
We first show that 'imN^ooE[y'_] ,_, Zt (Z'tZt)~x Z'tvt] < 0 for t = 2, . . . , T - 1. Note
that limyv^ocEI^j^jZ^ZÍZr)-^;^] = lim^^E^ [?,•_!,,_!;; t(Z'tZtyx
x Zißij]} = limyv-^cx)I/ii [E(z'Ì3l(Z'tZt)-xZi,t) x E(5vs_i^_ie/s/)].
Now, Eiy/.-^-je/^ar -0/(7-^
x - I A?, + ^7 S I AM (A.1)
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1383
M? U-
Let MU = N-izfLi&f'HZf), Mn = M'2X = W"1 ^(z/'CZ/7), and M22 =
N"1 S^j (Z/7/Z/7). From parts (i) and (iv) of the proofs of Theorem 2 and Lemma 1
we have pUm^-^oQ N~^ M'' = tf~2 ¿55,11 = (a2/(2À))H and plimyv->oo^22 = ^ 2/-
Moreover, because limjv->ooE(;y/,i àyij) = -a2/2 and lim^^oo TV"1 Var(>>/ ' Ay/,,) =
cr4/(2A), wehaveplim^^oo^-1/2M12 = plim^_,ooN-1/2M^1 =0.
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1384 HUGO KRUINIGER
LEMMA A.4. Let g, d, c*, and À be constants such that 0 < g < l,0<d < g,0 <c* <
oo, andO < I < oo. Furthermore, let p = 1 -ÀN~8. Then lim^^^ s/Nd^>c* Ng~d(' -
p2V+s)) = q(l9c*) <oo, when$(l,c*) = 2X(S+l)ifd = 0;q(l,c*) = 2Xc* ifO<
d < g; andq(Á,c*)=q(Á,c*) if d = g.
A.4. Proof of Theorem 4. Note that lim^i Var(y/ _5 - ///) = 0. Then we obtain for
the parameter sequence p = 1 - ÀN~8 that lim^^^ ^S/Nd->c* N~dVax(yi9' - Hi) =
lim^oo,5/tf'-*c**2*"<'0 -p2(1+5))/0 -P2) = Hm^oo^/yv^c^2^"^1 -
xp2(H-5))/(2/i) = i72^a,c*)/(2/l) by Lemma AA It follows that Hm^^^^^,*
N-dVar(yÌ9ÌAyÌ92) = K™N->oo9S/N<'-+c<> N~d^(yiA£i,l) = cj2{a2q{X,c*)/{2X) +
1{J = 0}a2s). Furthermore, if 0 < d < g, we have HmA^OOíS/A,</_>c.[Af«""í/
x E(y/flA^2)l = -Wm^^oo.í/^^c^^^V^íyiM ~ /*/) = -^2^U,c*)/2.
Finally, if d = 0, we obtain that lim^^ iS/Nd^c< N8-dE(yiAAyia) = A(trMy - a2).
Note that when d = 0,v2 = (T2q(Ã,c*)/(2À) + (T2s.
When T = 3, pAB = p + (X^ w>1 Ae/)3)/(Xf=1 w>1 A^2) and pArbov = p +
ŒJL^c/.a + O-p^iOA^VŒiL^.^A^).
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1385
A.5. Proof of Theorem 7. When T = 3 and p = 1 , pAB = /> + (2/Li >7, 1 Ae/,3)A£/!l
^ iei)2) and pArbov = P + (I/Li e/,3e/f2)/Œ/Li 3>i,2*i'2)-
Let us define X, = ^-1/2(1+£/)I^i ^/,1 A^,3, X2 = iV1/2(l+^I^l ^ l£/j 2,
X3 = A^"1/2!^! C/>3Cif2. and X4 = N~^1^ I^=1 y/,2^,2. Let <r4r = .t2(cV2 +
'{d = 0}(ít15 +(J2)). Then it is easily seen that as N, S -> oo with S/Nd -> c* > 0 and
5/N^ -> c > 0 where ¿i > 0 and d = max(d, 1), one obtains that X' -> A^(0, 2o*r), X2 ->
N(0,a¿r), X3 4 A^(0,iJ4), and X4 4 A^(1{J < '}ct2,ccj4). In addition, one obtains that
limN_>OOíS/^_>c*Cov(X1,X2) = -í74r and lim;v_>oo5/yVi/_>c*Cov(X3,X4) = a
Parts (i) and (ii) of Theorem 7 now follow straightforwardly from the preceding results. ■
let Kq = E{eqit)laq < oo. Then assuming that the sum of the powers of "e,-^, m/,5, m;,,,
flttd Aujj is less than or equal to 8, we have
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1386 HUGO KRUINIGER
(vii) limA^ootf-^lEIe^^
■
A.á. Proo/s of Lemma 3 and Theorem 8. In these proofs (p)limjv->oo is snort f°r
(pJlim^^QQ iS/Nd-*c*- Note that when P = ^ °/,i = ei,r» Ayi,t = e/,f » A^/ = e|» and
Z/^diagic?,...,^-1). ■
Proof of Lemma 3.
Part (i): The proof of the first two results is similar to the proof of the first part of
Theorem 7. As N, S -> oo with S/Nd -> c* > 0 and S/Nd -> c > 0 where d > 0 and
¿ = max(¿/, 1), we obtain that
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1387
I/l,(z/l/[^;(?,)^(?i)]'[^(pi)S/(pl)]Z/l)diag(JV-1/2/m_(7-_2),W-1/2j/7--2)
4 oAKUK>" + c<r4/C2diag(Om_(7-_2),/r-2); ^"' I/li (¿/"tAÏ^Plft) ?/(piè)]'
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1388 HUGO KRUINIGER
Proof of Theorem 8. The proof relies on results in (the proof of) Lemma 3.
Part (i): Recall that Z' = KABzfB' where ZfB = 'z' ZP] and rank(KAB) = m.
Therefore /5¡4£i is equal to a GMM estimator that exploits E(ZfBfAv¡) = 0 and uses the
weight matrix (N"1 2/Li ZfBfHZfB)~l. Consider the diagonal path limiting behavior
of this weight matrix. Since lim^-x» N~2dVar(yf { ) < oo by Lemma A.5, limyv->oo N~d
Var(^ij Aj/r) < oo, and limyv->cxD Var([AjZ)/]2) < oo, it is easily seen that ¿j2A^~^1+^
results and the proof of Lemma 3 that pABi - p -> X'6 t~5l X5/X'6 t^5l X6. Assuming that
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1389
From part (iii) of Lemma 3 we have AT1/2!^ z'%i 4 X71 - N(0, 1 77,22) and
A^(-l/2)(l+J)I^i z'-'yi-x ^ ¿81 ~ N{'[d < '}a2i,ca4l).
Recall that zf'[Av' ü¡] = ((Z/'Ai/)' (Z/7'[A¿- S?]')')' = KsZ?'[Av¡ v¡]' where
z/7/[Ay; »<]' = KnZ¡Ifüi and AT5 =diag(/r_2, tf77).
Considernow * 5 W~sr51 AT5' = AT1 X^ Kszf'AZfKSf and KsW~XSYS{bKSf =
N~x I^j Kszf'zfK*f. It is easily verified that plim^oo aH1+¿) ^ (Z/'//z/) =
KV*2)//, Plim^00^-Il^1(^//Z//'Z//^770 = a2^7^7^ = <r-2£77, and
plimN^ooNÍ-WV+VzíLiiZfCZl1 K11') = 0. We also have plimyv^oJAH1*^
xI^1(Z//z/)] = K4r/i72)/.
We conclude from the preceding results that plim/v-^ocíÂsrs* - p) = 0 and
W(1/2)<Ws*-/>)4 X'iirfXj/X'tirfXt, fork= 1, 1^.
It immediately follows from part (iii) of Lemma 3 and the preceding results that
Note that when /)= 1, t>(),=e()í, A^,=e,)f, A>>? =s?, and Z¡' =diag(e?,...,e(^ ').
Recall the distributional results from parts (i) and (ii) of the proof of Lemma 3 and in
particular that Xj ± Xg.
Note that WN<Arbov2(l) = (AT'S,!, z/^oíz/7)-1 and W^.sy^d) =
(A^-'ljLjZf'IAi»; v'iÚAv'i v'¡]Zf)-]. Consider now ^"^'^^(l)^7" and
KsW^lSYS2(l)KSl. Recall that o*KIlK11' = £77. Then it is easy to verify that
1- Plim^oo.^^e.iV-C^SJt^/'AoiAoiZ/ =ff*./#,
2. plim^ocs/^. AH If=1 K"z!"v,v',z!'K'" = £77, and
3- Plta^oo.^-.c^*-1^2^^, K"Z¡»o,Av¡Zl =0.
We conclude from the preceding results that P^^M^oo,s/Nd-^c* (PArbov2,R - 1) = 0,
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
1390 HUGO KRUINIGER
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms
GMM ESTIMATION IN DYNAMIC PANEL DATA MODELS 1391
N~l £A^ z/^ü/üJz/7 = a4 1. Similar arguments show that plim^^^ yS/Nd_>c* N~x
I^/'^A^Z/'^4/.
Because lims^-x»,«? N~lE(eflkuf s(Auijt)2cl) < oo Vk9s9t and V/,<? > 0 by
Lemma A.3, we have a fortiori 'imN^œS/Nd_+c* N~lVar([yijAyij]2) < oo and
'imN^oo s^Nd^c* N~lWar([yi'Aüit]2) < oo Vi when ¿/ > 0. Moreover, when
d = 0, we have lim^^ tS/Nd^c* Varfly/j Ayiit]2) < oo and lim^^^ iS/Nd-+c*
Var([y,M ¿K,]2) < oo Vf! It follows that plim^TOfS/^c, ATÍ^S^Z/'
This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:03:31 UTC
All use subject to http://about.jstor.org/terms