Professional Documents
Culture Documents
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=ims.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Institute of Mathematical Statistics is collaborating with JSTOR to digitize, preserve and extend access to The
Annals of Statistics.
http://www.jstor.org
TheAnnals of Statistics
2006, Vol. 34,No. 3, 1049-1074
DOI: 10.1214/009053606000000245
? InstituteofMathematical Statistics, 2006
processes are established. The conditions are shown to hold in case of expo
nential and hyperbolic decay in the ARCH weights, though in the latter case
a faster decay rate is required for the central limit theorem than for the law of
equations
(1) xt=rjt?t,
oc
where
oo
(3) coo>0, foj>0 (j > 1), ^^0y<OO,
= 1
7
=
(5) yt Ato+ xt
Key words and phrases. ARCH(oo) models, pseudo-maximum likelihood estimation, asymptotic
inference.
1049
1050 P.M. ROBINSON AND P. ZAFFARONI
Empirical evidence ofWhistler [35] and Ding, Granger and Engle [10] has sug
gested the possibility of long memory autocorrelation in the squares of financial
data. Taking [contrary to the first requirement in (3)] coo= 0, such long memory in
was considered by Robinson [29], the \//oj being the au
xf driven by (1) and (2)
= 1;
toregressive weights of a fractionally integrated process, implying YlJLi ^0j
see also Ding and Granger [9]. For such x//oj, and the same objective function as
was employed to generate the tests of Robinson [29], Koulikov [20] established
asymptotic statistical properties of estimates of fo- On the other hand, under our
[18] to be necessary for existence of an xt satisfying (1) and (2). The inter
Leipus
mediate in (3) is sufficient but not necessary for a.s. positivity of
requirement a},
and is imposed here to facilitate a clearer focus on the xf/oj, which decay, possibly
slowly, but never vanish.
We wish to estimate the (r + 2) x 1 vector #o = (coo, Mo, fo)' on the basis of ob
servations yt, t = 1,..., T, the prime denoting transposition. The case when /xo is
for = 0, is covered a version of our treatment. If
known, example, /xo by simplified
the yt were instead unobserved errors, we have /xo = 0, but would then
regression
need to replace xt by residuals in what follows; the details of this extension would
be relatively straightforward. Another relatively straightforward extension would
cover simultaneous estimation of the regression parameters coo and fo, after re
/xo by a more general parametric function; as in (1), (2) and (5), efficiency
placing
gain is afforded by simultaneous estimation.
Under stronger restrictions than < 1, Giraitis and Robinson [14] con
YlJLi i^Oj
sidered discrete-frequency Whittle estimation of fo, based on the squared obser
vations yf (with /xo known to be zero), this being asymptotically equivalent to
=
xt(v) yt -M,
oo
= 1
7
t-\
j=i
for f > 1, where 1() denotes the indicator function. Define also
= ?
?t zrgmin Qt(9), 9j argmin Qt(9),
0e? 0e?
where @ is a prescribed compact subset of Er+2. The quantities with over-bar are
introduced due to yt being unobservable for t < 0; 9j is uncomputable. Because
we do not assume in the we refer to as a
Gaussianity asymptotic theory, ?t pseudo
maximum likelihood estimate (PMLE).
We establish of 9j and asymptotic of Txl2(9j ?
strong consistency normality
?
#o), as T oo, for a class of ^(f) sequences. In the case of the first prop
erty this is accomplished by first showing of 9j and then
strong consistency
that 9t ?9t ?
0, a.s. In the case of the second we likewise first show it for
1052 P.M. ROBINSON AND P. ZAFFARONI
-
TXI2(9T do) and then show that 9T-9T= but the latter property, and
op(T~x/2),
?
thus the asymptotic normality of Tx/2(9j #o)> is achieved only under a restricted
set of possible fo values, and this seems of practical concern in relation to some
> 2. The =
Assumption A(q), q et are i.i.d. random variables with Eso 0,
= I, < oo and probability
Es^ E\so\q density function f(s) satisfying
= as?^0,
f(s) 0(L(\8\-x)\8\b)
?
for b > 1 and a function L that is slowly varying at the origin.
Assumption B. There exist coi,cou, lil, l^u such that 0 < col < cou < oo,
?oo < /xl < /X(/ < oo, and a compact set T e Rr such that 0 = [col, <ou] x
[Mi,jXtf] x T.
(6) inf^(f)>0;
~l~x
(7) sup \jrj(f) < Kj for some d > 0;
(9)pe{(d+irx,l),
(10) ?|jt0|2p<oo.
ESTIMATION OF ARCH(oo) MODELS 1053
(ID
r^-s^K)1-'
for all r\ > 0 and all //*= 1,..., r, h = 1,..., &, k < I.
rank{*Ul,...Jr)(f)}=r,
where
(12) ^ > \
such that
(13) irQj<Kj-x-d\
and (10) holds for
Assumption A(g) allows some asymmetry in ?r, but implies the less primitive
condition (which does not even require existence of a density) employed in a sim
ilar context by Berkes, Horvath and Kokoszka [3]. Assumptions B and C are stan
dard. Inequalities (7) and (13) together imply do > d, while (8) with (3) ismilder
than monotonicity but implies =
o(j~l) as j ? oo. We take n > 0 in As
i/roj
sumption F(Z) because < 1 for all G
i/j(0 large enough j, by (7). Assumption
is crucial to the proof of consistency, being used in Lemmas 9 and 10 to show
that in the limit 9o globally minimizes Qt(9)\ it also ensures nonsingularity of
the matrix Ho in Proposition 2 and Theorem 2 below. This and other assumptions
are discussed in Sections 3 and 4 in connection with some parameterizations of
interest.
Proof. The proof follows as in, for example, [17], Theorem 6, from uniform
a.s. convergence over 0 of Qt(9) to Q(9) ? Eqo(0) established in Lemma 7, the
fact thatQT(9T) < Qt(0), and Lemma 10.
Theorem 1. For some 8>0, let Assumptions A(2 + 8), B, C, D, E, F(l) and
G hold. Then
-
G0 = (2+ k4)M 2k3(N + N') + P, Ho = M + \P,
where
Proof. Write
=
<?<?(*) =
^ r-?f>(*).
where
-
ut(9) = r,(0)(l x?m) + cj-2(0)vt(9),
with
of(9) o0
=
(16) 0= Q{t\9t) Q{t\Oo) + Ht(Ot-Oo),
where has as its /th row the /th row of HT(9) = T~l evaluated
HT J2f=\ nt(0)
- -
at 9 =
9%\
where ht(9) = (d2/d9d9')QT(9), \\9^ 90\\ < \\9T 90\\, where
ESTIMATION OF ARCH(oo) MODELS 1055
-
we define ||A||
=
{tr(A'A)}1/2 for any real matrix A. Now ut(00) = rt(90)(l
?
s2) 2e2st/ot is, by Lemmas 2, 3 and 7, a stationary ergodic martingale dif
ference vector with finite variance, so from Brown [6] and the Cramer-Wold de
vice, T1/2Q{j}(Oo) -*d N(0, Go) as T -> oo. Lemma 7 and Theorem 1,
Finally, by
Ht -+p Ho, whence the proof is completed in standard fashion.
Define
T ^ T
GT(9) = T~x J>(0), HT(9) = T~x?ft,(0).
t=\ t=\
Proof. We have
= -
O=Q{t\0t) Qt\Oo) + Ht(9t Oo),
where Qj\0) = (d/dO)QT(0) and HT has as its ith row the ir/z row of HT(9)
evaluated at 0 = of, for - -
\\of #0|| < ||0r 0O||.Thus, from (16),
- - -
0T -0t = (H~x H-x)Q^(Oo) H~x\Qf(Oo) Q(Tl)(90)},
where the inverses a.s. for all sufficiently
exist large T by Lemma 9. In view of
Proposition 2 and Lemma 8, (17) follows on showing that
Q{t1\Oo)-Q(t)(9o)=op(T-x?2).
The left-hand side can be written (B\j + B2? + B^t)/T', where
T T t
= = - =
BXT ?>2fci? B2T l)b2t, B3T
-?>2 -2e2J2?tb3t,
t=\ t=\ t=\
with
-2(1)/ 2 -2\ 2(1) -2(1) 2 - -2
otK\of-af) V = or/' = a,2 erf
=-?a-, o\t b2t ?5-ry-, fc3, -?r-,
erferf ofcrfcrt
for of = = a2(1) = a2(1)(c90), with o^(9) =
of(9o), a2(1) a2(>0),
= = =
(d/d9)of(9), a2(1)(0) (d/d9)df(9). We show thatBiT op(Tx'2), i 1,2,3.
For the remainder of this proof, we drop the zero subscript in
\Jsoj.
1056 P.M. ROBINSON AND P. ZAFFARONI
\ 7=1 7= 1 I
where =
^j V*,- (fo)- From Assumption F(l),
< 1+ 2
|a,2(I) | ? *, |*,_,-1+ a:E ^'"xlj,
7=1 7=1
?K*t,
E^7^-7l<^E^-7J (Y,+j)
so since dt>(oi> 0,
r-l
= 1
7
From (8),
7= 1
It follows that
(19) \\a^\\/a^K^\
On the other hand, by the cr-inequality ([23], page 157) and (10),
oo oo
-
(20) E(af a}Y <Kj^ VjE\x,-j \2p< K? Vj
j=t j=t
n< ?
choosing 1 l/{p(do + 1)}, which (14) enables. Applying the cr-inequality
again,
T
E\\BlTr <Kj^E\80\2pE\\bltr.
t=\
ESTIMATION OF ARCH(oo) MODELS 1057
< we
Applying (21), this is 0(1) when p > 2/(do + 1),while when p 2/(do + 1),
may choose r\ so small to bound it by
< ^rp/2-{l+2(db+l)(l-//)}[p/2-2/{l+2(db+l)(l-i;)}] =
KT2-p(dQ+\)(\-r)) 0(Tp/2^
of n. Thus, =
using (12) [which requires (13)] and arbitrariness B\j op(Tx/2) by
Markov's inequality.
Consider Z^r- By independence of st and ?>2r,by the cr-inequality when p
<\,
and by the inequality of von Bahr and Esseen [34] and the fact that the 82 are i.i.d.
with mean 1when p > \,
T T
< <
E\\B2T\\2p K?(?|?0|4p + \)E\\b2t\\2p
KY,(E\\b4t\\2p
+ E\\b5t\\2p),
t=\ t=\
where
^n2E*;
L [j=t J +\Uhjl)\\2MxlJ\
J Ete2-;
J [j=t J
[j=t
/ oo
(oo \ ]/2 \ V2 j
r r oo i i/2-i
< Kir**'2 + I
If^j-i^mi-i,)^
L [j=t J J
so
oo
< Kt~pd? + KY^ /-(^o+1)p(1-27?) < ^/i-(^0+i)p(i-2^)
?||^k||2p
for sufficiently small n. Thus, E\\b4t\\2p is (9(1) for p > 2/(d0 + 1), while
?f=i
for p < 2/(do + 1), it is bounded by
KT2-(do+\)p(\-2rj) < KTp-(dQ+2){p-2/(do+2)}+2(do+\)pr]
=Q^JP^
J2E\\b5t\\2p=o(Tp),
1058 P.M. ROBINSON AND P. ZAFFARONI
applying the cr -inequality when p < and von Bahr and Esseen [34] when p >
\ ^.
< - so from (20),
Now b3t (erf of)xl2&-2,
T oo
E\\B3T\\2p<Kj2T,^j
t=ij=t
< =
K{t{p > 2/(do+ 1))+ (InT)l(p 2/(do+ 1))
+ T2-P{d?+Xh(p< 2/(db + D)}
=
o(Tp),
much as before. =
Thence, B3j op(Tx/2).
It remains to consider the last statement of the theorem, which follows on stan
dard application of Propositions 1 and 2, Theorem 1 and Lemmas 7 and 8.
In these references coo= 0 was assumed in (2), but we assume coo > 0 and general
ize (23) as follows. Introduce the functions aj = fl/(f)> bj = &/(?) and, for m > 1,
n > 0, n + m > r,
m n
-
(25) a(z', f) = X>7V, Mz; C)= 1 J^bjz'Hn > 1);
y=i 7=i
= = l,...,n;
(26) 0/> 0, j \,...,m; bj>0, j
(30) de(0,\).
We call xt based on (29) a fractional GARCH, FGARCH(rc, do, rn) process, for
=
do d(t;o).
Then (15) is true. Let also d and the aj, bj be thrice continuously dijferentiable
and do > Then (17) is true.
\.
Yl{ZocJ-kdk, .7> 1- From [3], the Cj are bounded above and below by posi
tive, exponentially when n > 1, and are all nonnegative when
decaying sequences
n = 0. Since the are all positive, it follows that (6) holds. Also, Stirling's approx
dj
imation indicates that j~d~x/K < < so the same
dj Kj~d~x, V^/(f) satisfy the
of Y, smoothness of d, and > d, to
inequalities. Compactness (30), imply d(t;)
check (7). The above argument indicates that tyoj < Kj~d?~x < Kk~d?~x <
Kxjrok
> k > so to
for j 1, (8) holds, and thus Assumption D. With regard (11), note
that (d/dd)^(z\ O = -{a(z; 0/b(z; - -
0}z-1(l z/ln(l z)9 where the coef
- - < <
ficient of zJ in -z_1(l z)d\n(l z) is K(lnj)j-d-x
EJk=ik~xdj-k
1060 P.M. ROBINSON AND P. ZAFFARONI
9^(z) j-Uc^ = 1
?-=zJ 0(z), j l,...,m,
daj
??=ZJ =
y(z)c/)(z), j l,...,n,
dbj
?f(z) Y(z) - d -
?ZT~ =-(1 z)a log(l z).
ad z
Choose ji (f)
= i for i =
1,..., m + n, ? e T, leaving jm+n+\ (t;) to be determined
subsequently. Fix ? and write U = partitioning it in the ratio m + n:l
x^(ji,...jr)(0^
and calling its (i, j)th submatrix We first show that the (m + n) x (m + n)
[/,-y.
matrix U\\ is nonsingular. Write R for the n x (m + n) matrix with (/, j)th element
x (m + a) matrix with (/, y)th element where
Yj-i, and S for the (m + n) 07_/+i,
= = 0 for < 0, and for > 0, 0y and Yj are
0y y^r j j respectively given by
OO 00
n-\
- 1.
X0Yj ^2^iYj-i
=0' J =m,...,m+n-
1=1
But it follows from (25) and (27) that they are generated by the linear difference
equation
n-\
-
Yj J2 biyj-i =7lh j =m,...,m+n-l,
i=\
ESTIMATION OF ARCH(oo) MODELS 1061
= = = ?
where nm am + for m + 1,..., m + n 1. Since
bnym-n, iij bnyj-n j
though Baillie, Bollerslev and Mikkelsen [1] indicated that they can be obtained.
Also, unlike FGARCH, FIGARCH xt never has finite variance.
The requirement do > for the central limit theorem in Corollary 1would also
\
be imposed in a corresponding result for FIGARCH. This is automatically satisfied
in GARCH models but if only do e (0, in (13) is possible in the general setting
^]
of Section 3, it appears that the asymptotic bias in 9j is of order at least T~xl2,
whereas that for 9j is always o(T~xl2). Assumption H copes with the replacement
of cr2(r^) by (9), the truncation error varying inversely with do- Inspection of the
of
proof of Theorem 2 indicates that this bias problem is due to the term H~1B\t.
?
The factor
of of
in b\t is nonnegative, and if j~do~x is an exact rate for
ifroj,
? as t ->
exceeds t~d?/K 00 with probability approaching one. So far as
of of
the factor ot is concerned, the second element of o2^ [see (18)] has
/of mb\t
zero mean, but the first is positive, and though the \/f- can have elements of either
sign, whenever do < \ it seems unlikely that the last r elements of B\j can be
1062 P.M. ROBINSON AND P. ZAFFARONI
<K+ K
aj ? ? Vo;, ihj,slj1eljl-h ~$-h-...-X
?(/=1 Vl = l 7/= l /
so by the cr -inequality,
00 / oo oo
'=1 Vi = l 7/=l
v
X Ip \2p \p -\2p\ I
\bt-J\-J2\ \kt-J\-Ji\
Thus, (10) holds if there is a p satisfying (9) and (32). Recursive substitution
and the cr-inequality were also used by Nelson ([27], Corollary) to upper-bound
in the GARCH(1,1) case, but he employed the simple dynamic structure
E\at\2p
available there, and (32) does not reduce to his necessary and sufficient condition.
If V(l; ?o) < 1> (32) adds nothing because we already know that Exq < oo
here, but if i/r(l; fo) > 1, the second factor on the left-hand side of (32) exceeds 1
and increases with p; the question is whether the first factor, which is less than
1 and decreases with p [due to Assumption A(g)], can over-compensate. Ana
lytic verification of (32) for given fo> P seems in general infeasible, and numerical
verification when the However, consider the
highly problematic x//j decay slowly.
family of densities
for y > 0, where a(y) = {r(y)/T(3y)}1/2 (also used by Nelson [28] tomodel
the innovation of the exponential GARCH We = 1
have E8o =
model). 0, E8\
as necessary, Assumption A(q) is satisfied for all q > 0, and E\8o\2p = T((2p +
(34) = + iyxeid^x^e-di,
x/,j(0 Y,r(f
i=\
(36) d e (0,oo),
(37) e{ > 0, i= 1,.. .,m,
tyj(0 > 0, all j > 1. By choosing m large enough in (34) or (35), any finite
\/f(l;i;) can be arbitrarily well approximated, but (34) and (35) can also achieve
For real x > 1, x^e~dx and (lnx)^x~d~x if
parsimony. decay monotonically
= 0, and for > 0, have maxima at and
/ / single f/d e//^+1\ respectively. Thus,
with m ? l and f\ ? 0, we have monotonic decay in (34) and (35); otherwise,
both can exhibit lack of monotonicity, while eventually decaying exponentially or
hyperbolically. The scale factors in (34) and (35) are so expressed because x^e~dx
and (lnx)fx~d-x integrate over (0, oo) to r(/ + l)/df+x and T(f + I)Id, re
so that ? in both but the
spectively, V"0; K) J2T=\ ei cases, approximation may
1064 P.M. ROBINSON AND P. ZAFFARONI
not be very close and the "integrated" case is less easy to distinguish than in
GARCH and FGARCH models (though it would be possible to alternatively scale
the weights by infinite sums to achieve equality).
The following corollary covers (34) and (35) and implies the
simultaneously,
special case when the f are specified a priori, for example, to be nonnegative
integers; strictly speaking, when the true value of f\ is unknown, Assumption C
prevents it from being zero.
COROLLARY2. Let^r(z\t;)be given by (22) and (34) or (35) with ra > 1 and
let d and the et, f be continuously differentiable. For some 8 > 0, let Assump
tionsA(2 + 8), B, C and E hold, with all f e Y satisfying (36)-(38) and
rank = r.
?(<?i, /i, ...,em, fm,d)
Then (15) is true. Let also d and the et, f be thrice continuously differentiable
and Assumption A(4) hold, and do = d(t^o) > in case of (35). Then (17) is true.
\
PROOF. Given (36)-(38) and the proofs of Corollaries 1 and 2, the verification
of Assumptions D and F(/) is straightforward. We check Assumption G for (35)
only, a very similar type of proof holding for (34). We have
1
wlE(u\j,...,u'mjy\d^
L vj J
where
= - i= l,...,r,
uu (lnlntf + 1) (d/dfi)lnr(fi+l), l)'ln^(; + 1),
m
= + 1^
Vj -j2eir(fi+iyllnfi+lu
(41) < K.
?sup|lnar2(^)|
(42)E\T8j\ <oo.
Jo
J?
(43) r\ roo
< tk~xdt+
/ tk-xMn2(-t)dt.
JO J\/ ?o
1066 P.M. ROBINSON AND P. ZAFFARONI
It suffices to show that the last integral is bounded. For all 8 > 0, there exists r\ > 0
<
such that L(s~l) ?~s, e e (0, rj), so
2 rT] 1 1
e~te /(e) ds<K e-'e eb-s ds + 2e~tri .
/OO-oo JO
The last integral is bounded by
f?? < Kt0-b-l)/2m
Kt(8-b-l)/2 e-e?(8-b-l)/2d?
JO
- -
Thus, (43) is finite if k + n(8 b l)/2 < 0, that is, since 8 is arbitrary, if A:<
(fc+l)n/2. D
E sup ( <*}
' V <
~ K < oo.
eeeWW)'
Proof. We have
oo
The proof can now be completed much as in the proof of Lemma 5.1 of [3], us
ing Holder's inequality as there but employing our Lemma 4 and taking M >
2pq/[(b+l)(q-2p)]. D
?-;^'
90,-, 90,-t ^
where = Now
?,-(?) 9*^(?)/9??,-2 90,-2.
I OO I OO
'7=1 I 7=1
so using Lemma 1,
\ P/P
oo r oo ( oo \ 1-P/P
<
E it;(f)ix2_,- =E i?y(f)ip/p^(f),-w"^ E =wk2.,
1= 1 7 17 J 1.7 J 1
SO
E sup ?-=-
{ } < oo.
oee\ o*2{6) I
The above proof implies that also
1p
E 1^(011 <??,
7=1
{00 1
whence, the proof of (44) with i\ > 3 is concluded. Next take i\ = 2. If ii > 2,
~
dko}{9)
<4?)
?in-r-1^ii0xi_lW,
1068 P.M. ROBINSON AND P. ZAFFARONI
where now = if i2 =
f;(?) 3*-V;(?)/9?i2-2 9^-2, while 2, i3 > 2,
Define
T
gt(9) = ut(9)uft(9), GT(9) = T~x J>(0).
t=\
LEMMA 7. For some 8 > 0, under Assumptions A(2 +8), B, C, D,E and F(l),
- 0.5. as T -
(47) sup \QT(9) Q(9)\ -* 0 00,
6>e0
Thus, by ergodicity
QT(9)^Q(9) a.s.,
for all 9 0. Then uniform convergence follows on establishing the equicontinu
ity property
as s -> 0, and continuity of Q(9). By the mean value theorem it suffices to show
that
SUP
?^? + , SUP~^~
\\dQT(9)\\ |3G(0)||< ?? a*s-'
0 II SO II e II 39 ||
<
\\ut(9)\\ + l^l-^T+ l
4lM#)||(l I +^-?k!
L erf(9)\ crt(9) JI
r r ip-il/pr
a2 -|l-l/p
Esup||ro(0)||+ Esup -^M Esupllro^)!!^^-"
0 L 0 Yofj(9)\ J L 0 J
+ ?supj-^-j + l
for all p > 1.On choosing p < 1+ 5/2, this is finite by Lemmas 5 and 6. (Our use
of Lemmas 5 and 6 is similar to Berkes, Horvath and Kokoszka's [3] use of their
Lemmas 5.1 and 5.2 in the GARCH(n, m) case.) This completes the proof of (47).
Then (48) and (49) follow by applying analogous arguments to those above, and so
we omit the details; indeed, (48) and (49) are only used in the proof of consistency
of Gt(9t), Ht(9t) for Go, Ho, where convergence over only a neighborhood of
#o would suffice.
Because
oo
\AT(9)\<KT-xJ2i^(0)-^m
t=\
T oo
<KT-'J2J2*jWxljM
r=l j=t
oo f t+T \
<KT~lJ2 E fjiOUUn).
t=oij=t+\ J
Now from (7),
t+T t+T
supY MS)<K E F^1 <Kmm(t + \,T)(t + l)-^~x.
SeT j=t+\ j=t+\
Thus,
T oo
From the cr-inequality, (9) and (10), J^iO + l)~-~Xx2Lt has finite pth moment,
and thus, by Loeve ([23], page 121), is a.s. finite. Thus, the second term of (53)
?
tends to zero a.s. as T oo, while the first does so for the same reasons combined
with the Kronecker lemma. Next,
T oo
\BT(e)\< KT-^xtmY^tm^jiiD
(54) T oo
< KT-X + D
Ex^)E;~-_1(*r2-7
t=\ j=t
-+ EXo(0)< +
T-'Y^XtiO) k\e(^-\ \\<K
by ergodicity and Lemma 5. Thus, (54) ?? 0 a.s. by the Toeplitz lemma. The con
vergence is uniform in 9 because, from the proof of Lemma 7, for all 9 e &,
as s ?> 0. This completes the proof of (50). We omit the proofs of (51) and (52) as
they involve the same kind of arguments.
Tta)(9)= ^-lnof(9)
oco
=
ot-2(9),
=? =
M0) lna2(0) -2at-2(6) T ^;(?)*,-;(/*),
o CO
= =
^(9) --lnof(9) o-2(9) J2 IrfhOxf-ifa),
a? 7=1
so that rt(9) = (Ttco(9), k = (X\,X2,
rtfl(9), t^(9))'. Write Xf3)\
where A.i and X2
are scalar and A.3 is r x 1. Consider first the case = =
k\ X2 0, A.3^ 0. Suppose
(55) does not hold. Then we must have
00
= 0 a.s.
J^X.'3iff\Oxf_j(fi)
;=i
If i- 0, it follows that
A.^j(1)(?)
00
-
(56) (<r,_ie,_i + jio vf = ? ^f\t)x-M)
~{^j^tt)}"1
7=2
Since ot-\ > 0 a.s., the left-hand side involves the nondegenerate random vari
able st-\, which is independent of the right-hand side, so (56) cannot hold. Thus,
^ = 0- of this argument indicates
3^7 (?) Repeated application that, for all f,
= = l,..., This is contradicted
Xf3\//j(t;) 0,j jr(0- by Assumption G, so (56)
cannot hold. Next consider the case X\ = 0, X2 / 0, A3 = 0. If (56) does not hold,
we must have
00
(57) = 0 a.s.
?lMO*?-y(/*)
;=i
1072 P.M. ROBINSON AND P. ZAFFARONI
- - -
{2^(0 + /xo li)}{cft-k?t-k + Mo M}
X'3\l/(kX)(t;)(crt-k8t-k
00
= a.s.
E {^?\Oxt-j(v) -2^j(0}xt-j(li)
7=^+1
The left-hand side is a.s. nonzero and involves the nondegenerate random variable
8t-k, which is independent of the right-hand side, so (58) cannot
hold. We are left
? =
with the cases where X\ ^ 0. Taking X\ = 1 and noting that cr2(9)rta)(9) 1,
the preceding arguments indicate that there exist no X2 and A3 such that
= 1 a.s.
^2cr2(9)rt^(9) + Xf3a2(9)rH(9)
LEMMA 10. For some 8 > 0, under Assumptions A(2 + 8), B, C, D, E, F(l)
andH,
Proof. We have
^v )- Q(9o)
Q(0) mko) = E -^-lnj-^?[-1 ^
+(/z-/z0)2? } ?,? .
[cup) \a2(9)\ J lcr2(9).
term on the right-hand side is zero only when =
The second \i /xo and is positive
? ? > > ?
otherwise. Because x hut 1 0 for x 0, with equality only when x 1, it
remains to show that
REFERENCES
[1] Baillie, R. T., Bollerslev, T. and Mikkelsen, H. O. (1996). Fractionally inte
[21] Lee, S. and Hansen, B. (1994). Asymptotic theory for the GARCH(1,1) quasi-maximum
likelihood estimator. Econometric Theory 10 29-52. MR 1279689
[22] Ling, S. and McAleer, M. (2003). Asymptotic for a vector ARMA-GARCH model.
theory
Econometric Theory 19 280-310. MR1966031
[23] LOEVE,M. (1977). Probability Theory 1, 4th ed. Springer, New York. MR0651017
[24] Lumsdaine, R. (1996). Consistency and asymptotic of the quasi-maximum likeli
normality
hood estimator in IGARCHQ, 1) and covariance GARCH(1,1) models. Econo
stationary
metrica 64 575-596. MR1385558
[25] MlKOSCH, T. and Starica, C. (2000). Limit theory for the sample autocorrelations and
extremes of a GARCH(1, 1) process. Ann. Statist. 28 1427-1451. MR1805791
[26] MlKOSCH, T. and Straumann, D. (2002). Whittle estimation in a heavy-tailed
GARCH(1, 1)model. Stochastic Process. Appl 100 187-222. MR1919613
[27] NELSON, D. (1990). Stationary and persistence in the GARCH(1, 1)models. Econometric
Theory 6 318-334. MR1085577
[28] Nelson, D. (1991). Conditional heteroskedasticity in asset returns: A new Econo
approach.
metrica 59 347-370. MR1097532
[29] Robinson, P. M. (1991). Testing for strong serial correlation and dynamic conditional het
[31] Robinson, P. M. and Zaffaroni, P. (1998). Nonlinear time series with long memory:
A model for stochastic volatility. J. Statist. Plann. Inference 68 359-371. MR 1629599
[34] vonBahr,B. and Esseen, C.-G. (1965). Inequalities for the rth absolute moment of a sum
of random variables, 1 < r < 2. Ann. Math. Statist. 36 299-303. MR0170407
[36] Zaffaroni, P. (2003). Gaussian inference on certain long-range dependent volatility models.
J. Econometrics 115 199-258. MR1984776
[37] ZAFFARONI, P. (2004). Stationarity and memory of ARCH(oo) models. Econometric Theory
20 147-160. MR2028355