Third and Fourth Moment Matrices of Vecx in M - 2002 - Linear Algebra and Its A PDF

Linear Algebra and its Applications 354 (2002) 223–229
www.elsevier.com/locate/laa
Third and fourth moment matrices of vec X in

multivariate analysis
Heinz Neudecker a , Götz Trenkler b,∗
a Department of Economics, University of Amsterdam, Roetersstraat 11, NL-1018 WB Amsterdam,
The Netherlands
b Department of Statistics, University of Dortmund, Vogelpothsweg 87, D-44221 Dortmund, Germany
Received 1 June 2001; accepted 13 February 2002

Submitted by S. Puntanen
Abstract
We consider the problem of estimating a function of the multivariate mean, where in good
approximation this function can be written as a sum of a linear and a quadratic form. For
a natural class of estimators we obtain mean and variance. Since normality is not assumed,
variance turns out to be a rather involved quantity, dependent on third and fourth moment
matrices.
© 2002 Elsevier Science Inc. All rights reserved.
AMS classification: 15A63; 62H12
Keywords: Multivariate analysis; Nonlinear function of the mean; Estimation; Third and fourth moment
matrices
1. Introduction
Suppose x1 , x2 , . . . , xn represent a random sample from a p–dimensional distri-

bution with mean = E(xi ) and covariance matrix = E[(xi − )(xi − ) ]. Let
X denote the corresponding sample matrix, i.e. X = (x1 , . . . , xn ). Consider now the
problem of estimating a function f () of , where in good approximation the func-
tion of f can be written as f (x) = a0 + a x + x Ax, where a0 is a real constant, a is
∗ Corresponding author.
E-mail addresses: heinz@fee.uva.nl (H. Neudecker), trenkler@statistik.uni-dortmund.de (G. Tren-
kler).
0024-3795/02/$ - see front matter 2002 Elsevier Science Inc. All rights reserved.
PII: S 0 0 2 4 - 3 7 9 5( 0 2) 0 0 3 3 2 - 4
224 H. Neudecker, G. Trenkler / Linear Algebra and its Applications 354 (2002) 223–229
a p × 1 vector and A is a symmetric matrix of type p × p. Then a natural estimator

for f () should be
T = f0 + f vec X + (vec X ) Fvec X (1.1)
with some suitably chosen nonstochastic quantities f0 , f and F. The vector vec X is
obtained from the matrix X by stacking its columns one underneath the other, i.e.
vec X = (x1 , . . . , xn ) .
Since E(vec X ) = 1n ⊗ and Cov(vec X ) = In ⊗ we immediately have
E(T ) = f0 + f (1n ⊗ ) + tr F(In ⊗ ) + (1n ⊗ ) F(1n ⊗ ) (1.2)
(cf. [4, p.13]), where 1n is the n × 1 vector of ones. However, when calculating
Var(T ) we need a relationship between the third and fourth moment matrices of
the xi and vec X , respectively. This relationship will be established in the following
section.
2. Results
Consider the independent random vectors zi = xi − , i = 1, . . . , n. Then E(zi ) =

0 and E(zi zi ) = Cov(xi ) = . Assume that the following third and fourth moment
matrices exist:
= E(zi ⊗ zi zi ), (2.1)
= E(zi zi ⊗ zi zi ). (2.2)

Let now z = (z1 , z2 , . . . , zn ) , so that z = vec X − E(vec X ) = vec X − 1n ⊗ , and
consequently E(z) = 0, E(zz ) = Cov(vec X ) = In ⊗ .
Our objective is to determine
∗ = E(z ⊗ zz ) (2.3)
and
∗ = E(zz ⊗ zz ) (2.4)
in terms of , and other quantities.
We now state our:
Theorem 1. The following identities hold:

(i) ∗ = (In ⊗ Kpn ⊗ Ip )(G ⊗ ),
where G = (E11 , . . . , Enn ) with Eii = ei ei , ei being the ith member of the ca-
p
nonical basis of Rn , and commutation matrix Kp,n = Kpn = i=1 nj=1 (Hij ⊗
Hij ), where Hij is the p × n matrix with 1 being its (i, j )th position and zero
elsewhere.
H. Neudecker, G. Trenkler / Linear Algebra and its Applications 354 (2002) 223–229 225
(ii) ∗ = (In2 p2 + Knp,np )(In ⊗ ⊗ In ⊗ ) + [vec(In ⊗ )][vec(In ⊗ )]

+ (In ⊗ Kpn ⊗ Ip ){K̃nn ⊗ [ − (Ip2 + Kpp )( ⊗ )
− (vec )(vec ) ]}(In ⊗ Knp ⊗ Ip ),

where K̃nn = ni=1 (Eii ⊗ Eii ).
Proof. (i) Consider

   
z1 ⊗ zz zz ⊗ z1
   
z ⊗ zz =  ...  = (In ⊗ Kp,np )  ...  . (2.5)
zn ⊗ zz zz ⊗ zn
This follows from the identity
zi ⊗ zz = Kp,np (zz ⊗ zi ), i = 1, . . . , n (2.6)
(cf. [2, Theorem 3.1(viii)]).
Obviously we have
E(zj zk ⊗ zi ) = 0 (2.7)
unless i = j = k. Since E(zi ⊗ zi zi ) = and E(zi zi ⊗ zi ) = Kpp , Eq. (2.7) yields

E(zz ⊗ zi ) = Eii ⊗ Kpp , i = 1, . . . , n. (2.8)
Hence we obtain

zz ⊗ z1
 
E(z ⊗ zz ) = (In ⊗ Kp,np )E  ...  (2.9)
⊗ zn zz
= (In ⊗ Kp,np )(G ⊗ Kpp ). (2.10)
Now we have (cf. [3])

Km,rs = (Kmr ⊗ Is )(Ir ⊗ Kms ), (2.11)
which implies, since Kpp Kpp = Ip2 :
(In ⊗ Kp,np )(G ⊗ Kpp ) = (In ⊗ Kpn ⊗ Ip )(G ⊗ ). (2.12)
(ii) Consider  
z1 z1 ⊗ zz ··· z1 zn ⊗ zz
 .. .. 
zz ⊗ zz =  . ··· . . (2.13)
zn z1 ⊗ zz ··· zn zn ⊗ zz
Since zi zj ⊗ zz = Kp,np (zz ⊗ zi zj )Knp,p , it follows that
zz ⊗ zz  
zz ⊗ z1 z1 ··· zz ⊗ z1 zn
 .. .. 
= (In ⊗ Kp,np )  . .  (In ⊗ Knp,p ). (2.14)
zz ⊗ zn z1 ··· zz ⊗ zn zn
Clearly, only E(zi zi ⊗ zi zi ) and E(zi zi ⊗ zj zj ), i =

/ j , are different from
0(i, j = 1, . . . , n). Furthermore, for i =
/ j we have
E(zi zi ⊗ zj zj ) = ⊗ (2.15)
such that along with (2.2) we arrive at
 
⊗
 .. 
 . 0 
 
 ⊗ 
 
E(zz ⊗ zi zi ) = 
 

 ⊗ 
 
 .. 
 0 . 
⊗
= In ⊗ ⊗ + Eii ⊗ ( − ⊗ ), i = 1, . . . , n. (2.16)
In addition, for i =
/ j we get
(i, j )-block
 
0
 (vec )(vec ) 
 
E(zz ⊗ zi zj ) =
 0 

Kpp ( ⊗ ) 
0

(j, i)-block
= Eij ⊗ (vec )(vec ) + Ej i ⊗ Kpp ( ⊗ ), (2.17)
where Eij = ei ej and Ej i = ej ei are of type n × n (i =

/ j ). To show (2.17), we used
the identities
zi zj ⊗ zi zj = (zi ⊗ zi )(zj ⊗ zj ) = (vec zi zi )(vec zj zj ) (2.18)
and
zj zi ⊗ zi zj = (zj ⊗ zi )(zi ⊗ zj )

= Kpp (zi ⊗ zj )(zi ⊗ zj ) = Kpp (zi zi ⊗ zj zj ). (2.19)
This implies
(In ⊗ Knp,p )E(zz ⊗ zz )(In ⊗ Kp,np )


In ⊗ ⊗ + E11 ⊗ ( − ⊗ ) ···
 ..
= .
En1 ⊗ (vec )(vec ) + E1n ⊗ Kpp ( ⊗ ) ···

E1n ⊗ (vec )(vec ) + En1 ⊗ Kpp ( ⊗ )
.. 
. 
In ⊗ ⊗ + Enn ⊗ ( − ⊗ )

n
= In2 ⊗ ⊗ + [Eii ⊗ Eii ⊗ ( − ⊗ )]
i=1

+ Eij ⊗ Eij ⊗ (vec )(vec ) + (Eij ⊗ Ej i ) ⊗ Kpp ( ⊗ )
i =j
/ i =j
/
= In2 ⊗ ⊗ + K̃nn ⊗ ( − ⊗ )+(vec In )(vec In ) ⊗ (vec )(vec )

+ Knn ⊗ Kpp ( ⊗ ) − K̃nn ⊗ (vec )(vec ) − K̃nn ⊗ Kpp ( ⊗ )
= (In2 p2 + Knn ⊗ Kpp )(In2 ⊗ ⊗ )

+ K̃nn ⊗ − (Ip2 + Kpp )( ⊗ ) − (vec )(vec )

+ (vec In ⊗ vec )(vec In ⊗ vec ) , (2.20)
where we used formula (2.11) from [2].

Our next step is to premultiply (2.20) by In ⊗ Kp,np and postmultiply by In ⊗
Knp,p in order to obtain E(zz ⊗ zz ). By using again (2.11), which yields
In ⊗ Knp,p = (In2 ⊗ Kpp )(In ⊗ Knp ⊗ Ip ) (2.21)
for the first term in (2.20) we find
(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )(In2 p2 + Knn ⊗ Kpp )

× (In2 ⊗ ⊗ )(In2 ⊗ Kpp )(In ⊗ Knp ⊗ Ip )
= (In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp + Knn ⊗ Ip2 )
× [In2 ⊗ Kpp ( ⊗ )](In ⊗ Knp ⊗ Ip )
= (In ⊗ Kpn ⊗ Ip )[In2 ⊗ ⊗ + Knn ⊗ Kpp ( ⊗ )](In ⊗ Knp ⊗ Ip )
= (In ⊗ Kpn ⊗ Ip )(In2 p2 + Knn ⊗ Kpp )(In2 ⊗ ⊗ )(In ⊗ Knp ⊗ Ip )
= (In2 p2 + Knp,np )(In ⊗ Kpn ⊗ Ip )(In ⊗ In ⊗ ⊗ )(In ⊗ Knp ⊗ Ip )
= (In2 p2 + Knp,np )(In ⊗ ⊗ In ⊗ ), (2.22)
where use was made of

Kpq,mn = (Ip ⊗ Kqm ⊗ In )(Kpm ⊗ Kqn )(Im ⊗ Kpn ⊗ Iq )
(cf. [3, formula (3.2)]).
Further,
(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )

× K̃nn ⊗ − (Ip2 + Kpp )( ⊗ ) − (vec )(vec )
× (In2 ⊗ Kpp )(In ⊗ Knp ⊗ Ip )

= (In ⊗ Kpn ⊗ Ip ) K̃nn ⊗ − (Ip2 + Kpp )( ⊗ )

− (vec )(vec ) (In ⊗ Knp ⊗ Ip ), (2.23)
as Kpp Kpp = and Kpp vec = vec . Finally we have
(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )(vec In ⊗ vec )(vec In ⊗ vec )

× (In2 ⊗ Kpp )(In ⊗ Knp ⊗ Ip )
= (In ⊗ Kpn ⊗ Ip )(vec In ⊗ vec )(vec In ⊗ vec ) (In ⊗ Knp ⊗ Ip )
= [vec (In ⊗ )][vec (In ⊗ )] (2.24)
by virtue of [3, Theorem 3.1(i)].

Collecting the terms (2.22)–(2.24) then gives part (ii) of the theorem.
Note that under normality we have

= N = 0 (2.25)
and
= N = (Ip2 + Kpp )( ⊗ ) + (vec )(vec ) . (2.26)
Identity (2.25) is trivial, since zi ∼ N(0, ) implies −zi ∼ N(0, ), which yields
E(zi ⊗ zi zi ) = −E(zi ⊗ zi zi ).
Identity (2.26) follows from [2, Theorem 4.3(iv)] by inserting zi for x.
Hence we obtain from Theorem 1:
Corollary. Under normality we have:

(i) ∗ = N∗ = 0,
(ii) ∗ = N
∗ = (In2 p 2 + Knp,np )(In ⊗ ⊗ In ⊗ )
+ [vec (In ⊗ )][vec (In ⊗ )] .
Remark 1. The preceding result is in accordance with Theorem 4.3(iv) in [2] when
replacing x by z. Then z ∼ N(0, In ⊗ ), E(z ⊗ z) = vec(In ⊗ ) and Cov(z ⊗ z) =
(In2 p2 + Knp,np )(In ⊗ ⊗ In ⊗ ). On the other hand, Cov(z ⊗ z) = E[(z ⊗ z)(z ⊗
z )] − E(z ⊗ z)E(z ⊗ z ) = N
∗ − [vec (In ⊗ )][vec (In ⊗ )] which coincides
with identity (ii) of the corollary after solving for ∗ .
N
Remark 2. Theorem 1 along with the corollary admits an interesting representation

of ∗ namely
∗ = N
∗ + (In ⊗ Kpn ⊗ Ip )[K̃nn ⊗ ( − )](In ⊗ Knp ⊗ Ip ).
N
(2.27)
Remark 3. Frauendorf and Trenkler [1] considered the problem under which con-
ditions the statistics
1
T1 = a0 + a x̄ + x̄ Ax̄, where x̄ = X 1n , (2.28)
n
and
1
n
T2 = a0 + a x̄ + xi Axi coincide. (2.29)
n
i=1
Observe that both T1 and T2 are special cases of the statistic T of (1.1). In the above
mentioned paper it was shown that T1 and T2 differ in general, and
1
E(T1 ) = f () + tr(A), (2.30)
n
E(T2 ) = f () + tr(A). (2.31)

Thus T1 and T2 are biased estimators of f (). On the basis of the preceding deri-
vations it is possible to decide which of the two estimators is preferable in terms of
their mean squared error. These conditions will be reported in a forthcoming paper.
Acknowledgement
The authors would like to thank the referee for his helpful comments.
References
[1] E. Frauendorf, G. Trenkler, On the exchangeability of transformations and the arithmetic mean,
J. Statist. Comput. Simulation 61 (1998) 305–307.
[2] J.R. Magnus, H. Neudecker, The commutation matrix: some properties and applications, Ann. Statist.
7 (1979) 381–394.
[3] H. Neudecker, T. Wansbeek, Some results on commutation matrices, with statistical applications,
Canad. J. Statist. 11 (1983) 221–231.
[4] G.A.F. Seber, Linear Regression Analysis, John Wiley, New York, 1977.

Third and Fourth Moment Matrices of Vecx in M - 2002 - Linear Algebra and Its A PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Third and Fourth Moment Matrices of Vecx in M - 2002 - Linear Algebra and Its A PDF

Uploaded by

Copyright:

Available Formats

Linear Algebra and its Applications 354 (2002) 223–229

Third and fourth moment matrices of vec X in

Received 1 June 2001; accepted 13 February 2002

Suppose x1 , x2 , . . . , xn represent a random sample from a p–dimensional distri-

a p × 1 vector and A is a symmetric matrix of type p × p. Then a natural estimator

T = f0 + f vec X + (vec X ) Fvec X (1.1)

E(T ) = f0 + f (1n ⊗ ) + tr F(In ⊗ ) + (1n ⊗ ) F(1n ⊗ ) (1.2)

Consider the independent random vectors zi = xi − , i = 1, . . . , n. Then E(zi ) =

= E(zi zi ⊗ zi zi ). (2.2)

Theorem 1. The following identities hold:

(ii) ∗ = (In2 p2 + Knp,np )(In ⊗ ⊗ In ⊗ ) + [vec(In ⊗ )][vec(In ⊗ )]

Proof. (i) Consider

Now we have (cf. [3])

Clearly, only E(zi zi ⊗ zi zi ) and E(zi zi ⊗ zj zj ), i =

where Eij = ei ej and Ej i = ej ei are of type n × n (i =

zj zi ⊗ zi zj = (zj ⊗ zi )(zi ⊗ zj )

(In ⊗ Knp,p )E(zz ⊗ zz )(In ⊗ Kp,np )

= In2 ⊗ ⊗ + K̃nn ⊗ ( − ⊗ )+(vec In )(vec In ) ⊗ (vec )(vec )

+ K̃nn ⊗ − (Ip2 + Kpp )( ⊗ ) − (vec )(vec )

where we used formula (2.11) from [2].

(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )(In2 p2 + Knn ⊗ Kpp )

where use was made of

(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )

as Kpp Kpp = and Kpp vec = vec . Finally we have

(In ⊗ Kpn ⊗ Ip )(In2 ⊗ Kpp )(vec In ⊗ vec )(vec In ⊗ vec )

by virtue of [3, Theorem 3.1(i)].

Note that under normality we have

Corollary. Under normality we have:

Remark 2. Theorem 1 along with the corollary admits an interesting representation

E(T2 ) = f () + tr(A). (2.31)

You might also like