Professional Documents
Culture Documents
Abstract
The paper concerns the probabilistic eval-
uation of plans in the presence of unmea-
sured variables, each plan consisting of sev-
eral concurrent or sequential actions. We
establish a graphical criterion for recogniz-
ing when the effects of a given plan can
be predicted from passive observations on
measured variables only. When the crite-
rion is satisfied, a closed-form expression is
provided for the probability that the plan
will achieve a specified goal.
Our problem is to evaluate an unconditional plan2, control problem shown in Figure 1. First, we will show
namely, to compute P ( y J x l , 22,. . . , xn) which repre- that P ( y ) f1, x2) cannot be identified without measur-
sents the impact of the plan (f 1 , . . . ,2,) on the out- ing Z , namely, the sequence Z1 = (01, Z2 = (0) would
come variable Y. The expression P(y 121, f 2 , . . . , i n ) not satisfy conditions (1)-(2). The two d-separation
is said to be identifiable in G if, for every assign- tests encoded in (2) are:
ment (xl, x2, . . . ,x,), the expression can be deter-
mined uniquely from the joint distribution of the ob-
servable~{X, Y, 2). A control problem is said to be
identifiable whenever P(ylP1, x2, . . . , kn) is identifi- The two subgraphs associated with these tests are
P
able. shown in Figure 2, (a) and (b), respectively. We see
Our main identifiability criteria are presented in Theo-
rems 1 and 3 below. These invoke d-separation tests on Ul Ul
various subgraphs of G, defined as follows. Let X , Y, . .?,
..' ..
, ,*?,
.,' ..
,I
and Z be arbitrary disjoint sets of nodes in a DAG G.
We use the expression (X 11 Y JZ)G to denote that X, t -..,, ,....-:.?
the set Z d-separates X f r o m y in G. We denote by ' 2 ----- --...-
1-.-.--
..--.--..vu2
Ga the graph obtained by deleting from G all arrows +- .... ........,,
pointing to nodes in X . Likewise, we denote by Gx the Z. .a. T,'
graph obtained by deleting from G all arrows emerging
from nodes in X . To represent the deletion of both
.."
xz,. XZ,..,'
incoming and outgoing arrows, we use the notation
A ".,
GFz. Finally, the expression P(y(k,z) =
P(y, zlx)/P(zJx) stands for the probability of Y = y Y Y
given that Z = z is observed and X is held constant
at x. Figure 2: The two subgraphs of G used in testing the
identifiability of (&, x2); (a) Gx x2and (b) Gx .
-1 '
-2
3 ADMISSIBLE MEASUREMENTS
that, while (Y 1) X l ) holds in Gx y2,(Y 11 X21X1)
Theorem 1 P(ylxl, . . . , itn) is identifiable if for ev- fails to hold in&-. Thus, in o z i r to passthe test,
ery 1 5 k 5 n there exists a set 21,of covariates sat-
isfying: we must have eith; Z1 = {Z) or Z2 = {Z) but, since
Z is a descendant of X I . onlv" the second alternative
c
Zk Nk, (1) A ,
21
= C P ( y l z 1 , x 1 , ~ 2 ., . , i n ) P ( ~ l ) II X I I 0)GLl , and (Y -
(Y - II X21Xl. 0 ) ~ ~ ~ ~ ~
21
n
= ~ ( Y I P ~ ~ PY () ~ ~ I ~ ~(6)~ L ~ )
k=l
.
on the l k ' s The identity of (6) and (5) follows from
the independence
P(lkIparLk) = P(lkI11,. . ., h-1, x i , . . . , ~ k - i ) (7)
X,
Upon explicating the lk's in (5), we may find that some
factors contain latent variables. When this happens we
may try to use the conditional independencies encoded
in the graph to eliminate those latent variables and,
if we succeed, the plan would be identifiable and the
resulting formula would give the desired causal effect.
Let us demonstrate this method on the of Figure 4: Illustrating non-uniqueness of minimal ad-
Figure 1. The Lk sequence 's given by L1 = {Ul} and missible sets: Z1 and z: are each minimal and admis-
L2 = {Z, U2}. Substituting in the G-formula yields. sible.
k=l
P(wklw1,. k - 1 1 ,- 1 (14)
can substitute Ai for Zi in Eq. (11). This implies that
Ai satisfies (2) and Zi is non-minimal, which is a con-
tradiction. We are now assured that Z1,. . . , Zk-1 are 6 GENERALIZATIONS
in Ak n Nk. Likewise, since {XI, . . . , Xk-l) is also in
Ak n Nk, (10) can be rewritten as 6.1 Y AND Z NON-DISJOINT
In practice, we will often be interested in a vector out-
come Y with components of Y being ancestors of con-
To prove that (10) is false, contrast (12) with the trol variables Xk for some k. For instance, in our AIDS
assumption that there exists an admissible sequence example, we may be interested in survival Y not only
Z;, . . .,Z;. Let Z* = Z; U:~(Z: U {Xi)). Admis- at a time after subjects have received treatment X2 but
sibility states that (2) is satisfied by 21,= 22, hence, also at a time after receiving treatment X1 but before
(Y 11 Xk ~ Z * ) G , .By (9), we can intersect the condi- receiving X2. If a component of Y is both an ancestor
tioning set Z* with Ak, yielding (Y - 11 XklZ* n Ak). of a control variable Xk and of a later component of
Finally, since Z* Nk, we have Y, it is necessary to regard the former component as a
confounding variable that must be adjusted for to esti-
mate the effect of the plan on Y. To do so, we no longer
impose the assumption that Y is a descendent of Xn
But (12) and (13) together contradicts (8), because (8) and that Y and Z are disjoint. Rather, we shall only
asserts that whenever we add to the conditioning set require that Y C Z where, henceforth, Z represents all
members of Ak, we preserve independencies. &ED observed non-control variables. With this redefinition
Theorem 2 now provides an effective decision proce- of Y , with the understanding that (Y - 11 XIX)G VX,
dure for testing G-identifiability: we prove below that
A key step in the proof of Theorem 5 is the following to calculate that (i) for any x;, h(y I x , f l , . . . ,ek) =
Lemma proved in [Robins & Pearl 19951. Pkx(y I X1 = 21,...,Xk - 1 = xk-1,Xk = x;,
L1 = el, . . . , Lk = t k ) , and (ii) the conditional dis-
Lemma 3 If a sequence Zk is G-admissible, then the tributions of L1, . . . , Lk given (Zl, . . . , Zk, X I , . . . ,Xk)
sequence Zk U Yk is also G-admissible. are the same under P and Pkr. Hence, the premise of
Lemma (4) is equivalent to the conditional indepen-
We shall also need this Lemma 3 in Section 6.3 below. dence under the distribution Pk, of Y and Xk given
(Z1,. . . , Z k , X 1= 21,. . .,Xk-l = xk-1).
Proof of Theorem 4: Given a plan x = (XI,. . . , x,),
define We note that the premise of Lemma (4) is a non-
graphical condition that is weaker than the graphical
premise of Theorem 4 and yet implies identifiability
by the G-formula based on 2 1 , . . ., Z n . However, as a
non-graphical condition, the premise of Lemma (4) is
much more difficult to check that the graphical premise
of Theorem 4.
To prove Theorem 4, we shall use the following Lemma 6.2 Xk+j NEED NOT BE A DESCENDENT
which is an easy consequence of the corollary to The- O F Xk
orem (AD.l) in Robins (1987).
In this subsection, we relax the assumption that Xk+j
Lemma 4 If, for each k, Zk c Nk and the expression is a descendent of Xk for all k, j > 0. As in Sec. 6.1,
Z remains the set of all observed non-control vari-
ables. Given X C V, we say X = ( X I , . . ., X n )
is consistent ordering of X in G if, for each k,
Xk is a non-descendent of {Xk+1,. . . , X,). Hence-
forth, given a consistent ordering of X , we redefine
Nk to be the set of observed non-control variables
does not depend on x; then Eq. (3) is true. that are non-descendents of any element in the set
{ X k ,Xk+1,.. . , Xn). Robins and Pearl (1995) proved
One can also prove Lemma (4) directly by using induc-
tion on n to show that the right hand side of Eq. (5) Theorem 6 Given a consistent ordering
plus the premise of the lemma imply the right hand ( X I , . . . , X,) of X with Xk not necessarily an ancestor
side of Eq. (3). of Xk+j, Theorems 4 and 5 remain true.
To complete the proof of Theorem 4, we shall show
that (i) the premise of Lemma (4) is equivalent to the Theorem 6 is an immediate corollary of Theorems 4
statement that and 5, and the following Theorem proved in Robins
and Pearl (1995) characterizing arrows that can be
added into and out of the Xk without destroying
when probabilities are computed under a particular Eqs. (1) and (2). Given a graph G , a consistent order-
joint distribution Pkxfor the variables V in G and (ii) ing (XI, . . . , X,) of X , and sets Z1, . . . , Z,, Zk C Nk,
Pkxis represented by the DAG Gk G, F ~ + ~ - let graph G* be the graph in which, for each k, all ar-
, , , , ,,~
[i.e., by definition, Pkx(v) = nj -4'
Pkx(vj I pajk) where rows
the
are included (i) from Xk both to each member of
set {Xk+1,.. . , X,) and to each variable (observed
P a j k are the parents of 1.;. on Gk and pajk is the value
of P a j k when V = v]. It then follows that Eqs. (1) and or unobserved) that is a descendent of some member
(2) imply the premise of Lemma (4), proving Theorem of {Xk+1,. . . , X,) and (ii) from each member of the
4. set Z I U . . . U Zk t o X k .
with a sustained exposure period - appli- of X , P(yli) gives the probability of Y = y induced by
cations to control of the healthy workers the action do(X = x).
survivor effect. Mathematical Modelling,
7, 1393-1512. If causal knowledge is organized as a set T of structural
[Robins 19871 Robins, J.M. (1987) Addendum to "A equations
new approach to causal inference in mor-
tality studies with a sustained exposure
period - applications to control of the where pai are the parents of Xi in G, fi are (un-
healthy workers survivor effect". Com- specified) deterministic functions and ~i are mutually
puters and Mathematics with Applica- independent disturbances [Pearl & Verma 19911, then
tions, 14, 923-945. the joint distribution of the observed variables has the
product from
[Robins 19891 Robins J.M. (1989) The control of
confounding by intermediate variables.
Statistics in Medicine,8 , 679-701.
[Robins 19931 Robins J.M. (1993) Analytic methods independent of the fi's in T. In such a process-based
for estimating HIV treatment and co- theory, the effect of the action d o ( 4 = vv(i) amounts to
factor effects. In: Methodological Issues overruling the process governed by fi and substituting
of AIDS Mental Health Research (eds. the process 6 = v: instead. Consequently, the induced
D.G. Ostrow, R. Kessler) New York: distribution in the mutilated theory T,,. would be
Plenum Publishing, 213-290. 3
Definition 2 (causal effect) Given two disjoint sets Rule 2 Action/observation exchange
of variables, X and Y , the causal effect of X on Y,
denoted P(ylx), is a function from X to the space of
probability distributions on Y. For each realization x
Plan Evaluation 453
R u l e 3 Insertion/deletion of actions
II ZIX, WIG=
P(yl2,2, w) = P(yl2, w) if (Y -
(20)
where Z(W) is the set of Z-nodes that are not
ancestors of any W-node in GX.