Professional Documents
Culture Documents
AS MARKOV CHAINS
Phillip E. Pfeifer
Robert L. Carraway
f
INTRODUCTION
The lifetime value of a customer is an important and useful
concept in interactive marketing. Courtheaux (1986) illustrates
its usefulness for a number of managerial problemsthe most
obvious if not the most important being the budgeting of marketing expenditures for customer acquisition. It can also be used
to help allocate spending across media (mail vs. telephone vs.
television), vehicles (list A vs. list B), and programs (free gift vs.
special price), as well as to inform decisions with respect to
retaining existing customers (see, e.g., Hughes, 1997). Jackson
(1996) even argues that its use helps firms to achieve a strategic
competitive advantage.
Dwyer (1989) helped to popularize the lifetime value (LTV)
concept by illustrating the calculation of LTV for both a customer
retention and a customer migration situation. Customer retention refers to situations in which customers who are not retained
are considered lost for good. In a customer retention situation,
nonresponse signals the end of the firms relationship with the
customer. In contrast, customer migration situations are those in
which nonresponse does not necessarily signal the end of the
relationship. Besides articulating this distinction between customer retention and migration, Dwyer listed several impediments
to the use of LTV.
More recently, Berger and Nasr (1998) argue for the impor 2000 John Wiley & Sons, Inc. and
Direct Marketing Educational Foundation, Inc.
f
JOURNAL OF INTERACTIVE MARKETING
VOLUME 14 / NUMBER 2 / SPRING 2000
43
44
FIGURE 1
called the Markov property. The Markov property is a necessary condition for a stochastic
system to be a Markov chain. It is a property of
the Berger and Nasr models, the Dwyer models,
and the Blattberg and Deighton model. Because all these models exhibit the Markov property with constant probabilities, they can all be
represented as Markov chains. Figure 1 is a
graphical representation of the MCM for the
firms relationship with Jane Doe.
The probabilities of moving from one state to
another in a single period are called transition
probabilities. For Jane Doe, the following 5 5
transition probability matrix summarizes the
transition probabilities shown graphically in
Figure 1. The last row of this matrix reflects the
assumption that if Jane is at recency 5 in any
future period, she will remain at recency 5 for
the next and all future periods. In the language
of Markov chains, r 5 or former customer is
an absorbing state. Once Jane enters that state
she remains in that state.
p1 1 p1
0
0
0
p2
0
1 p2
0
0
0
0
1 p3
0
P p3
p4
0
0
0
1 p4
0
0
0
0
1
45
the (i, j) element of matrix Pt (found by multiplying P by itself t times) is the probability Jane
Doe will be at recency j at the end of period t
given that she started at recency i. In essence, Pt
is a tidy way both to summarize and to calculate
the probability forecast of Jane Does recency at
any future time point t.
Now that we have a probability forecast for
ABCs future relationship with Jane Doe, we
turn to the economic evaluation of that relationship. The cash flows received by the firm in
any future period will be a function of Jane
Does recency. R, a 5 1 column vector of
rewards, summarizes these cash flows:
P tR
(1)
t0
where VT is the 5 1 column vector of expected present value over T periods. The elements of VT correspond to the five possible
initial states of the Jane Doe relationship. The
top element of VT corresponding to r 1 (a
Jane Doe relationship that starts with a purchase at t 0) will be of particular interest. This
is the expected present value of the cash flows
from the firms proposed relationship with Jane
Doe. It is, in other words, the expected LTV for
Jane Doe. Notice that this value accounts for NC
from the initial purchase at t 0 as well as a
trailing expenditure of M at time T if Jane is an
active customer at T.
Rather than selecting some finite horizon T
over which to evaluate its relationship with Jane
Doe, the firm might decide to select an infinite
horizon. (Because expected cash flows from a
customer relationship usually become quite
small at large values of t, the numerical differences between selecting a long but finite horizon and an infinite horizon are usually negligible. The infinite horizon approach has the
advantage of being computationally simpler.)
For an infinite horizon, it can be shown that
NC M
M
M
R
M
0
1 d
T
VT
V lim V T
T3
I 1 d 1P 1R
(2)
$36
$4
R $4
$4
$0
46
0.3
0.2
P 0.15
0.05
0
0.7
0
0
0
0
0
0.8
0
0
0
0
0
0.85
0
0
0
0
0
0.95
1
0.2300
0.1800
P 2 0.0875
0.0150
0
0.2100
0.1400
0.1050
0.0350
0
0.5600
0
0
0
0
0
0.6800
0
0
0
0
0
0.8075
0.9500
1
0.1950
0.1160
P 3 0.0473
0.0115
0
0.1610
0.1260
0.0613
0.0105
0
0.1680
0.1120
0.0840
0.0280
0
0.4760
0
0
0
0
0
0.6460
0.8075
0.9500
1
0.1397
0.0768
P 4 0.0390
0.0098
0
0.1365
0.0812
0.0331
0.0081
0
0.1288
0.1008
0.0490
0.0084
0
0.1428
0.0952
0.0714
0.0238
0
0.4522
0.6460
0.8075
0.9500
1
$52.320
$5.554
V $1.251
$1.820
$0
The differences in the results reflect the expected present value of cash flows after t 4.
Using an infinite horizon, the expected LTV for
Jane Doe increases to $52.32.
While all the expected present values have
improved under the longer horizon, the expected present value is still negative at recency
4. Given the economic and probabilistic assumptions of the model, the firm would do
better if it curtailed its relationship with Jane
Doe at recency 4 rather than recency 5.
It is a simple matter to modify the Markov
decision model to evaluate this proposed change
in policy. One way would be to reformulate the
model using four rather than five states. A quicker
short cut is to modify the existing five-state model
$50.115
$4.220
4
V $0.592
$1.980
$0
47
$53.149
$6.621
V $2.644
$0
$0
State
11
12
13
14
2
3
4
5
p 11
p 12
p 13
p 14
0
0
0
0
0
0
0
0
p2
0
0
0
0
0
0
0
0
p3
0
0
0
0
0
0
0
0
p4
0
P
1 p 11
0
0
0
1 p 12
0
0
0
1 p 13
0
0
0
1 p 14
0
0
0
0
1 p2
0
0
0
0
1 p3
0
0
0
0
1 p4
0
0
0
1
48
R
NC 11 M 11
NC 12 M 12
NC 13 M 13
NC 14 M 14
M 2
M 3
M 4
0
P
p 1p
0
1
R
NC M
0
V Prospect A 1 d 1p aV Customer
V Customer NC M/1 p/1 d
V Former Customer 0
This example illustrates that while it is possible
to extend the MCM to include prospecting,
there is usually little reason to do so. The transition from prospect to customer is usually simple enough that it can be treated algebraically,
without recourse to the MCM.
Next, consider a customer retention situation
where purchase amounts, repurchase probabilities, and remarketing expenditures all depend
on frequencythe number of times the customer has purchased from the firm. For the
purposes of this illustration, suppose these variables change for frequencies 1, 2, and 3, but
remain constant for frequencies of 4 or more.
The MCM of this situation requires five states:
frequencies 1, 2, 3, and 4 together with former
customer. The frequency 4 state might be better labeled frequency 4 or greater. The transition matrix and reward vector are as follows:
P
0 p1
0 0
0 0
0 0
0 0
R
A
NC M
0
P
0 pa 1 pa
0 p 1p
0 0
1
0
p2
0
0
0
49
0
0
p3
p4
0
1 p1
1 p2
1 p3
1 p4
1
R
NC 1 M 1
NC 2 M 2
NC 3 M 3
NC 4 M 4
0
FIGURE 2
50
TABLE 1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
0.103
0.076
0.059
0.045
0.038
0.035
0.030
0.027
0.025
0.021
0.021
0.020
0.017
0.017
0.016
0.015
0.014
0.013
0.013
0.012
0.012
0.011
0.011
0.010
0.121
0.090
0.069
0.053
0.045
0.041
0.035
0.032
0.029
0.025
0.024
0.024
0.020
0.020
0.019
0.018
0.017
0.016
0.015
0.014
0.014
0.013
0.012
0.012
0.143
0.106
0.081
0.062
0.053
0.049
0.041
0.038
0.035
0.030
0.028
0.028
0.024
0.024
0.022
0.021
0.020
0.018
0.018
0.017
0.016
0.015
0.015
0.014
0.151
0.112
0.086
0.066
0.056
0.051
0.043
0.040
0.037
0.031
0.030
0.030
0.025
0.025
0.024
0.022
0.021
0.019
0.019
0.018
0.017
0.016
0.015
0.015
0.163
0.121
0.093
0.071
0.061
0.056
0.047
0.043
0.040
0.034
0.033
0.032
0.027
0.027
0.026
0.024
0.022
0.021
0.020
0.019
0.018
0.017
0.017
0.016
51
R r,f
NC M
M
0
r1
r 2, 3, . . . , 24
r 25
where M is equal to the present value of remarketing expenditures or $1/(1.03)1/2. Notice the
very simple structure of these cash flows. As
mentioned earlier, useful extensions to this
model would be to consider net contributions
that vary with frequency and marketing expenditures that vary with both recency and frequency. These extensions are easy to implement
because they involve changes to the reward vector only.
We are now in a position to evaluate the
firms current policy of curtailing the customer
relationship after recency 24. Using equation
(2) and the inversion of a 121 121 matrix
allows us to calculate the 121 1 column vector
V. The first element of V is calculated to be
$89.264. The expected present value of the catalog firms relationship with the customer is
$89.264. Included in this number is the $60 net
contribution from the initial purchaseso that
$29.264 of the present value of the relationship
comes from cash flows after the initial purchase.
Examining the calculated expected values for
the remaining possible states of the firms relationship tells us that the policy of curtailing the
relationship after recency 24 is a fairly good
one. Only one state, state (24, 1) has a negative
JOURNAL OF INTERACTIVE MARKETING
52
TABLE 2
Calculated V for Catalog-Firm Policy (24, 24, 24, 24, 24) NC $60, M $2, and d 0.03
Frequency
Recency
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
$ 69.470
$ 4.069
$ 0.184
$ (2.563)
$ (4.371)
$ (5.715)
$ (6.861)
$ (7.597)
$ (8.153)
$ (8.553)
$ (8.651)
$ (8.682)
$ (8.679)
$ (8.426)
$ (8.142)
$ (7.765)
$ (7.257)
$ (6.646)
$ (5.940)
$ (5.168)
$ (4.295)
$ (3.343)
$ (2.310)
$ (1.194)
$0
$ 79.194
$ 12.697
$ 7.903
$ 4.415
$ 2.027
$ 0.172
$ (1.473)
$ (2.634)
$ (3.598)
$ (4.384)
$ (4.808)
$ (5.169)
$ (5.512)
$ (5.547)
$ (5.565)
$ (5.480)
$ (5.247)
$ (4.909)
$ (4.460)
$ (3.953)
$ (3.331)
$ (2.625)
$ (1.833)
$ (0.953)
$ 88.052
$ 20.710
$ 15.169
$ 11.050
$ 8.155
$ 5.843
$ 3.750
$ 2.204
$ 0.868
$ (0.282)
$ (1.016)
$ (1.689)
$ (2.362)
$ (2.685)
$ (2.996)
$ (3.197)
$ (3.243)
$ (3.174)
$ (2.984)
$ (2.737)
$ (2.362)
$ (1.902)
$ (1.354)
$ (0.715)
$ 92.381
$ 24.632
$ 18.737
$ 14.320
$ 11.185
$ 8.660
$ 6.351
$ 4.613
$ 3.090
$ 1.771
$ 0.882
$ 0.056
$ (0.772)
$ (1.231)
$ (1.685)
$ (2.035)
$ (2.213)
$ (2.270)
$ (2.210)
$ (2.098)
$ (1.850)
$ (1.521)
$ (1.097)
$ (0.585)
$ 95.821
$ 27.837
$ 21.704
$ 17.069
$ 13.751
$ 11.057
$ 8.576
$ 6.692
$ 5.027
$ 3.554
$ 2.538
$ 1.581
$ 0.611
$ 0.035
$ (0.546)
$ (1.021)
$ (1.315)
$ (1.494)
$ (1.544)
$ (1.547)
$ (1.411)
$ (1.189)
$ (0.878)
$ (0.473)
53
TABLE 3
Calculated V for Catalog-Firm Policy (3, 6, 9, 12, 14) NC $60, M $2, and d 0.03
Frequency
Recency
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
$ 71.487
$ 6.281
$ 2.578
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$ 80.085
$ 13.702
$ 9.011
$ 5.620
$ 3.321
$ 1.554
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$ 88.337
$ 20.987
$ 15.440
$ 11.318
$ 8.424
$ 6.113
$ 4.021
$ 2.478
$ 1.146
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$ 92.781
$ 25.063
$ 19.197
$ 14.809
$ 11.702
$ 9.207
$ 6.927
$ 5.219
$ 3.728
$ 2.441
$ 1.584
$ 0.792
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
$ 96.131
$ 28.159
$ 22.038
$ 17.417
$ 14.113
$ 11.433
$ 8.969
$ 7.101
$ 5.454
$ 3.999
$ 3.000
$ 2.063
$ 1.114
$ 0.559
$0
$0
$0
$0
$0
$0
$0
$0
$0
$0
You will notice that we have evaluated the proposed change in policy using the values calculated for the current policy. While this evaluation will not hold if we make more than one
change to the current policy, it is as prescribed
by the policy improvement algorithm. After
performing similar calculations for all states
not contacted under the current policy, we
find several that project a profitable contact.
The improved policy that results is (8, 12, 15,
16, 17).
The expected customer lifetime value for the
(8, 12, 15, 16, 17) policy is calculated to be
$74.519. All contacted states have positive ex-
54
REFERENCES
This paper introduced a general class of mathematical models, Markov Chain Models, which
are appropriate for modeling customer relationships and calculating LTVs. A major advantage of this class of models is its flexibility. This
flexibility was demonstrated by showing how the
MCM could handle the wide variety of the customer relationship situations previously modeled in the literature. The MCM is particularly
useful in modeling complicated customer-relationship situations for which algebraic solutions
will not be possible.
A second advantage of the MCM is that it is a
probabilistic model. It incorporates the language of probability and expected valuelanguage that will help marketers talk about relationships with individual customers.
The MCM is also supported by a well-developed theory about how these models can be
used for decision making. The use of this theory
was demonstrated by a comprehensive numerical example in which a catalog company adjusted its contact policy in response to increased
mailing costs.
55