Professional Documents
Culture Documents
delivered by
Dr Bob Crossman
Workbook
The birth of probability theory can be traced back to 1654 and rather fittingly
owes its inspiration to games of chance. The story goes that Antoine
Gombard, the Chevalier de Méré, variously described as a philosopher, a
gambler and a man of penetrating mind, approached mathematician Blaise
Pascal to solve some problems arising in gambling games.
• betting even money that one could throw at least one six in four throws
of a die ; with
• betting even money that one could throw at least one double six in 24
throws of two dice
• Geometry
• Mechanics
• Weather
• Economics
• etc
inductive
procedures
experiment mathematical
or modify methods and
observation computation
deductive
procedures
data prediction
First we define a
Random Experiment (RE)
This is an experiment where repeated trials of the experiment may be
performed under the same set of conditions such that:
1. The result of each trial of the experiment results in exactly one of a
number of identifiable outcomes. After a trial is performed, It must be
possible to say, which of the possible outcomes has occurred;
2. It is not possible to predict in advance the outcome of an individual
trial; and
3. The result of a single trial is independent of the result of previous
trials.
The next essential step in the model is the process of assigning a number to
each of the possible outcomes in the sample space of a random experiment.
This number could be denoted by
Pr(ei ) = pi
1. 0 ≤ pi ≤ 1 and
n
2. ∑p
i =1
i = p1 + p2 + p3 ........ + pn = 1
In other words;
Notes:
The choice of 1 for the sum of the probabilities is purely one of convenience in
arithmetic. Any other choice would amount to a matter of scaling or
normalising.
n
The symbol ∑ p is just a shorthand notation, which represents the sum of the
i =1
i
n
n numbers; ∑p
i =1
i = p1 + p2 + p3 ........ + pn .
Any subset of the set of possible outcomes can be used to define an event
associated with the outcome of a trial of a random experiment.
The event is said to occur if the outcome of the trial of the random experiment
is one of the outcomes in the subset that defines the event.
The probability of an event is the sum of the probabilities of the outcomes that
comprise the event.
Clearly, if the outcomes are equally likely, then the probability of an event E
is just given by
or equivalently
RE#3
RE#4
RE#6
RE#7
RE#8
n
Pr = (n − 0) × (n − 1) × (n − 2) × .......... × (n − (r − 1))
r terms
7
P3 = 7 × 6 × 5 = 420
8
P2 = 8 × 7 = 56
n
A convenient expression for Pr is given by
n n!
Pr =
( n − r )!
n ! = n × ( n − 1) × ( n − 2) × ....... × 3 × 2 ×1
product of n integers
n!
n
n Pr (n − r )! n!
Cr = = =
r! r! (n − r )!× r !
52 52! 52 × 51× 50 × 49 × 48
C5 = = = 2,598,960
( 52 − 5 )!5! 5 × 4 × 3 × 2 ×1
Events can be used in various ways to form other events. The probabilities of
the resulting events follow some simple rules.
Pr ( E ) + Pr ( E ) = 1 .
Pr ( A or B occurs) = Pr ( A ∪ B ) = Pr ( A) + Pr ( B )
Here we have used the symbol A ∪ B , which comes from the algebra of sets,
to represent the event (A or B or possibly both A and B occurs). For mutually
exclusive events, it is not possible for “both A and B to occur”
If events A and B have some outcomes in common, then they are not
mutually exclusive and both can occur simultaneously on the same trial. The
corresponding expression for the probability is
Pr ( A or B occurs) = Pr ( A ∪ B ) = Pr ( A) + Pr ( B ) − Pr( A ∩ B )
Two events that are associated with different trials of a random experiment
are necessarily independent.
For two independent events A and B, the probability that they both occur is
the product of their individual probabilities.
Pr ( A ∩ B ) = Pr ( A) × Pr ( B )
Pr ( A ∩ B ) = Pr ( A) × Pr ( B | A)
An example
We will illustrate the use of these rules in several of the examples that follow
in our treatment of House Edge for particular bets and games.
For now, we can revisit Pascal’s problem as posed by the Chevalier de Mere.
Random Variables
The value will be a number. This number may be one of a discrete set of
possible numbers or it may lie anywhere within a continuous interval of
numbers.
For example:
RV1: let X be the number of heads that appear when 3 coins are tossed.
RV2: Let Y be the amount of money a punter has after betting 1 unit straight
up on the spin of a roulette wheel.
RV3: Let T be the number of times a punter wins, when he has 4 successive
bets on 1st dozen at roulette.
RV4: Let F be the number of times a punter wins, when he has 50 successive
bets on Black at roulette.
RV5: Let H be the height of the next person to walk into the room.
We will initially focus on discrete Random Variables where the answer to the
first question is one of a discrete set of possible numbers.
We might label the values x1, x2, x3, x4, ........xn denote the associated probabilities
by p1, p2, p3, p4, ........ pn and display them in tabular form.
For example with RV1 above, the answers to the two questions could be
displayed as:
X
Value xi 0 1 2 3
1 3 3 1
Probability pi 8 8 8 8
Distribution Parameters
µ = E ( X ) = p1 × x1 + p2 × x2 + p3 × x3 + ...... + pn × xn
in words, multiply the corresponding values and probabilities and add them
up. The mean as defined is a measure of the centre of the distribution.
n
Using summation notation it can be written in a shorthand form: µ = ∑ pi xi
i =1
σ 2 = Var ( X ) = E (( X − µ ) 2 )
= p1 × ( x1 − µ )2 + p2 × ( x2 − µ )2 + ...... + pn × ( xn − µ ) 2
• take the distance that the value is from the centre (the mean);
• square it to treat values on either side of the mean in the same way;
• multiply by the corresponding probability; and
• add them up.
σ 2 = ( p1 x12 + p2 x2 2 + ..... + pn xn 2 ) − µ 2
n
σ 2 = ∑ pi xi 2 − µ 2
i =1
The use and interpretation of these parameters will become clearer as we use
them for various distributions of interest.
From the variance, we obtain a parameter that has the same dimensions as
the mean
σ = σ2
RV1
1 3 3 1 3
µ = × 0 + ×1 + × 2 + × 3 =
8 8 8 8 2
Create the corresponding tables and graphs for the Random Variables RV2
and RV3 above. Work out their mean and standard deviation.
RV2: Let Y be the amount of money a punter has after betting 1 unit straight
up on the spin of a roulette wheel.
RV3: Let T be the number of times a punter wins, when he has 4 successive
bets on 1st dozen at roulette.
RV4: Let F be the number of times a punter wins, when he has 50 successive
bets on Black at roulette.
0.1
0.08
probability
0.06
0.04
0.02
0
15
25
35
successes
Note that the probabilities for all values between 0 and 50 are non-zero, but
some are too small to register on the graph.
RV5: Let H be the height of the next person to walk into the room.
0.05
prob density
0.04
0.03
0.02
0.01
0
151
175
183
191
150
159
167
199
height
RV3 and RV4 above are particular examples of the discrete random variable
known as the Binomial Distribution.
Then the probability of getting exactly j successes (and n-j failures) is given by
n n− j
Pr ( B = j ) = j p j × (1 − p )
×
prob of
prob of
number of sequences j successes n-j failures
with j successes
n n− j
Pr ( B = j ) = p j (1 − p )
j
µ = np and σ = np (1 − p ) .
Notes:
The area under the curve between two points represents the probability that
the variable takes a value between those two points.
-5 -4 -3 -2 -1 0 1 2 3 4 5
standard deviations
0.05
prob density
0.04
0.03
0.02
0.01
151
175
183
191
150
159
167
199
height
Above is the Normal Variable with mean µ = 175 and standard deviation σ = 8 .
The variable above could, for example, represent the distribution of heights of
people.
Areas under any Normal curve N ( µ , σ ) can be related to areas under the
standard Normal variable N (0,1) by shifting and scaling. We need to ask the
question “How many standard deviations is the point to the right or left of
the mean?”
For example, the area under the second curve between 175 and 183 is equal
to the area under the standard Normal curve between 0 and 1, and the area
between 171 and 175 equates to the area between –0.5 and 0.
68.3%
-5 -4 -3 -2 -1 0 1 2 3 4 5
standard deviations
95.5%
-5 -4 -3 -2 -1 0 1 2 3 4 5
standard deviations
Calculating the House Edge and Standard Deviation for typical casino
bets
The events on which punters are able to wager can be easily divided into 2
categories:
Each can be described as a simple discrete Random Variable and the House
Edge and Standard Deviation readily obtained from the probabilities and
Dividends associated with the wager.
RTP, House Edge and Standard Deviation for a single Dividend wager
Let RTP be the amount of money a punter has after outlaying 1 unit on the
result of an event for which his chance of success is p and the dividend
received, if successful, is D.
Note that D includes the original stake of 1 unit which is returned to the punter
if the wager is successful.
RTP is a simple discrete Random Variable which takes one of two values
RTP
Value D 0
Probability p 1-p
µ = E ( RTP ) = p × D + (1 − p ) × 0 = pD
House Edge = 1 − µ = 1 − pD
σ 2 = p × D 2 + (1 − p) × 02 − ( pD) 2
= D 2 p (1 − p )
so σ = D p (1 − p )
RTP = µ = pD
σ = D p (1 − p )
House Edge = 1 − pD
Notes:
1. The Standard Deviation for the House Edge is exactly the same as that
for the RTP but it is simpler algebraically to work it out for the RTP
2. The Dividend D = Odds + 1 where the wager is described as paying odds
of Odds to 1
3. It is usually algebraically simpler to work with the Dividend rather than
the Odds
4. For a fair bet the House Edge =0, and the expected value of RTP is 1
1
5. This occurs if D =
p
6. Another advantage of working with Dividends rather than Odds is that
the Dividend for a successful parlay of two bets with individual
Dividends D1 and D2 is simply the product D = D1 D2 .
RTP
Value D1 D2 ..... Dn 0
Probability p1 p2 ..... pn 1 − p1 − p2 ... − pn
n n
The Variance is given by σ 2 = ∑ pi Di 2 − (∑ pi Di )2
i =1 i =1
n
RTP = µ = ∑ pi Di
i =1
n n
σ= ∑pDi =1
i i
2
− (∑ pi Di ) 2
i =1
n
House Edge =1 − ∑ pi Di
i =1
Work out House Edge and Standard Deviation for a bet of one unit on Straight
Up
Work out House Edge and Standard Deviation for a bet of one unit on Black
Roulette
Note that the dividends for Roulette are such that each of the commonly
36
available bets has the same expectation ( RTP = ) and hence the same
37
1 ) The important way in which the bets differ
house edge( Edge = = 2.702%
37
is in their volatility, and the standard deviation provides a way of measuring
this, and of comparing the different bets.
• "Dragon" means the two coins coming to rest on the floor of the ring
with the Dragon side uppermost on each;
• "Phoenix" means the two coins coming to rest on the floor of the ring
with the Phoenix side uppermost on each; and
• "Odds" means the two coins laying on the floor of the ring with one coin
Dragon side uppermost and the other coin Phoenix side uppermost;
If the Odds paid are 1 to 1, what is the House Edge and Standard Deviation?
At MBS the same bet is paid at Odds of 31 to 1. What is the House Edge and
Standard Deviation there?
This is an example of a multiple Dividend Bet. Work out the House Edge and
Standard Deviation.
(Note that in the same bet at MBS, the odds for all three dice are 12 to 1
instead of 3 to 1)
• “1 wins 18”
• “1 pays 31”
What sort of pattern bets do punters commonly place on the Sic Bo layout?
13.1. Where a player's hand qualifies for a progressive jackpot payout, the
amount of the payout shall be as follows:
12.3. The amount required to make a wager on the progressive jackpot shall
be displayed on a sign at the table;
12.6. The casino operator shall determine the rate of increment to the prize
pool(s) and may elect to retain a portion of each wager on the progressive
jackpot as gaming revenue;
From the player’s perspective, what is the House Edge and Standard
Deviation for this side-bet if the price of the side-bet is $25?
For what value of the jackpot pool is the RTP equal to 100%?
Referring to clause 12.6 above; If the initial seed is $250,000 and 17% is the
rate of increment to the prize pool(s), what is the “portion of each wager on
the progressive jackpot as gaming revenue” that the casino elects to retain?
10 × ( − 4) 10,200 0.3925%
1
Triplets 13 12 4 4 4
1 × 2 × 3× 1×1 54,912 2.1128%
Two Pairs 13 11 4 4 4
2 × 1 × 2× 2× 1 123,552 4.7539%
One Pair 13 12 4 4 4 4
1 × 3 × 2×1 × 1 × 1 1,098,240 42.2569%
No Pair 13 4
5
Total 52
5 2,598,960 100%
n
Note that is the number of combinations of n things taken r at a time.
r
SLOT MACHINE #2
payout hits
0 879816
2 300
5 300
10 300
25 400
40 400
50 500
100 800
150 600
200 400
300 400
500 300
1000 200
2000 10
3000 6
5000 3
10000 1
cycle 884736
How do we work out the RTP and the standard deviation for the slot machine
whose payout table is given above?.
How do we answer questions like “What is the chance that the major prize
goes off more than twice in the next 2 million spins?
If either or both of the Player and Banker initial 2 cards have a point total of 8
or 9, no further cards are drawn
After the Player has acted, the Banker acts according to the table
Each completed coup will use the next 4, 5 or 6 cards in sequence from the
shoe. To evaluate the distribution of the final Banker and Player hands we
need to systematically consider each of the 1,000,000 different (with respect
to point value) 6 card sequences that can be drawn without replacement from
a complete shoe. We could in practice, reduce this to 250,000 sequences
because the play of the coup treats the first and third cards (player initial
hand) collectively and also the second and fourth cards (banker initial hand)
together.
For this sequence, the Player total (first and third cards) is 0+1 = 1 and the
player draws a card (the 5th card) which is the 7.
The Banker initial total (2nd and 4th cards) is 0 + 3 = 3 and after the Player
hand has drawn the 7, the Banker draws the 6th card, viz the 8.
This sequence thus results in a win for the Player hand with a final total of 8
compared to the final Banker total of 1.
128 127 32 32 32 32
× × × × × = 0.00034102% .
416 415 414 413 412 411
absolute probabilities
probability banker bet wins 45.8597%
probability player bet wins 44.6247%
probability tie bet wins 9.5156%
total 100%
The probability that a resolved hand of baccarat results in a win for a bet on
banker is
573,063,141,609,472
= 50.6825%
130,692,712,257,536
The relative probabilities of success for banker and player bets are given by
relative probabilities
probability banker bet wins 50.6825%
probability player bet wins 49.3175%
total 100%
When considering bets on “tie” bets, one uses absolute probabilities since the
bet is always resolved.
House Edge and Standard Deviation
The House Edge and Standard Deviation for these bets can be determined
from the probability of success ( p ) and the Dividend D (Odds+1) paid for a
successful bet of 1 unit.
absolute bet parameters
probability Dividend House Standard
Edge Deviation
p D µ σ
banker 45.8597% 1.95 1.0579% 0.9274
player 44.6247% 2 1.2351% 0.9512
tie 9.5156% 9 14.3596% 2.6409
The RTP depends on the strategy adopted by the player given his two starting
cards and the dealer first card. All the possible cards or sequence of cards
that the player can draw and the subsequent possible resolution of the
dealer’s hand, given the cards drawn by the player, are considered for each of
the possible strategy decisions available to the player.
The strategy decision which results in the largest return to player given his
two starting cards and the dealer’s first card is deemed to be the correct basic
strategy.
When considering the standard deviation, one has to take into account the
number of different dividends or losses that are possible given doubling,
splitting and the blackjack payout.
Blackjack
Bet Basic
Strategy
µ 0.9984
σ 1.15
House
Edge 0.16%
In practice, the House Edge will be different to this figure because all players
will not play optimal Basic Strategy
So far we have calculated the House Edge and associated standard deviation
for a single bet of one unit for a variety of bets.
The standard deviation becomes very useful when we extend our analysis to
various patterns of possible bets. The expressions below are presented
without proof and use the properties of linear combinations of independent
random variables.
Here let µ represent the expected amount won (the House Edge) by the
House from 1 bet of one unit and let σ represent the standard deviation.
We will use the variable A to represent the amount won by the casino for a
related bet pattern and denote the corresponding parameters by
E ( A) = µ A and Var(A) =σ A2
We will use the variable R to represent the percentage won by the casino for
the related bet pattern and denote the corresponding parameters by
E ( R) = µ R and Var(R) =σ R2
amount won A
R = percentage won = =
amount bet (turnover) amount bet
Bet pattern µA σA µR σR
2. In every case the expected value for A, the amount won, is given by
the total amount wagered multiplied by µ
4. Note that in cases (a) and (b), the expressions for µ R and σ R are the
same. This can also be observed by comparing µ R and σ R in cases (c)
and (d). This illustrates the point made above that the spread of the
expected percentage win about the theoretical house advantage
depends on the number of bets and not on the size of the bets
(provided all bets are the same size) The standard deviation for the
1
percentage win is proportional to
N
The above table gives the expected value and the standard deviation for the
amount won, A and the percentage won, R for a given pattern of bets.
It does not tell us anything about the shape of the distributions of A and R