Professional Documents
Culture Documents
Chapter 10
Sampling distributions
Introduction
In real life, calculating the parameters
of populations is prohibitive because
populations are very large.
Rather than investigating the whole
population, we take a sample, calculate
a statistic related to the parameter of
interest and make an inference.
The sampling distribution of the statistic
is the tool that tells us how close the
statistic is to the parameter.
3
6/02/2014
Sampling Distributions
A sampling distribution is created by, as the
name suggests, sampling.
The method we will employ to derive the
sampling distribution uses the rules of
probability and the laws of expected value
and variance.
For example, consider the roll of one and two
dice
4
1/6
1/6
1/6
1/6
1/6
1/6
6/02/2014
is shown below:
P( )
Compare
Compare the distribution of X
Generalise
We can generalise the mean and variance of the
sampling of two dice:
to n-dice:
The standard deviation of
the sampling distribution of
the sample mean is called
the standard error:
9
6/02/2014
Notice that
is smaller
than x. The larger the sample 1
size the smaller . Therefore,
tends to fall closer to, as the
sample size increases.
10
Population
1.5
2.5
3
222
1.5
2.5
1.5
2
2.5
1.5
2
2.5
2.5
Compare1.5
the variability
of the
2
1.5
2.5 population
1.5
2.5 mean.
Let us take samples to the variability
of 22the sample
1.5
2.5
1.5
2.5
of two observations.
2
1.5
2.5
1.5
2
2.5
1.5
2
2.5
1.5
2
2.5
1
Also,
Expected value of the population = (1 + 2 + 3)/3 = 2
Expected value of the sample mean = (1.5 + 2 + 2.5)/3 = 2
11
12
6/02/2014
13
14
15
6/02/2014
Example 10.1
The weight of each 32g chocolate bar is
normally distributed with a mean of 32.2 g
and a standard deviation of 0.3 g.
Find the probability that, if a customer buys
one chocolate bar, that bar will weigh more
than 32 g.
Solution
The random variable X is the weight of a
chocolate bar.
0.7486
x = 32
= 32.2
18
6/02/2014
Solution
The random variable here is the mean
weight per chocolate bar, . We want
0.9082
0.7486
x = 32
= 32.2
19
Example
The average weekly income of graduates one
year after graduation is $600. Suppose the
distribution of weekly income has a standard
deviation of $100. What is the probability that
25 randomly-selected graduates have an
average weekly income of less than $550?
Solution
Let X be the weekly income of graduates one
year after graduation.
20
Example
The average weekly income of graduates
one year after graduation is $600. Suppose
the distribution of weekly income has a
standard deviation of $100.
1. What is the probability that 25 randomlyselected graduates have an average weekly
income of less than $550?
2. If a random sample of 25 graduates actually
had an average weekly income of $550, what
would you conclude about the validity of the
claim that the average weekly income is
$600?
21
6/02/2014
Solution
Let X be the weekly income of graduates one
year after graduation.
1.
- Z.025
Z.025
23
0.025
0.025
1.96
Normal distribution of
1.96
0.025
0.025
24
6/02/2014
Conclusion
There is a 95% chance that the sample
mean falls within the interval [560.8,
639.2] if the population mean is 600.
Since the sample mean was 550, the
population mean is probably not 600.
25
In general
26
6/02/2014
n=2
Mean = 3.495
Stand. dev. = 0.749
n=5
n = 10
Mean = 3.494
Stand. dev. = 0.544
28
Excel
Creating a
simulated
distribution
of the mean
Create samples
of size two.
Create a histogram
for the distribution
of the mean.
29
10
6/02/2014
31
Hence:
and
32
33
11
6/02/2014
Example
Approximate the binomial probability
P(X = 10) when n = 20 and p = 0.5.
The parameters of the normal
distribution used to approximate the
binomial are:
= np; = np(1 p)
36
12
6/02/2014
9.5
10
10.5
8.5
13.5
14
38
13
6/02/2014
Example
The Laurier companys brand has a market
share of 30%. In a survey, 1 000
consumers were asked which brand they
prefer. What is the probability that more
than 32% of the respondents say they
prefer the Laurier brand?
40
Solution
The number of respondents who prefer Laurier
is binomial with n = 1000 and p = 0.30.
Also, np = 1000(0.3) = 300 > 5
n(1 p) = 1000(1 0.3) = 700 > 5.
Therefore,
is normal with mean p = 0.30
and standard error
Hence
41
42
14
6/02/2014
The distribution of
is normal
with mean 1 2 and standard
deviation of
if
the two samples are independent
the original populations are normally
distributed.
43
44
Example
The mean salaries of MBA graduates from
two universities (1 and 2), after 5 years,
are $62 000 (stand. dev. = $14 500) and
$60 000 (stand. dev. = $18 300).What is
the probability that a sample mean of
University-1 graduates will exceed the
sample mean of University-2 graduates?
(n1 = 50; n2 = 60)
Solution
As the sample sizes are more than 30, the
distribution of
is approximately
normal.
45
15
6/02/2014
47
48
16
6/02/2014
49
Individual
50
-----
Statistic
51
17
6/02/2014
Statistic
------
Parameter
53
18