Professional Documents
Culture Documents
\
|
=
2
2
2
) (
2
1
exp
2
1
) (
o
to
x x f
) 96 . 1 , 96 . 1 ( o o +
o
=
X
Y
Effects of Varying Parameters
Infinite Number of Normal
Distribution Tables
Normal distributions differ by mean and
standard deviations
Each distribution would require its own table
Thats an infinite number of tables!
Standard Normal Distribution
If X is a Normal RV with mean and variance
2
has a normal distribution with mean 0
and variance 1, or standard normal distribution.
o
=
X
Z
Example
Values of the std normal CDF, ,
are tabulated in Appendix Table 1
To compute probabilities for a normal RV, it
must be standardized by subtracting its mean
and dividing by standard deviation
Example: Suppose Y ~ N(2,16), and we need
P(Y<0)
) ( ) ( z Z P z s = u
) 1 , 0 ( ~
4
2
N
Y
Z
=
3085 . 0 )
2
1
( )
4
2 0
( ) 0 ( = s =
s = s Z P Z P Y P
26
Moments: Skewness, Kurtosis
skewness =
( )
3
3
Y
Y
E Y
o
(
= measures asymmetry in a
distribution
- The larger the skewness (by absolute value), the more
asymmetric is distribution
- skewness = 0: distribution is symmetric
- skewness > (<) 0: distribution has long right (left) tail
kurtosis =
( )
4
4
Y
Y
E Y
o
(
- measure the thickness of tails
- kurtosis = 3: normal distribution
- kurtosis > 3: heavy tails (leptokurtotic), i.e. extreme events are
more likely to occur
27
Central Limit Theorem
Important Continuous Distributions
All derived from normal
distribution
2
distribution: arises from
sum of squared normal
random variables
t distribution: arises from
ratios of normal and
2
variables
F distribution: arises from
ratios of
2
variables.
Hypothesis Testing
Identifying Hypotheses
1. Formulate the question, e.g. test that the
population mean is equal to 3
2. State the question statistically (H
0
: = 3)
3. State its alternative statistically (H
1
: 3)
4. Choose level of significance
Typical values are 0.01, 0.05, 0.10
Rejection region of sampling distribution: the
unlikely values of sample statistic if null
hypothesis is true
Identifying Hypotheses: Examples
1. Is the population average amount of TV
viewing 12 hours?
H
0
: = 12
H
1
: 12
Choosing the Level of Significance:
Type I and Type II Errors
Type I Error: Reject a true null hypothesis.
Type II Error: Do not reject a false null.
We would like probabilities of both errors to be
small. BUT, we cannot make both very small at
the same time.
In statistics, we fix the probability of Type I error
at a significance level (e.g. 5%) and minimize the
probability of Type II error.
Hypothesis Testing: Basic Idea
Method 1: Compare Test Statistic to
Critical Value from the Table
1. Convert Sample Statistic (e.g., ) to standardized
Z variable
2. Compare to Critical Value from the table
If Z-test statistic falls in the rejection region,
reject H0;
Otherwise do not reject H0
Two-Sided Test: Rejection Regions
One-Sided Test: Rejection Region
Method 2: Compute the P-value
Probability of obtaining a test statistic more
extreme ( or ) than actual sample value given
H
0
is true.
The lowest significance level at which we reject H
0
Compute p-value for the test
Use this p-value to make rejection decision:
If p value , do not reject H
0
If p value < , reject H
0
P-values for 2-sided and 1-sided tests
Two-sided test: H
0
: =
0
H
1
:
0
One-sided test:
a. H
0
: >
0
b. H
0
: <
0
H
1
: <
0
H
1
: >
0
|
|
.
|
\
|
u =
|
|
.
|
\
|
> =
Y Y
Y Y
Z P value p
o
o
0 0
2 | |
|
|
.
|
\
|
u =
Y
Y
value p
o
0
1
|
|
.
|
\
|
u =
Y
Y
value p
o
0
P-value for a 2-Sided Test
Method 3: Confidence Intervals
Confidence interval: set of values that contains
the true population mean with a pre-specified
probability, say 95%.
This pre-specified probability is called confidence
level.
A 95% confidence interval for
Y
contains the true
value of
Y
in 95% of repeated samples.
The 90% CI is:
The 95% CI is:
The 99% CI is:
41
) ( 96 . 1 Y SE Y
) ( 58 . 2 Y SE Y
) ( 645 . 1 Y SE Y
Jarque-Bera Test for Normality
Assesses whether a given sample of data is
normally distributed
Aggregates information in the data about both
skewness and kurtosis
Test of the hypothesis that S = 0 and K = 3
The 5% critical value is 5.99; if JB > 5.99, reject
the null of normality.