Professional Documents
Culture Documents
C. Measures of dispersion:
Range
General characteristics:
a. simple to obtain (+)
b. influenced by extreme scores (-)
c. only a function of two of the scores (-)
d. unstable from sample to sample (-)
e. can be used with any type of measurement scale
C. Measures of dispersion:
H spread
2. H spread
H = Q3 Q1
H = 18.0833 13.1250 = 4.9583
H spread relies on the difference between the third and first quartiles, Q3 Q1.
H is short for hinge, developed by Tukey.
This is also known as the interquartile range.
H measures the range of the middle 50% of the distribution.
The larger the value, the greater is the spread in the middle of the distribution.
C. Measures of dispersion:
H spread
General characteristics:
a. unaffected by extreme scores (+)
b. not a function of all of the scores (-)
c. not very stable from sample to sample (-)
d. can be used with any type of measurement scale except nominal
C. Measures of dispersion:
Deviation scores
3. Deviation scores
The difference between a particular raw score and the mean of the collection of scores (i.e.,
population or sample).
Summing the deviation scores will always equal 0.
Thus, any measure involving simple deviation scores will be useless in that the sum of the
deviation scores will always be zero, regardless of the spread of scores.
Because it sums to zero, it is rarely used in statistics.
C. Measures of dispersion:
Deviation scores
Notation
Deviation
di = Xi m
Sum of deviation scores
C. Measures of dispersion:
Deviation scores
Table 3.3
Credit card data
C. Measures of dispersion:
Population variance
4. Population variance
measure of the area of a distribution
Definitional formula:
C. Measures of dispersion:
Population variance
It is conceptually how you define variance.
Conceptually, the variance is a measure of the area of a distribution.
That is, the more spread out the scores, the more area or space the distribution takes up and the
larger is the variance.
C. Measures of dispersion:
Population standard deviation
5. Population standard deviation
C. Measures of dispersion
General characteristics of variance and standard deviation:
a. are a function of every score (+)
b. affected by extreme scores (-)
c. quite useful for deriving other statistics (+)
d. can be used with interval and ratio measurement scales
Comparison of central tendency measures with dispersion measures
a. mode and range share certain characteristics
b. median shares certain characteristics with H spread
c. mean shares many characteristics with the variance and standard deviation
Why cant we just convert everything to sample statistics?
The sample mean may not be the same as the population mean. In most samples, the sample
mean will be somewhat different than the population mean.
You cant use the population mean anyways because it is unknown. Because they are different,
the deviations will be affected. The sample variance that would be obtained would be a biased
estimate of the population variance. The sample variance obtained would be systematically too
small.
In order to get an unbiased sample estimate of the population variance, a slight change in the
computational formula has to be made.
C. Measures of dispersion:
Sample variance
6. Sample variance
Definitional formula:
If n is used in the denominator, then the estimate of the population variance would
be biased (would be too small).
So we have to make an adjustment in the denominator to obtain an unbiased
estimate of the population variance.
C. Measures of dispersion:
Sample variance
7. Sample standard deviation
Summary of measures of dispersion
1. The range is the only appropriate measure for ordinal data. (Likert?)
2. The H spread, variance, and standard deviation can be used with interval or ratio
measurement scales.