Professional Documents
Culture Documents
SUMMARIZING DATA
Descriptive Analysis
Mean = 5 Mean = 6
Median = 5 Median = 5
0 1 2 3 4 5 6 7 8 9 10 12 14
Median = 6
• If the number of observations (n) is odd, the median is the middle value,
or the [(n+1)/2]th observation.
• If n is even, the median is usually calculated as the average of the two
middlemost values- that is, the average of the [(n/2)]th observation and
the [(n/2) + 1]th observation.
© 2002 Prentice-Hall, Inc. Chap 3-8
No Mode
Mode = 9
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 5, 12
© 2002 Prentice-Hall, Inc.
Chap 3-10
Introduction to Biostatistics Using SPSS
Measures of Variability
• It is not sufficient to describe a data set using only
measures of central tendency
• Need to determine how dispersed/ spread out the data
is.
• Measures of variability/spread includes
– Range
– Percentile / Quartile
– Deviation / Standard Deviation (sisihan piawai)
– Variance
– Coefficient of variation
Range = 12 - 7 = 5 Range = 12 - 7 = 5
7 8 9 10 11 12 7 8 9 10 11 12
Quartiles
• Split Ordered Data into 4 Quarters
25% 25% 25% 25%
Q1 Q3 Q2
i n 1
• Position of i-th Quartile Qi 4
Data in Ordered Array: 11 12 13 16 16 17 18 21 22
1 9 1 12 13
Position of Q1 2.5 Q1 12.5
4 2
Q1 Q3
• Q and are Measures of Noncentral Location
• 2
= Median, A Measure of Central Tendency
Introduction to Biostatistics Using SPSS
Skewness
• Relationship of the mode, median, mean and trimmed
mean is reflected through the skewness of the data.
• Skewness of the data measures how the data is
distributed.
• Zero Skewness
– symmetrical ( Mode = Median = Mean)
• Positive Skewness
– skewed to the right ( Mode < Median < Mean )
• Negative Skewness
– skewed to the left ( Mode > Median > Mean )
Q1 Q2 Q3 Q1Q2Q3 Q1 Q2 Q3
Mode<Median<Mean Mean<Median<Mode
Symmetrical
25
20
15
10
0
1 2 3 4 5 6 7 8 9
Mode=Median=Mean
Introduction to Biostatistics Using SPSS
Variance and Standard
Deviation
• The variance of a set of n measurements y1, y2, … ,yn with
mean y is the sum of the squared deviations divided by n
– 1. x 2
2
( x x)
2
x
n
2
n 1 n 1
• The standard deviation of a set of measurement is
defined to be the positive square root of the variance.
2
• Both measure how spread out the data is from the mean.
Data A
Mean = 15.5
s = 3.338
11 12 13 14 15 16 17 18 19 20 21
Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = .9258
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 4.57
© 2002 Prentice-Hall, Inc. Chap 3-20
– Stock B: S $5
CV 100% 100% 5%
© 2002 Prentice-Hall, Inc.
Chap 3-22 X $100
Introduction to Biostatistics Using SPSS
• How/where to get descriptive statistics?
– Analyze -> reports or descriptive statistics