Professional Documents
Culture Documents
2012-2013
LAB SESSION 9
Topic TWO: Statistical Software
Learning Content:
(A)
Introduction of Statistical Software
(B)
Introduction of PSPP
(C)
(D)
(E)
(F)
Data Handling
Displaying and Summarizing Data
Modifying Data
Review on Statistics
(G)
Analyzing Data
1.
One Sample T Test
2.
Paired Samples T Test
3.
4.
5.
6.
7.
Page 1
2012-2013
2.
Paired Samples T Test
3.
Independent Sample T Test
4.
One Way ANOVA
5.
Chi Square Test of Independence
It tests the association between two categorical variables.
variables are independent, there is no relationship between them.
When two
If there is a
Page 2
2012-2013
We often need to identify which cell or cells are the major contributors to
the significant chi-square test by examining the pattern of column
percentages.
Based on the column percentages, we would identify cells on the Pepsi
row and the Spirit row as the ones producing the significant result
because they show the largest differences: 30.1% on the Coke row
(53.2%-23.3%) and 19.3% on the Pepsi row (46.7%-27.4%)
As the significance value (p-value) is .001 (< .05), we would reject the
Null Hypothesis and conclude that there is a significant relationship
between the two variables.
It is better to follow up the analysis with a post hoc test.
Step #1
Open the System File (Lab07_Answers.sav).
Step #2
From the menus, choose Analyse
Descriptive Statistics Crosstabs.
Step #3
Select Marital Status, and move it into the Rows list. Then, select Gender,
and move it into the Columns list.
Page 3
2012-2013
Step #4
Click Format and a dialog box is prompted.
Step #5
Select Print tables, Pivot, Ascending and Label, and then click Continue.
Step #6
Click Statistics and a dialog box is prompted.
Step #7
Select Chisq, and then click Continue.
Step #8
Click OK to run the procedure.
As the significance value (p-value) is .46 (> .05), we cant reject the Null
Hypothesis and conclude that there is a no significant relationship
between the two variables.
Page 4
2012-2013
6.
Bivariate Correlation
Ittests the correlation(Pearson R) between two categorical variables, by
showing the magnitude and the direction.
e.g. The relationship between head circumference at age 6 to 14 months and
the cerebral grey matter measurement at age 2 to 5 years
e.g. The relationship between the lead content of soil and the distance from a
major highway
e.g. The relationship between the vocabulary size in English and age in early
childhood education
e.g. Consumption of hot chocolate is negatively correlated with crime rate (high
values of hot chocolate tend to be paired with lower crime rates), but both are
responses to cold weather. (CAUSATION / CAUSALITY)
Assumption:
The level of measurement for both variables is scale or ordinal.
Both variables should be normally distributed.
It is a measure of linear association ONLY.
If the relationship is not approximately linear (e.g. complicated curved
relationship, such as quadratic, exponential), we need to examine the
scatterplot.
Hypotheses:
Null: There is no association between the two variables.
Alternate: There is an association between the two variables.
The correlation coefficient is a number between +1 and -1. This number
tells us about the magnitude and direction of the association between
two variables.
The MAGNITUDE is the strength of the correlation.
If the correlation is 0 or very close to zero, there is no association
between the two variables being tested.
For the correlation coefficient between -0.5 and +0.5, there is a weak
correlation and usually means there is not really any relationship
between the two variables being tested.
Page 5
2012-2013
For the correlation coefficient between -0.5 and -0.8 OR between +0.5
and +0.8, there is a moderate correlationoccurring between the two
variables being tested.
For the correlation coefficient between -0.8 and -1 OR between +0.8 and
+1, there is a strong correlation occurring between the two variables
being tested.
The DIRECTION of the correlation tells us how the two variables are
related.
If the correlation is positive, the two variables have a positive
relationship (as one increases, the other also increases).
If the correlation is negative, the two variables have a negative
relationship (as one increases, the other decreases).
Example 2
Letsread the output of a Pearson correlation between the Rosenberg
Self-Esteem Scale and the Assessing Anxiety Scale.
Page 6
2012-2013
Step #1
Open the System File (Lab09_Children.sav).
Step #2
Background:We conducted a study on the relationship between Age (month)and
Height (cm) of children; in other words, predicting Height (cm) from Age (month), if
the two variables are correlated.
Step #3
From the menus, choose Analyse Bivariate Correlation.
Step #4
Select Age (month)and Height (cm) (both variables), and move them into the list
on the right text area.
Step #5
Select Two-tailed (OR One-tailed) in Test of Significance.
Step #6
Click OK to run the procedure.
Interpretation:
The Pearson correlation coefficient (0.68) shows that there is a positive,
moderate relationship between Age (month) and Height (cm).
As age increases, height also increases.
Page 7
2012-2013
7.
Simple Linear Regression
Ittests whether there is a linear relationship between two quantitative variables.
Assumptions:
The data are linear (If you look at a Scatterplot of the data, and the data
seem to be moving in a straight line, it's a good indication that the data are
linear).
The dependent variable should be normally distributed.
When the two variables are perfectly correlated,the prediction is perfect; the
less correlatedthe variables, the less accurate the prediction.
It takes the form y = a + bx where y is the response (dependent)
variable, x is the explanatory or predictor (independent) variable and a
is the intercept term of the model, b is the slope/gradient of the linear
model.
Hypotheses:
The hypotheses for regression focus on the slope/gradient of the regression
line.
Null: The slope/gradient equals zero (there is no slope/gradient)
Alternate: The slope/gradient is not equal to zero
Example 3
There are 3 key pieces of information in the output:
The R Square value
The significance of the regression
The values of the constant and slope
Letsread the outputs of a simple linear regression between Assessing
Prejudice (independent variable) and Comfort with Inter-Ethnic Situations
(dependent variable).
Page 8
2012-2013
Page 9
2012-2013
ANOVA
Coefficients
Interpretation:
In the Model Summary table, the Pearson correlation coefficient (0.68)
shows that there is a positive, moderate relationship between Age
(month) and Height (cm).
The R Square (0.46) shows 46% of the variance in Height (cm) can be
explained by differences in levels of Age (month).
In the ANOVA table, the significance value (p-value) is .00 (< .05), we
would reject the Null Hypothesis and conclude that at least one
coefficient is not equal to zero.
Page 10
2012-2013
Page 11