Professional Documents
Culture Documents
Association
KEY CONCEPTS
H. Ghaedamini (PhD)
(Ali)
stahgh@nus.edu.sg
Instructor
Department of Statistics and Applied Probability
1
1-Relationship between two variables
Deterministic Relationship Categorical variables
The value of the dependent Contingency table
variable can be determined with Odds Ratio and Risk Ratio
the value of the independent
variable: Numerical variables
Y=3X+2 Scatter diagram
Linear correlation coefficient
Statistical Relationship For more information on types
The average pattern of one of variables:
variable can be described with https://statistics.laerd.com/stat
the value of the other variable istical-guides/types-of-
variable.php
2
2- Association between categorical
variables, 2*2 Contingency Table
Outcome
O1 O2 Total
E1 a c a+c
Exposure
E2 b d b+d
Total a+b c+d a+c+b+d
Odds of O1 among E1 = a/c
= Odds Ratio
Odds of O1 among E2 = b/d
3
Exam
3-Design of the study Point!
Representative Line
The sum of distance of each point to this line is minimum
5
5-Linear Correlation Coefficient
Ecological correlation
Correlation based on aggregated data such as group average or rate
Association will be overstated based on the aggregated data
Ecological Fallacy
Deduce the inferences on correlation about individuals based on aggregated data
Atomistic fallacy
Generalize the correlation based on individuals towards the aggregate-level correlation
Attenuation Effect
Due to range restriction in one variable, the correlation coefficient obtained tends to understate the
strength of association between two variables
Regression towards mediocrity
In virtual test-retest situations the bottom group on the first test will on average show some improvement on
the second test; and the top group will, on average, fall back
6
Disclaimer
This document serves as supplementary reading material, and is not to be seen as any
modules syllabus
This document is subjected to errors and changes at any time, and author is not liable for any
effects of the change