You are on page 1of 10

1

CorrelationalResearch
Indroduction
Corelationrefersusrelationshipbetweentwovariables.Forexample,wecouldlookatthe
relationshipbetweenageandhealthandaskwhetherhealthimprovesordeclineswithage.We
couldalsolookattherelationshipbetweenageandcognitivedeclineandseeifthesetwo
variablesarerelated.Belowisanotherexamplethatshowsarelationshipbetweentwovariables:

ThisexampleshowsthatthereisarelationshipbetweenwatchingviolenceonTVand
aggressivebehavior.However,whenwelookatcorrelationswemustbeawareofthreeissues:
Correlationsdoesnotequalcausation
Wecannotinferacausalrelationship.WecannotsaythatwatchingviolentTVprograms
causespeopletobemoreaggressive.Allwecansayisthatthereisarelationshipbetweenthese
twovariables.
Correlationsarebidirectional
Thereasonwecannotinfercausationisthatcorrelationsarebidirectional(asrepresentedby
thetwoarrowheadsbetweenTVviolenceandaggression).ItisnotclearwhetherwatchingTV
causesaggressionorwhetheraggressivepeoplesimplyliketowatchviolentprograms.
ThirdvariableorAlternativeExplanations
Ascanbeseenintheabovediagram,arelationshipbetweentwovariablesdoesnotruleout
thepossibilitythatothervariablesmaybeimportantindeterminingTVwatchingandaggression.

2
Othervariablessuchasfamilybackground,parentingstyles,socioeconomicstatus,theperson's
personality,orpeersmaybeaffectingthisrelationship.
Correlational studies are used to show the relationship between two variables. Unlike
experimental studies, however, correlational studies can only show that two variables are related
they cannot determine causation (which variable causes a change in the other). A correlational
study serves only to describe or predict behavior, not to explain it. In psychological research, it is
important to remember that correlation does not imply causation; the fact that two variables are
related does not necessarily imply that one causes the other, and further research would need to
be done to prove any kind of causal relationship.

Background and Purpose of Correlation Research


The importance of corre ation research has been emphasised by authors such as Woodworth
who published a book entitled Experimental Psychology in 1938 (Woodworth 1938) and Lee
Cronbach who published an article entitled The Two Disciplines of Scientific Psychology
(Cronbach 1957) woodworth established two major distinctions in quantitative research
methods; the distinction between (a) independent and dependent variables and (b) experimental
and correlational methods. According to Woodworth, experimental designs/methods are
characterised by manipulation of variables while a correlational design/method measures two or
more characteristics from the same person and then calculates the correlation between the
characteristics. In Woodworths view these two research approaches had equal value
correlational research must be distinguished from the experimental method, but standing on
apar with it in value, rather than above or below..... (Woodworth 1938:3). Cronbach was also
concerned about the fact that correlational research had second-rate status in scientific
3psychology. He believed that asynthesis should take place with advocates of each design
valuing them equally, and utilising both strategies. As Cronbach puts it It is not enough for
each discipline to borrow from the other.
Correlational psychology studies only variance among organisms; experimental psychology
studies only variance among treatments. A united discipline will study both of these, but it will
also be concerned with the otherwise neglected interactions between organismic and treatment
variables. (Cronbach 1957:681). Whether this synthesis happened remains debatable but
issue/topic is certainly worthy of further discussion elsewhere. Interestingly many text books
including nursing books provide little discussion on correlational research, with several
providing one to three pages in chapters entitled.

Data Collection in Correlational Research

3
Feature of correlational research is that neither variable is manipulated. It does not matter how or
where the variables are measured. A researcher could have participants come to a laboratory to
complete a computerized backward digit span task and a computerized risky decision-making
task and then assess the relationship between participants scores on the two tasks. Or a
researcher could go to a shopping mall to ask people about their attitudes toward the
environment and their shopping habits and then assess the relationship between these two
variables. Both of these studies would be correlational because no independent variable is
manipulated. However, because some approaches to data collection are strongly associated with
correlational research, it makes sense to discuss them here. The two we will focus on are
naturalistic observation and archival data. A third, survey research, is discussed in its own
chapter.

Naturalistic Observation
Naturalistic observation is an approach to data collection that involves observing peoples
behavior in the environment in which it typically occurs. Thus naturalistic observation is a type
of field research (as opposed to a type of laboratory research). It could involve observing
shoppers in a grocery store, children on a school playground, or psychiatric inpatients in their
wards. Researchers engaged in naturalistic observation usually make their observations as
unobtrusively as possible so that participants are often not aware that they are being studied.
Ethically, this is considered to be acceptable if the participants remain anonymous and the
behavior occurs in a public setting where people would not normally have an expectation of
privacy. Grocery shoppers putting items into their shopping carts, for example, are engaged in
public behavior that is easily observable by store employees and other shoppers. For this reason,
most researchers would consider it ethically acceptable to observe them for a study. On the other
hand, one of the arguments against the ethicality of the naturalistic observation of bathroom
behavior discussed earlier in the book is that people have a reasonable expectation of privacy
even in a public restroom and that this expectation was violated.

Types
There are three types of correlations that are identified:
1. Positive correlation: Positive correlation between two variables is when an increase in
one variable leads to an increase in the other and a decrease in one leads to a decrease in
the other. For example, the amount of money that a person possesses might correlate
positively with the number of cars he owns.

2. Negative correlation: Negative correlation is when an increase in one variable leads to a


decrease in another and vice versa. For example, the level of education might correlate
negatively with crime. This means if by some way the education level is improved in a
country, it can lead to lower crime. Note that this doesn't mean that a lack of education

causes crime. It could be, for example, that both lack of education and crime have a
common reason: poverty.
3. No correlation: Two variables are uncorrelated when a change in one doesn't lead to a
change in the other and vice versa. For example, among millionaires, happiness is found
to be uncorrelated to money. This means an increase in money doesn't lead to happiness.

A correlation coefficient is usually used during a correlational study. It varies between +1 and -1.
A value close to +1 indicates a strong positive correlation while a value close to -1 indicates

ROLE OF CORRELATIONAL RESEARCH


Correlational research has played an important role in the history of educational and
psychological research. Early on, the bivariate correlation was used in heredity research and then
eventually expanded into all areas of educational and psychological inquiry. Subsequently more
sophisticated multivariate extensions enabled researchers to examine multiple variables
simultaneously. Correlational research has had and will continue to have an important role in
quantitative research in terms of exploring the nature of the relations among a collection of
variables. In part, unrelated variables can be eliminated from further consideration, thereby
allowing the researcher to give more serious consideration to related variables.
Correlational research can also play an important role in the development and testing of
theoretical models. Once the nature of bivariate relations has been determined, this information
can then be used to develop theoretical models. The idea here is to attempt to explain the nature
of the bivariate correlations rather than to simply report them. At this point, methods such as
factor analysis, path analysis and structural equation modeling can come into play.

Statistical Significance
A statistically significant relationship is one that is large enough to be unlikely to have occurred
in the sample if there's no relationship in the population. The issue of whether a result is unlikely
to happen by chance is an important one in establishing cause-and-effect relationships from
experimental data. If an experiment is well planned, randomization makes the various treatment
groups similar to each other at the beginning of the experiment except for the luck of the draw
that determines who gets into which group. Then, if subjects are treated the same during the
experiment (e.g. via double blinding), there can be two possible explanations for differences
seen: 1) the treatment(s) had an effect or 2) differences are due to the luck of the draw. Thus,
showing that random chance is a poor explanation for a relationship seen in the sample provides
important evidence that the treatment had an effect.

5
The issue of statistical significance is also applied to observational studies - but in that case there
are many possible explanations for seeing an observed relationship, so a finding of significance
cannot help in establishing a cause-and-effect relationship. For example, an explanatory variable
may be associated with the response because:

Changes in the explanatory variable causes changes in the response;


Changes in the response variable causes changes in the explanatory variable;
Changes in the explanatory variable contribute, along with other variables, to changes in
the response;
A confounding variable or a common cause affects both the explanatory and response
variables;
Both variables have changed together over time or space; or
The association may be the result of coincidence (the only issue on this list that is
addressed by statistical significance).

Pearson's Correlation Coefficient


A correlation coefficient is a statistical measure of the degree to which changes to the value of
one variable predict change to the value of another. In positively correlated variables, the value
increases or decreases in tandem. In negatively correlated variables, the value of one increases as
the value of the other decreases.
Correlation coefficients are expressed as values between +1 and -1. A coefficient of +1 indicates
a perfect positive correlation: A change in the value of one variable will predict a change in the
same direction in the second variable. A coefficient of -1 indicates a perfect negative correlation:
A change in the value of one variable predicts a change in the opposite direction in the second
variable. Lesser degrees of correlation are expressed as non-zero decimals. A coefficient of zero
indicates there is no discernable relationship between fluctuations of the variables.
Correlation is a technique for investigating the relationship between two quantitative, continuous
variables, for example, age and blood pressure. Pearson's correlation coefficient (r) is a measure
of the strength of the association between the two variables.
The first step in studying the relationship between two continuous variables is to draw a scatter
plot of the variables to check for linearity. The correlation coefficient should not be calculated if
the relationship is not linear. For correlation only purposes, it does not really matter on which
axis the variables are plotted. However, conventionally, the independent (or explanatory) variable
is plotted on the x-axis (horizontally) and the dependent (or response) variable is plotted on the
y-axis (vertically).
The nearer the scatter of points is to a straight line, the higher the strength of association between
the variables. Also, it does not matter what measurement units are used.

Values of Pearson's correlation coefficient


Pearson's correlation coefficient (r) for continuous (interval level) data ranges from -1 to +1:

r = -1

data lie on a perfect straight line with a negative slope

r=0

no linear relationship between the variables

r = +1

data lie on a perfect straight line with a positive slope

Positive correlation indicates that both variables increase or decrease together, whereas negative
correlation indicates that as one variable increases, so the other decreases, and vice versa.

RANGE OF THE CORRELATION COEFFICIENT

The graphic shows that the correlation coefficient (little r) can range from a -1 through 0
to a +1. If you get a value outside that range you have made a mistake calculating little
r.
Notice that values of r less than 0 indicate a negative or inverse relationship between
variables.
Notice that values of r greater than 0 indicate a positive or direct relationship between
variables.
Finally, notice that a value of r = 0 indicates no relationship between variables. The
farther the r value is from zero, the greater the relationship.

Benefits of Correlational Research


An experiment is not always the most appropriate approach to answering a research question.
Sometimes it is not possible to carry out a true experiment for practical or ethical reasons
because it is impossible to manipulate the independent variable. If a researcher was to look at the
psychological effects of long-term ecstasy use, it would not be ethical to randomly assign
participants to a condition of long-term ecstasy use. An experiment is also not feasible when
examining the effects of personality and individual differences since participants cannot be
randomly assigned into these categories. Correlational research allows a researcher to determine
if there is a relationship between two variables without having to randomly assign participants to
conditions.
The strength of correlational research is its predictive capabilities. With a large sample size, you
can use one variable to predict the likelihood of the other when there is a strong correlation
between the two. For instance, you could take two measurements from 1,000 familieswhether
the father is an alcoholic and whether a son is an alcoholicand calculate the correlation. If
there is a strong correlation between the two measurements, it will allow you to predict, within

8
certain limits of probability, what the chances are that the son of an alcoholic father will also
have a problem with alcohol.
1. Most of the variables show some kind of relationship. For instance, there is relationship
between price and supply, income and expenditure etc. With the help of correlation analysis we
can measure in one figure the degree of relationship.
2. Once we know that two variables are closely related, we can estimate the value of one variable
given the value of another. This is known with the help of regression.
3. Correlation analysis contributes to the understanding of economic behavior, aids in locating
the critically important variables on which others depend.
4. Progressive development in the methods of science and philosophy has been characterized by
increase in the knowledge of relationship. In nature also one finds multiplicity of interrelated
forces.
5. The effect of correlation is to reduce the range of uncertainty. The prediction based on
correlation analysis is likely to be more variable and near to reality.

Limitations of Correlational Research


A correlational study serves only to describe or predict behavior, not to explain it. Always
remember that correlation does not imply causation. Since there is no random assignment to
conditions, a researcher cannot rule out the possibility that there is a third variable affecting the
relationship between the two variables measured. Even if there is no third variable, it is
impossible to tell which factor is influencing the other. Only experimental research can
determine causation. In the above example, while a research could predict the likelihood of an
alcoholic father having an alcoholic son, they could not describe why this was the case.
An excellent example used by Li (1975) to illustrate the "third variable" problem is the positive
correlation in Taiwan in the 1970's between the use of contraception and the number of electric
appliances in one's house. Of course, using contraception does not induce you to buy electrical
appliances or vice versa. Instead, the third variable of education level affects both.
Another popular example is that there is a strong positive correlation between ice cream sales
and murder rates in the summer. As ice cream sales rise, so do murder rates. Is this because
eating ice cream makes us want to murder people? The actual explanation is that when the
weather is hot, more people buy ice cream, but they also go out more, drink more, and socialize
more, leading to an increase in murder rates. Extreme temperatures observed in the summer also
have been shown to increase aggression. In this case, there are many other variables at play that
feed the correlation between murder rates and ice cream sales.

ISSUES IN CORRELATIONAL RESEARCH

9
When consuming or conducting correlational research, there are a number of issues to consider,
with some issues being positive and others negative in nature. On the positive side, once
descriptive research has helped to identify the important variables, correlational research can
then be used to examine the relations among those important variables. For example, researchers
may be interested in determining which variables are most highly related to a particular outcome,
such as student achievement. This can then lead into experimental research in which the causal
relations among those key variables can be examined under more tightly controlled conditions.
Here one independent variable can be manipulated by the researcher (e.g., method of
instruction), with other related variables being controlled in some fashion (e.g., grade, level of
school funding). This then leads to a determination of the impact of the independent variable on
the outcome variable, allowing a test of strong causal inference.
On the negative side, a limitation of correlational research is that it does not allow tests of strong
causal inference. For example, if researchers find a high bivariate correlation between amount of
instructional time (X) and student achievement (Y), then they may ask if this correlation
necessarily implies that more instructional time causes higher achievement. The answer is not
necessarily. Two variables X and Y can be highly correlated for any of the following reasons and
others: (a) X causes Y; (b) Y causes X; (c) Z causes both X and Y, but X and Y are not causally
related; (d) X and Y both cause Z, but X and Y are not causally related; and (e) many other
variables might be involved. In addition, for a causal relationship X must occur before Y. Thus a
bivariate correlation coefficient gives information about the nature of the relations between two
variables, but not why they are related. Theoretical models of educational and psychological
phenomena tend to be rather complex, certainly involving more than simply two variables. More
sophisticated correlational methods, such as factor analysis, path analysis, or structural equation
modeling, have the ability to examine the underlying relations among many variables and can,
therefore, be used as a basis to argue for causal inference.
Another limitation of correlational methods is they commonly suggest that the variables are
linearly related to one another. For example, variables X and Y can be shown to have a linear
relationship if the data can be nicely fitted by a straight line. When variables are not linearly
related, correlational methods will reduce the strength of the relationship (in other words, the
linear relation will be closer to zero). Therefore, nonlinear relationships will result in smaller
linear correlations, possibly misleading the researcher and the field of inquiry. Outliers,
observations that are quite a bit different from the remaining observations, will also reduce the
strength of the relationship. It is wise for researchers to examine their data to see if (a) variables
are linearly related (e.g., by the use of scatterplots), and (b) there are any influential observations
(i.e., outliers).

Conculusion
Findings from correlational research can be used to determine prevalence, relationships among
variables and to forecast events from current data and knowledge. In spite of its many uses
however, Prudence is required when using the design and analysing data . To assist researchers

10
in reducing mistakes key issues were sing led out for discussion and several options put forward
for analysing data.
A final limitation of correlational research occurs when a researcher seeks to consider the
relations among every possible variable. The idea is if researchers examine the relations among
enough variables, then certainly some variables will be significantly related. While there is an
exploratory consideration here, in terms of seeing which variables are related, there is a statistical
consideration as well. That is, if researchers examine enough bivariate correlations, they will find
some variables that are significantly related by chance alone. For example, if they examine 100
correlations at the .05 level of significance, then they expect to find five correlations that appear
to be significantly different from zero, even though these correlations are not truly different from
zero. In this case, the more sophisticated multivariate correlational methods can be useful in that
fewer tests of significance tend to be done than in the bivariate case.
Correlational methods of inquiry have been popular in educational and psychological research
for quite some time in part because they are foundational in nature in terms of their ability to
examine the relations among a number of variables. Also, correlational methods can be used to
develop and test theoretical models (e.g., factor analysis, path analysis, structural equation
modeling). Despite the limitations of correlational research described here, these methods will
continue to be used. Additional information on correlational methods can be found in Grimm and
Yarnold (1995, 2000), Lomax (2007), and Schumacker and Lomax (2004).

You might also like