Data Analysis

Data analysis is a set of methods and techniques that can be
used to obtain information and insights from the data.
of observation, the number per objects, the number of groups

analyzed and the control exercised over the variables of interest.
Data analysis turns the quantity of papers into defensible,

actionable sets of conclusions and reports.
Assumptions underlying test statistic
Need for data analysis:

Data analysis can lead the researcher to information and
insights that otherwise would not be available.

It can help avoid erroneous judgments and conclusions.
It can provide a background to help interpret and
understand analysis conducted by others.

Knowledge of the power of data analysis techniques can
constructively influence research objectives and research

design.
The assumptions underlying the test statistic also affect the

choice of a statistical method of analysis. For example, the
assumption of a two-sample t-test are as follows.
The samples are independent
The characteristics of interest in each population have normal
distribution
The two populations have equal variables.
Statistical Techniques
Ordinal data
Statistical tests have been developed to permit comparisons

regarding the degree to which qualities of one group of data
differ from those of another group. Each statistical test is based
on certain assumptions about the population(s) from which
the data are drawn. If a particular statistical test is used to
analyze data collected from a sample that does not meet the
expected assumptions, then the conclusions drawn from the
results of the test will be flawed
A statistical test can never establish the truth of an hypothesis
with 100-percent certainty. Typically, the hypothesis is specified
in the form of a null hypothesis, i.e., the score characterizing
one group of measurements does not differ (within an
acceptable margin of measurement error) from the score
characterizing another group. Note the hypothesis does not
state the two scores are the same; rather, it states no significant
difference can be detected. Performing the statistical procedure
yields a test result that helps one reach a decision that
Intervals data
The scores are not different (the hypothesis is confirmed) or
Ratio data
The difference in the scores is too great to be explained by
The purpose of data analysis is to produce information that

will help to address the problem at hand.
Several factors influence the selection of the appropriate techniques for
data analysis. These include:
Type of data
Research design
Assumptions underlying the test statistic and related
consideration.
Type of data
The type of data plays a central role in the choice of a statistical
technique to be employed in data analysis.
A useful classification of data includes:
Nominal data
The nominal data are the most primitive for of data from the
perspective of data analysis. They are just numbers assigned to
objects, based only on the fact that the objects belong to
particular categories. An only meaningful measure of central
tendency is the mode.
The ordinal data offers higher level of measurements than the
nominal data. The median and mode are now both legitimate
measures of central tendency. Most no parametric tests can be
performed on ordinal data
Interval and Ratio data are the best form. A wide range of
parametric and nonparametric tests can be performed. The
mean, median and the mode are legitimate measure of central
tendency.
Research Design
A second consideration that affects the choice of analysis
technique is the research design used to generate the data. Some
of the decisions the analyst has to face involve the dependency
11.301
chance (the hypothesis is rejected).

Rejecting the hypothesis when it actually is true is called a Type-I
error. Failure to reject the hypothesis when it is false is termed a
Type-II error. For convenience and simplicity, a 5-percent risk of
making a Type-I error has become conventional; one should be
correct 95 out of 100 times when using the listed value in the
probability tables to accept or reject the hypothesis.
In statistics, robustness is the degree to which a test can stray
from the assumptions before changing the confidence you have
in the result of the statistical test you have used.
The entire gamut of statistical technique can be broadly
classified as univariate and multivariate techniques, based on the
nature of problem.
1. Univariate technique
2. Multivariate technique
Copy Right: Rai University
87
MARKETING INTELLIGENCE
LESSON 19:
DATA ANALYSIS
Univariate technique
Uses of Nonparametric Tests
These techniques are appropriate when there is a single measurement of each of the n sample objects, or when there are several
measurements of each of the n observations, but each variable
is analyzed in isolation.
Nonparametric tests are used in the behavioral sciences when
Univariate technique are further classified based on the type of

data as:
Parametric technique
Non-parametric data
The two classes of statistical tests are called parametric and

nonparametric. The word parametric comes from metric,
meaning to measure, and para, meaning beside or closely
related; the combined term refers to the assumptions about the
population from which the measurements were obtained.
Nonparametric data do not meet such rigid assumptions.
Nonparametric tests sometimes are referred to as distributionfree. That is, the data can be drawn from a sample that may
not follow the normal distribution.
Before a parametric test can be undertaken, it must be ascertained that: 1) the samples are random (i.e., each member of the
population has an equal chance of being selected for measurement); 2) the scores are independent of each other; 3) the
experiments are repeatable with constancy of measurements
from experiment to experiment; 4) the data are normally
distributed; and 5) the samples have similar variances (1,2).
Parametric statistics use mean values, standard deviation and
variance to estimate differences between measurements that
characterize particular populations
there is no basis for assuming certain types of distributions.

Siegel has advocated that nonparametric tests be used for
nominal and ordinal levels of measurements while

parametric tests be used for analyzing interval and ratio data.
On the other hand, researchers have argued that statistical
tests are selected to meet certain goals or to answer specific

questions rather than to match certain levels of measurement
with parametric or nonparametric procedures . This view
currently is prevalent among many statisticians.
In practice, levels of measurement sometimes are
downgraded from ratio and interval scales to ordinal or
nominal scales for the convenience of a measuring
instrument or interpretation. For example, muscular
strength (measured with a force gauge and considering the
length of the lever arm through which the force is acting) is a
variable that yields ratio data because a true zero point exists
in the level of measurement. Nonparametric tests should
not be substituted for parametric tests when parametric tests
are more appropriate.
Nonparametric tests should be used when the assumptions
of parametric tests cannot be met, when very small numbers

of data are used, and when no basis exists for assuming
certain types or shapes of distributions.
Nonparametric tests are used if data can only be classified,
counted or ordered-for example, rating staff on performance

or comparing results from manual muscle tests.
Parametric tests require data from which means and variances

can be calculated, i.e., interval and ratio data. Some statisticians
also support the use of parametric tests with ordinal-scaled
values because the distribution of ordinal data often is approximately normal.
These tests should not be used in determining precision or
As long as the actual data meet the parametric assumptions,

regardless of the origin of the numbers, then parametric tests
can be conducted. As is the case with all statistical tests of
differences, the researcher must interpret parametric statistical
conclusions based on ordinal data in light of their clinical or
practical implications
easily without automated instruments (calculators and

computers).
They are designed for small numbers of data, including
counts, classifications and ratings.
For both nonparametric and parametric data, the next level of

classification involves determining whether a single sample or
multiple samples are involved. Further in the multiple samples,
the appropriate statistical test depends on whether the samples
are independent or dependent.
Nonparametric tests can be used with data that are of the
nominative (e.g., characteristics such as right and left, male and
female) and ordinal (e.g., mild, moderate and severe) levels of
measurement, which may not follow the normal distribution
curve or comply with other assumptions required of data
analyzed by parametric statistical methods. However, the results
of analyzing data with these nonparametric statistical methods
can yield important information about the degree to which
qualities of one group of data differ from those of another
group
88
accuracy of instruments because the tests are lacking in both

areas.
Advantages
Nonparametric tests usually can be performed quickly and
They are easier to understand and explain.

Calculations of nonparametric tests generally are easy to
perform and apply, and they have certain intuitive appeal as

shortcut techniques.
Nonparametric tests are relatively robust and can be used
effectively for determining relationships and significance of

differences using behavioral research methods.
Disadvantages
Parametric tests are more powerful than nonparametric tests
and deal with continuous variables whereas nonparametric

tests often deal with discrete variables.
Using results from analyses of nonparametric tests for
making inferences should be done with caution because
small numbers of data are used, and no assumptions about
parent populations are made.
11.301
Notes -
The ease of calculation and reduced concern for assumptions
have been referred to as quick and dirty statistical

procedures.
Univariate
Technique
Nonparametric
Statistics
One sample
.Chi-square
.Kolmogorov
-Smirov
.Runs
Parametric
Statistic
Two or more
sample
Independent
.Chi-Square
.Rank sum
tests
.K-S
.ANOVA
One sample
.t-test
.z-test
Dependent
.Sign
.Wilcoxon
.McNemar
.CochranQ
Independent
.t-test
.z-test
.ANOVA
Two or More
sample
Dependent
.Paired t test
Classification of Univariate Statistical

Technique
Multivariate Statistical Technique
It is defined as a collection of procedures for analyzing the
association between two or more sets of measurements that are
made on each objects in one or more samples of objects.
Depending on the type of variable the multivariate Technique is
divided into
Dependence technique
Interdependence technique
Dependence variable are appropriate when one or more
variables can be identified as dependent variables and the
remaining as independent variables. The appropriate choice of
dependence technique further depends on whether there are one
or more dependent variables involved in the analysis.
In interdependence techniques, the variables are not classified as
dependent or independent; rather, the whole set of interdependent relationships is examined. The interdependence technique
can be further classified as focusing on variables or objects, that is,
as variable interdependence or interobject similarity techniques.
Multivariate
Techniques
Dependence
Technique
One dependent
variable
.ANOVA
.Multiple
.Regressions
.Discriminanat
Analysis
. Conjoint Analysis
Multiple dependent
variables
.MANOVA
&MANCOVA
.Canonical
Correlation
Interdependence
Technique
Focus on variables
FocusOn
Objects
.Factor Analysis
.Cluster analysis
.Multidimension
al Scaling
Classification for multivariate statistical techniques
11.301
89

Data Analysis

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Data Analysis

Uploaded by

Copyright:

Available Formats

Data analysis is a set of methods and techniques that can be

used to obtain information and insights from the data.

of observation, the number per objects, the number of groups

Data analysis turns the quantity of papers into defensible,

Assumptions underlying test statistic

Need for data analysis:

insights that otherwise would not be available.

understand analysis conducted by others.

constructively influence research objectives and research

The assumptions underlying the test statistic also affect the

Statistical tests have been developed to permit comparisons

The scores are not different (the hypothesis is confirmed) or

The difference in the scores is too great to be explained by

The purpose of data analysis is to produce information that

chance (the hypothesis is rejected).

Copy Right: Rai University

Uses of Nonparametric Tests

Nonparametric tests are used in the behavioral sciences when

Univariate technique are further classified based on the type of

The two classes of statistical tests are called parametric and

there is no basis for assuming certain types of distributions.

nominal and ordinal levels of measurements while

tests are selected to meet certain goals or to answer specific

of parametric tests cannot be met, when very small numbers

counted or ordered-for example, rating staff on performance

Parametric tests require data from which means and variances

These tests should not be used in determining precision or

As long as the actual data meet the parametric assumptions,

easily without automated instruments (calculators and

For both nonparametric and parametric data, the next level of

accuracy of instruments because the tests are lacking in both

They are easier to understand and explain.

perform and apply, and they have certain intuitive appeal as

effectively for determining relationships and significance of

and deal with continuous variables whereas nonparametric

Copy Right: Rai University

The ease of calculation and reduced concern for assumptions

have been referred to as quick and dirty statistical

Classification of Univariate Statistical

Classification for multivariate statistical techniques

Copy Right: Rai University

You might also like