Professional Documents
Culture Documents
DISTRIBUTIONS
FREQUENCY DISTRIBUTIONS
Consider the following example
How How How How old is John? old is Mary? old is Frank?
old am I?
4 40 2 40
= 10%
3 40
= 5%
= 7.5%
SPECIAL CASE
This frequency distribution has two (nearly equal) peaks: Bi-modal distribution
After sorting
Here:
850+875 2
= 862.5
10
Symbol
: SUM
values
Formula
n = number of values
Xcel function: AVERAGE (ranges)
NB: In many textbooks the average is called the mean. This gives the honest average a poor image, so it is not used in this course.
11
12
QUICK QUIZ
You made a survey on 10 different families to see how many children they have. You obtained the following observations: 0, 0, 1, 1, 2, 2, 2, 3, 4, 5 Indicate whether each statement is true or false.
The The The The The mode is 5 average is 2.5 median is 2 variable is quantitative variable is quantitative continuous
13
Formula: =
= the value of variable at the MIDDLE of the frequency class = the value of the frequency
40Ages computer
14
SYMMETRICAL DISTRIBUTIONS
In perfectly symmetrical frequency distributions, the relative positions of MODE, MEDIAN and AVG coincide
15
ASYMMETRICAL DISTRIBUTIONS
In a asymmetrical frequency distribution the relative positions of these three parameters appear as shown. This distribution is skewed to the right. The mirror image of this situation is also possible.
AVG MEDIAN MODE
16
17
QUICK QUIZ
From the following frequency distribution, indicate whether each statement is true or false.
60 50 40 30 20 10 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
The distribution is left skewed The mode is smaller than the median and the average Mode = Median = Average The mode is between 50 and 60 The average is higher than 5 The median is between 4 and 5
18
EXERCISE 1
You are given burger sizes of the last 20 burgers sold in one fast food. Answer the following questions.
What is the type of the variable Burger Size? Compute the range. Calculate the mode, median and average. Classify the data into 4 classes and compute the frequency distribution. Represent graphically the relative frequency distribution and comment it.
19
QUICK QUIZ
You are reported in the table below the number of clients that came to your restaurant the last 50 days.
Compute the missing values
xi 25 26 27 28 29 30 > 30 ni 5 fi 10.00% 12.00% 18.00% 22.00% 10.00% Fi 10.00% 32.00% 50.00% 72.00% 100.00%
Indicate whether each statement is true or false. x3= 27 clients The sample size is 50 clients f4 = 18% of days 28 clients came to your restaurant The median is 28 clients The average cannot be calculated
9 11 5
20
EXERCISE 2
Using data from the customer satisfaction feedback of one service, answer the following questions:
What is the type of the variable? Compute the absolute and relative frequency distribution. Graph the relative frequency and comment your results.
21
GRAPHICAL TOOLS
Use of different graphical representations depends on the nature (qualitative or quantitative) of the variable being studied.
Qualitative Variable
Circle diagram Bar chart
Continous
Histogram Density Curve Box Plot
22
23
24
Procedure:
Separate each number into a stem and a leaf. Here, we choose the number of hundreds as the stem and the tens digit as the leaf Group the numbers with the same stems
Stem 5 6 7 8 9 10 11 12
Remarks:
Stem and leaf plots simultaneously show data repartition and data itself The leaves are sorted in increasing order The most difficult step is the scale choice: tens/hundreds; sometimes 5/50; 2/20, etc
25
QUICK QUIZ
As a marketing consultant you observed 50 consecutive shoppers at a grocery store, and recorded how much money each shopper spent in the store.
The following graph provides this information.
0 1 2 3 4 5 6 2 7 7 8 9 0 1 2 3 3 4 4 4 5 5 5 5 7 7 8 8 9 0 0 1 1 1 1 4 6 7 9 9 1 2 3 3 4 5 6 8 9 1 4 6 2 2 4 4 9
26
QUICK QUIZ
The scores of a team from the last Statistics quiz are given in the stem and leafs graph below. The quiz was graded on 70pts.
Reading scale : 1 | 5 represent 15 points
1 2 3 4 5 6
27
28
29
36
QUICK QUIZ
The Box Plot here under represents the Swiss Civil Aviation Airport traffic in 2009.
From the Box Plot above, indicate weather each statement is true or false. 75% of airports have an
annual traffic lower than 100'000 flights. Half of the airports have an annual traffic greater than 70'000 flights. The skew is positive. Two airports in particular have most traffic.
38
GRAPH EXAMPLES
39
GRAPH EXAMPLES
In October 2012, a well known newspaper published that the average salary in Switzerland is ranked 6th among 29 countries used for the study. Below is the reference graph published by the OFS (office ffral de la statistique). What can you conclude?
40
QUICK QUIZ
We would like to study the distribution of net monthly salary for Swiss employees in 2013. Relative frequencies per class are given in the table below:
Salary classification 0-3000 CHF 3000-4000 CHF 4000-5000 CHF 5000-6000 CHF 6000-7000 CHF 7000-8000 CHF 8000 and more CHF Total Relative frequency 2% 14% 24% 20% 13% 9% 19% 100%
41
EXERCISE 3
The life cycle of 20 bulbs from the company Superligth SA has been measured during a control. The results obtained are in the stem-and-leaf (see Excel file).
Find the quartiles of this distribution and compute the IQR. Find the average life cycle knowing that the sum of leafs are 18800 hours. Find the mode?
42
EXERCISE 4
Answer the following questions using the available exam grades distribution.
How many students attended the exam? Compute the 5-number summary of the exam results. What is the average grade? Draw the graph of the distribution and comment it.
43