Chapter 09 W12 L1 Multiple Regression Analysis 2015 UTP C10 PDF

7/14/2015
Chapter 9 (Week 11)

Multiple Linear
Regression Model
Dr.S.Rajalingam
L1 – Multiple Linear Regression

(MLR) Model
1
Learning Outcomes
At the end of the lesson, the student should be able to

 Use the LSM to estimate a multiple linear model
 Determine whether the model obtained is an adequate fit
to the data
 Perform the test for inferences on regression parameters
 Find the CI for the slopes.
1
7/14/2015
The General Idea

Simple regression considers the relation
between a single independent variable and
dependent variable
3 Basic Biostat
The General Idea

Multiple regression simultaneously considers the
influence of multiple independent variables on a
dependent variable Y
The intent is to look at

the independent effect
of each variable while
“adjusting out” the
influence of potential
confounders
4 Basic Biostat
2
7/14/2015
Regression Modeling
 A simple regression
model (one independent
variable) fits a regression
line in 2-dimensional
space
 A multiple regression
model with two
independent variables fits
a regression plane in 3-
dimensional space
Basic Biostat 5
Simple Regression Model

Regression coefficients are estimated by
minimizing sum of the squared error (SSE) to
derive this model:
6 Basic Biostat
3
7/14/2015
Multiple Regression Model

Again, estimates for the multiple slope
coefficients are derived by minimizing SSE derive
this multiple regression model:
7 Basic Biostat

 Intercept α predicts
where the regression
plane crosses the Y
axis
 Slope for variable X1
(β1) predicts the
change in Y per unit
X1 holding X2
constant
 The slope for variable
X2 (β2) predicts the
change in Y per unit
X2 holding X1
constant
8 Basic Biostat
4
7/14/2015
A multiple regression
model with k independent
variables fits a regression
“surface” in k + 1
dimensional space (cannot
be visualized)
9 Basic Biostat
MULTIPLE LINEAR REGRESSION

 An extension of a simple linear regression model
 Allows the dependent variable y to be modeled as a linear
function of more than one independent variable xi
 Consider the following data consisting of n sets of values

( y1 , x11 , x21 , ....xk 1 )
( y2 , x12 , x22 , ....xk 2 )
.
( yn , x1n , x2 n , ....xkn )
 The value of the dependent variable yi is modeled as
10
5
7/14/2015
 the dependent variable is related to k independent or

repressor variables
 the multiple linear regression model can provide a rich
variety of functional relationships by allowing some of the
input variables xi to be functions of other input variables.
 As in simple linear regression, the parameters  0 ,  1 , .....,  k

are estimated using the method of least squares.
 However, it would be tedious to find these values by hand,

thus we use the computer to handle the computations.
 the ANOVA is used to test for significance of regression
 the t - test is used to test for inference on individual

regression coefficient
11
Example : MLR Models
12
6
7/14/2015
Data for Multiple Linear Regression
Example 1: pg. 310
7
7/14/2015
Multiple Linear Regression Models
17
18
8
7/14/2015
 it is tedious to find these values by hand, thus

we use the computer to handle the computations.
19
 Now were day we have computer to

calculate… So… you no need to worry
about the calculation.
20
9
7/14/2015
Regression Analysis: y versus x1, x2

The regression equation is Strength = ????
Predictor Coef SE Coef T P-value
Constant 2.264 1.060 ?
x1 2.74427 0.09 ?
x2 0.0125 0.002 ?
S=? R-Sq = ?
Analysis of Variance
Source DF SS MS F P-value
Regression ? 5990.4 2995.4 ? 0.000
Residual Error 22 115.2 5.2
Total ? 6105.9
10
7/14/2015
MULTIPLE LINEAR REGRESSION ANALYSIS

 Testing for the significance of regression.
Hypotheses: H 0 : 1   2   3  ....   k  0
H 1 : at least one  j  0
MS R
Test statistic: F0 
MS E
SSR SSE
where: MS R  MS E 
k n p
MS R
Rejection criteria: F0   f , k , n  p
MS E
23
ANOVA Table for multiple linear regression
Source Sum of Degrees Mean Computed F

Of variation Squares of Square
freedom (Sum of squares /
(df) df)
SSR
Regression SSR k MS R  F = MSR/MSE
k
SSE
Error SSE n – (k+1) MS E 
n  (k  1)
Total SST n–1
24
11
7/14/2015
 Inferences on the model parameters in multiple regression.
 The hypotheses are H 0 :  j   j ,0

H1 :  j   j ,0
 Test statistic ˆ j   j , 0
T0 
se ( ˆ ) j
 Reject H0 if T0  t / 2 , n  p
 A 100 (1 –  )% CI for an individual regression coefficient is
ˆ j  t  / 2 , n  ( k 1) ( se )( ˆ j )   j  ˆ j  t  / 2 , n  ( k 1) ( se )( ˆ j )
25

The regression equation is Strength =
Constant 2.264 0 1.060 ?
x1 2.74427 1 0.09 ?
x2 0.0125 2 0.002 ?
S=? R-Sq = ?
Reject H0 if T  t / 2 , n  p
Regression 2 5990.4 2995.4 572.17 0.000
Total 24 6105.9
s2
12
7/14/2015

The regression equation is Strength = 2.26 + 2.74 Wire Ln + 0.0125 Die Ht
Constant 2.264 1.060 2.14 0.044
x1 2.74427 0.09 30.49 0.000
x2 0.0125 0.002 6.25 0.000
S = 2.288 R-Sq = 98.1%
Regression 2 5990.4 2995.4 572.17 0.000
Total 24 6105.6
Reject H0 if F0  f  , k , n  ( p )
Example 2: A set of experimental runs were made to

determine a way of predicting cooking time y at various levels
of oven width x1, and temperature x2. The data were recorded
as follows:
y x1 x2
6.4 1.32 1.15
15.05 2.69 3.4 a) Estimate the multiple
18.75 3.56 4.1 linear regression equation
30.25 4.41 8.75 form the data.
44.86 5.35 14.82
48.94 6.3 15.15
51.55 7.12 15.32
61.5 8.87 18.18
100.44 9.8 35.19
111.42 10.65 40.4
13
7/14/2015
Solution:
Y X1 X2 X1-square X2-square X1X2 X1Y X2Y
6.4 1.32 1.15 1.7424 1.3225 1.518 8.448 7.36
15.05 2.69 3.4 7.2361 11.56 9.146 40.4845 51.17
18.75 3.56 4.1 12.6736 16.81 14.596 66.75 76.875
30.25 4.41 8.75 19.4481 76.5625 38.5875 133.4025 264.6875
44.86 5.35 14.82 28.6225 219.6324 79.287 240.001 664.8252
48.94 6.3 15.15 39.69 229.5225 95.445 308.322 741.441
51.55 7.12 15.32 50.6944 234.7024 109.0784 367.036 789.746
61.5 8.87 18.18 78.6769 330.5124 161.2566 545.505 1118.07
100.44 9.8 35.19 96.04 1238.336 344.862 984.312 3534.484
111.42 10.65 40.4 113.4225 1632.16 430.26 1186.623 4501.368
TOTAL 489.16 60.07 156.46 448.2465 3991.121 1284.037 3880.884 11750.03
30
14
7/14/2015
Least squares normal equations :

10 0  60.07 1  156.46 2  489.16
60.07  0  488.24651  1284.037  2  3880.884
156.46 0  1284.037 1  3991.121 2  11750.03
By solving this normal equations :
 0  0.568, 1  2.706,  2  2.051

Then the estimated regression model is :
 Yˆ  0.568  2.706 X1  2.051X 2
Solution:
Using the computer for computations, the following results were
observed.
Regression Analysis: cooking time versus width, temperature
The regression equation is ????
Predictor Coef SE Coef T P

Constant 0.568 0.585 0.970 0.364
Width 2.706 0.194 ??? 0.000
Temp 2.051 0.046 ??? 0.000
S=? R-Sq = ? R-Sq(adj) = 100%
Source DF SS MS F P
Regression ??? 10953.334 5476.667 13647.872 0.000
Total ??? 10956.143
15
7/14/2015
Solution:
Using the computer for computations, the following results were
observed.
Regression Analysis: cooking time versus width, temperature
The regression equation is
Cooking time = 0.568 + 2.706 width + 2.051 temperature
Predictor Coef SE Coef T P

Constant 0.568 0.585 0.970 0.364
Width 2.706 0.194 13.935 0.000
Temp 2.051 0.046 44.380 0.000
S = 0.6334 R-Sq = 99.9% R-Sq(adj) = 100%
Source DF SS MS F P
Regression 2 10953.334 5476.667 13647.872 0.000
Total 9 10956.143
Example 2 ( continued):
b) Test whether the regression explained by the model obtained in
part (a) is significant at the 0.01 level of significance.
Solution:
We use ANOVA to test for significance of regression
The hypotheses are
H 0 : 1   2  0
H 1 : 1 and  2 are not both zero
The test statistic is MSR

F0 
MSE
16
7/14/2015
The following ANOVA table is obtained:

Source DF SS MS F P
Regression 2 10953.334 5476.667 13647.872 0.000
Total 9 10956.143
Taking the significance  = 1% = 0.01
f  , k , n  ( k 1)  f 0.01, 2 , 7  9 .55

F  13647 .87  9 .55
Decision : Reject H0, A linear model is fitted
END
37
17

Chapter 09 W12 L1 Multiple Regression Analysis 2015 UTP C10 PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chapter 09 W12 L1 Multiple Regression Analysis 2015 UTP C10 PDF

Uploaded by

Copyright:

Available Formats

7/14/2015

Chapter 9 (Week 11)

L1 – Multiple Linear Regression

At the end of the lesson, the student should be able to

The General Idea

The General Idea

The intent is to look at

Simple Regression Model

Multiple Regression Model

Multiple Regression Model

Multiple Regression Model

MULTIPLE LINEAR REGRESSION

 Consider the following data consisting of n sets of values

 the dependent variable is related to k independent or

 As in simple linear regression, the parameters  0 ,  1 , .....,  k

 However, it would be tedious to find these values by hand,

 the ANOVA is used to test for significance of regression

 the t - test is used to test for inference on individual

Example : MLR Models

Data for Multiple Linear Regression

Example 1: pg. 310

Multiple Linear Regression Models

Multiple Linear Regression Models

Multiple Linear Regression Models

 it is tedious to find these values by hand, thus

 Now were day we have computer to

Regression Analysis: y versus x1, x2

MULTIPLE LINEAR REGRESSION ANALYSIS

ANOVA Table for multiple linear regression

Source Sum of Degrees Mean Computed F

 Inferences on the model parameters in multiple regression.

 The hypotheses are H 0 :  j   j ,0

Regression Analysis: y versus x1, x2

Regression Analysis: y versus x1, x2

Example 2: A set of experimental runs were made to

6.4 1.32 1.15 1.7424 1.3225 1.518 8.448 7.36

15.05 2.69 3.4 7.2361 11.56 9.146 40.4845 51.17

18.75 3.56 4.1 12.6736 16.81 14.596 66.75 76.875

30.25 4.41 8.75 19.4481 76.5625 38.5875 133.4025 264.6875

44.86 5.35 14.82 28.6225 219.6324 79.287 240.001 664.8252

48.94 6.3 15.15 39.69 229.5225 95.445 308.322 741.441

51.55 7.12 15.32 50.6944 234.7024 109.0784 367.036 789.746

61.5 8.87 18.18 78.6769 330.5124 161.2566 545.505 1118.07

100.44 9.8 35.19 96.04 1238.336 344.862 984.312 3534.484

111.42 10.65 40.4 113.4225 1632.16 430.26 1186.623 4501.368

TOTAL 489.16 60.07 156.46 448.2465 3991.121 1284.037 3880.884 11750.03

Least squares normal equations :

By solving this normal equations :

 0  0.568, 1  2.706,  2  2.051

 Yˆ  0.568  2.706 X1  2.051X 2

Predictor Coef SE Coef T P

S=? R-Sq = ? R-Sq(adj) = 100%

Predictor Coef SE Coef T P

S = 0.6334 R-Sq = 99.9% R-Sq(adj) = 100%

We use ANOVA to test for significance of regression

The hypotheses are

The test statistic is MSR

The following ANOVA table is obtained:

Taking the significance  = 1% = 0.01

f  , k , n  ( k 1)  f 0.01, 2 , 7  9 .55

You might also like