Professional Documents
Culture Documents
Executive team :Shridara Manisha Dsouza Shwetha Vedaprakash Keshav Murthy Basavaraj
Multiple Regression
Multiple regression is a statistical technique that allows us to predict
EXAMPLE:
Suppose we were interested in predicting how much an individual enjoys their job. Variables such as salary, extent of academic qualifications, age, sex, number of years in full-time employment and socioeconomic status might all contribute towards job satisfaction.
Purposes:
Prediction Explanation Theory building
exploring
linear
relationships
between
the
predictor and criterion variables that is, when the relationship follows a straight line. 2. The criterion variable that you are seeking to predict should be measured on a continuous scale
Contd.
3. Multiple regression requires a large number of observations. The number of cases (participants) must substantially exceed the number of predictor variables you are using in your regression. The absolute minimum is that you have five times as many participants as predictor variables. A more acceptable ratio is 10:1, but some people argue that
Assumptions
Independence: the scores of any particular subject are independent of the scores of all other subjects Normality: in the population, the scores on the dependent variable are normally distributed for each of the possible combinations of the level of the X variables; each of the variables is normally distributed Homoscedasticity: in the population, the variances of the dependent variable for each of the possible combinations of the levels of the X variables are equal. Linearity: In the population, the relation between the dependent variable and the independent variable is linear when all the other independent variables are held constant.
Methods
Multiple regression is an extension of bivariate linear regression. The generalized equation is Y= 0 + 1X1 + 2 X2 + .. + n Xn + where, 0 = a constant , the value of Y when all X values are zero. 1= the slope of regression surface = an error term, normally distributed about a mean of zero.
If D1=1 and D2=1, then Y 47,099 21,160 16,024 84,283 The average price of houses with a driveway and rec room is $84,283.
1 2
The average price of houses with a driveway but no rec room is $68,259.
The average price of houses with a rec room but no driveway is $63,123.
Y 47,099
The average price of houses with no driveway and no rec room is $47,099.
Dummy variables
A dummy variable is a variable which takes on the value 1 or 0 depending upon the answer to a yes or no question. For example, a dummy variable might
Example (1)
WAGE=1 + 2 ED + 3 MALE, Where MALE= 1 if the person is male, and 0 if the person is female. This generates two equations one for females and one for males. (FEMALE):WAGE=1 + 2 ED
(MALE):WAGE=1 + 2 ED + 3 = (1 + 3 ) + 2 ED
A test of the hypothesis Ho: 3=0 is a test of the hypothesis
that the wage equation is the same for men and women.
USES
Analysis of financial and economical data their fore correct interpretation of regression report is
possible.
Use to detect any systematic differences which
Conclusion
Hence the multiple regression and dummy variable analysis helps to find the dependent and independent value for some extent it could give an accurate interpretation but if the independent value is more in number it is quite complex to make interpretation
THANK YOU