Management Science Notes
Management Science Notes
√
t computed = r
n−2
1−r 2
The Population Regression Model
After calculating this value, we can find its p-value by
using Excel’s function = T . DIST .2 T (t , deg freedom )
Regression Analysis
The hypothesized relationship may be linear,
quadratic or some other form.
The next slide presents some of the possible patterns.
Simple Linear Regression Equation meaningful because no apartment can have 0 square
foot.
For regression:
Computing the slope and the intercept of the 1. Go to Data, choose Data Analysis.
regression equation, we have: 2. Choose Regression among the Data Analysis Tools.
3. Fill up necessary fields.
4. Click OK.
The estimated regression equation therefore is: The estimated regression equation is:
Y =2+ 1.25 X∨Sales=2+1.25(Payroll ) Sales=2+1.25∗(Payroll)
√
n
∑ (Y i−Y^ i )2
SYX =
√ SSE
n−2
= i=1
n−2
Where:
The estimated regression equation is:
SSE = error sum of squares
Sales=2+1.25∗(Payroll)
n = sample size
The slope of this model is 1.25. Is there a relationship
Example 1 Standard error of estimate between the payroll and the sales?
Example 1 Excel output
Multiple Regression
Multiple regression extends simple regression to
include several independent variables (called
Example 1 Excel output for assessing assumptions predictors or explanatory variables).
Sales=306.516−24.975 ( X 1 ) +74.131 ( X 2 )
The Estimated Regression Equation
b 1=−24.975
Y^ =b0 +b 1 X 1 +b 2 X 2+ …+b X k
Sales will decrease, on average by 24.975 pies per
Where: week for each $1 increase in selling price, net of the
effects of changes due to advertising.
Y^ is the predicted value of the response variable.
b 0 is the estimated intercept or constant. b 0=74.131
b 1 , b2 , bk are the estimated slope coefficients.
Sales will increase, on average, by 74.131 pies per
X 1 , X 2 , … , X k are the predictor variables.
week for each $100 increase in advertising cost, net of
the effects of changes dues to price.
Predict sales for a week if the selling price is 6.50 and
the advertising cost is $420:
Similar to simple regression, there is one residual for The coefficient of multiple determination reports the
every observation in a multiple regression: proportion total variation in Y that is explained by the
variation of all predictor variables taken together.
e i= y i− ^yi for i=1 ,2 , … , n
It is also called r-squared and is obtained by:
For a regression with k predictors, the hypotheses to We lose a degree of freedom when a new variable
be tested are: is added.
H 0: All the true coefficients are zero Did the new X variable add enough independent
power to offset the loss of one degree of freedom?
H 1: At least one of the coefficients is nonzero.
The adjusted R2 shows the proportion of variation in Y
If: explained by all X variables adjusted for the number of
Significance F/P-value>alpha (.05), accept null X variables used.
hypothesis.
Determining How Many Predictors If a predictor is proven not relevant you can omit it
when writing your equation. Therefore, in some cases,
One way to prevent overfitting the model is to limit
the simple regression is a special case of multiple
the number of predictors based on the sample size.
regression. E.g., if you have 3 predictors and two of
Evans’ Rule (conservative): n /k ≥ 10 (at least 10 them are irrelevant.
observations per predictor)
Regression Diagnostics