0% found this document useful (0 votes)

81 views

ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2

Co ownership of contributed Assets All assets contributed into the partnership are owned by the partnership by virtue of its separate and distinct juridical personality (all partners jointly own contributed assets in a special sense).

Uploaded by

Noah H

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views

ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2

Uploaded by

Noah H

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 22

ECO 391- 007 Lecture Handout for Chapter 15 SPRING 2003

REGRESSION ANALYSIS

Sections 15.1, 15.2

Brief outline:
I. What is Regression Analysis?
A. Define.
B. Independent and Dependent Variables.
C. MBA Admissions Example.
D. A formal definition of regression analysis and what you can use it for.
II. Linear Equations
A. One-variable case
B. Case with many variables
III. Deterministic and Stochastic Relationships
IV. The Simple Linear Regression Model

I. What is Regression Analysis?

Regression analysis is a statistical tool that allows us to look at the impact of one variable on
another while controlling for potential confounding effects. (It holds other things constant, or, in
Latin, "Ceteris Paribus")

Examples:

B. Independent and Dependent Variables

Examples: Which one of the following is a dependent variable and which are independent ones?
Rain , Agricultural Output

Education, Earnings, Experience

Alcohol Consumption, Potential for Heart Attack, Smoking

Advertising Expenditures, Sales, Prices of Substitute Goods

Independent Variables: (also called exogenous or explanatory variables) are the variables whose
value influences or determines the value of another variable (the dependent variable).

Dependent Variables: (also called endogenous variables) are the variables whose values are
influenced by the value of the independent variable.

Examples:
(1) n is the sample size and i represents the observation number.

Observation Independent Variable Dependent Variable

Number Make of Car Gasoline Mileage
(i), n = 3
1 Nissan 30
2 Cadillac 18
3 Yugo 50

(2)
Observation Dependent Variable Independent Variable Independent Variable
Number Yearly Income ($'s) Education (years) Years of Work
(i), n = 3 Experience
1 12,000 8 0
2 20,000 10 5
3 30,000 12 10

3) Dependent Variable - The number of votes a candidate receives during an election

List potential independent variables:

(4) Dependent Variable - The grade you will receive in this class:
List potential independent variables:
C. MBA Admissions Example

The Dean of B&E college needs help to determine which applicants to accept to our MBA program. He
hires you to predict how each applicant would do academically in our MBA program.

1) What factors (variables) on the applicants would you want data on?

2) How can we measure the impact of each of these variables on MBA academic performance?

D. Formal definition of regression analysis

Regression Analysis: A statistical technique that attempts to explain changes in the dependent
variable as a function of changes in independent (explanatory) variables, through the
quantification of an equation. (holding all else constant)

Econometrics is what we call regression analysis when we apply it to economic phenomena.

Reasons to use regression analysis:

1) To quantify theories.
(Describe economic reality.)

2) To test our theories

(test hypothesis)

3) To measure the strength of a relationship

4) To use for forecasting

II. Linear Relationships
A. One-Variable Case
Let X = number of minutes you talk on the phone (long-distance call to Europe)

Let Y = the size of the bill for the call.

(Y denotes the dependent variable.)
Observation Dependent Variable Independent Variable
Number Y or the bill in $. X or minutes
Call #
1 1.2 5
2 17.3 70
3 2.6 15
4 4.7 30
5 7.5 50

There is a mathematical relationship b/w X and Y: Y = f(X)

Plot these points to get a scatter plot diagram.

The points are (Xi, Yi) where i denotes the observation number.
Y

Points to plot:
(X1, Y1) = (5, 1.2) 10
(X2, Y2) = (70, 17.3)
(X3, Y3) = (15, 2.6) 8
(X4, Y4) = (30, 4.7)
(X5, Y5) = (50, 7.5) 6

Specific functional form for 4

a linear relationship:
Yi = o + 1Xi 2

0
0 10 20 30 40 50 60 70 X
0: constant term (or Y-intercept term).
0 tells us the value of Y when X is zero.
(Graphically, value of Y where the line hits the Y axis.)

1 is called the slope term. (rise over the run)

or Y/X or (Y1-Y0)/(X1-X0)
where (Xo,Yo) and (X1,Y1) are two points on the line.

If 1 < 0 the line slopes downward X and Y are inversely related.

If 1 > 0 the line slopes upward X and Y are positively related

Specific interpretation of 1:

As X increases by one unit, Y increases by the amount 1 .

if 0 > 0 and 1 > 0

Yi =0 + 1Xi

1
0

Looking ahead:

Regression analysis allows us to estimate the values of 0 and 1 that characterize the relationship between
X and Y.

B. Case of Several Variables:

Example: Expenditure on food (at constant prices) as a function of the quantities of goods.

Say, consumers choose among bread, cheese and beer.

Y= money spent on basket i, dollars

X’S = amounts of the goods in basket i

Yi = o + 1Xibread +2Xicheese + 3Xibeer ---- still a linear relationship.

Interpretation of the coefficients:

o

1

2

3
III. Deterministic and Stochastic Relationships

A deterministic relationship is one in which each value of X is paired with only one Y value. It’s
an
exact relationship, of the same nature as discussed in the previous section.

Additional example 1: Let’s assume I am selling apples at a constant price.

Y = my income from selling apples ($'s)
X = number of apples I sell

A deterministic linear relationship is represented by a straight line (one-variable case) or a three-

dimentional plane (2 variable case), etc.

Deterministic relationship: Yi = o + 1Xi

A stochastic relationship is one in which one value of X may be associated with several different
values of Y for different data points. In short, there is an underlying linear relation between X and
Y, but Y is subject to some external “noise”.

Example:
Y = yearly family expenditures on recreational activity.
X = yearly family income.

Example: Height (X) and Weight (Y) of people.

Stochastic relationship: Yi = o + 1Xi +ε i

The εi in the stochastic equation is called the random or stochastic error term.

The stochastic error term accounts for all of the other variables besides X that determine the value of
Y.
εi accounts for:

1) Independent explanatory variables besides X. (omitted from our equation.)

2) Measurement errors in data.

3) Incorrect functional form

4)Randomness-unpredictable occurrences

Note: Some dependent variables will have more inherent error than others.

Car prices VS. Divorce Rates

Regression analysis: A method of estimating stochastic relationships and analyzing the estimates.
One-variable Stochastic Relationships are best illustrated by a scatter plot diagram:
Example: height-weight stochastic relationship.

Aside: On Scatter Plot diagrams

We use scatter plot diagrams because they show us…

1) If a relationship exists between two variables.

Sample A Sample B
Y Y

X X

2) If two variables are positively(directly) or negatively(inversely) related.

Sample A Sample B
X=income, Y=consumption X=price of cars,Y=# of cars sold

Y Y

X X
3) If the relationship between two variables is linear or nonlinear.
Linear Nonlinear
Y Y

X X
4) Something about the strength of a relationship between two variables.
Sample A Sample B
Y Y

X X
IV. The Simple Linear Regression Model.

Recall that a stochastic relationship between two variables is one in which the explanatory, independent
variable explains some of the value of the dependent variable, but it is not the sole determinant of Y.
Since other variables and error in data collection might also be affecting the value of Y, we include a
random error term, , that accounts for everything that X does not.

Consider the general form of a stochastic equation below:

Yi = o + 1Xi + εi
where: o and 1 are coefficients
εi is the random or stochastic error term
and i denotes the observation number.

This equation shows the behavioral relationship between X and Y and if we estimate the specific values
of o and 1 then we have statistically quantified the relationship.

The knowledge of the  -parameters is extremely valuable in many practical applications.

However, the exact values of ’s can be known only if we have all population data in our possession,
(which we, unfortunately, do not)

The goal of linear regression analysis is to estimate the values of o and 1 using sample
data.
For example,

Let Xi be a family’s income and let Yi be the family’s spending on recreational activities.
Two families who both have an income of $60,000 per year, (X1 = $60,000 and X2 = $60,000), may have
different levels of recreational spending. (Y1 = $5,000 and Y2 = $10,000)

For any given value of X, Y is said to be a random variable meaning that Y can take on any one in a
distribution of possible values. We expect this distribution to have a mean or expected value. For
instance, ten different families who all earn $60,000 dollars may all spend different amounts on recreation,
but we may say that on average, families who earn $60,000 per year spend $7,000 on recreation.

E(YiX = Xi) or E(YiXi) is called the conditional expected value of the random variable Yi when X
takes on a specific value. Below is a distribution showing the different values the random variable Yi can
take on given that Xi takes a specific value. (here: Xi = $60,000)

E(YiXi) Yi
given Xi = $60,000
For a linear regression model

E(YiXi) = o + 1Xi

This is called the population regression equation.

The mean of the Y distribution at each value of X falls on the population regression line.

f(Y)
Y

X2
X1 X2 X3 X

The actual (observed) data points and the population regression line:

E(YiXi) = o + 1Xi
True Population Regression Line

Note that the actual data points from a sample do not all actually fall directly on the true population
regression line.

The difference between the data points and the line is represented by the random error term.
The random error term, εi = Yi - E(YiXi)
εi = Yi - o + 1Xi or
Yi = o + 1Xi + εi (The Stochastic Equation)

Thus,
1) The (o + 1Xi) portion of the above equation is the systematic or deterministic component of the
stochastic equation. If Y depended solely upon this part of the equation, then each value of X would
only be associated with one value of Y.

2) εi is the random error term. This accounts for any part of the Y value that is explained by factors
other than X. This is the part of the equation that allows one X value to be associated with more than
one Y value. (i.e. “the garbage collector”)

Again, we do not observe the entire population to get the values of β1 and β2. We need to estimate these
values using samples.

Sample Information:

1) Ŷi = bo + b1Xi is called the sample regression equation (estimated regression equation) that
shows the behavioral relationship between X and Y for the sample data. This equation serves as an
estimate of the true population regression line that we cannot actually measure.

This implies that bo is an estimate of o

and b1 is an estimate of 1

2) ei is the estimated value of εi and it represents the distance between

observed data points and the sample regression line. It is called the residual value.

Yi(hat) is called the predicted (or fitted) value of Yi given X = Xi.

The actual (observed) data points and the sample regression line:

Yi = βo + β1Xi
Population Regression Line

Yi = bo + b1Xi
Sample Regression Line
eI (the residual) is an estimate of εi and it represents the difference between the actual observed Yi value
and the Ŷi value that is predicted by plugging Xi into the estimated regression line formula.
There will be n residual values, one for each data point pair.

e1, e2, and e3 , etc.

ei = Yi - Ŷi or ei = Yi - bo - b1Xi

Example:

Y = consumption in dollars per day

X = income in dollars per day

ei, the estimated

Yi(hat) the estimated value
Observation # Xi Yi value of εi (the
of Yi (predicted value)
residual)
1 10 6

2 15 8

3 8 5

4 12 8

5 14 10

Suppose that we take these data points and estimate the sample regression equation.
(We would be using formulas and techniques that you will learn in 15.3.) We would estimate:

E(YiXi) = o + 1Xi (the population regression line)

using

Ŷi = bo + b1Xi (the sample regression line)

After using the method of least squares that we will learn, we find the bo = 2 and b1 = .5 or

Ŷi = 2 + .5Xi
IN CLASS EXERCISE:

1) Graph the sample regression line. Return to the previous table and for each value of X, calculate the predicted
value of Yi, or Ŷi. Plot each of these five predicted values on the graph below. Connect these points and you
have the sample regression line. You will be graphing the points (X i, Ŷi). As we plug each of the values of X into
the sample regression equation, we will calculate the predicted value Ŷ i. This is the value of Y if we fit it perfectly
into the behavioral relationship defined by the sample regression line. Complete the fourth column of the table.

2) Plot the five original, observed data points. Label the actual, observed data points 1, 2, 3, 4, and 5.

(X1,Y1), (X2,Y2), etc.

3) On the graph, mark the distance between the sample regression line and the
actual observed data points. These distances represent the residuals. In the table above, calculate the value of the
residuals to complete the last column.
Recall that the residual is calculated as ei = Yi - Ŷi.

Y
14

2 4 6 8 10 12 14 16 18 20 X
Next time we will study how to estimate ’s using the sample data above (actually, we will look for such bo, b1 that
minimize the sum of squared residuals. For now, let’s take for granted that the best estimates are bo = 2 and b1=0.5

Part 2: Write the intuitive interpretation of the estimated coefficients:

bo = 2: means that….

b1 = .5: means that…

An Overview of Regression Analysis
Questions for Practice

1) To test your understanding of linear relationships, try graphing the following linear equations:
a) Y = 4 + 2X
b) Y = 4 - 2X
c) Y = 2 + 2X
d) Y = 2 + 3X
Note that larger values of the slope make the graph of the line appear steeper.
e) Try to verbally interpret the coefficients.

2) Suppose that a company installs and repairs copying machines. The company studied the relationship
between repair costs for a sample of six machines and the number of pages copied by each machine. The
goal is to identify machines whose costs are too high relative to their copying volumes. The repair costs
in dollars and the pages copied in thousands for the six machines are as follows:

Machine 1 2 3 4 5 6
Repair Cost 85 120 70 165 125 90
Pages 900 1350 550 850 1500 800
Copied

a) Which variable is the dependent variable and which is the independent variable? Why?

b) Make a scatter diagram of these observations.

c) Does the maintenance cost of any machine seem to be out of line?

d) Does there appear to be any relationship between repair costs and the number of pages copied? (i.e.
direct or inverse, linear or nonlinear, weak or strong.)

e) Can you think of any other independent variables that might be influencing this dependent variable?

3) a) Based on lecture to this point, write your own definition of regression analysis that makes sense to
you and memorize it.

b) What are the three primary uses for regression analysis? Give one specific example of each that we
did not discuss in class.

4) If the points (3,18) and (6,9) are two points on a straight line,
a) What is the slope of that line?

b) Are the variables X and Y positively or negatively related?

c) Interpret the value of the slope.

d) Based on the information you have been given, can you find the value of the
Y-intercept term? If so, find it.

5) Consider the following related variable pairs. Which pairs show deterministic relationships and which
show stochastic relationships? Explain.

X Y
Number of hamgurgers Person i’s weight
consumed per week by
person i

Number of people who Ticket Revenues from

pay for a ticket to ball ball game i
game i

Number of hours Person i’s GPA

spent studying per
week by person i

6) Does regression analysis attempt to estimate deterministic or stochastic relationships? Explain.

7) Explain the four factors that contribute to the random error term.

8) Which dependent variable, people’s annual income or attendance at UK basketball games, would you
expect to exhibit more random (unexplainable) inherent variation and why?

9) Along with this practice sheet you were given a copy of UK’s MBA program admission application.
In an earlier class, we considered the variables that might determine an applicant’s academic success in
the program. Looking at the application, you will see that the class came up with most of the same
variables that the admissions office actually considers. If we wanted to estimate a student’s MBA GPA
as a function of these potential determinants, list the variables from the application form that we can
actually quantify (measure and use numerically) in our estimation. What unit of measure would we use
for each of these dependent variables? For each variable discuss how reliable you think the data are. (An
important bit of info for this class - the word data is plural.)
10) Each year top American cities are ranked according to their ability to provide high-quality and low-
cost labor to companies that are relocating. One important measure used to form the rankings is the labor
stress index, which indicates the availability of workers in the city. (The higher the index, the tighter the
job market - i.e. the more difficult for employers to find employees.) Note that one of the determinants
of this measure is the unemployment rate. The values of these two variables for each of the top 10 cities
are listed below in the table.
Obs. # 1 2 3 4 5 6 7 8 9 10
Labor Market 107 107 100 100 80 100 100 93 87 80
Stress Index(Y)
Unemployment 4.5% 3.8% 5.1% 4.9% 5.4% 4.8% 5.5% 4.3% 5.7% 4.6%
Rate(X)

(When calculating your statistics, treat the percentages as whole numbers, i.e. enter 4.5% as the number
4.5 rather than .045. The results should be comparable, but your calculations by hand will be less
tedious.)

a) What is the independent variable? Explain.

b) What is the dependent variable? Explain.

c) Is this a stochastic relationship? Explain.

d) Construct a scatter plot diagram.

e) Based on your scatter plot diagram, what is your initial conclusion about the relationship between the
labor market stress index and the unemployment rate? (Relationship positive or negative, linear or
nonlinear, strong or weak?)

11) a) Graph the true regression line and the estimated regression line assuming that o > o and
1 < 1, with each being positive. Clearly denote each line.

b) In the graph, plot one observation (data point) that is below both lines. Show for that observation the
residual, e, and the stochastic error term, . (2 points)

12) True/False and Explain:

a) One Drawback of conducting controlled experiments is the potential for confounding effects.

b) Regression analysis is used to test theories, quantify theories, and make forecasts.
Lecture I: An Overview of Regression Analysis KEY KEY KEY
Questions for Practice
1) When graphing a linear equation there are a few things to keep in mind. The most obvious place to
start is with the intercept term. The Y=intercept, or o, tells the value of Y when X is zero. This is the
number that appears as the constant term in the equation. So for a) we know that one point on the line is
the point (0,4). Find another point that satisfies the equation. For instance if X = 2, Y = 4 + 2(2) = 4 + 4
= 8. So another point on the line is the point (2,8). All you need to graph a linear function are two
points. Graph these two points a draw a line that runs through both.
d a
Y
c

8
7
6
5
4
3
2
1

1 2 3 4 5 6 7 8 9 10 X
b
Note that larger values of the slope make the graph of the line appear steeper.
e) A verbal interpretion of the coefficient would say as X increases by one unit Y increases by the value
of the slope. For instance, in part a), as X increases by one unit, Y increases by 2.
2) a) The repair cost is dependent because it depends upon the level of use. Pages copied would then be
the independent variable.
b) Scatter diagram:
Repair Costs
170
160
150
140
130
120
110
100
90
80
70

500 600 700 800 900 1,000 1,100 1,200 1,300 1,400 1,500 #of copies
c) the maintenance cost of machine 4 seems to be out of line. It stands out from the other points in the
diagram.

d) Given the appearance of the scatter diagram it would seem that the variables are positively and linearly
related. The relationship appears to be very strong.

e) Other independent variables that might be influencing the repair cost could be i) how often the user
cleans the machine and how well they maintain service for the machine; ii) Do they give the machine a
rest in between running big jobs; iii) do they use the appropriate type of paper; iv) do they use the
machine as the backboard in the office’s big Nerf basketball championship; etc.

3) a) This one I leave to you.

b) Regression analysis may be used to test theories, quantify relationships, and make predictions or
forecasts. I will let you work on the examples.

4) If the points (3,18) and (6,9) are two points on a straight line,

a) the slope would be (Y2 - Y1)/(X2 - X1) or (9-18)/(6-3) or -9/3 or -3.

b) Since the slope is negative, we can assume that the variables are negatively related.

c) A slope of -3 says that as X increases by one unit, Y decreases by 3 units.

d) Based on the information you have been given, you can find the value of the
Y-intercept term. Try a little simple algebra. We know that a linear equation can be written as Y = o +
1X. We know that 1 = -3. Plug in the X and Y values from one of the points. We know these points
“satisfy” the equation.
18 = o - 3(3) or 18 = o - 9 or 27 = o
Also, if you draw the graph, plot the two points, and draw the line going through them. You can usually
see where it hits the Y-axis. (although this is not always the most accurate approach.)

5) i) The relationship between hamburger consumption and human weight is stochastic. While
hamburger consumption certainly might have an impact on weight, other factors besides hamburger
consumption are also important in determining weight.
ii) The relationship between ticket sales and ticket revenues is deterministic because the number of
tickets sold (as long as we know the price) completely determines the revenue from selling the tickets.
iii) The relationship between study time and GPA is stochastic because other factors in addition to study
time are essential in determining the value of GPA.

6) Regression analysis attempts to estimate stochastic relationships. The whole point

of the analysis is to explain the factors that make one observation have a different value of the dependent
variable from some other observation. With a deterministic equation, we would already know why a
difference occurred. For instance, if girl scout cookies sell for $2.50 per box and Ingrid sells 10 boxes,
her cookie revenue will be $25.00. If Constance also sells 10 boxes, she too will have revenue of $25.00.
BORING. There is not really anything there to analyze. Now suppose that Ingrid and Constance, who
are both girl scouts, hit the streets selling cookies. Ingrid sells 100 boxes and Constance sells 20 boxes.
The interesting question is to figure out why. What is the difference between these two girl scouts that
might explain the wide difference in sales? This is something regression analysis might allow us to

20
consider. Is age a factor? Did each girl sell in their home neighborhood? How many doors did each girl
knock upon? Did they use the phone to try to make sales? Is Ingrid more pleasant looking or more
outgoing? Is Constance less motivated? Does Ingrid come from a very big family with LOTS of
relatives?

7) The random error term consists of four components:

i) Omitted explanatory variables
ii) measurement error in the data
iii) selection of the wrong functional form to represent the relationship
iv) purely random variation

8) Which dependent variable, people’s annual income or attendance at UK basketball

games, would you expect to exhibit more random (unexplainable) inherent variation and why? The
variable that has the most of this type of variation is the one that we feel we can explain the least. So for
each variable - try to think of what explains it. For basketball games, attendance might be determined by
how well the team is doing, weather, flu epidemics, school vacations, etc. We can do a pretty good job of
explaining it. Now let’s think about income. It is affected by our level of education, training, job
experience, personal connections, motivation, physical skills, etc. I wrote this question and I am not
exactly sure myself of what the answer is, but I would imagine that in KY we can probably explain and
predict attendance at basketball games better than we can predict someone’s annual income. This implies
that there are reasons two people might have different incomes that we cannot determine.

10) a) The independent variable is the unemployment rate. This variable is one of the determinants of the
stress index that tells us how tight the job market is in an area.
b) The dependent variable is the stress index. Its value is determined or a function of the unemployment
rate.
c) This is a stochastic relationship. The value of the stress index varies for other reasons besides just the
level of unemployment. (i.e. unemployment is not the sole determinant of the stress index.)

21
d) See Below: 110
Stress
Index
105

100

Unemployment Rate
3.8 4.0 4.2 4.4 4.6 4.8 5.0 5.2 5.4 5.6 5.8
e) Although it is somewhat difficult to see given this scatter plot, it would appear that there is some sort
of linear, negative relationship although it does not look very strong.
11) Estimated Regression Line

a) and b) Y

1 True Population
Regression Line
1

ei i

o

o

12) a) False: Controlled experiments allow you to avoid the problems related to
confounding affects by controlling for potential confounding factors.
b) True: These are the reasons we discussed for using regression analysis.

Joy at Work by Dennis Bakke
100% (7)
Joy at Work by Dennis Bakke
16 pages
Introduction To Scientific Management
50% (2)
Introduction To Scientific Management
30 pages
3.topic 1 - Intro HRM
No ratings yet
3.topic 1 - Intro HRM
22 pages
Regression Analysis - SSB
No ratings yet
Regression Analysis - SSB
2 pages
4 STAT-602 Regression & Correlation (Mid&Final)
No ratings yet
4 STAT-602 Regression & Correlation (Mid&Final)
22 pages
Unit Regression Analysis: Objectives
No ratings yet
Unit Regression Analysis: Objectives
18 pages
Econometrics I Handout
No ratings yet
Econometrics I Handout
41 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
36 pages
Lecture Notes
No ratings yet
Lecture Notes
141 pages
M1 Stat-701 SLR 2022
No ratings yet
M1 Stat-701 SLR 2022
17 pages
CH - 3 - Econometrics UG
No ratings yet
CH - 3 - Econometrics UG
38 pages
regression analysis
No ratings yet
regression analysis
8 pages
Regression Analysispdf
No ratings yet
Regression Analysispdf
20 pages
Chapter 0
No ratings yet
Chapter 0
10 pages
Chapter 10
No ratings yet
Chapter 10
3 pages
Chapter Two Metrics (I)
No ratings yet
Chapter Two Metrics (I)
35 pages
Chapter 3 - Linear Regression
No ratings yet
Chapter 3 - Linear Regression
43 pages
Lecture 2.2: Simple Regression Model-Linear Equation With One Independent Variable
No ratings yet
Lecture 2.2: Simple Regression Model-Linear Equation With One Independent Variable
14 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
Regression Course For Second Year (Chap 1-3)
No ratings yet
Regression Course For Second Year (Chap 1-3)
59 pages
ch 14 .....
No ratings yet
ch 14 .....
36 pages
simple-regression
No ratings yet
simple-regression
14 pages
Regression and Correlation Analysis
No ratings yet
Regression and Correlation Analysis
16 pages
STAT Q4 Week 9 Enhanced.v1
No ratings yet
STAT Q4 Week 9 Enhanced.v1
11 pages
Note Simple Linear Regression
No ratings yet
Note Simple Linear Regression
17 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
91 pages
Brief Lecture Notes On Simple Linear Regression Regression Analysis
No ratings yet
Brief Lecture Notes On Simple Linear Regression Regression Analysis
8 pages
ECO - Chapter 2 SLRM
No ratings yet
ECO - Chapter 2 SLRM
40 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
correlation
No ratings yet
correlation
13 pages
Econometrics Chapter Two-1
No ratings yet
Econometrics Chapter Two-1
41 pages
Management Science Notes
No ratings yet
Management Science Notes
13 pages
Aiml M3 C3
No ratings yet
Aiml M3 C3
37 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
14 - Regresi dan Korelasi
No ratings yet
14 - Regresi dan Korelasi
34 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
6.1 Basics-of-Statistical-Modeling
No ratings yet
6.1 Basics-of-Statistical-Modeling
17 pages
Syndicated Learning Program - II (SLP-II) Regression Analysis
No ratings yet
Syndicated Learning Program - II (SLP-II) Regression Analysis
26 pages
Correlation and Regression
No ratings yet
Correlation and Regression
7 pages
Chapter 6
No ratings yet
Chapter 6
58 pages
1486016038da-mod12-Q1-e-text
No ratings yet
1486016038da-mod12-Q1-e-text
11 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
BS Ref17
No ratings yet
BS Ref17
32 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
SIMPLE LINEAR REGRESSION ANALYSIS..
No ratings yet
SIMPLE LINEAR REGRESSION ANALYSIS..
51 pages
Business Analytics
No ratings yet
Business Analytics
19 pages
Ria Stats regression analysiss
No ratings yet
Ria Stats regression analysiss
2 pages
Chapter 14 (14.1 - 14.2)
No ratings yet
Chapter 14 (14.1 - 14.2)
22 pages
Econometrics Chapter _Two (1)
No ratings yet
Econometrics Chapter _Two (1)
71 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
Econometrics- chapter -chapter- II
No ratings yet
Econometrics- chapter -chapter- II
34 pages
Regression
No ratings yet
Regression
25 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Ra Web
No ratings yet
Ra Web
70 pages
Statistics LEC8 Correlation Ciefficient
No ratings yet
Statistics LEC8 Correlation Ciefficient
13 pages
Unit 5
No ratings yet
Unit 5
104 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
Econometrics Notes
No ratings yet
Econometrics Notes
6 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
The Advantages and Disadvantages of A Partnership
No ratings yet
The Advantages and Disadvantages of A Partnership
4 pages
Forming A Partnership
No ratings yet
Forming A Partnership
2 pages
Accounting 101 - Cash and Cash Equivalents
No ratings yet
Accounting 101 - Cash and Cash Equivalents
2 pages
Accounting 101 - Basic Accounting Terms
No ratings yet
Accounting 101 - Basic Accounting Terms
10 pages
Accounting 3 - Cash and Cash Equivalents
No ratings yet
Accounting 3 - Cash and Cash Equivalents
6 pages
Accounting 2 Statement of Financial Position Balance Sheet
100% (1)
Accounting 2 Statement of Financial Position Balance Sheet
5 pages
Accounting 3 Cash Flow Statement Discussion
No ratings yet
Accounting 3 Cash Flow Statement Discussion
6 pages
IFR The Impact of Robots On Employment Positioning Paper Updated Version 2018
No ratings yet
IFR The Impact of Robots On Employment Positioning Paper Updated Version 2018
17 pages
Unit - II Labour: I. Fill in The With Appropriate Answers
No ratings yet
Unit - II Labour: I. Fill in The With Appropriate Answers
4 pages
Crop Profitability Calculator
No ratings yet
Crop Profitability Calculator
12 pages
Paul Baran - On The Political Economy of Backwardness
100% (1)
Paul Baran - On The Political Economy of Backwardness
19 pages
Eco 101
No ratings yet
Eco 101
12 pages
Chapter 12 Standard Costing
0% (1)
Chapter 12 Standard Costing
95 pages
Overheads notes 2024-2025
No ratings yet
Overheads notes 2024-2025
35 pages
An Introduction To Doing Business in Indonesia 2023
No ratings yet
An Introduction To Doing Business in Indonesia 2023
76 pages
Using The Above Explanation
No ratings yet
Using The Above Explanation
6 pages
Adecco Thailand SLG2024
No ratings yet
Adecco Thailand SLG2024
108 pages
Introduction To Sociology
100% (5)
Introduction To Sociology
74 pages
Costing
No ratings yet
Costing
77 pages
Public Economics Lectures
No ratings yet
Public Economics Lectures
884 pages
2 Ananta KR Nath
No ratings yet
2 Ananta KR Nath
5 pages
15 Guidance Notes On Standard Costing
No ratings yet
15 Guidance Notes On Standard Costing
49 pages
Theory of Market Socialism
No ratings yet
Theory of Market Socialism
13 pages
Case Study - On Working
No ratings yet
Case Study - On Working
60 pages
Catch Up Friday Jan 19 1
No ratings yet
Catch Up Friday Jan 19 1
4 pages
10-1108_jsit-08-2024-0297
No ratings yet
10-1108_jsit-08-2024-0297
30 pages
Chapter-Two: Competitiveness, Strategies, and Productivity in Operations
No ratings yet
Chapter-Two: Competitiveness, Strategies, and Productivity in Operations
63 pages
Child Labor Tyranny Speech
No ratings yet
Child Labor Tyranny Speech
4 pages
Labour Law Assignment..
No ratings yet
Labour Law Assignment..
17 pages
Posthuman Capitalism PDF
No ratings yet
Posthuman Capitalism PDF
5 pages
Competitiveness, Strategy, and Productivity
No ratings yet
Competitiveness, Strategy, and Productivity
45 pages
Limiting Factor
0% (2)
Limiting Factor
4 pages
Sapiens Nihil Affirmat Quod Non Probat
No ratings yet
Sapiens Nihil Affirmat Quod Non Probat
14 pages
SUMMER HOLILDAYS Homework
No ratings yet
SUMMER HOLILDAYS Homework
33 pages

ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2

Uploaded by

ECO 391-007 Lecture Handout For Chapter 15 SPRING 2003 Regression Analysis Sections 15.1, 15.2

Uploaded by

ECO 391- 007 Lecture Handout for Chapter 15 SPRING 2003

Sections 15.1, 15.2

I. What is Regression Analysis?

B. Independent and Dependent Variables

Education, Earnings, Experience

Alcohol Consumption, Potential for Heart Attack, Smoking

Advertising Expenditures, Sales, Prices of Substitute Goods

Observation Independent Variable Dependent Variable

3) Dependent Variable - The number of votes a candidate receives during an election

D. Formal definition of regression analysis

Econometrics is what we call regression analysis when we apply it to economic phenomena.

Reasons to use regression analysis:

2) To test our theories

3) To measure the strength of a relationship

4) To use for forecasting

Let Y = the size of the bill for the call.

There is a mathematical relationship b/w X and Y: Y = f(X)

Plot these points to get a scatter plot diagram.

Specific functional form for 4

1 is called the slope term. (rise over the run)

If 1 < 0 the line slopes downward X and Y are inversely related.

Specific interpretation of 1:

if 0 > 0 and 1 > 0

B. Case of Several Variables:

Say, consumers choose among bread, cheese and beer.

Y= money spent on basket i, dollars

Yi = o + 1Xibread +2Xicheese + 3Xibeer ---- still a linear relationship.

Interpretation of the coefficients:

Additional example 1: Let’s assume I am selling apples at a constant price.

A deterministic linear relationship is represented by a straight line (one-variable case) or a three-

Deterministic relationship: Yi = o + 1Xi

Example: Height (X) and Weight (Y) of people.

Stochastic relationship: Yi = o + 1Xi +ε i

1) Independent explanatory variables besides X. (omitted from our equation.)

2) Measurement errors in data.

3) Incorrect functional form

Car prices VS. Divorce Rates

Aside: On Scatter Plot diagrams

1) If a relationship exists between two variables.

2) If two variables are positively(directly) or negatively(inversely) related.

Consider the general form of a stochastic equation below:

The knowledge of the  -parameters is extremely valuable in many practical applications.

This is called the population regression equation.

This implies that bo is an estimate of o

2) ei is the estimated value of εi and it represents the distance between

Yi(hat) is called the predicted (or fitted) value of Yi given X = Xi.

e1, e2, and e3 , etc.

Y = consumption in dollars per day

ei, the estimated

E(YiXi) = o + 1Xi (the population regression line)

Ŷi = bo + b1Xi (the sample regression line)

(X1,Y1), (X2,Y2), etc.

Part 2: Write the intuitive interpretation of the estimated coefficients:

b1 = .5: means that…

b) Make a scatter diagram of these observations.

c) Does the maintenance cost of any machine seem to be out of line?

b) Are the variables X and Y positively or negatively related?

c) Interpret the value of the slope.

Number of people who Ticket Revenues from

Number of hours Person i’s GPA

6) Does regression analysis attempt to estimate deterministic or stochastic relationships? Explain.

a) What is the independent variable? Explain.

b) What is the dependent variable? Explain.

c) Is this a stochastic relationship? Explain.

d) Construct a scatter plot diagram.

12) True/False and Explain:

3) a) This one I leave to you.

a) the slope would be (Y2 - Y1)/(X2 - X1) or (9-18)/(6-3) or -9/3 or -3.

c) A slope of -3 says that as X increases by one unit, Y decreases by 3 units.

6) Regression analysis attempts to estimate stochastic relationships. The whole point

7) The random error term consists of four components:

8) Which dependent variable, people’s annual income or attendance at UK basketball

You might also like