0% found this document useful (0 votes)

201 views

Chapter 15 Qualitative Response Regression Models Part 2

This document discusses qualitative response regression models where the dependent variable is qualitative rather than quantitative. It introduces the logit and probit models as alternatives to the linear probability model (LPM) for modeling binary dependent variables. The logit model linearizes the relationship between the log-odds and the explanatory variables to allow for estimation using maximum likelihood. An example analysis of factors affecting student grades is presented to illustrate estimation and interpretation of the logit model. The probit model is also introduced as an alternative that uses the normal cumulative distribution function.

Uploaded by

dineoraphuti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

201 views

Chapter 15 Qualitative Response Regression Models Part 2

Uploaded by

dineoraphuti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Chapter 15

Qualitative Response Regression Models

Part 2
Before the recess…..
• Models considered in Chapter 1 to 13 – dependent variable was quantitative
whereas explanatory variables were quantitative, qualitative or a mixture
• In this chapter the dependent variable (or the response variable) is qualitative
• LPM – OLS
– But problems
Problems of LPM
• LPM poses several problems (can therefore not simply extend OLS to binary
dependent variable regression models)
– Non-normality of the disturbance
– Heteroscedastistic variances of disturbances
– Nonfulfillment of
– Questionable value of R2 as measure of goodness of fit

• Most of these problems are surmountable

• But fundamental problem – LPM assumes that marginal effect of X remains
constant throughout e.g. in home ownership example, the probability of
owning a house increases by the same constant amount of 0.1.
Alternatives to LPM
• What do we need?
– A probability model where, as X increases, the probability of the event occurring increases but
never steps outside the 0-1 interval and
– The relationship between P and X is nonlinear – “one which approaches zero at slower and
slower rates as X gets small and approaches one at slower and slower rates as X gets very large.”

• An S-shaped curve resembles the cumulative distribution function of a random

variable
• For each random variable, there is a unique CDF
• CDFs commonly chosen to represent the 0-1 response models are:
– Logistic (Logit) Model
– Normal (Probit/Normit) Model
Logit model
• Home ownership example LPM

• Now consider a different representation

• where (Logistic distribution function)
• ranges from to , ranges between 0 and 1 and is nonlinearly related to ( –
requirements solved!
• But, new estimation problem created - also nonlinear in parameters, OLS
can’t be used – but equation can be linearized
Linearizing the model (1)
• Probability of owning a house:

• Probability of not owning a house:

• Odds ratio in favour of owning a house (ratio of the probability that a family
will own a house to the probability that it will not own a house, e.g. if P=0.8,
odds are 4 to 1 in favour of the family owning a house)
Linearizing the model (2)
• Take natural log of equation

– L is called the logit and therefore the logit model

– The logit is the log of the odds ratio
– It is not only linear in X but also linear in the parameters
Features of the logit model
• As P goes from 0 to 1, the logit L goes from - ∞ to + ∞, thus although the
probabilities lie between 0 and 1, logits are not so bounded
• Although L is linear in X, the probabilities themselves are not (in contrast with
LPM, where probability increases linearly with X)
• Many regressors can be added to the model
• If L = positive, when value of regressor/s increases, odds that regressand = 1
increases (some event of interest happens). If L = negative, the odds that regressand
= 1 decrease as the value of X increases
• Slope measures the change in L for a unit change in X, how the log-odds in favour
of owning a house change as income changes by a unit. Intercept measures the
value of log-odds in favour of owning a house if income is zero
Features of the logit model
• Given a certain level of income (e.g. X*) if we actually want to estimate the
probability of owning a house (not the odds in favour of owning a house) it
can be estimated directly from once the estimates of are available
• LPM assumed that is linearly related to , logit assumes that log of odds ratio
is linearly related to
Estimation of logit model
• ………1
• To estimate equation, we need values for X and L
• Depends on type of data we have for analysis
– Data at the individual / micro level
– Grouped / replicated data (we will not be covering this type of data)
Data at individual level
• OLS estimation of equation 1 is infeasible
• For example in the housing example
– P = 1 if a family owns a house and P = 0 if it does not own a house
– Substituting into the Logit we obtain:
– if a family own a house
– if a family does not own a house
– These expressions are meaningless – therefore can’t use OLS
– Will have to use maximum-likelihood (ML) method to estimate parameters
Grouped / replicated data
• Leave out
• Second half of p 556 – 561
• Start again at 15.8
Logit model for ungrouped / individual data
- example
• Student’s final grade in an intermediate microeconomics course (Dependent
variable – Grade, Y=1 if final grade was an A, Y=0 if final grade was a B or a
C)
• Independent variables
– GPA – grade point average
– TUCE – score on exam given at beginning of term to test entering knowledge of
macroeconomics
– PSI – personalized system of instruction (1 if the new teaching method is used, 0
otherwise)
example
Example (continued)
• Logit model

• Can’t use OLS / WLS – have to use nonlinear estimating procedure – ML

• Eviews
Example (continued)
• Keep in mind:
• ML = large sample method, therefore estimated standard errors are asymptotic
• Therefore instead of using t-stats to evaluate statistical significance, we use Z
stat (standard normal distribution). Inferences are based on normal table
Example (continued)
• R2 not meaningful in binary regressand models – can make use of pseudo R2
(Eviews – McFadden R2) or count R2:

– Since regressand in logit model takes value of 1 or zero – if predicted probability is greater
than 0.5 we classify it as 1 and if less than 0.5 we classify is as 0
– Then count number of correct predictions and compute R 2 as specified in equation
– Keep in mind that goodness of fit is of secondary importance in these models – expected signs
more important

• To replicate the F-test used in the linear regression models – we can use the
likelihood ratio (LR) statistic. It follows the distribution with df – number of
explanatory variables
Example (continued)
• Grade = -13.0213 + 2.8261GPA + 0.0951TUCE + 2.3786PSI
• If GPA increases with one unit, on average, the estimated logit increases with 2.83 units, ceteris
paribus (positive relationship – other explanatory variables too)
• More meaningful interpretation – odds ratio
– Take the antilog of slope coefficients

– Students with a higher GPA are more than 16 times (almost 17 times) as likely to get an A than students with
a lower GPA

• TUCE is not statistically significant, but GPA and PSI are

• PSI (students who are exposed to the new method of teaching are more than 10 times as likely
to get an A than students who are not exposed to it, ceteris paribus)
• Model is overall significant – p-value of LR stat < 0.05
Example (continued)
• Grade = -13.0213 + 2.8261GPA + 0.0951TUCE + 2.3786PSI
• If interested in actual probability of student number 10 getting an A grade:
– Grade (10) = -13.0213 + 2.8261(3.92) + 0.0951(29) + 2.3786(0) =0.816595 (logit value)
– (probability)
– Student’s actual final grade was an A and our logit model assigns a probability of 1 to a
student who gets an A, estimated probability of 0.69 is not exactly 1 but relatively close
to it.

• To obtain count R2:

– Obtain actual, fitted, residual table
– Identify incorrect predictions
Example (continued)
• Count R2 = 26/32 = 0.8125
• McFadden R2 = 0.374
• Values are not directly comparable – but gives an indication about the orders
of magnitude (and goodness of fit not that important…..)
Probit model
• We know that we have to make use of a cumulative distribution function
(CDF) to explain the behavior of a binary dependent variable
• Logit model is not the only CDF that can be used
• Normal CDF has been found useful in some applications – known as the
probit model / normit model
• In principle we can substitute the normal CDF in place of the logistic CDF in:
Probit model
• OR
• The probit model can be presented based on utility theory / rational choice
perspective on behavior, developed by McFadden
• If we choose the normal distribution as the appropriate probability distribution
– we can use the probit model.
• Model is mathematically difficult as it involves integrals.
• For practical purposes, logit and probit models give similar results.
• Choice depends on ease of computation (computer packages)
Probit estimation with grouped data
• Leave out from bottom of p 567 to 570
• Start again with Probit model for ungrouped / individual data
Probit model for ungrouped / individual
data
• Repeat example on final grade in intermediate microeconomics course
Probit model example
• Results can be compared to logit estimated previously
• GPA and PSI are individually statistically significant and model is overall
significant
• We can’t compare the coefficients of the logit and probit regression
coefficients directly
Marginal effects of models
• In linear regression model – slope measures change in average value of Y for a
unit change in X, ceteris paribus
• In LPM – slope measures change in probability of event occurring as result of
unit change in X, ceteris paribus
• In logit model – slope measures the change in the log of odds associated with a
unit change in that variable, ceteris paribus. But rate of change in probability of
event happening is given by . In evaluating P – all variables included in analysis
are in involved
• In probit model – rate of change is complicated and given by where is the
density function of standard normal variable and Z is the regression model used
in the analysis
Logit and probit models
• Which model is preferable between logit and probit models?
• Models are quite similar, logit has slightly fatter tails – therefore conditional
probability approaches 0 or 1 at a slower rate in logit than probit
• No compelling reason to choose one over other, many researchers choose logit
because of comparative mathematical simplicity
• Remember that the coefficients between 2 models are not directly comparable
(have to adjust coefficients to compare due to differences in variance)
Rest of the chapter
• Leave out……

• Try to work through the practical before the tutorial sessions this week.
Next week…..
• No Lecture or tutorials
• Semester Test 2
– Tuesday 19 September, 18:00 in Eles 201 and 202
– Chapters 14 (only theory) and 15 (theory and practical)
– 50 Marks, 90 Minutes
Logit vs. Probit – Fatter tails

Chapter 4-Functional Forms of Regression Model
No ratings yet
Chapter 4-Functional Forms of Regression Model
21 pages
Business and Economic Forecasting EMET3007/EMET8012 Problem Set 1
No ratings yet
Business and Economic Forecasting EMET3007/EMET8012 Problem Set 1
2 pages
Basic Econometrics - Lecture Notes
50% (2)
Basic Econometrics - Lecture Notes
2 pages
Measure of Skewness - Lesson 7
No ratings yet
Measure of Skewness - Lesson 7
16 pages
Qualitative Response Regression Model - Probabilistic Models
No ratings yet
Qualitative Response Regression Model - Probabilistic Models
34 pages
Logit & Probit Model
No ratings yet
Logit & Probit Model
51 pages
Econometrics: Specification Errors
100% (2)
Econometrics: Specification Errors
13 pages
ch19 The Identification Problem
100% (1)
ch19 The Identification Problem
30 pages
ch14 Nonlinear Regression Models
100% (1)
ch14 Nonlinear Regression Models
18 pages
Qualitative Response Regression Models 1
No ratings yet
Qualitative Response Regression Models 1
29 pages
Chapter 4 Utility Functions
No ratings yet
Chapter 4 Utility Functions
5 pages
ch12 Autocorrelation
100% (1)
ch12 Autocorrelation
36 pages
Financial
No ratings yet
Financial
323 pages
Econometric Modeling: Model Specification and Diagnostic Testing
No ratings yet
Econometric Modeling: Model Specification and Diagnostic Testing
52 pages
Chapter 03 Productivity, Output and Employment
No ratings yet
Chapter 03 Productivity, Output and Employment
44 pages
The Linear Regression Model: An Overview: Damodar Gujarati
No ratings yet
The Linear Regression Model: An Overview: Damodar Gujarati
17 pages
202003271457478511akash Heteroscedasticity
No ratings yet
202003271457478511akash Heteroscedasticity
16 pages
ch02 Ans
No ratings yet
ch02 Ans
11 pages
Multicollinearity Among The Regressors Included in The Regression Model
No ratings yet
Multicollinearity Among The Regressors Included in The Regression Model
13 pages
CH 10 MULTICOLLINEARITY WHAT HAPPENS IF THE EGRESSORS ARE CORRELATED
No ratings yet
CH 10 MULTICOLLINEARITY WHAT HAPPENS IF THE EGRESSORS ARE CORRELATED
36 pages
Chapter 09 - Dummy Variables
No ratings yet
Chapter 09 - Dummy Variables
21 pages
Chapter 2 (Econometrics)
No ratings yet
Chapter 2 (Econometrics)
36 pages
International Economics: Foreign Exchange Markets and Exchange Rates
No ratings yet
International Economics: Foreign Exchange Markets and Exchange Rates
50 pages
Unit 4 - Class 2 - PPT - Classical & Keynesian Theory-1
No ratings yet
Unit 4 - Class 2 - PPT - Classical & Keynesian Theory-1
14 pages
Heteroscedasticity Notes
No ratings yet
Heteroscedasticity Notes
9 pages
Immediate download Basic Econometrics 5th Edition Gujarati Solutions Manual all chapters
100% (35)
Immediate download Basic Econometrics 5th Edition Gujarati Solutions Manual all chapters
45 pages
Aggregate Demand I: Building The IS-LM Model: Questions For Review
No ratings yet
Aggregate Demand I: Building The IS-LM Model: Questions For Review
10 pages
Ignou Assignment
No ratings yet
Ignou Assignment
8 pages
5
No ratings yet
5
16 pages
Studenmund Ch02 v2
No ratings yet
Studenmund Ch02 v2
30 pages
Introductory Econometrics IGNOU
No ratings yet
Introductory Econometrics IGNOU
212 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Theory of National Income Determination
No ratings yet
Theory of National Income Determination
50 pages
Ratex
No ratings yet
Ratex
5 pages
Theories of Foeign Exchange Determination
No ratings yet
Theories of Foeign Exchange Determination
57 pages
Wooldridge 7e Ch01 SM-1
No ratings yet
Wooldridge 7e Ch01 SM-1
5 pages
Gujarati Student Solutions
100% (1)
Gujarati Student Solutions
189 pages
Chapter 1 - Introduction To Probability
No ratings yet
Chapter 1 - Introduction To Probability
36 pages
Econometrics Question M.Phil II 2020
No ratings yet
Econometrics Question M.Phil II 2020
4 pages
Econometrics
No ratings yet
Econometrics
25 pages
Econometric S
No ratings yet
Econometric S
26 pages
Econometrics Lecture 1: Introduction
No ratings yet
Econometrics Lecture 1: Introduction
18 pages
IS-LM Framework
No ratings yet
IS-LM Framework
54 pages
Regression Diagnostic I: Multicollinearity: Damodar Gujarati
No ratings yet
Regression Diagnostic I: Multicollinearity: Damodar Gujarati
7 pages
Revealed Preference Theory
No ratings yet
Revealed Preference Theory
5 pages
International Linkages: Chapter #13
No ratings yet
International Linkages: Chapter #13
30 pages
Assessment ManageEconomics
No ratings yet
Assessment ManageEconomics
21 pages
Economics Practice MCQ Page 1
No ratings yet
Economics Practice MCQ Page 1
2 pages
econometrics notes 2024
100% (1)
econometrics notes 2024
46 pages
C - D Production Function
No ratings yet
C - D Production Function
5 pages
Homoscedastic That Is, They All Have The Same Variance: Heteroscedasticity
100% (1)
Homoscedastic That Is, They All Have The Same Variance: Heteroscedasticity
11 pages
Lecture 11 - BudgetLine and Consumer's Equilibrium
No ratings yet
Lecture 11 - BudgetLine and Consumer's Equilibrium
6 pages
Chapter 22 General Equilibrium Theory: A. Interdependence in The Economy
No ratings yet
Chapter 22 General Equilibrium Theory: A. Interdependence in The Economy
3 pages
Assignment: Intermediate Microeconomics-II: Attempt Any 4 Questions. Each Question Carries 5 Marks
100% (1)
Assignment: Intermediate Microeconomics-II: Attempt Any 4 Questions. Each Question Carries 5 Marks
2 pages
Questions Regarding Panel Data
No ratings yet
Questions Regarding Panel Data
3 pages
SEM 4 - 10 - BA-BSc - HONS - ECONOMICS - CC-10 - INTRODUCTORYECONOMETRI C - 10957
No ratings yet
SEM 4 - 10 - BA-BSc - HONS - ECONOMICS - CC-10 - INTRODUCTORYECONOMETRI C - 10957
3 pages
Gujrati T, F, Chi SQR Tables
No ratings yet
Gujrati T, F, Chi SQR Tables
9 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Topic 3: Qualitative Response Regression Models
No ratings yet
Topic 3: Qualitative Response Regression Models
29 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
The Delta Method: Appendix 1
No ratings yet
The Delta Method: Appendix 1
4 pages
7 Measures of Dispersion
0% (1)
7 Measures of Dispersion
8 pages
Non Linear Model (1)
No ratings yet
Non Linear Model (1)
4 pages
CS1 Paper-A September 2019 Examiners Report
No ratings yet
CS1 Paper-A September 2019 Examiners Report
13 pages
KPMG Lean Six Sigma
No ratings yet
KPMG Lean Six Sigma
42 pages
Module 14 - Measures of Central Tendency
No ratings yet
Module 14 - Measures of Central Tendency
9 pages
Research
No ratings yet
Research
2 pages
ECE567 09F Final
No ratings yet
ECE567 09F Final
10 pages
Illustrating Random Sampling.
No ratings yet
Illustrating Random Sampling.
11 pages
Applied Statistics Worksheet 5.4.21
No ratings yet
Applied Statistics Worksheet 5.4.21
2 pages
Module 5 Ge 4educ
No ratings yet
Module 5 Ge 4educ
12 pages
Akuntansi Kreatif PDF
No ratings yet
Akuntansi Kreatif PDF
14 pages
Module 6.1 - Z Scores
No ratings yet
Module 6.1 - Z Scores
31 pages
Spirit Freightways Is A Leader in Transporting Agricultural Products in The Western Provinces of Canada
No ratings yet
Spirit Freightways Is A Leader in Transporting Agricultural Products in The Western Provinces of Canada
3 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
Hypothesis
No ratings yet
Hypothesis
108 pages
MMW Statistic m6
No ratings yet
MMW Statistic m6
5 pages
Quantitative Techniques in Management - blck2
No ratings yet
Quantitative Techniques in Management - blck2
72 pages
ANSWERS KEY-SOLUTIONS OF SEMI-FINAL EXAM IN STATISTICS AND PROBABILITY 2019-2020 FOR OF GRADE 11
No ratings yet
ANSWERS KEY-SOLUTIONS OF SEMI-FINAL EXAM IN STATISTICS AND PROBABILITY 2019-2020 FOR OF GRADE 11
2 pages
Cbsnews Post-speech 20250304
100% (3)
Cbsnews Post-speech 20250304
3 pages
ETF1100 Group Assignment S1 2024
No ratings yet
ETF1100 Group Assignment S1 2024
3 pages
Statistics Chapter3 BSC211
No ratings yet
Statistics Chapter3 BSC211
20 pages
GRMD2102 - Homework 2
No ratings yet
GRMD2102 - Homework 2
3 pages
Learn_Seaborn_1674064934
No ratings yet
Learn_Seaborn_1674064934
24 pages
SCS3250A - Module 1 - Introduction To Statistics and Analytics
No ratings yet
SCS3250A - Module 1 - Introduction To Statistics and Analytics
44 pages
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
No ratings yet
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
16 pages
Accuracy Measures
No ratings yet
Accuracy Measures
61 pages
Sampling Techniques
No ratings yet
Sampling Techniques
12 pages
Self Check Exercises: Exercise 7.2
No ratings yet
Self Check Exercises: Exercise 7.2
11 pages

Chapter 15 Qualitative Response Regression Models Part 2

Uploaded by

Chapter 15 Qualitative Response Regression Models Part 2

Uploaded by

Chapter 15

Qualitative Response Regression Models

• Most of these problems are surmountable

• An S-shaped curve resembles the cumulative distribution function of a random

• Now consider a different representation

• Probability of not owning a house:

– L is called the logit and therefore the logit model

• Can’t use OLS / WLS – have to use nonlinear estimating procedure – ML

• TUCE is not statistically significant, but GPA and PSI are

• To obtain count R2:

You might also like