0% found this document useful (0 votes)

46 views9 pages

Logistic Regression: 1 Applied Methods in Biostatistics - Week 2 2019

1. Logistic regression is used to model binary outcome variables and extends linear regression to non-normally distributed outcomes. It is applied to outcomes such as disease presence/absence. 2. The logistic regression model relates the log-odds of the outcome (logit) to the predictor variables. It allows estimation of odds ratios to quantify the effect of predictors on the outcome. 3. An example uses logistic regression to predict lymph node metastasis in prostate cancer patients based on age, serum acid level, x-ray results, tumor size, and grade. The odds ratios estimated from the model quantify the effect of each predictor on metastasis risk.

Uploaded by

IuliaOpris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views9 pages

Logistic Regression: 1 Applied Methods in Biostatistics - Week 2 2019

Uploaded by

IuliaOpris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

05‐11‐2020

Logistic regression

1 Applied Methods in Biostatistics - Week 2 2019

Generalization of the
Linear regression model

In many practical situation linear regression model is inadequate.

For example in case where: the outcome has two possible responses (binary data)
or the outcome represents count data (positive integers)
 it makes no sense to model the outcome as normally distributed
Generalized linear models (GLMs) are an extension of linear regression
Regression models to model non-normally distributed outcome variables.

2 Applied Methods in Biostatistics - Week 2

1
05‐11‐2020

Binary outcome variable

In many studies the outcome of interest is the presence or absence of some condition.
Examples:
 smoking status
 responding to a treatment
 presence or absence of cancer
 survival status of a subject after a surgery: dead or alive
 having myocardial infarction or CHD: yes/no
 success (’yes’/1) and failure (’no’/0) are often used as generic terms of the two possible
responses
the interest is in quantifying the risk or odds of success or occurrence of some event of
interest

3 Applied Methods in Biostatistics - Week 2

Example: Prostatic cancer

A study of 53 prostate cancer patients. Before surgery two continuous exposure variables (age,
serumacid, phosphatase) and three categorical (binary) exposure variables (X-ray, tumour size,
tumour grade) were measured. The patients then had surgery (laparotomy) to determine whether
there was nodal involvement, i. e. lymph node metastases (NI = 1) or not (NI = 0) in the cancer to
adopt the treatment regimen for the patient.

Pat NI Age Acid Xray Size Grade

(pos) (large) (serious)
1 0 66 0.48 0 0 0
2 0 68 0.56 0 0 0
…
52 1 64 0.89 1 1 0
53 1 68 1.26 1 1 1
Brown, B.W. (1980)
4 Applied Methods in Biostatistics - Week 2

2
05‐11‐2020

Risk outcome: odds

Studies
• Case-control / Cross sectional
• Cohort: cumulative incidence rate

Simple (exploratory) inference

• Confidence intervals & hypothesis tests
• comparing risks between exposed/unexposed groups
• Test for association (two or more groups)
• Chi-square-tests/ Fishers exact tests

5 Applied Methods in Biostatistics - Week 2

Logistic regression model

The model is based on:

• Relationship
• logit (p) = log (p/(1-p))
= log-odds (p) = β0 + β1 x1 + β2x2 + … + βkxk
• E.g: p not linear in βs, but logit(p) linear
• Data from binomial distribution

Inference similar to linear model

• Allows many categorical & numerical indep. variables

Estimation & inference: computer

6 Applied Methods in Biostatistics - Week 2

3
05‐11‐2020

Purposes of logstic regression

Effect estimation
• exp (β1) = OR1 = Effect of variable
• Stata: logistic calculates effect estimates exp (β1) directly!

Prediction:
• Best model for predicting risk p of disease for new cases
• Stata: logit calculates parameter estimates of β0, β1, β2, …
• Rule of thumb: at least 10 cases and 10 controls for each indep. var. in model

7 Applied Methods in Biostatistics - Week 2

Estimation:
Interpretation of the coefficients
Interpretations of coefficients is similar to linear regression. However since the logit is linear, the coefficients we
have an analogous interpretation on the logit or log odds scale.
Logit (πNI(Xray, Size, Age)) = β0 + β1Xray + β2 Size + β3 Age

Binary exposure (Comparing Xray examination (1 = positive finding, 0 = negative finding) for Size and Age
fixed)
| , , β0 + β1Xray + β2 Size + β3 Age
𝑂𝑅 xray 𝑒 β1
| , , β0 + β2 Size + β3 Age

Continous exposure variable

| , , β0 + β1Xray + β2 Size + β3 Age+1
𝑂𝑅age 𝑒 β3
| , , β0 + β2 Size + β3 Age

8 Applied Methods in Biostatistics - Week 2

4
05‐11‐2020

Estimation Example (1):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age
. logit NI Xray Size Age

Iteration 0: log likelihood = -35.126076

Iteration 1: log likelihood = -26.176433
Iteration 2: log likelihood = -26.042916
Iteration 3: log likelihood = -26.04263
Iteration 4: log likelihood = -26.04263
Logistic regression Number of obs = 53
LR chi2(3) = 18.17
Prob > chi2 = 0.0004
Log likelihood = -26.04263 Pseudo R2 = 0.2586
-------------------------------------------------------------------------
NI | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------+-----------------------------------------------------------------
Xray | 2.175658 .7644116 2.85 0.004 .6774385 3.673877
Size | 1.596897 .7079243 2.26 0.024 .2093913 2.984403
Age | -.0604558 .054447 -1.11 0.267 -.16717 .0462584
_cons | 1.518419 3.22939 0.47 0.638 -4.811069 7.847908
------------------------------------------------------------------------

9 Applied Methods in Biostatistics - Week 2

Estimation Example (2):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

Logistic regression Number of obs = 53
LR chi2(3) = 18.17
Prob > chi2 = 0.0004
Log likelihood = -26.04263 Pseudo R2 = 0.2586
-------------------------------------------------------------------------
NI | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
------+------------------------------------------------------------------
Xray | 8.807976 6.732919 2.85 0.004 1.968828 39.40437
Size | 4.937689 3.49551 2.26 0.024 1.232927 19.7747
Age | .9413353 .0512529 -1.11 0.267 .8460557 1.047345
-------------------------------------------------------------------------

10 Applied Methods in Biostatistics - Week 2

5
05‐11‐2020

Inferences - Testing overall regression

Hypotheses: H0 : β1 = β2 = . . . = βn = 0
(e. g., Xray, Size and Age are not of predictable value for prostatic cancer

Likelihood ratio (LR) statistic compares two models

1. minimal model = logistic regression model under H0
2. full model = logistic regression model taking account for (all) the exposure variables of interest
 for each model the maximum likelihood function L is calculated:
1. Lm := L( 𝛽 0) for the minimal model
2. Lf := L(𝛽 0 , 𝛽 1 , 𝛽 2. … 𝛽 n) for the full model

Likelihood ratio statistic

LR = 2{log(Lf ) − log(Lm)} = 2 log~ chi square distributed

11 Applied Methods in Biostatistics - Week 2

Estimation Example (overall test):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

12 Applied Methods in Biostatistics - Week 2

6
05‐11‐2020

Inferences - Wald-test
Which factors had a significant effect on the dependent variable adjusted for all the other
independent variables?

 Hypotheses: H 0 :  i  0 vs . H 1 :  i  0

ˆ i
 Test statistics:  N(0,1)-distributed
Z i
 ~
se ˆ i

with Z ~  -distributed,
2 2

degree of freedom=1
13 Applied Methods in Biostatistics - Week 2

Estimation Example (Wald test):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

14 Applied Methods in Biostatistics - Week 2

7
05‐11‐2020

Maximum likelihood estimation

The idea behind: determine the parameters that maximize the probability
(likelihood) of the sample data.
From a statistical point of view, the method of maximum likelihood is considered
to be more robust and yields estimators with good statistical properties.
An efficient methods for quantifying uncertainty through confidence bounds.
Although the methodology for maximum likelihood estimation is simple, the
implementation is mathematically intense. Using today's computer power,
however, mathematical complexity is not a big obstacle.
Maximize the likelihood function L(ϑ) is equivalent to maximize the log-
Likelihood-function l(ϑ)

15 Applied Methods in Biostatistics - Week 2

Estimation Example (ML estimation):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

16 Applied Methods in Biostatistics - Week 2

8
05‐11‐2020

Prediction
The logistic regression approach is suitable for predicting success probability or the outcome risk
for new cases in dependence of exposures
Example: Prostatic cancer

𝐿𝑜𝑔𝑖𝑡 𝜋𝑁𝐼 𝑋𝑟𝑎𝑦, 𝑆𝑖𝑧𝑒, 𝐴𝑔𝑒 𝛽0 𝛽1𝑋𝑟𝑎𝑦 𝛽2 𝑆𝑖𝑧𝑒 𝛽3 𝐴𝑔𝑒

1.52 2.18Xray 1.60Size-0.06Age

Xray Size Age logit 𝜋𝑁𝐼 π𝑁𝐼 𝑃(NI = 1)

0 0 68 -2.56 0.072
1 0 68 -0.38 0.515
0 1 51 0.06 0.406
1 1 57 1.88 0.868

17 Applied Methods in Biostatistics - Week 2

Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Laboratory 10
No ratings yet
Laboratory 10
8 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
Logistic Regression-1
No ratings yet
Logistic Regression-1
27 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
Biostatistics An Applied Introduction for the Public Health Practitioner 1st Edition Bush Test Bankpdf download
100% (5)
Biostatistics An Applied Introduction for the Public Health Practitioner 1st Edition Bush Test Bankpdf download
39 pages
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
No ratings yet
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
36 pages
Lecture13 PDF
No ratings yet
Lecture13 PDF
48 pages
Hnu B215 Biostatistics For Health Sciences
No ratings yet
Hnu B215 Biostatistics For Health Sciences
13 pages
Skin
No ratings yet
Skin
9 pages
5.1) Binary logistic regression
No ratings yet
5.1) Binary logistic regression
32 pages
Applied Statistics II-2 and III
100% (1)
Applied Statistics II-2 and III
59 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Logistic Regression A Self Learning Text (Statistics for Biology and Health) 3rd ed. 2010 Edition Latest Edition Download
100% (12)
Logistic Regression A Self Learning Text (Statistics for Biology and Health) 3rd ed. 2010 Edition Latest Edition Download
16 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
48 pages
Repeated Measures, Part 2: Charles E. Mcculloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, Ucsf
No ratings yet
Repeated Measures, Part 2: Charles E. Mcculloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, Ucsf
29 pages
Minitab Tip Sheet 15
No ratings yet
Minitab Tip Sheet 15
5 pages
Logistic Regression-Advanced Biostat-PDF(1)
No ratings yet
Logistic Regression-Advanced Biostat-PDF(1)
86 pages
ppt4
No ratings yet
ppt4
54 pages
Modeling Ordinal Categorical Data (Agresti)
No ratings yet
Modeling Ordinal Categorical Data (Agresti)
71 pages
Psy 512 Logistic Regression
No ratings yet
Psy 512 Logistic Regression
12 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
Sas 11 Inferential Stat
No ratings yet
Sas 11 Inferential Stat
6 pages
HSU B301 BIOSTATISTICS FOR HEALTH SCIENCES Main Exam
No ratings yet
HSU B301 BIOSTATISTICS FOR HEALTH SCIENCES Main Exam
12 pages
ISYE6414 FA23 Practice Midterm Exam 2 Solutions
No ratings yet
ISYE6414 FA23 Practice Midterm Exam 2 Solutions
6 pages
Lecture 10
No ratings yet
Lecture 10
13 pages
CUHK STAT5102 Ch7
No ratings yet
CUHK STAT5102 Ch7
33 pages
Regression Logistic Regression
100% (1)
Regression Logistic Regression
37 pages
Biostats II 2013 Lecture 1
No ratings yet
Biostats II 2013 Lecture 1
19 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
BIOSTAT Assignment
No ratings yet
BIOSTAT Assignment
6 pages
Biostat Practice 23 07 Categorical
No ratings yet
Biostat Practice 23 07 Categorical
18 pages
L5 Logistic Regression (2011)
100% (1)
L5 Logistic Regression (2011)
55 pages
Bioepi Finals Module 5
No ratings yet
Bioepi Finals Module 5
84 pages
Course_outline_PBH711(1)
No ratings yet
Course_outline_PBH711(1)
2 pages
Biostatistics1718 1 PDF
No ratings yet
Biostatistics1718 1 PDF
30 pages
Lecture 1 - Introduction to Statistics for Health Science
No ratings yet
Lecture 1 - Introduction to Statistics for Health Science
47 pages
Logistic regression_2021 ch-8
No ratings yet
Logistic regression_2021 ch-8
52 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
13. Review of Logistic and Poisson Regression Models
No ratings yet
13. Review of Logistic and Poisson Regression Models
15 pages
Logistics Regression
No ratings yet
Logistics Regression
30 pages
Modeling Ordered Categorical Data: James J. Dignam
No ratings yet
Modeling Ordered Categorical Data: James J. Dignam
27 pages
Study Designs: Sample Bias
No ratings yet
Study Designs: Sample Bias
4 pages
Lec-4 Logistic Regression
No ratings yet
Lec-4 Logistic Regression
54 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Course - Outline - PBH 711.2 - Spring - 2024
No ratings yet
Course - Outline - PBH 711.2 - Spring - 2024
3 pages
604
No ratings yet
604
32 pages
Logistic Regression
0% (1)
Logistic Regression
4 pages
Biostatistics: ABSITE Review Series Sarah Abdulla
No ratings yet
Biostatistics: ABSITE Review Series Sarah Abdulla
30 pages
Logistic 6
No ratings yet
Logistic 6
17 pages
Lect7 Math231
No ratings yet
Lect7 Math231
29 pages
7.logistic.randomintercept
No ratings yet
7.logistic.randomintercept
48 pages
HFHFH
No ratings yet
HFHFH
37 pages
BMC Medical Research Methodology: Bias in Odds Ratios by Logistic Regression Modelling and Sample Size
No ratings yet
BMC Medical Research Methodology: Bias in Odds Ratios by Logistic Regression Modelling and Sample Size
5 pages
18Logistic regression yilma
No ratings yet
18Logistic regression yilma
88 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
High-Dimensional Covariance Estimation: With High-Dimensional Data
From Everand
High-Dimensional Covariance Estimation: With High-Dimensional Data
Mohsen Pourahmadi
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
1st Periodic Test Syllabus X
No ratings yet
1st Periodic Test Syllabus X
1 page
Vision Statement Mission Statement
No ratings yet
Vision Statement Mission Statement
8 pages
ISO 2631-1-1997 Amd1-2010 - Mechanical Vibration and Shock - Evaluation of Human Exposure - General
No ratings yet
ISO 2631-1-1997 Amd1-2010 - Mechanical Vibration and Shock - Evaluation of Human Exposure - General
8 pages
Family Ties and Peso Signs: Challenges For Career Counseling in The Philippines
No ratings yet
Family Ties and Peso Signs: Challenges For Career Counseling in The Philippines
11 pages
Heat / Temperature Control: Maintaining A Stable Fluid Film
100% (3)
Heat / Temperature Control: Maintaining A Stable Fluid Film
94 pages
Accounting Capital+Stock+Transactions
No ratings yet
Accounting Capital+Stock+Transactions
17 pages
SIST-ISO-2491-1996 Chavetas
No ratings yet
SIST-ISO-2491-1996 Chavetas
7 pages
REVISTA Europe11
No ratings yet
REVISTA Europe11
15 pages
Present Perfect Speaking Task
No ratings yet
Present Perfect Speaking Task
1 page
Research Study On Social Media
100% (1)
Research Study On Social Media
17 pages
Bài Thuyết Trình NLBĐS
No ratings yet
Bài Thuyết Trình NLBĐS
21 pages
Risk Ass Hazob Safe Design
No ratings yet
Risk Ass Hazob Safe Design
19 pages
Background of Ifrs
100% (1)
Background of Ifrs
2 pages
Production of Social Studiessse303
No ratings yet
Production of Social Studiessse303
63 pages
unit 3
No ratings yet
unit 3
8 pages
02 Profit Planning
100% (1)
02 Profit Planning
32 pages
Shrinkage-Swelling Potential of Soil
No ratings yet
Shrinkage-Swelling Potential of Soil
70 pages
Objectives:: Pin Diagram of Pic 16F84A
No ratings yet
Objectives:: Pin Diagram of Pic 16F84A
18 pages
(HHI-TEC-0513-R0) Speed Limit for Dual Fuel Engines
No ratings yet
(HHI-TEC-0513-R0) Speed Limit for Dual Fuel Engines
3 pages
Pipeline Deflection Inspection - Pipe Deflectometers
No ratings yet
Pipeline Deflection Inspection - Pipe Deflectometers
2 pages
Most Essential Learning Competencies in English 7 10 - Compress
100% (1)
Most Essential Learning Competencies in English 7 10 - Compress
7 pages
DIMALANTA, LOIS G. - 3B (MycoViro Individual Activity)
No ratings yet
DIMALANTA, LOIS G. - 3B (MycoViro Individual Activity)
11 pages
Bidder Certification of Compliance: 1. General Information About The Bidder
No ratings yet
Bidder Certification of Compliance: 1. General Information About The Bidder
10 pages
Structural Analysis Formula Sheet
No ratings yet
Structural Analysis Formula Sheet
3 pages
Session 1
No ratings yet
Session 1
18 pages
2.safety Standard Checklist
No ratings yet
2.safety Standard Checklist
1 page
DR René KABERA :this Situation Was Alarming
No ratings yet
DR René KABERA :this Situation Was Alarming
2 pages
Suspension Systems
100% (1)
Suspension Systems
118 pages
Process Management Handbook
No ratings yet
Process Management Handbook
46 pages
RAW Terbaru Update 29-08-2023
No ratings yet
RAW Terbaru Update 29-08-2023
4 pages

Logistic Regression: 1 Applied Methods in Biostatistics - Week 2 2019

Uploaded by

Logistic Regression: 1 Applied Methods in Biostatistics - Week 2 2019

Uploaded by

05‐11‐2020

1 Applied Methods in Biostatistics - Week 2 2019

In many practical situation linear regression model is inadequate.

2 Applied Methods in Biostatistics - Week 2

Binary outcome variable

3 Applied Methods in Biostatistics - Week 2

Example: Prostatic cancer

Pat NI Age Acid Xray Size Grade

Risk outcome: odds

Simple (exploratory) inference

5 Applied Methods in Biostatistics - Week 2

Logistic regression model

The model is based on:

Inference similar to linear model

Estimation & inference: computer

6 Applied Methods in Biostatistics - Week 2

Purposes of logstic regression

7 Applied Methods in Biostatistics - Week 2

Continous exposure variable

8 Applied Methods in Biostatistics - Week 2

Estimation Example (1):

Iteration 0: log likelihood = -35.126076

9 Applied Methods in Biostatistics - Week 2

Estimation Example (2):

. logistic NI Xray Size Age

10 Applied Methods in Biostatistics - Week 2

Inferences - Testing overall regression

Likelihood ratio (LR) statistic compares two models

Likelihood ratio statistic

11 Applied Methods in Biostatistics - Week 2

Estimation Example (overall test):

. logistic NI Xray Size Age

12 Applied Methods in Biostatistics - Week 2

Estimation Example (Wald test):

. logistic NI Xray Size Age

14 Applied Methods in Biostatistics - Week 2

Maximum likelihood estimation

15 Applied Methods in Biostatistics - Week 2

Estimation Example (ML estimation):

. logistic NI Xray Size Age

16 Applied Methods in Biostatistics - Week 2

𝐿𝑜𝑔𝑖𝑡 𝜋𝑁𝐼 𝑋𝑟𝑎𝑦, 𝑆𝑖𝑧𝑒, 𝐴𝑔𝑒 𝛽0 𝛽1𝑋𝑟𝑎𝑦 𝛽2 𝑆𝑖𝑧𝑒 𝛽3 𝐴𝑔𝑒

Xray Size Age logit 𝜋𝑁𝐼 π𝑁𝐼 𝑃(NI = 1)

17 Applied Methods in Biostatistics - Week 2

You might also like