0% found this document useful (0 votes)

15 views33 pages

11, 12. Predictive Analysis

The document provides an overview of linear and multiple regression analysis, including key concepts such as correlation, assumptions, and statistical measures. It discusses the importance of regression analysis in understanding relationships between variables, predicting outcomes, and controlling for other variables. Additionally, it introduces Structural Equation Modeling (SEM) as a multivariate technique used in marketing research to analyze relationships between observed and latent variables.

Uploaded by

2747-Sakshi Tanwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views33 pages

11, 12. Predictive Analysis

Uploaded by

2747-Sakshi Tanwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Linear and

Multiple
Regression
Dr. Rajarshi Debnath
Marketing Area
FORE School of Management,
New Delhi
Agenda

• Linear Regression
• Multiple Regression

Ŷ = a + bX
Product Moment
Correlation

• The product moment correlation, r, summarizes

the strength of association between two metric
(interval or ratio scaled) variables, say X and Y.

• It is an index used to determine whether a linear

or straight-line relationship exists between X and
Y.

• As it was originally proposed by Karl Pearson, it is

also known as the Pearson correlation coefficient.
It is also referred to as simple correlation,
bivariate correlation, or merely the correlation
coefficient.
Nonmetric Correlation
• If the nonmetric variables are ordinal and numeric, Spearman's
rho,ρs , and Kendall's tau,τ, are two measures of nonmetric
correlation, which can be used to examine the correlation
between them.
• Both these measures use rankings rather than the absolute values
of the variables, and the basic concepts underlying them are quite
similar. Both vary from −1.0 to +1.0.
• In the absence of ties, Spearman's ρs yields a closer approximation
to the Pearson product moment correlation coefficient,ρ , than
Kendall's τ. In these cases, the absolute magnitude of τ tends to be
smaller than Pearson's ρ.
• When the data contain a large number of tied ranks, Kendall's τ
seems more appropriate.
Regression Analysis
Regression analysis examines associative relationships between a
metric dependent variable and one or more independent variables
in the following ways:
• Determine whether the independent variables explain a significant variation in the
dependent variable: whether a relationship exists?
• Determine how much of the variation in the dependent variable can be explained by the
independent variables: strength of the relationship?
• Determine the structure or form of the relationship: the mathematical equation relating
the independent and dependent variables?
• Predict the values of the dependent variable.
• Control for other independent variables when evaluating the contributions of a specific
variable or set of variables.
• Regression analysis is concerned with the nature and degree of association between
variables and does not imply or assume any causality.
Conducting Bivariate
Regression Analysis
Plot the Scatter Diagram

• A scatter diagram, or
scattergram, is a plot of
the values of two
variables for all the cases
or observations.
Conducting
Bivariate
Regression
Analysis
Which
Straight
Line is
Best?
Assumptions
• The error term is normally distributed. For each fixed value of X, the
distribution of Y is normal.
• The means of all these normal distributions of Y, given X, lie on a
straight line with slope b.
• The mean of the error term is 0.
• The variance of the error term is constant. This variance does not
depend on the values assumed by X.
• The error terms are uncorrelated. In other words, the observations
have been drawn independently.
Model (fit) summary

Important ANOVA (choice based)

Tables in Model Coefficients

Linear
Residual Statistics (Only if Durbin-Watson is
Regression implemented)

Assumption checks
r- Correlation

r2 – Explains % of variance

Model Adjusted r2- Presents better estimate of

population
Summary Standard Error of estimate- standard deviation
of expected values for dependent variable

Durbin Watson- statistic is a test statistic used to

detect the presence of autocorrelation
Sum of Squares- for regression it is the between group sum of
squares; for residuals, within the group sum of squares.

DF- for regression, number of independent variables (1 in this

case). For residuals, number of subjects minus number of
independent variables.

ANOVA Mean Square- sum of squares divided by degree of freedom

F- mean square regression divided by mean square residual

Sig- likelihood that this result could occur by chance.

B- coefficient and constant for linear regression equation.

Std.Error- Standard error of B: a measure of stability or sampling

error of the B- values. It is standard deviation of B- Values given
in large number of samples drawn from same population.

Coefficients Beta- standardized Regression Coefficients

t- B divided by standard error of B

Sig- Likelihood that this result could occur by Chance.

Case Study:
Medical
Practitioner
Identify the relationship between:

• Job Satisfaction and Burnout

Multicollinearity
• Multicollinearity (also collinearity) is a
phenomenon in which two or more
predictor variables in a multiple
regression model are highly correlated,
meaning that one can be linearly
predicted from the others with a
substantial degree of accuracy.

•variance inflation
factor (VIF)>5
Linear regression makes
several key assumptions:

• Assumption#0: Measurement of variable

• Assumption#1: Linear relationship
• Assumption#2: Multivariate normality
• Assumption#3: No or little multicollinearity
• Assumption#4: No auto-correlation
• Assumption#5: Homoscedasticity
Objectives of Regression Analysis

Prediction Explanation- Magnitude, Sign, and Research Design

Significance
Sample size: 1:10 (variable:sample) minimum
Variables: Metric
Multiple Regression (1 of 2)
The general form of the multiple regression model is as follows:

Y = β0+ β1X1+ β2X2+ β3X3+…+ βkXk+ e

which is estimated by the following equation:

Ŷ = a + b1X1+ b2X2+ b3X3+…+ bkXk

As before, the coefficient a represents the intercept, but the b's are
now the partial regression coefficients.
Statistics Associated with Multiple Regression
(1 of 2)
• Adjusted R2. R2, coefficient of multiple determination, is adjusted for
the number of independent variables and the sample size to account
for the diminishing returns. After the first few variables, the
additional independent variables do not make much contribution.

• Coefficient of multiple determination. The strength of association in

multiple regression is measured by the square of the multiple
correlation coefficient, R2, which is also called the coefficient of
multiple determination.

• F test. The F test is used to test the null hypothesis that the
coefficient of multiple determination in the population, R2pop, is zero.
This is equivalent to testing the null hypothesis. The test statistic has
an F distribution with k and (n − k − 1) degrees of freedom.
Statistics Associated with Multiple Regression
(2 of 2)

• Partial F test. The significance of a partial regression coefficient,βi, of

Xi may be tested using an incremental F statistic. The incremental F
statistic is based on the increment in the explained sum of squares
resulting from the addition of the independent variable Xi to the
regression equation after all the other independent variables have
been included.

• Partial regression coefficient. The partial regression coefficient, b1,

denotes the change in the predicted value, Ŷ, per unit change in X1
when the other independent variables, X2 to Xk, are held constant.
SPSS Windows
The CORRELATE program computes Pearson product moment correlations
and partial correlations with significance levels. Univariate statistics,
covariance, and cross-product deviations may also be requested. Significance
levels are included in the output. To select these procedures using SPSS for
Windows, click:
Analyze>Correlate>Bivariate …
Analyze>Correlate>Partial …
Scatterplots can be obtained by clicking:
Graphs>Scatter >Simple>Define …
REGRESSION calculates bivariate and multiple regression equations,
associated statistics, and plots. It allows for an easy examination of residuals.
This procedure can be run by clicking:
Analyze>Regression Linear …
Structural Equation
Modelling (SEM)
Introduction to SEM
• SEM is a multivariate statistical technique used to analyze
relationships between observed and latent variables.

• Combines factor analysis and regression modeling.

• Widely used in marketing research for understanding consumer

behavior, brand loyalty, etc.
Why SEM in Marketing Research?
• Helps in testing theoretical models.

• Simultaneously examines multiple relationships.

• Accounts for measurement errors.

• Provides deeper insights than traditional regression analysis.

Key Components of SEM
• Observed Variables: Measurable items (survey responses, sales data,
etc.).
• Latent Variables: Unobserved constructs (brand trust, satisfaction,
etc.).
• Path Diagrams: Visual representation of relationships.
• Structural Model vs. Measurement Model: How variables relate vs.
how constructs are measured.
Assumptions of SEM
• Sample Size: Minimum 200-300 recommended.
• Multivariate Normality: Data should be normally distributed.
• No Multicollinearity: High correlation between variables should be
avoided.
• Model Identification: Degrees of freedom should be positive.
• Linearity: Relationships between variables should be linear.
• Measurement Invariance: Measurement scales should be consistent
across groups.
Steps in SEM
1. Model Specification: Define theoretical relationships.
2. Model Identification: Check if enough data is available.
3. Model Estimation: Use Maximum Likelihood (ML) estimation.
4. Model Evaluation: Assess model fit (CFI, RMSEA, Chi-square, etc.).
5. Model Modification: Improve the model by adjusting paths, and
adding/removing variables.
Model Fit Indices
• Chi-Square (X²): Lower is better, but sensitive to sample size.

• CFI (Comparative Fit Index): > 0.90 is acceptable.

• RMSEA (Root Mean Square Error of Approximation): < 0.08 is good fit.

• SRMR (Standardized Root Mean Square Residual): < 0.08 is

recommended.
Introduction to Jamovi for SEM
• Open-source statistical software for SEM.
• User-friendly interface.
• No coding required.
Demonstration – Running SEM in Jamovi
1. Import dataset.
2. Define observed and latent variables.
3. Specify model using path diagrams.
4. Run analysis and interpret model fit indices.
5. Check significance of paths and modify model if necessary.
Interpreting SEM Results
• Look at path coefficients (significance & direction).
• Assess model fit indices.
• Modify the model if necessary.
• Report findings with theoretical and managerial implications.
Thank you!

Thogersen 2000
No ratings yet
Thogersen 2000
29 pages
RMD S10 Regression
No ratings yet
RMD S10 Regression
22 pages
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
No ratings yet
Bi Is The Slope of The Regression Line Which Indicates The Change in The Mean of The Probablity Bo Is The Y Intercept of The Regression Line
5 pages
Multiple Regression Analysis & Applications
No ratings yet
Multiple Regression Analysis & Applications
23 pages
Regression
No ratings yet
Regression
12 pages
Topic 8 - Correlation, Regression Factor Analysis
No ratings yet
Topic 8 - Correlation, Regression Factor Analysis
76 pages
CH 17 Correlation Vs Regression
No ratings yet
CH 17 Correlation Vs Regression
17 pages
Topic 2 - Group 5 - Marketing Research Exploratory Research
No ratings yet
Topic 2 - Group 5 - Marketing Research Exploratory Research
50 pages
09 - M & S - Corr+Regr
No ratings yet
09 - M & S - Corr+Regr
18 pages
Inferential Analysis
No ratings yet
Inferential Analysis
45 pages
Multivariate Analysis: Are Some of The Variables Dependent On Others?
100% (2)
Multivariate Analysis: Are Some of The Variables Dependent On Others?
16 pages
Chapter - Seventeen: Correlation & Regression Analysis
No ratings yet
Chapter - Seventeen: Correlation & Regression Analysis
17 pages
Correlation & Regression
No ratings yet
Correlation & Regression
8 pages
6.multiple Regressions - BDSM - 2020 - Oct
No ratings yet
6.multiple Regressions - BDSM - 2020 - Oct
45 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
6 Correlation and Linear Regression
No ratings yet
6 Correlation and Linear Regression
32 pages
CH 5 - Correlation and Regression
No ratings yet
CH 5 - Correlation and Regression
9 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Corr_Regression Analysis
No ratings yet
Corr_Regression Analysis
19 pages
BRM-Lecture 4-2023
No ratings yet
BRM-Lecture 4-2023
48 pages
Correlation Regression
100% (1)
Correlation Regression
25 pages
Correlation Regression 15 16
No ratings yet
Correlation Regression 15 16
19 pages
Regression 2024
No ratings yet
Regression 2024
49 pages
Multiple-Regression -Batool & Raya
No ratings yet
Multiple-Regression -Batool & Raya
24 pages
Regn & Marketing Research
No ratings yet
Regn & Marketing Research
23 pages
Module5 Marketing Mix Model 1
No ratings yet
Module5 Marketing Mix Model 1
43 pages
Combinepdf PDF
No ratings yet
Combinepdf PDF
140 pages
Lecture 7
No ratings yet
Lecture 7
65 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
6 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Regression Correlation
No ratings yet
Regression Correlation
22 pages
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
No ratings yet
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
4 pages
Correlation and Simple Linear Regression Analyses: Objectives
No ratings yet
Correlation and Simple Linear Regression Analyses: Objectives
6 pages
Unit 4-1
No ratings yet
Unit 4-1
29 pages
Multiple Regression Analysis 1
No ratings yet
Multiple Regression Analysis 1
57 pages
EDA-GROUP-1
No ratings yet
EDA-GROUP-1
19 pages
Week 03 Regression
No ratings yet
Week 03 Regression
14 pages
07 - Correlation and Regression Analysis-1
No ratings yet
07 - Correlation and Regression Analysis-1
13 pages
Screenshot 2023-12-04 at 11.27.14
No ratings yet
Screenshot 2023-12-04 at 11.27.14
32 pages
Correlation
No ratings yet
Correlation
5 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
Session 1.3 Notes
No ratings yet
Session 1.3 Notes
39 pages
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
No ratings yet
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
6 pages
Multiple Regression Presentation
No ratings yet
Multiple Regression Presentation
19 pages
14 - Regresi dan Korelasi
No ratings yet
14 - Regresi dan Korelasi
34 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
23 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
5 pages
Correlation
100% (1)
Correlation
29 pages
Correlation and Regression Notes
No ratings yet
Correlation and Regression Notes
5 pages
5.multiple Regression
No ratings yet
5.multiple Regression
17 pages
Chapter 17 Correlation Regression
No ratings yet
Chapter 17 Correlation Regression
42 pages
Multiple Linear Regression: y BX BX BX
No ratings yet
Multiple Linear Regression: y BX BX BX
14 pages
Regression PPT Final
100% (1)
Regression PPT Final
59 pages
Linear Regression Analysis_1
No ratings yet
Linear Regression Analysis_1
18 pages
8multiple Linear Regression
100% (1)
8multiple Linear Regression
21 pages
Multiple Regression
100% (1)
Multiple Regression
29 pages
Multiple linear regression
No ratings yet
Multiple linear regression
39 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
Conducting Meetings FORE 2024
No ratings yet
Conducting Meetings FORE 2024
8 pages
Chapter 8_final-2
No ratings yet
Chapter 8_final-2
12 pages
Chapter 9 (1)
No ratings yet
Chapter 9 (1)
10 pages
Nike.docx 2
No ratings yet
Nike.docx 2
3 pages
The Youth Ecological-Resilience Scale
No ratings yet
The Youth Ecological-Resilience Scale
10 pages
Ho Mediation
No ratings yet
Ho Mediation
3 pages
Measuring Citizen Satisfaction Using The SERVQUAL Approach: The Case of The Hellenic Post'
No ratings yet
Measuring Citizen Satisfaction Using The SERVQUAL Approach: The Case of The Hellenic Post'
12 pages
Tutorial Lavaan
No ratings yet
Tutorial Lavaan
49 pages
Impact of Digital Marketing On Consumer Behavior in Pokhara
No ratings yet
Impact of Digital Marketing On Consumer Behavior in Pokhara
11 pages
10-1108_AJEMS-06-2023-0221
No ratings yet
10-1108_AJEMS-06-2023-0221
15 pages
Ahmed Et Al 2023 Factors Affecting The Time Overrun of Road Construction Projects in Ethiopia
No ratings yet
Ahmed Et Al 2023 Factors Affecting The Time Overrun of Road Construction Projects in Ethiopia
26 pages
Islamic Financial Literacy and Inclusion On Personal Finance Behavior With Socio-Demography As A Moderating Variable
No ratings yet
Islamic Financial Literacy and Inclusion On Personal Finance Behavior With Socio-Demography As A Moderating Variable
6 pages
Fazdillah Reza Kaspuri - CV (En)
No ratings yet
Fazdillah Reza Kaspuri - CV (En)
1 page
Malaysia 2
No ratings yet
Malaysia 2
25 pages
Gregg Shorthand Dictionary a Compilation of Shorthand Outlines for 34055 Words 1314 Names and 1856 Frequently Used Phrases - John Robert Gregg
No ratings yet
Gregg Shorthand Dictionary a Compilation of Shorthand Outlines for 34055 Words 1314 Names and 1856 Frequently Used Phrases - John Robert Gregg
21 pages
Planned Behaviour in Purchasing Health Insurance: Ritzky Karina Brahmana Rayenda Brahmana Gesti Memarista
No ratings yet
Planned Behaviour in Purchasing Health Insurance: Ritzky Karina Brahmana Rayenda Brahmana Gesti Memarista
22 pages
Pls-Sem P
100% (1)
Pls-Sem P
32 pages
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
No ratings yet
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
5 pages
05 Content
No ratings yet
05 Content
3 pages
The Impact of Customer Experience Management On Customer Loyalty
No ratings yet
The Impact of Customer Experience Management On Customer Loyalty
5 pages
Theory of Planned Behavior and Value-Belief Norm Theory As Antecedents of Pro-Environmental Behavior: Evidence From The Local Community
No ratings yet
Theory of Planned Behavior and Value-Belief Norm Theory As Antecedents of Pro-Environmental Behavior: Evidence From The Local Community
19 pages
Week 3 Literature Review
No ratings yet
Week 3 Literature Review
17 pages
Psychological Safety and The Critical Role of Leadership Development Final
No ratings yet
Psychological Safety and The Critical Role of Leadership Development Final
9 pages
My Article
No ratings yet
My Article
16 pages
Gimenez Ventura 2003 PDF
No ratings yet
Gimenez Ventura 2003 PDF
21 pages
Jash Parmar Masterarbeit
No ratings yet
Jash Parmar Masterarbeit
101 pages
Reporting Reliability, Convergent and Discriminant Validity With Structural Equation Modeling: A Review and Best Practice Recommendations
No ratings yet
Reporting Reliability, Convergent and Discriminant Validity With Structural Equation Modeling: A Review and Best Practice Recommendations
39 pages
Semester Wise Breakdown of Courses: Annex-C3.1
No ratings yet
Semester Wise Breakdown of Courses: Annex-C3.1
11 pages
PLS Sem
No ratings yet
PLS Sem
348 pages
Sulastri 2020 E R
No ratings yet
Sulastri 2020 E R
10 pages
Aleksandra Sobodic Disertacija
No ratings yet
Aleksandra Sobodic Disertacija
166 pages
Gupta 2018
No ratings yet
Gupta 2018
16 pages
Personality Measurement, Faking, and Employment Selection: Joyce Hogan, Paul Barrett, and Robert Hogan
No ratings yet
Personality Measurement, Faking, and Employment Selection: Joyce Hogan, Paul Barrett, and Robert Hogan
16 pages

11, 12. Predictive Analysis

Uploaded by

11, 12. Predictive Analysis

Uploaded by

Linear and

• The product moment correlation, r, summarizes

• It is an index used to determine whether a linear

• As it was originally proposed by Karl Pearson, it is

Important ANOVA (choice based)

Tables in Model Coefficients

Model Adjusted r2- Presents better estimate of

Durbin Watson- statistic is a test statistic used to

DF- for regression, number of independent variables (1 in this

ANOVA Mean Square- sum of squares divided by degree of freedom

F- mean square regression divided by mean square residual

Sig- likelihood that this result could occur by chance.

Std.Error- Standard error of B: a measure of stability or sampling

Coefficients Beta- standardized Regression Coefficients

t- B divided by standard error of B

Sig- Likelihood that this result could occur by Chance.

• Job Satisfaction and Burnout

• Assumption#0: Measurement of variable

Prediction Explanation- Magnitude, Sign, and Research Design

Y = β0+ β1X1+ β2X2+ β3X3+…+ βkXk+ e

which is estimated by the following equation:

Ŷ = a + b1X1+ b2X2+ b3X3+…+ bkXk

• Coefficient of multiple determination. The strength of association in

• Partial F test. The significance of a partial regression coefficient,βi, of

• Partial regression coefficient. The partial regression coefficient, b1,

• Combines factor analysis and regression modeling.

• Widely used in marketing research for understanding consumer

• Simultaneously examines multiple relationships.

• Accounts for measurement errors.

• Provides deeper insights than traditional regression analysis.

• CFI (Comparative Fit Index): > 0.90 is acceptable.

• SRMR (Standardized Root Mean Square Residual): < 0.08 is

You might also like