Intro To Factor Analysis
Intro To Factor Analysis
Rietveld, T. & Van Hout, R. (1993). Statistical Techniques for the Study of Language and
Language Behaviour. Berlin – New York: Mouton de Gruyter.
2
Thursday 4 February 2021 05:17 AM
Factor Analysis
The starting point of factor analysis is a correlation
matrix, in which the inter-correlations between the
studied variables are presented. The dimensionality
of this matrix can be reduced by “looking for
variables that correlate highly with a group of other
variables, but correlate very badly with variables
outside of that group” (Field 2000: 424). These
variables with high inter-correlations could well
measure one underlying variable, which is called a
‘factor’.
Field, A. (2000). Discovering Statistics using SPSS for Windows. London – Thousand Oaks
– New Delhi: Sage publications.
3
Thursday 4 February 2021 05:17 AM
Factor Analysis
Factor analysis is a method of dimension reduction.
4
Thursday 4 February 2021 05:17 AM
Factor Analysis
There are many different methods that can be used
to conduct a factor analysis
5
Factor Analysis
Given the number of factor analytic techniques and
options, it is not surprising that different analysts
could reach very different results analysing the
same data set.
6
Factor Analysis
However, all analysts are looking for a simple
structure.
7
Factor Analysis
Factor analysis is a technique that requires a large
sample size.
8
Factor Analysis
As a rule of thumb, a bare minimum of 10 observations
per variable is necessary to avoid computational
difficulties.
Number of Cases Prospects
50 very poor
100 poor
200 fair
300 good
1000 excellent
10
Factor Analysis
In this example we examine students assessment of
academic courses. We restrict attention to 12 variables.
Item 13 INSTRUCTOR WELL PREPARED
Item 14 INSTRUCTOR SCHOLARLY GRASP
Item 15 INSTRUCTOR CONFIDENCE
Item 16 INSTRUCTOR FOCUS LECTURES
Item 17 INSTRUCTOR USES CLEAR RELEVANT EXAMPLES
Item 18 INSTRUCTOR SENSITIVE TO STUDENTS
Item 19 INSTRUCTOR ALLOWS ME TO ASK QUESTIONS
Item 20 INSTRUCTOR IS ACCESSIBLE TO STUDENTS OUTSIDE CLASS
Item 21 INSTRUCTOR AWARE OF STUDENTS UNDERSTANDING
Item 22 I AM SATISFIED WITH STUDENT PERFORMANCE EVALUATION
Item 23 COMPARED TO OTHER INSTRUCTORS, THIS INSTRUCTOR IS
Item 24 COMPARED TO OTHER COURSES THIS COURSE WAS
12
Factor Analysis
Analyze > Dimension Reduction > Factor
13
Factor Analysis
Select variables 13-24 that is “instructor well
prepared” to “compared to other courses this course
was”. By using the arrow button.
Use the buttons at the side of the screen to set additional options.
14
Factor Analysis
Use the buttons at the side of the previous screen to set
the Descriptives. Employ the Continue button to return to
the main Factor Analysis screen.
17
Factor Analysis
Varimax rotation tries to maximize the variance of each of
the factors, so the total amount of variance accounted for
is redistributed over the three extracted factors.
18
Factor Analysis
Select the OK button to proceed with the analysis, or
Paste to preserve the syntax.
factor
/variables item13 item14 item15 item16 item17 item18 item19 item20
item21 item22 item23 item24
/print initial det kmo repr extraction rotation fscore univaratiate
/format blank(.30)
/plot eigen rotation
/criteria factors(3)
/extraction paf
/rotation varimax
/method = correlation. 19
Factor Analysis
Descriptive Statistics
Likert scale!
INSTRUCTOR AWARE OF
STUDENTS 3.77 .984 1365
UNDERSTANDING
I AM SATISFIED WITH
STUDENT
3.61 1.116 1365
Norman, G. (2010). Likert scales, levels of PERFORMANCE
EVALUATION
measurement and the “laws” of statistics. COMPARED TO OTHER
analysis.
INSTRUCTOR AWARE OF
STUDENTS 3.77 .984 1365
UNDERSTANDING
I AM SATISFIED WITH
STUDENT
3.61 1.116 1365
Are they meaningful for a
PERFORMANCE
EVALUATION
Likert scale!
COMPARED TO OTHER
INSTRUCTORS, THIS 3.81 .957 1365
INSTRUCTOR IS
COMPARED TO OTHER
COURSES THIS 3.67 .926 1365
COURSE WAS 21
Factor Analysis
Descriptive Statistics
Note N is 1365.
INSTRUCTOR AWARE OF
STUDENTS 3.77 .984 1365
UNDERSTANDING
I AM SATISFIED WITH
STUDENT
3.61 1.116 1365
PERFORMANCE
EVALUATION
COMPARED TO OTHER
INSTRUCTORS, THIS 3.81 .957 1365
INSTRUCTOR IS
COMPARED TO OTHER
COURSES THIS 3.67 .926 1365
COURSE WAS 22
Factor Analysis
Correlation Matrixa
a. Determinant = .002
23
Factor Analysis
24
Factor Analysis
26
Factor Analysis
Communalities
Initial Extraction
INSTRUC WELL
.564 .676
PREPARED
each variable's
INSTRUCTOR FOCUS
.447 .468
LECTURES
continua).
.326 .369
STUDENTS OUTSIDE
CLASS
INSTRUCTOR AWARE OF
STUDENTS .516 .549
UNDERSTANDING
I AM SATISFIED WITH
STUDENT
.397 .444
PERFORMANCE
EVALUATION
COMPARED TO OTHER
INSTRUCTORS, THIS .662 .791
INSTRUCTOR IS
COMPARED TO OTHER
COURSES THIS .526 .632
COURSE WAS
27
Extraction Method: Principal Axis Factoring.
Factor Analysis
Communalities
Initial Extraction
INSTRUC WELL
.564 .676
PREPARED
correlation coefficient
COMPARED TO OTHER
INSTRUCTORS, THIS .662 .791
INSTRUCTOR IS
Initial Extraction
INSTRUC WELL
.564 .676
PREPARED
INSTRUC SCHOLARLY
proportion of each
LECTURES
INSTRUCTOR USES
CLEAR RELEVANT .585 .623
variable's variance that can EXAMPLES
INSTRUCTOR SENSITIVE
29
Extraction Method: Principal Axis Factoring.
Factor Analysis
Factor - The initial number of factors is the same as
the number of variables used in the factor analysis.
However, not all 12 factors will be retained. In this
example, only the first three factors will be retained
(as we requested).
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring.
30
Factor Analysis
Initial Eigenvalues - Eigenvalues are the variances of the
factors. Because we conducted our factor analysis on the
correlation matrix, the variables are standardized, which
means that the each variable has a variance of 1, and the
total variance is equal to the number of variables used in
the analysis, in this case, 12.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring.
31
Factor Analysis
Initial Eigenvalues - Total - This column contains the
eigenvalues. The first factor will always account for the
most variance (and hence have the highest eigenvalue), and
the next factor will account for as much of the left over
variance as it can, and so on. Hence, each successive factor
will account for less and less variance.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring.
32
Factor Analysis
Initial Eigenvalues - % of Variance - This column contains
the percent of total variance accounted for by each factor
(6.249/12 = .52 or 52%).
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring.
33
Factor Analysis
Initial Eigenvalues - Cumulative % - This column contains
the cumulative percentage of variance accounted for by
the current and all preceding factors. For example, the
third row shows a value of 68.313. This means that the
first three factors together account for 68.313% of the
total variance.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring.
34
Factor Analysis
Extraction Sums of Squared Loadings - The number of
rows in this panel of the table correspond to the number
of factors retained. The values are based on the common
variance (of the retained factors). The values in this panel
of the table will always be lower than the values in the left
panel of the table, because they are based on the common
variance, which is always smaller than the total variance.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000
Extraction Method: Principal Axis Factoring. 35
Factor Analysis
Rotation Sums of Squared Loadings - The values in this
panel of the table represent the distribution of the
variance after the varimax rotation. Varimax rotation
tries to maximize the variance of each of the factors, so
the total amount of variance accounted for is
redistributed over the three extracted factors.
Note the more even split.
Total Variance Explained
Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings
Factor Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative %
1 6.249 52.076 52.076 5.851 48.759 48.759 2.950 24.583 24.583
2 1.229 10.246 62.322 .806 6.719 55.478 2.655 22.127 46.710
3 .719 5.992 68.313 .360 3.000 58.478 1.412 11.769 58.478
4 .613 5.109 73.423
5 .561 4.676 78.099
6 .503 4.192 82.291
7 .471 3.927 86.218
8 .389 3.240 89.458
9 .368 3.066 92.524
10 .328 2.735 95.259
11 .317 2.645 97.904
12 .252 2.096 100.000 36
Extraction Method: Principal Axis Factoring.
Factor Analysis
The scree plot graphs the eigenvalue (variance) against the
factor number. You can see these values in the first two
columns of the variance explained table.
37
Factor Analysis
From the third factor on, you can see that the line is almost
flat, meaning the each successive factor is accounting for
smaller and smaller amounts of the total variance.
38
Factor Analysis
Factor Matrixa
Factor
1 2 3
INSTRUC WELL
.713 -.398
PREPARED
INSTRUC SCHOLARLY
.703 -.339
INSTRUCTOR ALLOWS
Factor Analysis
Factor
1 2 3
INSTRUC WELL
.713 -.398
PREPARED
INSTRUC SCHOLARLY
.703 -.339
GRASP
40
Factor Analysis
The plot shows the
items (variables) in
the rotated factor
space.
While this picture
may not be
particularly
helpful, when you
get this graph in
the SPSS output,
you can
interactively
rotate it.
41
Factor Analysis
42
Factor Analysis
Another run of the factor analysis program is conducted
with a promax rotation. It is included to show how
different the rotated solutions can be, and to better
illustrate what is meant by simple structure.
As you will see with an oblique rotation, such as a promax
rotation, the factors are permitted to be correlated with
one another. With an orthogonal rotation, such as the
varimax shown above, the factors are not permitted to
be correlated (they are orthogonal to one another).
Oblique rotations, such as promax, produce both factor
pattern and factor structure matrices. For orthogonal
rotations, such as varimax and equimax, the factor
structure and the factor pattern matrices are the same.
43
Factor Analysis
Use the buttons at the bottom of the screen to set the
alternate Rotation, employ the Continue button to return to
the main Factor Analysis screen.
44
Factor Analysis
The resulting plot with a “simple” structure is shown.
45
Factor Analysis
For a recent review see Factor Analysis at 100. Historical
Developments and Future Directions. By Robert Cudeck,
and Robert C. MacCallum (Eds.). Lawrence Earlbaum
Associates, Mahwah, NJ, 2007, xiii+381 pp., ISBN:978-0-
8058-5347-6 (hardcover), and, ISBN 978-0-8058-6212-6
(paperback).
46
Factor Analysis
Summary
49
Factor Analysis
50
Factor Analysis
After having obtained the correlation matrix, it is time to decide which
type of analysis to use: factor analysis or principal component analysis. The
main difference between these types of analysis lies in the way the
communalities are used. In principal component analysis it is assumed that
the communalities are initially 1. In other words, principal component
analysis assumes that the total variance of the variables can be accounted
for by means of its components (or factors), and hence that there is no
error variance. On the other hand, factor analysis does assume error
variance. This is reflected in the fact that in factor analysis the
communalities have to estimated, which makes factor analysis more
complicated than principal component analysis, but also more conservative.
51
SPSS Tips
Now you should go and try for yourself.
52