Principal Component Analysis

Principal Component Analysis (PCA) is a dimensionality reduction technique that transforms original variables into principal components to preserve variance in data. It involves steps such as centering the data, calculating the covariance matrix, and performing eigenvalue decomposition to select components for reduced representation. Linear Discriminant Analysis (LDA) is another dimensionality reduction method that focuses on maximizing class separation by projecting data onto a lower-dimensional space while preserving class discriminatory information.

Uploaded by

anupjareda7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views19 pages

Principal Component Analysis

Uploaded by

anupjareda7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Principal Component analysis:

PCA stands for Principal Component Analysis. It is a dimensionality reduction technique

commonly used in data analysis and machine learning. The primary goal of PCA is to reduce the
dimensionality of a dataset while preserving as much of the variance or information present in
the data as possible.
PCA achieves this by transforming the original variables into a new set of variables, called
principal components. These principal components are linear combinations of the original
variables and are orthogonal to each other, meaning they are uncorrelated. The first principal
component accounts for the largest possible variance in the data, the second principal component
for the second largest variance, and so on.
In essence, PCA helps in simplifying the complexity of high-dimensional data by capturing the
most important patterns or directions of variation in the data, thereby enabling easier
visualization, exploration, and analysis of the dataset. It is widely used in various fields such as
image processing, signal processing, finance, and bioinformatics, among others.
The principal components (PCs) in PCA are derived through linear algebra techniques, primarily
involving eigenvalue decomposition or singular value decomposition (SVD) of the covariance
matrix of the original data. Here's a brief overview of the mathematics behind PCA:
1. Centering the data: First, the mean of each feature (variable) is subtracted from the dataset.
This step ensures that the data is centered around the origin.
2. Covariance matrix: The covariance matrix is calculated for the centered data. This matrix
represents the pairwise covariances between all pairs of features.
3. Eigenvalue decomposition (EVD):
 EVD: In EVD, the covariance matrix is decomposed into its eigenvectors and
eigenvalues. The eigenvectors represent the directions (principal components) of
maximum variance in the data, and the corresponding eigenvalues represent the
magnitude of variance along those directions. The eigenvectors are usually sorted in
descending order based on their corresponding eigenvalues, so the first principal
component (PC1) captures the most variance; the second principal component (PC2)
captures the second most variance, and so on.
4. Selecting principal components: After obtaining the eigenvectors or singular vectors, the
desired number of principal components is selected based on the explained variance or the
application's requirements. Typically, one can select a subset of the principal components that
capture most of the variance in the data.
5. Projection: Finally, the original data is projected onto the selected principal components to
obtain the reduced-dimensional representation of the data. This is achieved by taking the dot
product of the centered data matrix with the matrix of selected principal components.

Numerical:

To compute PCA, following steps:

1. Center the data
Linear Discriminant analysis:

Linear Discriminant Analysis (LDA) is a dimensionality reduction technique and a classification

algorithm used in machine learning and statistics. Linear Discriminant Analysis (LDA) is
primarily used in supervised learning tasks.

The main goal of LDA is to find a linear combination of features that characterizes or separates
two or more classes of objects or events. It's particularly useful when dealing with classification
problems where the classes are well-separated. LDA seeks to project the feature space onto a
lower-dimensional space while preserving the class discriminatory information as much as
possible.

Here's how LDA works:

1. Calculate the mean vectors: For each class in the dataset, calculate the mean vector, which
represents the mean values of each feature for that class.
2. Compute the scatter matrices: There are two scatter matrices used in LDA:
 Within-class scatter matrix (Sw): It measures the spread of data within individual classes.
 Between-class scatter matrix (Sb): It measures the spread between different classes.
3. Compute the eigenvectors and eigenvalues: Next, compute the eigenvectors and eigenvalues of
the matrix (Sw^(-1)) * Sb. The eigenvectors represent the directions (linear discriminants) that
maximize the separation between classes, while the eigenvalues represent the amount of variance
explained by each eigenvector.
4. Select discriminants: Sort the eigenvectors by their corresponding eigenvalues in descending
order and choose the top k eigenvectors to form a matrix W. These eigenvectors serve as the axes
for the new feature subspace.
5. Project the data onto the new feature subspace: Multiply the original data matrix by the
matrix W to obtain the new feature subspace.
6. Classification: Once the data is projected onto the new feature subspace, a classification
algorithm (e.g., nearest neighbor classifier, logistic regression) can be applied to classify the
data.

LDA is widely used in various fields, including pattern recognition, face recognition,
bioinformatics, and finance, among others. It's especially effective when the classes are well-
separated and the assumptions of normality and equal covariance matrices hold true.

It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
PCA and LDA Assignment
No ratings yet
PCA and LDA Assignment
5 pages
Dimensionality Reduction & Models
No ratings yet
Dimensionality Reduction & Models
59 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
ML 6
No ratings yet
ML 6
7 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
SVD and PCA in Data Science
No ratings yet
SVD and PCA in Data Science
58 pages
Principal Component Analysis (PCA) and Linear Discriminant Analysis For Image Recognition
No ratings yet
Principal Component Analysis (PCA) and Linear Discriminant Analysis For Image Recognition
17 pages
Unit 5
No ratings yet
Unit 5
13 pages
Feature Extraction in Machine Learning
No ratings yet
Feature Extraction in Machine Learning
17 pages
Module 4
No ratings yet
Module 4
48 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Dimensionality Reduction Techniques in Data Mining Aim To Reduce The Number of Features
No ratings yet
Dimensionality Reduction Techniques in Data Mining Aim To Reduce The Number of Features
9 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
PCA - Ensemble Classifiers
No ratings yet
PCA - Ensemble Classifiers
9 pages
Updated Feature Enginering Notes
No ratings yet
Updated Feature Enginering Notes
47 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
15 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Principal Component Analysis Guide
No ratings yet
Principal Component Analysis Guide
23 pages
Dimensionality Reduction: Key Concepts
No ratings yet
Dimensionality Reduction: Key Concepts
13 pages
Pca 1
No ratings yet
Pca 1
3 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Unit 3
No ratings yet
Unit 3
28 pages
Day School 03
No ratings yet
Day School 03
32 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
The Math Behind PCA
No ratings yet
The Math Behind PCA
3 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
LDA and PCA Algorithms Explained
No ratings yet
LDA and PCA Algorithms Explained
10 pages
Unit 4
No ratings yet
Unit 4
17 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
PR - Unit 4
No ratings yet
PR - Unit 4
15 pages
Unit 5 (Dimensionality Reduction)
No ratings yet
Unit 5 (Dimensionality Reduction)
96 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
What Is Principal Component Analysis (PCA) ?
No ratings yet
What Is Principal Component Analysis (PCA) ?
13 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Unit 4
No ratings yet
Unit 4
33 pages
Unit 3
No ratings yet
Unit 3
102 pages
6 Dimension Reduction Theory
No ratings yet
6 Dimension Reduction Theory
18 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
ML Unit 3
No ratings yet
ML Unit 3
29 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
CH 6
No ratings yet
CH 6
11 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
Mod2 Dimensionality Reduction
No ratings yet
Mod2 Dimensionality Reduction
18 pages
PCA, LDA, and Clustering Explained
No ratings yet
PCA, LDA, and Clustering Explained
51 pages
Final Year Project Ideas EEE 2025
No ratings yet
Final Year Project Ideas EEE 2025
2 pages
Chapter 2
No ratings yet
Chapter 2
27 pages
Internship Documents
No ratings yet
Internship Documents
3 pages
ML Mid Term Exam2024
No ratings yet
ML Mid Term Exam2024
2 pages
Water
No ratings yet
Water
17 pages
PME Question Bank
No ratings yet
PME Question Bank
5 pages
RM Paper Quantum Machine Learning 2
No ratings yet
RM Paper Quantum Machine Learning 2
7 pages
Unit - 3 PME Notes
No ratings yet
Unit - 3 PME Notes
37 pages
SAP PM Presentation Bmansi - ppt1
100% (1)
SAP PM Presentation Bmansi - ppt1
70 pages
Sunken Slab
No ratings yet
Sunken Slab
2 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Business Plan
50% (2)
Business Plan
38 pages
Hydraulic Filter Solutions & Manufacturing
No ratings yet
Hydraulic Filter Solutions & Manufacturing
4 pages
SkitreLABS: AI-Driven Soft Skills Training
No ratings yet
SkitreLABS: AI-Driven Soft Skills Training
15 pages
Cembureau Net Zero Roadmap
No ratings yet
Cembureau Net Zero Roadmap
31 pages
GDC State of Game Industry 2023
No ratings yet
GDC State of Game Industry 2023
34 pages
JOB DESCRIPTION - TESDA Positions
No ratings yet
JOB DESCRIPTION - TESDA Positions
270 pages
Bending Beyond Elastic Limit
No ratings yet
Bending Beyond Elastic Limit
15 pages
UTB List of Programmes For 2022-2023 (For MFA) - Updated
No ratings yet
UTB List of Programmes For 2022-2023 (For MFA) - Updated
15 pages
Meeting Minutes (1) F&B Logistic Technical & Safety
No ratings yet
Meeting Minutes (1) F&B Logistic Technical & Safety
3 pages
Non-Profit Final Accounts
No ratings yet
Non-Profit Final Accounts
20 pages
Nickel
No ratings yet
Nickel
46 pages
United Nation's Stance Over Kashmir Dispute
No ratings yet
United Nation's Stance Over Kashmir Dispute
9 pages
Top Attorneys 2019
100% (1)
Top Attorneys 2019
14 pages
Marketing Strategy for 2019
100% (2)
Marketing Strategy for 2019
35 pages
Bill June
No ratings yet
Bill June
1 page
Ns Assignment 2
No ratings yet
Ns Assignment 2
6 pages
2D Geometric Shapes Dataset - For Machine Learning and Patte - 2020 - Data in BR
No ratings yet
2D Geometric Shapes Dataset - For Machine Learning and Patte - 2020 - Data in BR
5 pages
Understanding Computerized Systems
No ratings yet
Understanding Computerized Systems
3 pages
Section Two Quiz: Answer Key On Next Page
No ratings yet
Section Two Quiz: Answer Key On Next Page
3 pages
Hernandez vs. DBP 71 SCRA 290
No ratings yet
Hernandez vs. DBP 71 SCRA 290
2 pages
Sureweld 6011 Electrode Specifications
No ratings yet
Sureweld 6011 Electrode Specifications
1 page
Chakrabarti Et Al 2023 JFDR
No ratings yet
Chakrabarti Et Al 2023 JFDR
29 pages
Batman - C2C Crochet Blanket Pattern - PrettyThingsByKatja
100% (1)
Batman - C2C Crochet Blanket Pattern - PrettyThingsByKatja
28 pages
Truck System Diagnosis Overview 2013
100% (2)
Truck System Diagnosis Overview 2013
657 pages
CS-T180 User Manaul - 1038111 - 2018-06
No ratings yet
CS-T180 User Manaul - 1038111 - 2018-06
322 pages
4562 140876 8110833275 PDF
No ratings yet
4562 140876 8110833275 PDF
3 pages
Bahawalpur Highway Tender Notice
No ratings yet
Bahawalpur Highway Tender Notice
5 pages

Principal Component Analysis

Uploaded by

Principal Component Analysis

Uploaded by

Principal Component analysis:

PCA stands for Principal Component Analysis. It is a dimensionality reduction technique

To compute PCA, following steps:

Linear Discriminant Analysis (LDA) is a dimensionality reduction technique and a classification

Here's how LDA works:

You might also like