0% found this document useful (0 votes)

41 views33 pages

Dimensionality Reduction

This document discusses dimensionality reduction and principal component analysis (PCA). It provides the following key points: 1) Dimensionality reduction is used to simplify complex high-dimensional data by summarizing it with a lower dimensional real-valued vector, with minimal loss of information. 2) PCA finds the directions (principal components) along which the data has the most variance. It projects the data onto these principal components to reduce dimensionality while preserving as much information as possible. 3) PCA works by computing the eigenvalues and eigenvectors of the covariance matrix of the data. The principal components are the eigenvectors corresponding to the largest eigenvalues. Projecting the data onto these principal components reduces dimensionality.

Uploaded by

Yashwanth Yashu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views33 pages

Dimensionality Reduction

Uploaded by

Yashwanth Yashu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Dimensionality Reduction

Motivation
• Clustering
• One way to summarize a complex real-valued data point with a single
categorical variable
• Dimensionality reduction
• Another way to simplify complex high-dimensional data
• Summarize data with a lower dimensional real valued vector
Motivation
• Clustering
• One way to summarize a complex real-valued data point with a single
categorical variable
• Dimensionality reduction
• Another way to simplify complex high-dimensional data
• Summarize data with a lower dimensional real valued vector

• Given data points in d dimensions

• Convert them to data points in r<d dimensions
• With minimal loss of information
Data Compression
Reduce data from
(inches)

2D to 1D

(cm)
Data Compression
Reduce data from
(inches)

2D to 1D

(cm)

Andrew Ng
Data Compression
Reduce data from 3D to 2D
Principal Component Analysis (PCA) problem formulation

Reduce from 2-dimension to 1-dimension: Find a direction (a vector )

onto which to project the data so as to minimize the projection error.
Reduce from n-dimension to k-dimension: Find vectors
onto which to project the data, so as to minimize the projection error.

Andrew Ng
Covariance
• Variance and Covariance:
• Measure of the “spread” of a set of points around their center of mass(mean)
• Variance:
• Measure of the deviation from the mean for points in one dimension
• Covariance:
• Measure of how much each of the dimensions vary from the mean with
respect to each other

• Covariance is measured between two dimensions

• Covariance sees if there is a relation between two dimensions
• Covariance between one dimension is the variance
Positive: Both dimensions increase or decrease together Negative: While one increase the other decrease
Covariance

• Used to find relationships between dimensions in high dimensional

data sets

The Sample mean

Eigenvector and Eigenvalue
Ax = λx
A: Square Matirx
λ: Eigenvector or characteristic vector
X: Eigenvalue or characteristic value
• The zero vector can not be an eigenvector
• The value zero can be eigenvalue
Eigenvector and Eigenvalue
Ax = λx
A: Square Matirx
λ: Eigenvector or characteristic vector
X: Eigenvalue or characteristic value

Example
Eigenvector and Eigenvalue
Ax - λx = 0
Ax = λx
(A – λI)x = 0

If we define a new matrix B:

B = A – λI
Bx = 0
BUT! an eigenvector
If B has an inverse: x=B 0=0-1
cannot be zero!!

x will be an eigenvector of A if and only if B does

not have an inverse, or equivalently det(B)=0 :
det(A – λI) = 0
Eigenvector and Eigenvalue
Example 1: Find the eigenvalues of
2  12
A 
 1  5 
  2 12
I  A   (  2)(  5)  12
1   5
 2  3  2  (  1)(  2)
two eigenvalues: 1,  2
Note: The roots of the characteristic equation can be repeated. That is, λ1 = λ2 =…= λk. If that happens, the
eigenvalue is said to be of multiplicity k.
2 1 0
Example 2: Find the eigenvalues of
A  0 2 0
 2 1 0 0 0 2
I  A  0  2 0  (  2)3  0
0 0  2
Principal Component Analysis
Input:

Set of basis vectors:

Summarize a D dimensional vector X with K dimensional

feature vector h(x)
Principal Component Analysis

Basis vectors are orthonormal

New data representation h(x)

Principal Component Analysis

New data representation h(x)

Empirical mean of the data

SIFT feature visualization

• The top three principal components of SIFT descriptors from a set of images are computed
• Map these principal components to the principal components of the RGB space
• pixels with similar colors share similar structures
Application: Image compression

Original Image

• Divide the original 372x492 image into patches:

• Each patch is an instance that contains 12x12 pixels on a grid
• View each as a 144-D vector
PCA compression: 144D  60D
PCA compression: 144D  16D
16 most important eigenvectors
2 2 2 2
4 4 4 4
6 6 6 6
8 8 8 8
10 10 10 10
12 12 12 12
2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12

2 2 2 2
4 4 4 4
6 6 6 6
8 8 8 8
10 10 10 10
12 12 12 12
2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12

2 2 2 2
4 4 4 4
6 6 6 6
8 8 8 8
10 10 10 10
12 12 12 12
2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12
PCA compression: 144D ) 6D
6 most important eigenvectors

2 2 2
4 4 4
6 6 6
8 8 8
10 10 10
12 12 12
2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12

2 2 2
4 4 4
6 6 6
8 8 8
10 10 10
12 12 12
2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12
PCA compression: 144D 
3D
3 most important eigenvectors
2 2

4 4

6 6

8 8

10 10

12 12
2 4 6 8 10 12 2 4 6 8 10 12

12
2 4 6 8 10 12
PCA compression: 144D 
1D
60 most important eigenvectors

Looks like the discrete cosine bases of JPG!...

Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
35 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
Session 12 PCA
No ratings yet
Session 12 PCA
32 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Component Analysis (PCA) : RV College of Engineering
No ratings yet
Principal Component Analysis (PCA) : RV College of Engineering
80 pages
Dimensonality Reduction
No ratings yet
Dimensonality Reduction
25 pages
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
No ratings yet
Dimensionality Reduction Using PCA: Unsupervised Machine Learning
32 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Principal Components Analysis (PCA) : R. Jothi
No ratings yet
Principal Components Analysis (PCA) : R. Jothi
47 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
48 pages
PCA for Data Simplification
No ratings yet
PCA for Data Simplification
70 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
PCA Overview and Key Comparisons
100% (2)
PCA Overview and Key Comparisons
17 pages
PCA
100% (1)
PCA
33 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
5 Dimentionality Reduction
No ratings yet
5 Dimentionality Reduction
27 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
38 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
104 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
7 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Dimensionality Reduction (Pca)
No ratings yet
Dimensionality Reduction (Pca)
32 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
What Is Principal Component Analysis (PCA) ?
No ratings yet
What Is Principal Component Analysis (PCA) ?
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Unsupervised Learning: PCA & Clustering
No ratings yet
Unsupervised Learning: PCA & Clustering
96 pages
Princomps George Dallas
No ratings yet
Princomps George Dallas
9 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
60 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
PCA Basics for Social Scientists
100% (1)
PCA Basics for Social Scientists
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Unsupervised ML 2 - Dr. Niyati - NIT KKR
No ratings yet
Unsupervised ML 2 - Dr. Niyati - NIT KKR
54 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Facial Recognition Using PCA and SVM
No ratings yet
Facial Recognition Using PCA and SVM
12 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
CH 6
No ratings yet
CH 6
11 pages
Dimensionality Reduction DR
No ratings yet
Dimensionality Reduction DR
31 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
7 pages
Data Science Fundamentals - Class1
100% (1)
Data Science Fundamentals - Class1
51 pages
Data Wrangling
No ratings yet
Data Wrangling
30 pages
Understanding Plane Waves in Electromagnetism
No ratings yet
Understanding Plane Waves in Electromagnetism
32 pages
Review of Vector Calculus
No ratings yet
Review of Vector Calculus
21 pages
B.E Maths Syllabus: Complex Analysis
No ratings yet
B.E Maths Syllabus: Complex Analysis
4 pages
Nikon Optiphot User's Guide
No ratings yet
Nikon Optiphot User's Guide
7 pages
Partially-Encased Composite Thin-Walled Steel Beams: V. and L. Drab
No ratings yet
Partially-Encased Composite Thin-Walled Steel Beams: V. and L. Drab
5 pages
Tubo Industria Alimentaria en 10217 7 en 10357
No ratings yet
Tubo Industria Alimentaria en 10217 7 en 10357
1 page
Uncertainties in Measurement
No ratings yet
Uncertainties in Measurement
7 pages
Constant Underflow Leaching Solution
No ratings yet
Constant Underflow Leaching Solution
8 pages
IOQM - Practice Sheet-4 - (Only Que.)
No ratings yet
IOQM - Practice Sheet-4 - (Only Que.)
1 page
Separable Cubic Stochastic Operators
No ratings yet
Separable Cubic Stochastic Operators
8 pages
Influence of Surface Hardening Processes On Wear Characteristics of Soil Working Tools
No ratings yet
Influence of Surface Hardening Processes On Wear Characteristics of Soil Working Tools
25 pages
Fundamentals, Sensor Systems, Spectral Libraries, and Data Mining For Vegetation Second Edition Huete Latest PDF 2025
No ratings yet
Fundamentals, Sensor Systems, Spectral Libraries, and Data Mining For Vegetation Second Edition Huete Latest PDF 2025
57 pages
SPM Add Maths Analysis (Section by Section)
No ratings yet
SPM Add Maths Analysis (Section by Section)
4 pages
Astm D3349 - 21
No ratings yet
Astm D3349 - 21
5 pages
Martinez Structural Report
No ratings yet
Martinez Structural Report
52 pages
Generalizations of Hermite's Identity
No ratings yet
Generalizations of Hermite's Identity
9 pages
Differential Equations Overview
No ratings yet
Differential Equations Overview
2 pages
Math MELCs for Grade 1: Quarter 3
No ratings yet
Math MELCs for Grade 1: Quarter 3
80 pages
05 Spectrum 25-Med
No ratings yet
05 Spectrum 25-Med
60 pages
(2023 Volume 6) The Best Constant For An Inequality
No ratings yet
(2023 Volume 6) The Best Constant For An Inequality
5 pages
UC XLPE Catalogue
No ratings yet
UC XLPE Catalogue
63 pages
Questions for Atheists: Logic & Belief
No ratings yet
Questions for Atheists: Logic & Belief
35 pages
Mechanics I (APPM1028A and APPM1029A)
No ratings yet
Mechanics I (APPM1028A and APPM1029A)
5 pages
Andronow, Chaikin - Theory of Oscillations - 1949
No ratings yet
Andronow, Chaikin - Theory of Oscillations - 1949
377 pages
DR Book Edson Soares V1
No ratings yet
DR Book Edson Soares V1
108 pages
Rock Slope Stability
No ratings yet
Rock Slope Stability
20 pages
Engineering Electromagnetics by David K. Cheng-19-28
No ratings yet
Engineering Electromagnetics by David K. Cheng-19-28
10 pages
Thermal Conductivity of Polymer Composites Filled
No ratings yet
Thermal Conductivity of Polymer Composites Filled
10 pages
Geophysical Data Interpretation Guide
No ratings yet
Geophysical Data Interpretation Guide
3 pages
Capacitor Reactance in Parallel Circuits
No ratings yet
Capacitor Reactance in Parallel Circuits
13 pages
Liquid Penetrant Testing Report: SKETCH: See Next Page
No ratings yet
Liquid Penetrant Testing Report: SKETCH: See Next Page
2 pages
DP Quiz 3
88% (8)
DP Quiz 3
3 pages

Dimensionality Reduction

Uploaded by

Dimensionality Reduction

Uploaded by

Dimensionality Reduction

• Given data points in d dimensions

Reduce from 2-dimension to 1-dimension: Find a direction (a vector )

• Covariance is measured between two dimensions

• Used to find relationships between dimensions in high dimensional

The Sample mean

If we define a new matrix B:

x will be an eigenvector of A if and only if B does

Set of basis vectors:

Summarize a D dimensional vector X with K dimensional

Basis vectors are orthonormal

New data representation h(x)

New data representation h(x)

Empirical mean of the data

• Divide the original 372x492 image into patches:

Looks like the discrete cosine bases of JPG!...

You might also like