Deminesionality Reduction

The document is a seminar presentation on Dimensionality Reduction in the context of a Master's program in Computer Science. It covers key concepts such as machine learning, predictive modeling, and the importance of dimensionality reduction for data visualization, compression, and noise removal. The presentation also discusses methods for feature selection and feature reduction, including techniques like Principal Component Analysis.

Uploaded by

lencho03406

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views13 pages

Deminesionality Reduction

Uploaded by

lencho03406

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

AMBO UNIVERSITY INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE

MASTERS PROGRAM IN COMPUTER SCIENCE

Seminar on Dimensionality Reduction

PREPARED BY: LENCHO JAMBARE

may 14,2019

1
Outline
• Machine Learning
• Predictive Modeling
• Dimensionality Reduction
• Why Dimensionality Reduction
• Feature Selection and Feature reduction

2
Introduction to dimensionality
reduction
• Machine Learning: machine learning is nothing
but a field of study which allows computers to
“learn” like humans without any need of explicit
programming.
• Predictive Modeling: Predictive modeling is a
probabilistic process that allows us to forecast
outcomes, on the basis of some predictors.
• These predictors are basically features that
come into play when deciding the final result,
i.e. the outcome of the model. 3
Dimensionality Reduction
• Dimensionality reduction is the process of reducing
the number of random variables under
consideration, by obtaining a set of principal
variables.
• objective: find a low-dimensional representation of
the data that retains as much information as possible

• It can be divided into feature selection and feature

extraction.

4
Why Dimensionality Reduction?
• Visualization: projection of high-dimensional
data onto 2D or 3D.
• Data compression: efficient storage and
retrieval.
• Noise removal: positive effect on query
accuracy.
• It reduces computation time.
• It also helps remove redundant features, if any
• Dimensionality reduction is an effective
approach to downsizing data
5
Feature Selection and Feature reduction

• Feature selection is the process of identifying and

selecting relevant features for your sample

• Feature selection is just visualizing the

relationship between the features and the target
variable by removing each feature against the
target variable.

6
Feature selection:

 feature section have the following

advantages:
improve mining performance
Speed of learning
Predictive accuracy
Simplicity and comprehensibility of mined
results

7
Methods of attribute subset selection

 Step-wise forward
selection:
 The procedure starts with
an empty set of attributes.
• The best of the original
attributes is determined and
added to the set.
• At each subsequent
iteration or step, the best of
the remaining original
attributes is added to the
set.
8
Methods of attribute subset
selection(cont’d)..
2. Step-wise backward
elimination:
• The procedure starts with the
full set of attributes.
• At each step, it removes the
worst attribute remaining in the
set.
3. Combination forward selection
and backward elimination:
• at each step one selects the best
attribute and removes the worst
from among the remaining
attributes.

9
Methods of attribute subset selection(cont’d)…
4. Decision tree induction:
• Decision tree induction constructs a flow-
chart-like structure where each internal
(non-leaf) node denotes a test on an
attribute,
• At each node, the algorithm chooses the
best" attribute to partition the data into
individual classes.
• When decision tree induction is used for
attribute subset selection, a tree is
constructed from the given data.
• All attributes that do not appear in the tree
are assumed to be irrelevant.
• The set of attributes appearing in the tree,
form the reduced subset of attributes.
10
Feature Reduction
• We create new features as functions of the
existing ones (instead of choosing a subset of
the existing features)
• This could be achieved in :
• Unsupervised manner (e.g., principal
component analysis chooses a projection that
is efficient for representation)

11
Principal Component Analysis
• The aim is to find a new feature space with minimum
loss of information
• It is assumed that the "most important" aspects of the
data lies on the projection with the greatest variance
• Principal component analysis (PCA) transforms the
data to a new coordinate system such that :
• The greatest variance lies on the first coordinate (the
first principal component), the second greatest
variance lies on the second coordinate (the second
principal component), and so on
12
ntio n
r atte
yo u
yo u for
han k
T

Machine Learning Unit-5
No ratings yet
Machine Learning Unit-5
49 pages
Introduction to Dimensionality Reduction
No ratings yet
Introduction to Dimensionality Reduction
5 pages
L-10 - Presentation1-09052024-072206pm
No ratings yet
L-10 - Presentation1-09052024-072206pm
27 pages
Unit 4 - ML (NEW)
No ratings yet
Unit 4 - ML (NEW)
80 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Module 2 ML
No ratings yet
Module 2 ML
15 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
ML Lecture UIII 1 Dim Red
No ratings yet
ML Lecture UIII 1 Dim Red
25 pages
ML Unit Iv Part I
No ratings yet
ML Unit Iv Part I
11 pages
Machine Learning Dimensionality Guide
No ratings yet
Machine Learning Dimensionality Guide
9 pages
Dimensionality
No ratings yet
Dimensionality
9 pages
Top 11 Dimensionality Reduction Techniques
No ratings yet
Top 11 Dimensionality Reduction Techniques
12 pages
ML Unit 4 at VS
No ratings yet
ML Unit 4 at VS
33 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Dimensionality Reduction in Machine Learning-1
No ratings yet
Dimensionality Reduction in Machine Learning-1
16 pages
DimensionalityReduction (Filter and Wrapper Methods)
No ratings yet
DimensionalityReduction (Filter and Wrapper Methods)
47 pages
Unit IV Dimensionality Reduction
No ratings yet
Unit IV Dimensionality Reduction
34 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
16 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
47 pages
Unit-13 Feature Selection and Extraction
No ratings yet
Unit-13 Feature Selection and Extraction
24 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
14 pages
Unit No.02 - Feature Extraction and Selection
No ratings yet
Unit No.02 - Feature Extraction and Selection
17 pages
Feature Dimensionality Reduction: A Review: Survey and State of The Art
No ratings yet
Feature Dimensionality Reduction: A Review: Survey and State of The Art
31 pages
Feature Selection & Feature Extraction
No ratings yet
Feature Selection & Feature Extraction
19 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Chapter6 - Unit IV2024
No ratings yet
Chapter6 - Unit IV2024
84 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
23 pages
ML Unit 4
No ratings yet
ML Unit 4
20 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
ML Chapter 4
No ratings yet
ML Chapter 4
38 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Business Data Mining Week 4
No ratings yet
Business Data Mining Week 4
12 pages
ML Module 6
No ratings yet
ML Module 6
29 pages
Chapter 1.2. Overview of ML
No ratings yet
Chapter 1.2. Overview of ML
17 pages
Lecture4-Dimensionality Reduction Methods
No ratings yet
Lecture4-Dimensionality Reduction Methods
40 pages
AML Unit 5
No ratings yet
AML Unit 5
13 pages
Data Reduction Techniques Explained
No ratings yet
Data Reduction Techniques Explained
23 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
ASM-BDM - Module 3 - Notes
No ratings yet
ASM-BDM - Module 3 - Notes
12 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
Dimensionality Reduction Review
No ratings yet
Dimensionality Reduction Review
8 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
38 pages
ML Unit 4 (R22)
No ratings yet
ML Unit 4 (R22)
34 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
6 - Data Pre-Processing-III
No ratings yet
6 - Data Pre-Processing-III
30 pages
Dimensionality Reduction Final
No ratings yet
Dimensionality Reduction Final
5 pages
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
20 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
104 pages
5 Data Pre Processing III
No ratings yet
5 Data Pre Processing III
30 pages
Data Reduction Techniques
100% (1)
Data Reduction Techniques
41 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
Data Mining: Dimensionality Reduction
No ratings yet
Data Mining: Dimensionality Reduction
135 pages
Lec 13
No ratings yet
Lec 13
62 pages
ISOMAP in ML
No ratings yet
ISOMAP in ML
12 pages
A Survey of Feature Selection and Feature Extraction Techniques in Machine Learning
No ratings yet
A Survey of Feature Selection and Feature Extraction Techniques in Machine Learning
7 pages
Unit 4
No ratings yet
Unit 4
51 pages
Unit 5
No ratings yet
Unit 5
30 pages
Unit 2
No ratings yet
Unit 2
105 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
Unit 1
No ratings yet
Unit 1
55 pages
Unit 3
No ratings yet
Unit 3
55 pages
Predictive Research
No ratings yet
Predictive Research
8 pages
Dam 3
No ratings yet
Dam 3
15 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
Lect 4 ERD
No ratings yet
Lect 4 ERD
50 pages
Chapter 5
No ratings yet
Chapter 5
28 pages
Chapter 4 - Analysis
No ratings yet
Chapter 4 - Analysis
67 pages
Uninformed Search
No ratings yet
Uninformed Search
21 pages
Chapter 05 Network Configuration Basics
No ratings yet
Chapter 05 Network Configuration Basics
45 pages
Chapter Two-Nd
No ratings yet
Chapter Two-Nd
7 pages
Chapter Three-Nd
No ratings yet
Chapter Three-Nd
61 pages
AI - Study Material Full
No ratings yet
AI - Study Material Full
33 pages
Informed Search
No ratings yet
Informed Search
24 pages
Chapter One-Nd
No ratings yet
Chapter One-Nd
37 pages
Chapter 1 Introduction To NSA
No ratings yet
Chapter 1 Introduction To NSA
26 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
Chapter - 4, 5 and 6
No ratings yet
Chapter - 4, 5 and 6
91 pages
Chapter 4 and 5
No ratings yet
Chapter 4 and 5
65 pages
RRB Chemical Metallurgical Assistant Old Questions 30th Aug Shift 2
No ratings yet
RRB Chemical Metallurgical Assistant Old Questions 30th Aug Shift 2
50 pages
Savita
No ratings yet
Savita
3 pages
Judge The Validity of The Evidence Listened To-Presentation
No ratings yet
Judge The Validity of The Evidence Listened To-Presentation
38 pages
MR
No ratings yet
MR
157 pages
SPM 2024 Trial Speaking Test Guide
No ratings yet
SPM 2024 Trial Speaking Test Guide
6 pages
Dlpmammelba
No ratings yet
Dlpmammelba
12 pages
Edci 311 Notes Recent (1) - 1
0% (1)
Edci 311 Notes Recent (1) - 1
31 pages
Gael Karomba, MSC - LinkedIn
No ratings yet
Gael Karomba, MSC - LinkedIn
2 pages
Grade 9 Term 3 Creative Schemes
No ratings yet
Grade 9 Term 3 Creative Schemes
13 pages
Present Simple vs Continuous Guide
No ratings yet
Present Simple vs Continuous Guide
1 page
Marketing Insights for Hospitality Sector
No ratings yet
Marketing Insights for Hospitality Sector
66 pages
Osgerby, Bill - Playboys in Paradise - Masculinity
No ratings yet
Osgerby, Bill - Playboys in Paradise - Masculinity
245 pages
Tweens Audience Analysis Profile Data
No ratings yet
Tweens Audience Analysis Profile Data
13 pages
Basavaraj JP Tosca Automation Resume-1
No ratings yet
Basavaraj JP Tosca Automation Resume-1
2 pages
Lecture 2
No ratings yet
Lecture 2
98 pages
Girl Scout Awards Overview and Faith
No ratings yet
Girl Scout Awards Overview and Faith
2 pages
Tech Workforce Demographics
No ratings yet
Tech Workforce Demographics
54 pages
The Learning Approaches of The Grade 9 Students of Concordia College S.Y. 2017 2018
No ratings yet
The Learning Approaches of The Grade 9 Students of Concordia College S.Y. 2017 2018
64 pages
IGCSE Writing
No ratings yet
IGCSE Writing
10 pages
Bai Tap Chuyen Sau Anh 8 Global Unit 3 TEENAGERS
No ratings yet
Bai Tap Chuyen Sau Anh 8 Global Unit 3 TEENAGERS
14 pages
ANU B.Ed. I Sem Exam Fee Schedule Update
No ratings yet
ANU B.Ed. I Sem Exam Fee Schedule Update
1 page
Competency Assessment Agreement
No ratings yet
Competency Assessment Agreement
3 pages
Syllabus: Cambridge International AS & A Level Islamic Studies 9488
No ratings yet
Syllabus: Cambridge International AS & A Level Islamic Studies 9488
37 pages
Local Media2776546381057787868
No ratings yet
Local Media2776546381057787868
10 pages
PETI TeacherSurvey
No ratings yet
PETI TeacherSurvey
26 pages
Aileen Samputon's Nursing CV
No ratings yet
Aileen Samputon's Nursing CV
10 pages
M4-L2-Activity Sheet 1
No ratings yet
M4-L2-Activity Sheet 1
5 pages
Urdu M. A. Final 2017
No ratings yet
Urdu M. A. Final 2017
6 pages
WWE Progression
No ratings yet
WWE Progression
5 pages
PGMP Exam Preparation: Jean Gouix, PMP, PGMP Martial Bellec, PMP, PGMP
33% (3)
PGMP Exam Preparation: Jean Gouix, PMP, PGMP Martial Bellec, PMP, PGMP
16 pages