0% found this document useful (0 votes)

68 views9 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

The document discusses K-Nearest Neighbor (KNN), a simple machine learning classification algorithm. KNN classifies new data points based on the labels of the k closest training examples in feature space. It can be sensitive to noise and high-dimensional data. Key aspects include choosing a distance metric, determining k, and addressing the "curse of dimensionality". KNN is simple to implement but requires searching all training data for predictions.

Uploaded by

Ali Shan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views9 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

Ali Shan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 9

K- Nearest Neighbor

Carla P. Gomes
CS4700
1-Nearest Neighbor

One of the simplest of all machine learning classifiers

Simple idea: label a new point the same as the closest known point

Label it red.

Carla P. Gomes
CS4700
Distance Metrics
Different metrics can change the decision surface

Dist(a,b) =(a1 – b1)2 + (a2 – b2)2 Dist(a,b) =(a1 – b1)2 + (3a2 – 3b2)2

Standard Euclidean distance metric:

– Two-dimensional: Dist(a,b) = sqrt((a1 – b1)2 + (a2 – b2)2)
Adapted from “Instance-Based Learning”
– Multivariate: Dist(a,b) = sqrt(∑ (ai – bi)2) lecture slides by Andrew Moore, CMU.

Carla P. Gomes
CS4700
1-NN’s Aspects as an
Instance-Based Learner:

A distance metric
– Euclidean
– When different units are used for each dimension
 normalize each dimension by standard deviation
– For discrete data, can use hamming distance
 D(x1,x2) =number of features on which x1 and x2 differ
– Others (e.g., normal, cosine)

How many nearby neighbors to look at?

– One
How to fit with the local points?
– Just predict the same output as the nearest neighbor.

Adapted from “Instance-Based Learning”

lecture slides by Andrew
Carla P.Moore,
Gomes CMU.
CS4700
k – Nearest Neighbor

Generalizes 1-NN to smooth away noise in the labels

A new point is now assigned the most frequent label of its k nearest
neighbors

Label it red, when k = 3

Label it blue, when k = 7

Carla P. Gomes
CS4700
KNN Example
Food Chat Fast Price Bar BigTip
(3) (2) (2) (3) (2)
1 great yes yes normal no yes
2 great no yes normal no yes
3 mediocre yes no high no no
4 great yes yes normal yes yes

Similarity metric: Number of matching attributes (k=2)

New examples:
– Example 1 (great, no, no, normal, no) Yes
 most similar: number 2 (1 mismatch, 4 match)  yes

Second most similar example: number 1 (2 mismatch, 3 match)  yes

– Example 2 (mediocre, yes, no, normal, no) Yes/No

 Most similar: number 3 (1 mismatch, 4 match)  no

Second most similar example: number 1 (2 mismatch, 3 match)  yes

Selecting the Number of Neighbors

Increase k:
– Makes KNN less sensitive to noise

Decrease k:
– Allows capturing finer structure of space

Pick k not too large, but not too small (depends on data)

Carla P. Gomes
CS4700
Curse-of-Dimensionality

Prediction accuracy can quickly degrade when number of attributes

grows.
– Irrelevant attributes easily “swamp” information from relevant
attributes
– When many irrelevant attributes, similarity/distance measure
becomes less reliable

Remedy
– Try to remove irrelevant attributes in pre-processing step
– Weight attributes differently
– Increase k (but not too much)

Carla P. Gomes
CS4700
Advantages and Disadvantages of KNN

Need distance/similarity measure and attributes that “match” target

function.

For large training sets,

 Must make a pass through the entire dataset for each classification.
This can be prohibitive for large data sets.

Prediction accuracy can quickly degrade when number of attributes

grows.

Simple to implement algorithm;

Requires little tuning;
Often performs quite weel!
(Try it first on a new learning problem). Carla P. Gomes
CS4700

Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
Chatbot Final Year Project PDF Free
100% (1)
Chatbot Final Year Project PDF Free
24 pages
CS 4700: Foundations of Artificial Intelligence
No ratings yet
CS 4700: Foundations of Artificial Intelligence
9 pages
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
No ratings yet
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
8 pages
Week 6 - K-Nearest Neighbor-Dikonversi
No ratings yet
Week 6 - K-Nearest Neighbor-Dikonversi
9 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
Week 5 - Instance-Based Learning & PCA
No ratings yet
Week 5 - Instance-Based Learning & PCA
69 pages
m3 Final-1
No ratings yet
m3 Final-1
171 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Machine Learning For Humans, Part 2.3 - Supervised Learning III - by Vishal Maini - Machine Learning For Humans - Medium
No ratings yet
Machine Learning For Humans, Part 2.3 - Supervised Learning III - by Vishal Maini - Machine Learning For Humans - Medium
25 pages
Similarity Based Learning (Part 2)
No ratings yet
Similarity Based Learning (Part 2)
15 pages
K-Nearest Neighbors: Nipun Batra July 5, 2020
No ratings yet
K-Nearest Neighbors: Nipun Batra July 5, 2020
66 pages
Lecture Slides-Week15,16
No ratings yet
Lecture Slides-Week15,16
50 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
No ratings yet
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
8 pages
DSB - Unit3
No ratings yet
DSB - Unit3
87 pages
Chapter 2
No ratings yet
Chapter 2
70 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
AIML
No ratings yet
AIML
13 pages
K - Nearest Neighbours
No ratings yet
K - Nearest Neighbours
6 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
Lecture 2 - Nearest-Neighbors Methods
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
57 pages
Lazy LearningClassification Using Nearest Neighbors
No ratings yet
Lazy LearningClassification Using Nearest Neighbors
36 pages
Machine Learning Note 4
No ratings yet
Machine Learning Note 4
2 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
KNN Using Python
No ratings yet
KNN Using Python
23 pages
Similarity Analysis
No ratings yet
Similarity Analysis
85 pages
BDA
No ratings yet
BDA
31 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
25 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Univt - IV
No ratings yet
Univt - IV
72 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
06 KNN
No ratings yet
06 KNN
41 pages
BookSlides 5A Similarity-based-Learning
No ratings yet
BookSlides 5A Similarity-based-Learning
40 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
Unit II 2 Mark Answers ML
No ratings yet
Unit II 2 Mark Answers ML
3 pages
Lec 02 - KNN
No ratings yet
Lec 02 - KNN
36 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Lecture-8 Classification Using K-NN
No ratings yet
Lecture-8 Classification Using K-NN
40 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
08 - KNN
No ratings yet
08 - KNN
39 pages
KNN
No ratings yet
KNN
53 pages
DS - Module 3
No ratings yet
DS - Module 3
65 pages
05 KNN
No ratings yet
05 KNN
49 pages
Week 07
No ratings yet
Week 07
24 pages
Predict Based Simmiliarity and Validation
No ratings yet
Predict Based Simmiliarity and Validation
19 pages
Unit 4 - KVR
No ratings yet
Unit 4 - KVR
111 pages
04 KNN M
No ratings yet
04 KNN M
26 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
4.4-InstanceBasedLearning Part 1
No ratings yet
4.4-InstanceBasedLearning Part 1
16 pages
1 KNN-Algo
No ratings yet
1 KNN-Algo
27 pages
Textbook ML - Removed
No ratings yet
Textbook ML - Removed
10 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
Chapter 4. K Nearest Neighbors
No ratings yet
Chapter 4. K Nearest Neighbors
55 pages
Lecture 3
No ratings yet
Lecture 3
58 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
10 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
5 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
5 pages
Introduction To Critical Reading
No ratings yet
Introduction To Critical Reading
10 pages
Various Issues in Curriculum Change
100% (3)
Various Issues in Curriculum Change
35 pages
Direct & Indirect Spech Rules
No ratings yet
Direct & Indirect Spech Rules
9 pages
Minutes of Meeting Different Projects
No ratings yet
Minutes of Meeting Different Projects
14 pages
Senior Technical Writer Communications in San Diego CA Resume Phillip Raimi
No ratings yet
Senior Technical Writer Communications in San Diego CA Resume Phillip Raimi
2 pages
Testbank For Physics of Everyday Phenomena 9th Edition
No ratings yet
Testbank For Physics of Everyday Phenomena 9th Edition
17 pages
Bachelor Thesis Example Business
100% (1)
Bachelor Thesis Example Business
8 pages
Public Administration and Ethics
No ratings yet
Public Administration and Ethics
229 pages
Pymetrics Overview
No ratings yet
Pymetrics Overview
2 pages
Syllabus JHS Template 24 25
No ratings yet
Syllabus JHS Template 24 25
2 pages
Dissociative Identity Disorder
No ratings yet
Dissociative Identity Disorder
6 pages
Expansion 1 Written Tests A e B Gabaritos
No ratings yet
Expansion 1 Written Tests A e B Gabaritos
2 pages
Action Research in Araling Panlipunan
94% (17)
Action Research in Araling Panlipunan
3 pages
Multigrade LP
No ratings yet
Multigrade LP
7 pages
Madsen
No ratings yet
Madsen
2 pages
Anil Kumar: Phone: Address: Email
No ratings yet
Anil Kumar: Phone: Address: Email
2 pages
De 708
No ratings yet
De 708
6 pages
Memo To Superintendent
No ratings yet
Memo To Superintendent
3 pages
Introducing Digital Humanities Pedagogy
100% (1)
Introducing Digital Humanities Pedagogy
8 pages
Zoom Presentation
No ratings yet
Zoom Presentation
10 pages
Swot Personal Reflection Questions
No ratings yet
Swot Personal Reflection Questions
2 pages
Types of Drills PDF
No ratings yet
Types of Drills PDF
8 pages
360 Leadershipskills Survey Fillable Template
No ratings yet
360 Leadershipskills Survey Fillable Template
4 pages
Chuasoopeng
No ratings yet
Chuasoopeng
6 pages
Chapter 1 Curriculum Essentials
100% (3)
Chapter 1 Curriculum Essentials
36 pages
UMT Theory Seminar
100% (1)
UMT Theory Seminar
8 pages
Langley, A. (2007) - Process Thinking in Strategic Organization. Strategic Organization, 5 (3) - 271-283
No ratings yet
Langley, A. (2007) - Process Thinking in Strategic Organization. Strategic Organization, 5 (3) - 271-283
12 pages
EQ2410 (2E1436) Advanced Digital Communications
No ratings yet
EQ2410 (2E1436) Advanced Digital Communications
11 pages
NSO Class 4 Paper 2018 Part 1
No ratings yet
NSO Class 4 Paper 2018 Part 1
6 pages
Sample - Pack de Syllabi (A1-C2)
No ratings yet
Sample - Pack de Syllabi (A1-C2)
19 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

K- Nearest Neighbor

One of the simplest of all machine learning classifiers

Standard Euclidean distance metric:

How many nearby neighbors to look at?

Adapted from “Instance-Based Learning”

Generalizes 1-NN to smooth away noise in the labels

Label it red, when k = 3

Label it blue, when k = 7

Similarity metric: Number of matching attributes (k=2)

Second most similar example: number 1 (2 mismatch, 3 match)  yes

– Example 2 (mediocre, yes, no, normal, no) Yes/No

Second most similar example: number 1 (2 mismatch, 3 match)  yes

Prediction accuracy can quickly degrade when number of attributes

Need distance/similarity measure and attributes that “match” target

For large training sets,

Prediction accuracy can quickly degrade when number of attributes

Simple to implement algorithm;

You might also like