0% found this document useful (0 votes)

15 views

Lecture 7

Uploaded by

20208046

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Lecture 7

Uploaded by

20208046

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Lecture 7: Naïve bayse

Naïve bayes classifier

• It is a classification technique based on Bayes theorem with
independent assumption among features (predictors).

• Naïve Bayes model is easy to build, with no complicated

iterative parameter estimation which makes it particularly
useful for very large datasets
Bayes Theorem
▪ Given a class C and feature X which bears on the class:

P( X | C ) P(C )
P(C | X ) =
P( X )

▪ P(C) : independent probability of C (hypotheses): prior probability

▪ P(X) : independent probability of X (data, predicator)
▪ P(X|C): conditional probability of X given C: likelihood
▪ P(C|X): conditional probability of C given X: posterior probability
Maximum A Posterior
▪ Based on Bayes Theorem, we can compute the Maximum A Posterior
(MAP) hypothesis for the data
▪ We are interested in the best hypothesis for some space C given
observed training data X.
c MAP  argmax P(c | X )
cC
P ( X | c ) P (c )
= argmax
cC P( X )
= argmax P( X | c) P(c)
cC

C: set of all hypothesis (Classes).

Note that we can drop P(X) as the probability of the data is constant
(and independent of the hypothesis).
Bayes Classifiers
Assumption: training set consists of instances of different classes
described cj as conjunctions of attributes values
Task: Classify a new instance d based on a tuple of attribute values
into one of the classes cj  C
Key idea: assign the most probable class c MAP using Bayes Theorem.

cMAP = argmax P(c j | x1 , x2 , , xn )

c j C

P( x1 , x2 , , xn | c j ) P(c j )
= argmax
c j C P( x1 , x2 ,, xn )
= argmax P( x1 , x2 , , xn | c j ) P(c j )
c j C
The Naïve Bayes Model

▪ The Naïve Bayes Assumption: Assume that the effect of the

value of the predictor (X) on a given class ( C ) is
independent of the values of other predictors.

▪ This assumption is called class conditional independence

P( x1 , x2 ,, xn | C ) = P( x1 | C )  P( x2 | C )  ..... P( xn | C )
P( x1 , x2 ,, xn | C ) =  n
i =1 P( xi | C )
Naïve Bayes Algorithm
• Naïve Bayes Algorithm (for discrete input attributes) has two phases
– 1. Learning Phase: Given a training set S, Learning is easy, just
create probability tables.
For each target value of ci (ci = c1 ,  , c L )
Pˆ (C = ci )  estimate P(C = ci ) with examples in S;
For every attribute value x jk of each attribute X j ( j = 1,  , n; k = 1,  , N j )
Pˆ ( X j = x jk |C = ci )  estimate P( X j = x jk |C = ci ) with examples in S;
Output: conditional probability tables; for X j , N j  L elements

– 2. Test Phase: Given an unknown instance X = ( a1 ,  , an ) ,

Look up tables to assign the label c* to X’ if

[ Pˆ ( a1 |c * )    Pˆ ( an |c * )]Pˆ (c * )  [ Pˆ ( a1 |c)    Pˆ ( an |c)]Pˆ (c), c  c * , c = c1 ,  , c L

Classification is easy, just multiply probabilities
Example
• Example: Play Tennis
Example
• Learning Phase

Outlook Play=Yes Play=No Temperature Play=Yes Play=No

Sunny 2/9 3/5 Hot 2/9 2/5

Overcast 4/9 0/5 Mild 4/9 2/5

Rain 3/9 2/5 Cool 3/9 1/5

Humidity Play=Yes Play=No Wind Play=Yes Play=No

High 3/9 4/5 Strong 3/9 3/5

Normal 6/9 1/5 Weak 6/9 2/5

P(Play=Yes) = 9/14 P(Play=No) = 5/14

Example
• Test Phase

– Given a new instance, predict its label

x’=(Outlook=Sunny, Temperature=Cool, Humidity=High, Wind=Strong)
– Look up tables achieved in the learning phrase
P(Outlook=Sunny|Play=Yes) = 2/9 P(Outlook=Sunny|Play=No) = 3/5
P(Temperature=Cool|Play=Yes) = 3/9 P(Temperature=Cool|Play==No) = 1/5
P(Huminity=High|Play=Yes) = 3/9 P(Huminity=High|Play=No) = 4/5
P(Wind=Strong|Play=Yes) = 3/9 P(Wind=Strong|Play=No) = 3/5
P(Play=Yes) = 9/14 P(Play=No) = 5/14

– Decision making with the MAP rule

P(Yes|x’) ≈ [P(Sunny|Yes)P(Cool|Yes)P(High|Yes)P(Strong|Yes)]P(Play=Yes) = 0.0053

Given the fact P(Yes|x’) < P(No|x’), we label x’ to be “No”. 10

Naïve Bayes
• Algorithm: Continuous-valued Features
– Numberless values taken by a continuous-valued feature
– Conditional probability often modeled with the normal distribution
 ( x j −  ji ) 2 
exp  − 
1
Pˆ ( x j | ci ) =
2  ji  2 ji
2 
 
 ji : mean (avearage) of feature values x j of examples for which c = ci
 ji : standard deviation of feature values x j of examples for which c = ci

– Learning Phase: for X = ( X1 ,  , Xn ), C = c1 ,  , c L

Output: n L normal distributions and P(C = ci ) i = 1,  , L

– Test Phase: Given an unknown instance X = ( a1 ,  , an )

• Instead of looking-up tables, calculate conditional probabilities with all
the normal distributions achieved in the learning phrase
• Apply the MAP rule to assign a label (the same as done for the discrete case)
11
Naïve Bayes
• Example: Continuous-valued Features
– Temperature is naturally of continuous value.
Yes: 25.2, 19.3, 18.5, 21.7, 20.1, 24.3, 22.8, 23.1, 19.8
No: 27.3, 30.1, 17.4, 29.5, 15.1
– Estimate mean and variance for each class
1 N 1 N Yes = 21.64 , Yes = 2.35
 =  xn ,  =  ( xn − )2
2
 No = 23.88 , No = 7.09
N n=1 N n=1

– Learning Phase: output two Gaussian models for P(temp|C)

1  ( x − 21 .64) 2
 1  ( x − 21.64) 2

Pˆ ( x | Yes) = exp − 
 = exp
 − 
2.35 2  2  2.35  2.35 2
2
 11.09 

ˆ 1  ( x − 23 .88) 2
 1  ( x − 23.88) 2

P( x | No) = 
exp − 
 = 
exp − 
7.09 2  2  7.09  7.09 2
2
 50.25 
12
Zero conditional probability
• If no example contains the feature value
– In this circumstance, we face a zero conditional probability problem
during test

Pˆ ( x1 | ci )    Pˆ (a jk | ci )    Pˆ ( xn | ci ) = 0 for x j = a jk , Pˆ (a jk | ci ) = 0

– For a remedy, class conditional probabilities re-estimated with

n + mp
Pˆ (a jk | ci ) = c (m-estimate)
n+m
nc : number of training examples for which x j = a jk and c = ci
n : number of training examples for which c = ci
p : prior estimate (usually, p = 1 / t for t possible values of x j )
m : weight to prior (number of " virtual" examples, m  1) 13
Zero conditional probability
• Example: P(outlook=overcast|no)=0 in the play-tennis dataset
– Adding m “virtual” examples (m: up to 1% of #training example)
• In this dataset, # of training examples for the “no” class is 5.
• We can only add m=1 “virtual” example in our m-esitmate remedy.
– The “outlook” feature can takes only 3 values. So p=1/3.
– Re-estimate P(outlook|no) with the m-estimate

0 (No.of samples outlook=overcast|no )

5 (No.of samples class=no )

1/3 (outloock has 3 values(sunny,

overcast, rain) )
1
Conclusion
▪ Naïve Bayes is based on the independence assumption
▪ Training is very easy and fast; just requiring considering each attribute in each
class separately
▪ Test is straightforward; just looking up tables or calculating conditional
probabilities with normal distributions

▪ Naïve Bayes
• Performance of naïve Bayes is competitive to most of state-of-the-art classifiers
even if in presence of violating the independence assumption
• It has many successful applications, e.g., spam mail filtering

Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Technical Drawing Guideline
No ratings yet
Technical Drawing Guideline
82 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Lecture 5-Naïve Bayes
No ratings yet
Lecture 5-Naïve Bayes
26 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Naive-Bayes
No ratings yet
Naive-Bayes
25 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
No ratings yet
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
25 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
ML-09-naive-bayes-classifier
No ratings yet
ML-09-naive-bayes-classifier
24 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
DM NaiveBayes
No ratings yet
DM NaiveBayes
15 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
19 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
10 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
NOTES
No ratings yet
NOTES
15 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
ML-Lecture-12-NB
No ratings yet
ML-Lecture-12-NB
15 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
UNIT 2 AAM notes (1)
No ratings yet
UNIT 2 AAM notes (1)
38 pages
Wk08
No ratings yet
Wk08
10 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
16_Naïve Bayes Classifier
No ratings yet
16_Naïve Bayes Classifier
21 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Classification-Alternative Techniques: Bayesian Classifiers
No ratings yet
Classification-Alternative Techniques: Bayesian Classifiers
7 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
21 pages
05-Classification-II-2024
No ratings yet
05-Classification-II-2024
54 pages
Naive Bayes
No ratings yet
Naive Bayes
18 pages
07 - ML - Naive-Bayes-update
No ratings yet
07 - ML - Naive-Bayes-update
26 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Final 2015
No ratings yet
Final 2015
2 pages
Special Programs Final Exam Schedule - v3
No ratings yet
Special Programs Final Exam Schedule - v3
1 page
2010
No ratings yet
2010
2 pages
Final 2014
No ratings yet
Final 2014
2 pages
Final 2006June
No ratings yet
Final 2006June
2 pages
Final 2016
No ratings yet
Final 2016
2 pages
Final 2008
No ratings yet
Final 2008
2 pages
Final 2013
No ratings yet
Final 2013
2 pages
2012
No ratings yet
2012
2 pages
Final 2006Jan
No ratings yet
Final 2006Jan
40 pages
Final 2007
No ratings yet
Final 2007
2 pages
Quiz_A
No ratings yet
Quiz_A
1 page
2011
No ratings yet
2011
2 pages
2009
No ratings yet
2009
2 pages
2020 answer v2 by sallam
No ratings yet
2020 answer v2 by sallam
8 pages
Midterm (1)
No ratings yet
Midterm (1)
4 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
Midterm 2017
No ratings yet
Midterm 2017
5 pages
Midterm 2022
No ratings yet
Midterm 2022
7 pages
Problem Solving
No ratings yet
Problem Solving
4 pages
machine learning exams_solved
No ratings yet
machine learning exams_solved
8 pages
GA Lec5 Operators Selection Replacement
No ratings yet
GA Lec5 Operators Selection Replacement
26 pages
lecture 6
No ratings yet
lecture 6
27 pages
Elements of CNC Machine
100% (2)
Elements of CNC Machine
40 pages
Minitab Introduction
No ratings yet
Minitab Introduction
86 pages
Problem 6 IPO Chart Algorithm Flowchart Trace Table Pascal Program
No ratings yet
Problem 6 IPO Chart Algorithm Flowchart Trace Table Pascal Program
6 pages
An Efficient and Robust VUMAT Implementation of El
No ratings yet
An Efficient and Robust VUMAT Implementation of El
29 pages
2122 F1 Maths Midyr Section B Marking Ss Ver
No ratings yet
2122 F1 Maths Midyr Section B Marking Ss Ver
10 pages
Binder 1
No ratings yet
Binder 1
12 pages
A2 Key (KET) : KSE Academy
No ratings yet
A2 Key (KET) : KSE Academy
10 pages
Cost of Capital
0% (1)
Cost of Capital
57 pages
Root Canal Curvatures
No ratings yet
Root Canal Curvatures
3 pages
DSP Unit-I
No ratings yet
DSP Unit-I
17 pages
Almost Continuous Function in Topology
No ratings yet
Almost Continuous Function in Topology
39 pages
Complete Download Crystal Bases 1st Edition Daniel Bump PDF All Chapters
100% (4)
Complete Download Crystal Bases 1st Edition Daniel Bump PDF All Chapters
71 pages
Test STA116 JAN2021
No ratings yet
Test STA116 JAN2021
2 pages
Smogorzhevsky - Lobachevskian Geometry - Little Mathematics Library - Mir
No ratings yet
Smogorzhevsky - Lobachevskian Geometry - Little Mathematics Library - Mir
73 pages
Example of Constructing A Predictive Parsing Table
No ratings yet
Example of Constructing A Predictive Parsing Table
18 pages
SDF Report
No ratings yet
SDF Report
2 pages
Python Pandas
100% (1)
Python Pandas
35 pages
John Von Neumann
No ratings yet
John Von Neumann
43 pages
Arman Khan - Ai Practical Journal
No ratings yet
Arman Khan - Ai Practical Journal
33 pages
Propositional Logic & Hardware
No ratings yet
Propositional Logic & Hardware
5 pages
Masters Theorem (Aad)
No ratings yet
Masters Theorem (Aad)
22 pages
GATE Mathematics Questions All Branch by S K Mondal
No ratings yet
GATE Mathematics Questions All Branch by S K Mondal
75 pages
1CP1 02 Sam MS
100% (1)
1CP1 02 Sam MS
17 pages
HCF, LCM And Simplification
No ratings yet
HCF, LCM And Simplification
24 pages
GR 11 Term 1 2019 Ps Resource Pack A PDF
No ratings yet
GR 11 Term 1 2019 Ps Resource Pack A PDF
124 pages
Density Mass and Volume Questions MME 1
No ratings yet
Density Mass and Volume Questions MME 1
7 pages
REV Python SM 1
No ratings yet
REV Python SM 1
18 pages
Minimax
No ratings yet
Minimax
7 pages
FEM Notes 2016 PDF
No ratings yet
FEM Notes 2016 PDF
71 pages