0% found this document useful (0 votes)

60 views

Main Learning Algorithms: Find-S Algorithm

The document lists and briefly describes several common machine learning algorithms including: Find-S, Candidate-Elimination, ID3, gradient descent, backpropagation, genetic algorithms, Bayesian learning, Q-learning, EM, K-means, and AdaBoost. It also provides high-level definitions and explanations of key machine learning concepts such as error metrics, PAC learning, naive Bayes classification, and the EM algorithm.

Uploaded by

nguyennd_56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views

Main Learning Algorithms: Find-S Algorithm

Uploaded by

nguyennd_56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Main Learning Algorithms

Find-S
Candidate-Elimination
ID3
Gradient descent and Backpropagation
Genetic Algorithms
Bayesian Learning
Q Learning
EM and K-means
AdaBoost
1
Find-S Algorithm
1. Initialize h to the most specic hypothesis in H
2. For each positive training instance x
For each attribute constraint a
i
in h
If the constraint a
i
in h is satised by x
Then do nothing
Else replace a
i
in h by the next more general constraint that is
satised by x
3. Output hypothesis h
2
Candidate Elimination Algorithm
G maximally general hypotheses in H
S maximally specic hypotheses in H
For each training example d, do
If d is a positive example
Remove from G any hypothesis inconsistent with d
For each hypothesis s in S that is not consistent with d
Remove s from S
Add to S all minimal generalizations h of s such that
- h is consistent with d, and
- some member of G is more general than h
Remove from S any hypothesis that is more general than another
hypothesis in S
3
If d is a negative example
Remove from S any hypothesis inconsistent with d
For each hypothesis g in G that is not consistent with d
Remove g from G
Add to G all minimal specializations h of g such that
- h is consistent with d, and
- some member of S is more specic than h
Remove from G any hypothesis that is less general than another
hypothesis in G
ID3 Algorithm
Input: Examples, Target attribute, Attributes
1. Create a Root node for the tree
2. if all Examples are positive, then return the node Root with label +
3. if all Examples are negative, then return the node Root with label
4. if Attributes is empty, then return the node Root with label = most
common value of Target attribute in Examples
4
5. Otherwise
A the best decision attribute for Examples
Assign A as decision attribute for Root
For each value v
i
of A
- add a new branch from Root corresponding to the test A = v
i
- let Examples
v
i
be the subset of Examples that have value v
i
for A
- if Examples
v
i
is empty then add a leaf node with label =
most common value of Target attribute in Examples
- else add the tree ID3(Examples
v
i
, Target attribute, AttributesA)
Entropy and Information Gain
Entropy(S) p

log
2
p

Gain(S, A) Entropy(S)

vV alues(A)
[S
v
[
[S[
Entropy(S
v
)
5
Denitions of Error
error
T
(h) Pr
xT
[f(x) ,= h(x)]
error
S
(h)
1
n

xS
(f(x) ,= h(x))
bias E[error
S
(h)] error
T
(h)
6
PAC Learning
Denition: C is PAC-learnable by L using H if for all c C, di-
stributions T over X, such that 0 < < 1/2, and such that
0 < < 1/2,
learner L will with probability at least (1 ) output a hypothesis
h H such that error
T
(h) , in time that is polynomial in 1/,
1/, n and size(c).
7
8
Gradient Descent
Gradient-Descent(training examples = x, t), )
x: vector of input values, t: target output, : learning rate (0.05)
Initialize each w
i
to some small random value
Until the termination condition is met, Do
1. Initialize each w
i
to zero.
2. For each x, t) in training examples, Do
Input the instance x to the unit and compute the output o
For each linear unit weight w
i
, Do
w
i
w
i
+(t o)x
i
3. For each linear unit weight w
i
, Do
w
i
w
i
+w
i
9
Backpropagation Algorithm
Initialize all weights to small random numbers.
Until satised, Do
For each training example, Do
1. Input the training example to the network and compute the network
outputs
2. For each output unit k, compute
k
= o
k
(1 o
k
)(t
k
o
k
)
3. For each hidden unit h, compute
h
= o
h
(1 o
h
)

koutputs
w
kh

k
4. Update each network weight w
ji
w
ji
w
ji
+w
ji
w
ji
=
j
x
ji
10
Genetic Algorithm
Input:
Fitness: evaluation function of hypotheses,
Fitness threshold: the threshold used as termination criterion,
p: size of the population,
r: fraction of population to be replaced by Crossover,
m: mutation rate
Initialize: P h
1
, ..., h
p
, p random hypotheses
Evaluate: for each h in P, compute Fitness(h)
While max
hP
Fitness(h) < Fitness threshold
Create a new generation P
S
P P
S
Return the hypothesis from P that has the highest tness.
11
Create a new generation P
S
:
1. Select: Probabilistically select (1 r)p members of P to add to P
S
,
using
Pr(h
i
) =
Fitness(h
i
)

p
j=1
Fitness(h
j
)
2. Crossover: Probabilistically select
rp
2
pairs of hypotheses from P. For
each pair, h
1
, h
2
), produce two ospring by applying the Crossover
operator. Add all ospring to P
s
.
3. Mutate: Probabilistically select m p members of P
s
and invert a ran-
domly selected bit.
Basic Formulas for Probabilities
Product Rule:
P(A B) = P(A[B)P(B) = P(B[A)P(A)
Sum Rule:
P(A B) = P(A) +P(B) P(A B)
Theorem of total probability: if events A
1
, . . . , A
n
are mutually exclusive
with

n
i=1
P(A
i
) = 1, then
P(B) =
n

i=1
P(B[A
i
)P(A
i
)
12
Bayes Theorem
P(h[D) =
P(D[h)P(h)
P(D)
Conditional independence: X is c.i. of Y given Z if
P(X[Y, Z) = P(X[Z)
Bayes classiers
Maximum a posteriori (MAP) hypothesis
h
MAP
= argmax
hH
P(h[D)
Bayes Optimal Classier
v
OB
= argmax
v
j
V

h
i
H
P(v
j
[h
i
)P(h
i
[D)
Naive Bayes Classier
v
NB
= argmax
v
j
V
P(v
j
)

i
P(a
i
[v
j
)
13
Naive Bayes Algorithm
Naive Bayes Learn(examples)
For each target value v
j

P(v
j
) estimate P(v
j
)
For each attribute value a
i
of each attribute a

P(a
i
[v
j
) estimate P(a
i
[v
j
)
Classify New Instance(x)
v
NB
= argmax
v
j
V

P(v
j
)

a
i
x

P(a
i
[v
j
)
14
Q Learning for Deterministic Worlds
For each s, a initialize table entry

Q(s, a) 0
Observe current state s
Do forever:
Select an action a and execute it
Receive immediate reward r
Observe the new state s
t
Update the table entry for

Q(s, a) as follows:

Q(s, a) r + max
a
t

Q(s
t
, a
t
)
s s
t
15
16
EM Algorithm for mixture of Gaussians
Pick random initial h =
1
, ...,
k
)
Repeat until termination condition:
E step: Calculate the expected value E[z
ij
] of each hidden variable z
ij
,
assuming the current hypothesis h =
1
, ...,
k
) holds.
E[z
ij
] =
p(x = x
i
[ =
j
)

k
l=1
p(x = x
i
[ =
l
)
=
e

1
2
2
(x
i

j
)
2

k
l=1
e

1
2
2
(x
i

l
)
2
M step: Calculate a new maximum likelihood hypothesis h
t
=
t
1
, ...,
t
k
),
assuming the value taken on by each hidden variable z
ij
is its expected
value E[z
ij
] calculated above. Replace h =
1
, ...,
k
) by h
t
=
t
1
, ...,
t
k
).

m
i=1
E[z
ij
] x
i

m
i=1
E[z
ij
]
17
General EM Algorithm
General EM Algorithm:
Estimation (E) step: Calculate Q(h
t
[h) using the current hypothesis h
and the observed data X to estimate the probability distribution over
Y .
Q(h
t
[h) E[lnP(Y [h
t
)[h, X]
Maximization (M) step: Replace hypothesis h by the hypothesis h
t
that
maximizes this Q function.
h argmax
h
t
Q(h
t
[h)
18
K-means
A variant of EM computing only k means of data generated from k Gaussian
distributions.
Step 1. Begin with a decision on the value of k = number of clusters
Step 2. Put any initial partition that classies the data into k clusters. You
may assign the training samples randomly, or systematically as follows
1. Take the rst k training samples as single-element clusters
2. Assign each of the remaining (N-k) training samples to the cluster with
the nearest centroid. After each assignment, recomputed the centroid
of the gaining cluster.
19
Step 3 . Take each sample in sequence and compute its distance from the
centroid of each of the clusters. If a sample is not currently in the cluster
with the closest centroid, switch this sample to that cluster and update the
centroid of the cluster gaining the new sample and the cluster losing the
sample.
Step 4 . Repeat step 3 until convergence is achieved, that is until a pass
through the training sample causes no new assignments.
AdaBoost
Given (x
1
, y
1
), . . . , (x
m
, y
m
), where x
i
X, y
i
Y = 1, +1
Initialize D
1
(i) = 1/m, i = 1, ..., m.
For t = 1, ..., T:
Train base learner using distribution D
t
(i)
Get base classier h
t
: X T
Choose
t
T
Update
D
t+1
(i) =
1
Z
t
D
t
(i) e

t
y
i
h
t
(x
i
)
where Z
t
is a normalization factor
20
Output the nal classier
H(x) = sign

t=1

t
h
t
(x)

Understanding Machine Learning
100% (69)
Understanding Machine Learning
416 pages
Introduction To Machine Learning - Ethem Alpaydin
100% (4)
Introduction To Machine Learning - Ethem Alpaydin
432 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Aiml Lab Algorithms
No ratings yet
Aiml Lab Algorithms
10 pages
UNIT-3
No ratings yet
UNIT-3
99 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
07 Intro to ML
No ratings yet
07 Intro to ML
38 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Bayesian Learning Note
No ratings yet
Bayesian Learning Note
20 pages
Machine_learning(unit 3)
No ratings yet
Machine_learning(unit 3)
9 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
39 pages
Fall 2022 Midterm Notes PDF
No ratings yet
Fall 2022 Midterm Notes PDF
15 pages
ML8Ensembles (1)
No ratings yet
ML8Ensembles (1)
31 pages
ML Unit 3 MID1
No ratings yet
ML Unit 3 MID1
83 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
No ratings yet
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
30 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
ML Unit V
No ratings yet
ML Unit V
12 pages
ITML U1 Overview
No ratings yet
ITML U1 Overview
45 pages
Learning
No ratings yet
Learning
51 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Dl Highlights
No ratings yet
Dl Highlights
6 pages
NNML
No ratings yet
NNML
113 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
12 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
ML - Interview Prep
No ratings yet
ML - Interview Prep
9 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Wk08
No ratings yet
Wk08
10 pages
ML
No ratings yet
ML
85 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
DM See M4
No ratings yet
DM See M4
8 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
Ttnt 09 Learning From Examples
No ratings yet
Ttnt 09 Learning From Examples
58 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Lec04 Classifiers NBC
No ratings yet
Lec04 Classifiers NBC
24 pages
8 Classification
No ratings yet
8 Classification
45 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
Machine Learning: Foundations: Prof. Nathan Intrator
No ratings yet
Machine Learning: Foundations: Prof. Nathan Intrator
60 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Learning AI
No ratings yet
Learning AI
34 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
46 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
DM assignment 2
No ratings yet
DM assignment 2
23 pages
l9
No ratings yet
l9
110 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek
100% (1)
Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek
35 pages
Application and Comparison of Classification Techniques in Controlling Credit Risk
0% (1)
Application and Comparison of Classification Techniques in Controlling Credit Risk
16 pages
CC - Unit IV - Chapters
No ratings yet
CC - Unit IV - Chapters
47 pages
Session 02 - Regression - and - Classification
No ratings yet
Session 02 - Regression - and - Classification
22 pages
Data Mining and Sentiment Analysis: A Seminar Report On
No ratings yet
Data Mining and Sentiment Analysis: A Seminar Report On
39 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
Legal Case Classification Using Machine Learning With NLP
No ratings yet
Legal Case Classification Using Machine Learning With NLP
6 pages
Crime Analysis and Prediction Using Datamining: A Review
No ratings yet
Crime Analysis and Prediction Using Datamining: A Review
20 pages
Employee Attrition Prediction
No ratings yet
Employee Attrition Prediction
66 pages
Review of Data Mining Classification Techniques
No ratings yet
Review of Data Mining Classification Techniques
4 pages
Final Report
No ratings yet
Final Report
42 pages
Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms
No ratings yet
Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms
19 pages
Ism Research Assessment 3
No ratings yet
Ism Research Assessment 3
27 pages
Aishwarya Pendyala Fall2019
100% (1)
Aishwarya Pendyala Fall2019
57 pages
Orange: From Experimental Machine Learning To Interactive Data Mining
No ratings yet
Orange: From Experimental Machine Learning To Interactive Data Mining
16 pages
Naive Ba Yes
No ratings yet
Naive Ba Yes
5 pages
MLA LabManual1
No ratings yet
MLA LabManual1
52 pages
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
No ratings yet
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
9 pages
Naive Bayes Classifier From Wikipedia
No ratings yet
Naive Bayes Classifier From Wikipedia
13 pages
Assignment 3 Ai
No ratings yet
Assignment 3 Ai
6 pages
IEEE Paper 1
No ratings yet
IEEE Paper 1
5 pages
Spam Detection & Classification Final
No ratings yet
Spam Detection & Classification Final
38 pages
Project Name Spam Email Detection 1
No ratings yet
Project Name Spam Email Detection 1
7 pages
CS583 Supervised Learning
No ratings yet
CS583 Supervised Learning
166 pages
The Prediction of Disease Using Machine Learning: December 2021
No ratings yet
The Prediction of Disease Using Machine Learning: December 2021
8 pages
Practical Guide To Scikit-Learn For Data Science
No ratings yet
Practical Guide To Scikit-Learn For Data Science
27 pages
Lab 04 - SUpervised ML Classification
No ratings yet
Lab 04 - SUpervised ML Classification
3 pages
Model_QP_NLP_DrChandiniAG (2)
No ratings yet
Model_QP_NLP_DrChandiniAG (2)
4 pages
Hate Speech Detection PPT FINAL
100% (1)
Hate Speech Detection PPT FINAL
29 pages

Main Learning Algorithms: Find-S Algorithm

Uploaded by

Main Learning Algorithms: Find-S Algorithm

Uploaded by

Main Learning Algorithms

You might also like