0% found this document useful (0 votes)

21 views50 pages

1.concept Learning

The document discusses concept learning, which is defined as determining a hypothesis that best fits a set of training examples by searching through potential hypotheses. It involves finding a concept, or target function, that classifies examples as positive or negative based on their attributes. The goal of concept learning is to find a hypothesis that correctly classifies all examples. The space of possible hypotheses can be organized from most general to most specific using relations like more-general-than. Common concept learning algorithms like Find-S search this hypothesis space from specific to general hypotheses.

Uploaded by

businessaccanurag2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views50 pages

1.concept Learning

Uploaded by

businessaccanurag2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

What is concept learning?

“Problem of searching through a predefined space of potential hypotheses for

the hypothesis that best fits the training examples”

Concept Learning
H1(x)
H2(x)
determines a hypothesis
H3(x) that best fits the training
.
.
examples, by searching
. space of potential
hypotheses

Space of all possible hypotheses

Introduction

• Assume a given domain, e.g. objects, animals, etc.

• A concept can be seen as a subset of the domain, e.g. birds⊆animals
• Task: acquire intentional concept description from training examples
• Generally we can’t look at all objects in the domain
What is concept
The set of features that differentiate one object from another, can be called
a concept.

Concept Space

Target concept
to be learned

Other concepts

Boolean valued function is able to identity target concept over concept space
Target Concept:

The set of items/objects over which the concept is defined is called the set of
instances and denoted by X.

The concept or function to be learned is called

the target concept and denoted by c.

It can be seen as a boolean valued function

X c
defined over X and can be represented as:

c: X -> {0, 1}

The goal of concept learning is to find a hypothesis h which can

identify all the objects in X so that:
h(x) = c(x) for all x in X
There are three necessary things for an algorithm which supports concept learning:
1. Training data (Past experiences to train our models)
2. Target Concept (Hypothesis to identify data objects)
3. Actual data objects (For testing the models)

The inductive learning is based on formulating a generalized concept after

observing a number of instances of examples of the concept.

we can make our machine to learn from past data and make them intelligent to
identify whether an object falls into a specific category of our interest or not.

Machines can also learn from the concepts to identify whether an object
belongs to a specific category or not by processing past/training data to find a
hypothesis that best fits the training examples.
Assume the following:
Some attributes/features of the day can be:
Sky, Air Temperature, Humidity, Wind, Water, Forecast
X = set of instances
Many concepts can defined over the X.
For example, the concepts can be
- Days on which my friend Rama enjoys his favorite water sport
- Days on which my friend Rama will not go outside of his house.
- etc
Target concept —
-the concept or function to be learned
-denoted by c
-a boolean valued function defined over X
-represented as c: X → {0, 1}.
e.g., c = Days on which Rama will enjoy sports

To indicate whether Rama enjoys sports on that day, one more attribute
EnjoySport is included in dataset as shown below

X, set of instances
Our target concept is EnjoySport.
It is defined as EnjoySport : X -> {0,1}

Our goal is to predict EnjoySport for given arbitrary day with new sample values
for attributes of day, based on previous learning(from training examples)

H denotes set of all possible hypotheses that computer can consider regarding
the identify of target concept.

Our goal is to determine hypothesis from H, to identify target concept, such

that h(x)=c(x)=1

Learning can be viewed as a task of searching this space

Different learning algorithms search this space in different ways

Representation of Hypothesis:
Each hypothesis is represented by vector of constraints as follows:
<attribute-1, attribute-2, …, attribute-n>

e.g., For task EnjoySports, hypothesis is a vector of six constraints as follows

<Sky, AirTemp, Humidity, Wind, Water, Forecast>

Each attribute has one of these three posibilities:

-a specfic value (e.g., W ater = W arm )

– don’t care (e.g., “W ater =?”)
– no value allowed (e.g.,“Water= ∅”)

Examples of hypotheses:
Hypothesis Target Concept
<?, Cold, High, ?, ?, ?> Rama will enjoy sports on cold days with high
humidity
<?, ?, ?, ?, ?, ?> Every day is good day for enjoying sports
<0, 0, 0, 0, 0, 0> No day is good day for enjoying sports
Most General/Most Specific Hypothesis

• Most general hypothesis: (?, ?, ?, ?, ?)

• Most specific hypothesis: ( ∅, ∅, ∅, ∅, ∅ )

EnjoySport Concept Learning Task

X — The set of items over which the concept is defined is called the set of instances,
which we denote by X. In the current example, X is the set of all possible days, each
represented by the attributes Sky, AirTemp, Humidity, Wind, Water, and Forecast.

c — The concept or function to be learned is called the target concept, which we

denote by c. In general, c can be any boolean valued function defined over the
instances X; that is, c: X → {0, 1}. In the current example, the target concept
corresponds to the value of the attribute EnjoySport (i.e, c(x)=1 if EnjoySport=Yes,
and c(x)=0 if EnjoySport= No).
(x, c(x)) — When learning the target concept, the learner is presented by a set of
training examples, each consisting of an instance x from X, along with its target
concept value c(x). Instances for which c(x) = 1 are called positive examples and
instances for which c(x) = 0 are called negative examples. We will often write the
ordered pair (x, c(x)) to describe the training example consisting of the instance x
and its target concept value c(x).
D — We use the symbol D to denote the set of available training examples.

H — Given a set of training examples of the target concept c, the problem faced by the
learner is to hypothesize, or estimate, c. We use the symbol H to denote the set of all
possible hypotheses that the learner may consider regarding the identity of the target
concept.

h(x) — In general, each hypothesis h in H represents a Boolean-valued function

deﬁned over X; that is, h : X →{0, 1}. The goal of the learner is to ﬁnd a hypothesis h
such that h(x) = c(x) for all x in X.
The Inductive Learning Hypothesis

The inductive learning hypothesis: Any hypothesis found to approximate the

target function well over a sufficiently large set of training examples will also
approximate the target function well over other unobserved examples.
Hypothesis Space
Sky has 3 possible values, and other 5 attributes have 2 possible values, as follows

In the hypothesis representation of EnjoySports, value of each attribute could be either

“?” or “0” other than defined values. So the hypothesis space H has 5120 distinct
hypothesis.
The number of combinations: 5×4×4×4×4×4 = 5120 syntactically distinct hypotheses.
They are syntactically distinct but not semantically.

For example, the below 2 hypothesis says the same but they look different.

h1 = <Sky=0, Temp=warm, Humidity=?, Wind=strong, Water=warm, Forecast=same >

h2 = <Sky=sunny, Temp=warm, Humidity=?, Wind=strong, Water=0, Forecast=same>

Neither of these hypotheses accept any “day”, so semantically the same.

All such hypothesis having same semantic is counted as 1. So we can have total number
of combinations as below.

1 (hypothesis with one or more 0)

+
4×3×3×3×3×3 (add ? to each attribute)
=
973 semantically distinct hypotheses
General-to-Specific Ordering of Hypotheses

Many algorithms for concept learning organize the search through the hypothesis
space by relying on a general-to-specific ordering of hypotheses.

Consider two hypotheses

h1 = (Sunny, ?, ?, Strong, ?, ?)
h2 = (Sunny, ?, ?, ?, ?, ?)
Now consider the sets of instances that are classified positive by hl and by h2.
– Because h2 imposes fewer constraints on the instance, it classifies more
instances as positive.
– In fact, any instance classified positive by hl will also be classified positive by h2.
– Therefore, we say that h2 is more general than hl.
-In reverse, we can also say that, h1 is more specific than h2.
More-General-Than Relation

For any instance x in X and hypothesis h in H, we say that x satisfies h if and only if
h(x) = 1.

More-General-Than-Or-Equal Relation:

more-specific-than:

≥ does not depend on the concept to be learned

• It defines a partial order over the set of hypotheses
• strictly-more-general than: >
• more-specific-than ≤
more-general-than-equal-to or more-general-than

In the above example,

there are 2 instances- x1 and x2, and 3 hypothesis - h1, h2 and h3.
h1 classifies — x1, h2 classifies — x1 and x2, and h3 classifies — x1.
This indicates, h2 is more-general-than h1 and h3.
h2 > h1 and h2 > h3
But there is no more-general relation between h1 and h3
Two most popular approaches to find a suitable hypothesis, they are:
1.Find-S Algorithm
2.List-Then-Eliminate Algorithm
3.Candidate-Elimination Algorithm
Find-s:
finding a maximally specific hypothesis

FIND-S Algorithm starts from the most specific hypothesis and generalize it by
considering only positive examples.
• FIND-S algorithm ignores negative examples.

FIND-S algorithm finds the most specific hypothesis within H that is consistent
with the positive training examples.

The final hypothesis will also be consistent with negative examples if

the correct target concept is in H, and the training examples are correct.
FIND-S Algorithm
assume the learner is given the sequence of training examples from the EnjoySport
task
The Find-S algorithm for the above example, can be illustrated with the
below figure.
The key property of the FIND-S algorithm:

•FIND-S is guaranteed to output the most specific hypothesis within H that is

consistent with the positive training examples
•FIND-S algorithm’s final hypothesis will also be consistent with the negative
examples provided the correct target concept is contained in H, and provided the
training examples are correct.
Unanswered questions by FIND-S

There are several questions still left unanswered, such as:

1.Has FIND-S converged to the correct target concept?. Although FIND-S will find a
hypothesis consistent with the training data, it has no way to determine whether it
has found the only hypothesis in H consistent with the data (i.e., the correct target
concept), or whether there are many other consistent hypotheses as well.
2.Why prefer the most specific hypothesis ?. In case there are multiple hypotheses
consistent with the training examples, FIND-S will find the most specific. It is unclear
whether we should prefer this hypothesis over, say, the most general, or some other
hypothesis of intermediate generality.
3.Are the training examples consistent ?. In most practical learning problems there
is some chance that the training examples will contain at least some errors or noise.
Such inconsistent sets of training examples can severely mislead FIND-S, given the
fact that it ignores negative examples.
4.What if there are several maximally specific consistent hypotheses?. There can be
several maximally specific hypotheses consistent with the data. Find S finds only one.
Consistent Hypothesis
Definition
A hypothesis h is consistent with a set of training examples D if and only if h(x) =
c(x) for each example (x, c(x)) in D.

difference between definitions of consistent and satisfies

An example x is said to satisfy hypothesis h when h(x) = 1, regardless of whether

x is a positive or negative example of the target concept.

An example x is said to consistent with hypothesis h iff h(x) = c(x)

The set of training examples D are below.

Assume
hypothesis h = < Sunny, Warm, ?, Strong, ?, ?>
Now for each example (x, c(x)) in D, we will evaluate h(x)

1.(<Sunny, Warm, Normal, Strong, Warm, Same>,<yes>) → h(x)=c(x)

2.(<Sunny, Warm, High, Strong, Warm, Same>,<yes>) → h(x)=c(x)
3.(<Rainy, Cold, High, Strong, Warm, Change>,<No>) → h(x)=c(x)
4.(<Sunny, Warm, High, Strong, Cool, Change>,<yes>) → h(x)=c(x)

Hence, hypothesis h is consistent with a set of training examples D

Lets say, we have a hypothesis h2 = < ?, Warm, ?, Strong, ?, ?>,
is this hypothesis consistent with set of training example D ?

All the training examples hold h(x) = c(x). So hypothesis h2 is consistent with D.

Assume a hypothesis h1 = < ?, ?, ?, Strong, ?, ?>,

is this hypothesis consistent with set of training example D ?

In case of training example (3), h(x) != c(x). So hypothesis h1 is not consistent

with D.
Version Spaces
The Candidate-Elimination algorithm represents the set of all hypotheses consistent
with the observed training examples.

In above example, we have two hypothesis from H and they are consistent with D.
h1=< Sunny, Warm, ?, Strong, ?, ?> and h2=< ?, Warm, ?, Strong, ?, ?>
So this set of hypothesis { h1, h2} is called a Version Space.
List-Then-Eliminate Algorithm
List-Then-Eliminate algorithm initializes the version space to contain all hypotheses
in H, then eliminates any hypothesis found inconsistent with any training example.

The algorithm is as follows :

For the above EnjoySport training examples D, we can output the below list of
hypothesis which are consistent with D. In other words, the below list of hypothesis is
a version space.

In the list of hypothesis, there are two extremes representing general (h1 and h2)
and specific (h6) hypothesis.

Lets define these 2 extremes as general boundary G and specific boundary S.

Compact Representation of Version Spaces

A version space can be represented with its general and specific boundary sets.

Definition — G
The general boundary G, with respect to hypothesis space H and training data D, is
the set of maximally general members of H consistent with D.

Definition — S
The specific boundary S, with respect to hypothesis space H and training data D, is
the set of minimally general (i.e., maximally specific) members of H consistent with
D.
The Candidate-Elimination algorithm represents the version space by storing only its
most general members G and its most specific members S.

Given only these two sets S and G, it is possible to enumerate all members of a
version space by generating hypotheses that lie between these two sets in general-
to-specific partial ordering over hypotheses.

Every member of the version space lies between these boundaries

where x ≥y means x is more general or equal to y

Example Version Space
The below figure shows the version space for EnjoySport concept learning including
both general and specific boundary sets.
Candidate-Elimination algorithm

•The Candidate-Elimination algorithm computes the version space containing all

hypotheses from H that are consistent with an observed sequence of training
examples.
•It begins by initializing the version space to the set of all hypotheses in H; that is, by
initializing the G boundary set to contain the most general hypothesis in H as
G0 ← { <?, ?, ?, ?, ?, ?> }
and initializing the S boundary set to contain the most specific hypothesis as
S0 ← { <0, 0, 0, 0, 0, 0> }
•These two boundary sets delimit the entire hypothesis space, because every other
hypothesis in H is both more general than S0 and more specific than G0.
•As each training example is considered, the S and G boundary sets are generalized and
specialized, respectively, to eliminate from the version space any hypotheses found
inconsistent with the new training example.
•After all examples have been processed, the computed version space contains all the
hypotheses consistent with these examples and only these hypotheses.
Candidate-Elimination Algorithm
Candidate elimination algorithm with an example

•Here are the training examples D

Remarks on Version Space and Candidate elimination algorithm

Will the Candidate elimination algorithm Converge to the Correct

Hypothesis ?
The version space learned by the Candidate elimination algorithm will converge
toward the hypothesis that correctly describes the target concept, provided
(1) there are no errors in the training examples, and
(2) there is some hypothesis in H that correctly describes the target concept.

What will happen if the training data contains errors ?.

The algorithm removes the correct target concept from the version space.
– S and G boundary sets eventually converge to an empty version space if sufficient
additional training data is available.
– Such an empty version space indicates that there is no hypothesis in H consistent
with all observed training examples.

A similar symptom will appear when the training examples are correct, but the
target concept cannot be described in the hypothesis representation.
What will happen if the training data contains errors ?.
Suppose, for example, that the second training example above is incorrectly presented
as a negative example instead of a positive example.
Lets run the candidate elimination algorithm on this data and see the result.
After processing all the training examples, the algorithm removes the correct target
concept from the version space.
Find-S Candidate Elimination

Find a hypothesis consistent with training Find a compact representation of all

data. hypothesis consistent with training data

Only consider the most specific one. Consider all possible consistent
hypothesis.

Ignore No instances. Consider both Yes and No instances

UNIT1
No ratings yet
UNIT1
82 pages
M2 - Concept Learning
No ratings yet
M2 - Concept Learning
64 pages
Concept Learning and Genrel To Specific Ordering - 2
No ratings yet
Concept Learning and Genrel To Specific Ordering - 2
46 pages
ML Notes Module2
No ratings yet
ML Notes Module2
16 pages
Concept Learning
No ratings yet
Concept Learning
11 pages
Concept Learning
No ratings yet
Concept Learning
71 pages
Module 1- Concept Learning (1)
No ratings yet
Module 1- Concept Learning (1)
50 pages
Lecture 5.2
No ratings yet
Lecture 5.2
8 pages
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
No ratings yet
Lecture Notes - 18CS71 - Machine Learning - Module 1: Introduction
16 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
40 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
ML LAB Task-1 Task-2 Notes
No ratings yet
ML LAB Task-1 Task-2 Notes
12 pages
Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Artificial Intelligence and Machine Learning 18CS71
17 pages
Concept Learning
No ratings yet
Concept Learning
59 pages
Unit 1-Concept Learning
No ratings yet
Unit 1-Concept Learning
59 pages
ML_Lecture_2_Version_Spaces
No ratings yet
ML_Lecture_2_Version_Spaces
32 pages
3 Ml Ch2 Concept Learning Short
No ratings yet
3 Ml Ch2 Concept Learning Short
16 pages
module 2 AI n ML notes
No ratings yet
module 2 AI n ML notes
16 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
Unit2_4
No ratings yet
Unit2_4
7 pages
CH2 ConceptLearning
No ratings yet
CH2 ConceptLearning
38 pages
Concept Learning - QB - Solutions
No ratings yet
Concept Learning - QB - Solutions
13 pages
CSE543: Machine Learning: Lecture 2: August 6, 2014
No ratings yet
CSE543: Machine Learning: Lecture 2: August 6, 2014
27 pages
03-computational cognitive science
No ratings yet
03-computational cognitive science
42 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
UNIT_1_notes
No ratings yet
UNIT_1_notes
16 pages
Concept Learning and The General-To-Specific Ordering2
No ratings yet
Concept Learning and The General-To-Specific Ordering2
19 pages
Combined ML
100% (1)
Combined ML
705 pages
Aiml Lab Exp 1 (Find S)
No ratings yet
Aiml Lab Exp 1 (Find S)
24 pages
UNIT-1
No ratings yet
UNIT-1
43 pages
Module 3
No ratings yet
Module 3
70 pages
Module 2
No ratings yet
Module 2
15 pages
Concept Learning
No ratings yet
Concept Learning
42 pages
ML Unit 1 29 45
No ratings yet
ML Unit 1 29 45
17 pages
Chapter 11
No ratings yet
Chapter 11
55 pages
Concept
No ratings yet
Concept
43 pages
Lecture3 Concept Learning
No ratings yet
Lecture3 Concept Learning
42 pages
ED317 Statistical Machine learning
No ratings yet
ED317 Statistical Machine learning
174 pages
Basics of Learning Theory
No ratings yet
Basics of Learning Theory
35 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
1 Concept-Learning
No ratings yet
1 Concept-Learning
25 pages
ML-Module 2-P2
No ratings yet
ML-Module 2-P2
53 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
ML 02
No ratings yet
ML 02
25 pages
2.concept Learning
No ratings yet
2.concept Learning
21 pages
concept learning
No ratings yet
concept learning
18 pages
Week_7_Notes[1]
No ratings yet
Week_7_Notes[1]
11 pages
UNIT-1
No ratings yet
UNIT-1
45 pages
ML unit -I part II
No ratings yet
ML unit -I part II
9 pages
ITML U1 Overview
No ratings yet
ITML U1 Overview
45 pages
2 concept-learning
No ratings yet
2 concept-learning
42 pages
ML 02 Concept
No ratings yet
ML 02 Concept
7 pages
unit - 1
No ratings yet
unit - 1
29 pages
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
No ratings yet
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
77 pages
ML Lec. 02
No ratings yet
ML Lec. 02
32 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
U1 - ML
No ratings yet
U1 - ML
5 pages
Horn Clause: Fundamentals and Applications
From Everand
Horn Clause: Fundamentals and Applications
Fouad Sabry
No ratings yet
Abductive Reasoning: Fundamentals and Applications
From Everand
Abductive Reasoning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Set-Theoretic Paradoxes and their Resolution in Z-F
From Everand
Set-Theoretic Paradoxes and their Resolution in Z-F
Samuel Horelick
4.5/5 (2)
Application Layer 5thunit
No ratings yet
Application Layer 5thunit
59 pages
Unit 2ANNs
No ratings yet
Unit 2ANNs
169 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
CD Unit4-1 (RTE)
No ratings yet
CD Unit4-1 (RTE)
24 pages
Historical Origins of Education Notes
No ratings yet
Historical Origins of Education Notes
101 pages
Đề Tài: Xây Dựng Chiến Lược Marketing Cho
No ratings yet
Đề Tài: Xây Dựng Chiến Lược Marketing Cho
9 pages
The Relationship Between Parental Involv
No ratings yet
The Relationship Between Parental Involv
14 pages
Jurnal Tanah Dan Sumberdaya Lahan Vol 5 No 1: 655-662, 2018 e-ISSN:2549-9793
No ratings yet
Jurnal Tanah Dan Sumberdaya Lahan Vol 5 No 1: 655-662, 2018 e-ISSN:2549-9793
8 pages
SDS EastmanTriethylPhosphate (TEP) ENUS ISS191022
No ratings yet
SDS EastmanTriethylPhosphate (TEP) ENUS ISS191022
12 pages
BROCHURE Polymers Dispersions Reclassification of MIT-WACKER-10-18
No ratings yet
BROCHURE Polymers Dispersions Reclassification of MIT-WACKER-10-18
2 pages
Magnetic Force
No ratings yet
Magnetic Force
7 pages
Lessons From Molecular Gastronomy (Ruy)
No ratings yet
Lessons From Molecular Gastronomy (Ruy)
11 pages
73 Powerful Coaching Questions To Ask Your Clients
No ratings yet
73 Powerful Coaching Questions To Ask Your Clients
36 pages
(Spacemaster) Privateers - Races and Cultures
100% (9)
(Spacemaster) Privateers - Races and Cultures
130 pages
Possible Answers Grade 11 Activities GEOMORPHOLOGY - 240405 - 204628
100% (1)
Possible Answers Grade 11 Activities GEOMORPHOLOGY - 240405 - 204628
16 pages
Giz2021 en Kazakhstan Policy Brief Agriculture
No ratings yet
Giz2021 en Kazakhstan Policy Brief Agriculture
10 pages
Funcao
No ratings yet
Funcao
3 pages
Celsius & Fahrenheit Temperature Lesson by Slidesgo
No ratings yet
Celsius & Fahrenheit Temperature Lesson by Slidesgo
57 pages
L4-Bearing and Azimuth
No ratings yet
L4-Bearing and Azimuth
16 pages
Arvo Part History
100% (1)
Arvo Part History
9 pages
CODEV Ch3
No ratings yet
CODEV Ch3
26 pages
Savai Education - Full Course One Year
No ratings yet
Savai Education - Full Course One Year
2 pages
Anuradha Second Semester Result
No ratings yet
Anuradha Second Semester Result
2 pages
U1test Yourself
No ratings yet
U1test Yourself
8 pages
Stanje Hrčkov V Evropě (Eng.)
No ratings yet
Stanje Hrčkov V Evropě (Eng.)
73 pages
Documentary Film Review Sheet
No ratings yet
Documentary Film Review Sheet
2 pages
Chapter 1 Theoretical Perspectives on Gender PPT.pdf
No ratings yet
Chapter 1 Theoretical Perspectives on Gender PPT.pdf
25 pages
SCM - 16 - Performance Measurement
No ratings yet
SCM - 16 - Performance Measurement
28 pages
(Palgrave Series in African Borderlands Studies) Benedikt Korf, Timothy Raeymaekers (eds.) - Violence on the Margins_ States, Conflict, and Borderlands-Palgrave Macmillan US (2013)
No ratings yet
(Palgrave Series in African Borderlands Studies) Benedikt Korf, Timothy Raeymaekers (eds.) - Violence on the Margins_ States, Conflict, and Borderlands-Palgrave Macmillan US (2013)
291 pages
Coursework For Elementary Education
100% (2)
Coursework For Elementary Education
7 pages
Islmc Medicine
100% (1)
Islmc Medicine
3 pages
RW-200 (Viton)
No ratings yet
RW-200 (Viton)
2 pages
Essentials of Anatomy & Physiology (10th Ed.) - New
No ratings yet
Essentials of Anatomy & Physiology (10th Ed.) - New
7 pages
Format Soal Kelas 3 - Bahasa Inggris
No ratings yet
Format Soal Kelas 3 - Bahasa Inggris
18 pages

1.concept Learning

Uploaded by

1.concept Learning

Uploaded by

What is concept learning?

“Problem of searching through a predefined space of potential hypotheses for

Space of all possible hypotheses

• Assume a given domain, e.g. objects, animals, etc.

The concept or function to be learned is called

It can be seen as a boolean valued function

The goal of concept learning is to find a hypothesis h which can

The inductive learning is based on formulating a generalized concept after

Our goal is to determine hypothesis from H, to identify target concept, such

Learning can be viewed as a task of searching this space

Different learning algorithms search this space in different ways

e.g., For task EnjoySports, hypothesis is a vector of six constraints as follows

Each attribute has one of these three posibilities:

-a specfic value (e.g., W ater = W arm )

• Most general hypothesis: (?, ?, ?, ?, ?)

• Most specific hypothesis: ( ∅, ∅, ∅, ∅, ∅ )

c — The concept or function to be learned is called the target concept, which we

h(x) — In general, each hypothesis h in H represents a Boolean-valued function

The inductive learning hypothesis: Any hypothesis found to approximate the

In the hypothesis representation of EnjoySports, value of each attribute could be either

h1 = <Sky=0, Temp=warm, Humidity=?, Wind=strong, Water=warm, Forecast=same >

h2 = <Sky=sunny, Temp=warm, Humidity=?, Wind=strong, Water=0, Forecast=same>

Neither of these hypotheses accept any “day”, so semantically the same.

1 (hypothesis with one or more 0)

Consider two hypotheses

≥ does not depend on the concept to be learned

In the above example,

The final hypothesis will also be consistent with negative examples if

•FIND-S is guaranteed to output the most specific hypothesis within H that is

There are several questions still left unanswered, such as:

difference between definitions of consistent and satisfies

An example x is said to satisfy hypothesis h when h(x) = 1, regardless of whether

An example x is said to consistent with hypothesis h iff h(x) = c(x)

1.(<Sunny, Warm, Normal, Strong, Warm, Same>,<yes>) → h(x)=c(x)

Hence, hypothesis h is consistent with a set of training examples D

Assume a hypothesis h1 = < ?, ?, ?, Strong, ?, ?>,

In case of training example (3), h(x) != c(x). So hypothesis h1 is not consistent

The algorithm is as follows :

Lets define these 2 extremes as general boundary G and specific boundary S.

Every member of the version space lies between these boundaries

where x ≥y means x is more general or equal to y

•The Candidate-Elimination algorithm computes the version space containing all

•Here are the training examples D

Will the Candidate elimination algorithm Converge to the Correct

What will happen if the training data contains errors ?.

Find a hypothesis consistent with training Find a compact representation of all

Ignore No instances. Consider both Yes and No instances

You might also like