0% found this document useful (0 votes)

8 views56 pages

Chap 5 Learning

The document discusses learning agents and their components, data preparation for learning, classification tasks including the steps and performance measures, types of learning approaches, and an example of face recognition.

Uploaded by

asnake ketema

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views56 pages

Chap 5 Learning

Uploaded by

asnake ketema

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 56

LEARNING AGENTS

[email protected]

1
Contents to be
covered
• What is learning
- Components of learning Agent

• Data set preparation for learning

• Classification tasks
- Steps for classification
- Classifier performance measure
- Factors affecting classifier model performance

• Types of learning

• Learning approaches/methods
2
What is Learning?
• Learning is one of the keys to human intelligence. Do
you agree?
• what learning is? Learning is:
- Memorizing something
- Knowing facts through observation and exploration
- Improving motor and/or cognitive skills through practice
• The idea behind learning is that percepts should not
only be used for acting now, but also for improving
the agent’s ability to act in the future to achieve
the goal.
- Learning is essential for unknown environments,
i.e. when the agent lacks knowledge.
- It enables to organize new knowledge into
general,
- effective representations
Learning modifies the agent's decision making mechanisms
3
to improve performance
Learning Examples
– Learning is memorizing and remembering
• Telephone number
• Reading textbook
• Understanding Language (memorizing
grammar &
practicing)
• Recognize face, signature & fraudulent credit
card transactions
– Learning is improving motor skills
• Riding a bike
• Exercising and practicing the idea
– Learning is understanding the strategy & rule of the
game
• Playing chess and football
– Learning is abstraction and exploration
• Develop scientific theory
• Undertaking research
– Learning is nothing but
• Feature extraction 4
• Classification
Feature extraction

Male Hair Fema Hair

length le length y
y =0
=Task: to extract features which are=good for = 30 cm
classification.
Good features:
• Objects from the same class have similar feature values.
• Objects from different classes have different values.

“Good” “Bad” 5
Classification
• Applications
–Character recognition:
Different printing styles.
–Speech recognition:
• Use of a dictionary or
the syntax of the
language.
• Example: Credit scoring
–Differentiating between
low-risk & high-risk
customers from their
income & savings
Discriminant: IF income > θ1 AND savings > θ2 THEN
low-risk
ELSE
high-risk 6
Face Recognition
• Learning: it is training (adaptation) from data set
• Training examples of a person –
• Face recognition is
challenging because
of the effect of
facial expression,
• Test images lighting, occlusion,
make-up,
hair style, etc.

7
Learning Agents
Percepts

Actions 8
The Basic Learning Model
• A computer program is said to learn from experience
E with respect to some class of tasks T and
performance measure P,
– if its performance at tasks T, as measured by P, improves
with experience E.
• Learning agents consist of four main components:
– learning element -- the part of the agent responsible
for improving its performance
– performance element -- the part that chooses the
actions
to take
– critic – provides feedback for the learning element
how the agent is doing with respect to a performance
standard
training
- problem thegenerator -- suggests problems or actions that 9
system further.
Data sets preparation for learning
• Data Sets also k.as Samples, Examples or Instances
• Good data is a prerequisite for
producing effective models of any types of
problem.
• Usually, the given data set is divided into training
and test sets.
• Training set
– Used in supervised learning, a training set is a set
of problem instances (described as a set of
properties and their values), together with a
classification of the instance.
• Test set
– A set of instances and their classifications used
How to Split Data Sets into Training
 Holdout Method and Testing Sets?
– Given data is randomly partitioned into
two independent sets
o Training set (2/3) for model construction
o Test set (1/3) for accuracy estimation
- If many (thousands) of examples are available, including
several hundred examples from each class.

 Cross-Validation Method
-Randomly partition the data into k mutually
exclusive
subsets, each approximately equal size.
-Where k = 10 is most popular.
At i-th iteration, use Di as test set and others as
-set 11
- training
Use Cross-validation for small data
Cross Validation Examples

12
Example: 3-Fold Cross-Validation

 Data is partitioned into 3 sets

 3 experiments: Each with a different holdout set
for testing
Hold
Out Tra ining Performance=67%
(test)

Average Performance=
Hold
out Tra ining Performance=60%
(67+60+81)/3=69.3

Hold
Tra ining Performance=81%
out
Data sets preparation for learning…
- It is important that the test data is not used in any
way to create the classifier.

Generally,
– The larger the training data the better the
classifier

– The larger the test data the more accurate

the error estimate

14
Classification tasks- A two step process
 Model construction: (Learning step or training
step)
– Construct the classification model based on the training
data.
• Training Data
- A set of tuples
- Each tuple is assumed to belongs to a predefined class
- Labeled data(ground truth)
• How a classification model looks like?
-A classification model can be represented by one of the
ff forms:-
 classification rules,
 decision trees, or
 mathematical formulae
o Thus, model construction refers to describing a set15
Step 1: Model Construction

Classification
Algorithms
Trainin
g
Data

NAME RANK YEARS TENURED Classifier

Mike Assistant Prof 3 no (Model)
Mary Assistant Prof 7 yes
Bill Professor 2 yes
Jim Associate Prof 7 yes IF rank = ‘professor’
Dave Assistant Prof 6 no OR years > 6
THEN tenured = 1
Anne Associate Prof 3 no
‘yes’ 6
Classification tasks- A two step process…

 Model usage:
- Before using the model, we first need to test its
accuracy.
• Measuring model accuracy
- To measure the accuracy of a model we need
test
data.
- Test data is similar in its structure to training
data(labeled data).
• How to test?
• The known label of test samples is compared with
the classified result from the model.
17
Classification tasks- A two step process…
• Accuracy rate is the percentage of test set samples
that are correctly classified by the model

Number of correctclassifications
Accuracy  ,
Total number of test cases
• Important:- test data should be independent of training
set, otherwise over-fitting will occur.

• Using the model- if the accuracy is acceptable, use the

model to classify data tuples whose class labels are not
known

18
Step 2: Using the Model in
Prediction

Classifier
model

Testin
g Unseen Data
Data
(Jeff, Professor, 4)
NAM E RANK YEARS TENURED
Tom Assistant Prof 2 no Tenured?
M erlisa Associate Prof 7 no
George Professor 5 yes
Joseph Assistant Prof 7 yes 19
Classification:- Step1
Split data into train and test sets

20
Classification:- Step2
Build a model on a training set

21
Classification:- Step3
Evaluate on test set (Re-train?)

22
Confusion matrix
• A confusion matrix is useful tool for analyzing how well u’r classifier
can recognize tuples of different classes
• A confusion matrix displays the number of correct and incorrect
predictions made by the model compared with the actual
classifications in the test data.
Observe the following Confusion Matrix

Classified as Cancer Not Cancer

Actual Cancer 7(# of TP) 2(# of FN) To tal Dataset=14
Class Not Cancer 3(# of FP) 2(# of TN)

• The matrix is n-by-n, where n is the number of classes.

• The above confusion matrix can be used to calculate TP, FP,
Precision, Recall and F-Measure, and also Accuracy of the sy2s3
tem
Classifier performance Measure
Where:c1=“cancer”,c2=“not_cancer”

P=?????
N=????

24
Error Rate=1-accuracy
Performance Measure

Table-1 Confusion Matrix

for a 2-class problem

25
Example

2
6
Classifier performance Measure Examples

Compute the following performance measures from the

confusion matrix given below.
- Sensitivity, Specificity, Accuracy, Recall and Precision

Classes Yes NO
NO How many
Yes
90 210
210 data sets
Yes
NO 140 9560 are there in
140 9560
Sensitivity =
this
TP/P = 90/300 = 30%
Specificity = TN/N = 9560/9700 = 98.56% example?
Accuracy = (TP + TN)/(P+N) = 9650/10,000 =
96.50%
Error Rate= 1-Accuracy= 1- 0.965= 3.5%
Precision = TP/(TP+FP) = TP/N = 90/230 = 39.13% 27

Recall = TP/(TP+FN) = TP/P= 90/300 = 30.0%

Exercise
Compute the following performance measures from
the confusion matrix given below.

Sensitivity =TP/TP+FP
Specificity =
Accuracy =
Error Rate=
Precision =
Recall 28
Performance Measure
 Accuracy
- classifier accuracy: predicting class label
- predictor accuracy: guessing value of predicted
attributes
 Speed
- time to construct the model (training time)
- time to use the model (classification/prediction time)
 Robustness: handling noise and missing values

 Scalability: efficiency in disk-resident databases

 Interpretability
- understanding and insight provided by the model 29
Purpose of Evaluation
• The objective of learning classifications from
sample data is to classify and predict
successfully on new data

• The aim of evaluation is to estimate the true

error rate using a finite amount of data.

• The true error rate is defined as the error rate

of a classifier on an asymptotically large number
of new cases that converge in the limit to the
actual population distribution (i.e. it is an
inherently statistical measure). 30
Factors Affecting Performance of
Classifier
• There are several factors affecting the
performance:
- Types of training provided
- The learning algorithms used
- The type of feedback provided
- The form and extent of any initial background
knowledge

31
Sum-up
• What is learning?
• Discuss the learning agent Components?
• What is dataset? What is the need of splitting data
into training and test samples/instances?
• What are the methods to split dataset into training
and testing dataset?
• What are the two classification tasks and discus it.
• What is the purpose of confusion matrix?
• What are the commonly used measures for evaluating
the classification(Classifier model) performance
• Which factors affect the performance of classifier
model?
32
Types of learning

• Supervised learning
- classification, regression

• Unsupervised learning
- clustering
• Semi- Supervised learning
• Reinforcement learning

33
Types of learning
 Supervised learning – Classification
-Isthelearning process when the outcome variable
is
known.

-Data and corresponding labels are given.

-The outcome datasets are provided w/c are used to train

the machine and get the desired output.

-Occurs where a set of input/output pairs are explicitly

presented to the agent by a teacher
– The teacher provides a category label for each pattern in
a training set, then the learning algorithm finds a rule
that adoes
with newainput.
good job of predicting the output associated
34
Cont…
Classification:- Given a collection of records (training set ),
with each record containing a set of attributes, and one of
the attribute being the class.
-Goal: previously unseen records should be assigned a class
as accurately as possible.
- A test set is used to determine the accuracy of the model.

3
5
Supervised Learning

Example:-

36
Supervised learning
An example: data (loan application)

37
The learning Process
• Learn a classification model from the data
• Use the model to classify future loan applications
into
– Yes (approved) and
– No (not approved)
• What is the class for following case/instance?

38
Example

label
Classification: a finite set of
apple labels

apple

banana

Super
Example: Digit Recognition

40
Ranking example

Given a query
and a set of
web pages,rank
them
according
to relevance

41
Cont…

42
Applications
 Medical Diagnosis:- Predicting tumor
cells as benign or malignant

 Fraud Detection:- Classifying credit

card transactions
as legitimate or fraudulent

 Credit Approval
 Classifying secondary structures of
protein
as alpha-helix, beta-sheet, or
random
coil

 Categorizing news stories as finance, 43

Classification Examples
• In classification, we predict labels y (classes) for
inputs x
• Examples:
– OCR (input: images, classes: characters)
– Medical diagnosis (input: symptoms, classes: diseases)
– Automatic essay grader (input: document,
classes: grades)
– Fraud detection (input: account activity, classes:
fraud
/ no fraud)
– DNA and protein sequence identification
– Categorization and identification of astronomical
images
– Customer service email routing
– books
Recommended articles in a newspaper, 44

… recommended
many more
Types of learning…
• Unsupervised learning – Clustering
- The class labels of training data is unknown.
-Learning when there is no information about what the
correct outputs are.
– In unsupervised learning or clustering there is no explicit
teacher, the system forms clusters or natural groupings
of the input patterns.
- A form of learning by observation rather than learning by
examples
– Clustering is a technique for finding similarity
groups in data.
4
5
Cont…
• Thus Cluster Analysis
– Finding groups of objects such that the objects in a
group will be similar (or related) to one another and
different from (or unrelated to) the objects in
other groups
Inter-cluster
Intra-cluster distances
distances are are
minimized maximized
Example

Unsupervised learning: given data, i.e. examples, but

no labels 47
Examples
Example 1: Given a collection of text documents,
we want to organize them according to their
content similarities.

Example 2: groups people of similar sizes

together to make “small”, “medium” and “large”.

49
Unsupervised Learning

Example:-
Unsupervised learning 50
Types of learning
• Reinforcement learning (RL): an agent interacting with
the world makes observations, takes actions, & is
rewarded or punished; it should learn to choose actions
in order to obtain a lot of reward.

• Examples
– Game playing: player knows whether it win or lose, but
not know how to move at each step
– Control: a traffic system can measure the delay of cars,
but not know how to decrease it.

51
RL is learning from interaction

52
Example

Backgammon

WIN!

LOSE!

Given sequences of moves and whether or not

the player won at the end, learn to
moves
make good 53
Summary about types of Learning

• Clustering is a machine learning technique

that finds similarities between data
according to the characteristics found in the
data & groups similar data objects into one
cluster
Learning methods
• There are various learning methods.
Popular learning techniques include the following.
– Decision tree : divide decision space into
piecewise constant regions.
– K- Nearest Neighbour:- classify based on
similarity measurements
– Neural networks: partition by non-linear boundaries
– Bayesian network: a probabilistic model
– Regression: (linear or any other polynomial)
– Support vector machine: solves non-linearly
separable problems.
– Expectation maximization algorithm
55
The End

አመሰግናለሁ !!!
56

ActWin Getting Started - R2 Software PDF
No ratings yet
ActWin Getting Started - R2 Software PDF
7 pages
Ai CH4
No ratings yet
Ai CH4
27 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
L7.1.AI
No ratings yet
L7.1.AI
127 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
No ratings yet
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
16 pages
CH-5_ML
No ratings yet
CH-5_ML
36 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
6.Data Mining - Classification Ppt
No ratings yet
6.Data Mining - Classification Ppt
37 pages
19_ML_intro
No ratings yet
19_ML_intro
33 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
19-Introduction classification algorithm-18-09-2024
No ratings yet
19-Introduction classification algorithm-18-09-2024
102 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
87 pages
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
No ratings yet
Classification & Prediction: - Shailesh Yadav Central University of Rajasthan
28 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
No ratings yet
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
30 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
3ML.02.MainConcepts_Evaluation
No ratings yet
3ML.02.MainConcepts_Evaluation
35 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
Pattern Recognition Application
No ratings yet
Pattern Recognition Application
43 pages
ClassificationandPrediction_Module3
No ratings yet
ClassificationandPrediction_Module3
88 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
2 Supervised Learning
No ratings yet
2 Supervised Learning
48 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
AI Chapter 6
No ratings yet
AI Chapter 6
28 pages
IntroClassificationDA-2024
No ratings yet
IntroClassificationDA-2024
129 pages
For Unit 4 Useful
100% (1)
For Unit 4 Useful
107 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Unit 3 (DWDM)
No ratings yet
Unit 3 (DWDM)
23 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
06 Learning
No ratings yet
06 Learning
51 pages
Classification
No ratings yet
Classification
33 pages
Unit6 -1 Classification-and-Prediction-Basics_3a2ac6b1-316a-4e6b-b18f-efed2317596b
No ratings yet
Unit6 -1 Classification-and-Prediction-Basics_3a2ac6b1-316a-4e6b-b18f-efed2317596b
12 pages
Supervised Machine Learning Algorithm
100% (1)
Supervised Machine Learning Algorithm
111 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
AAI Lecture 9 Sp 25
No ratings yet
AAI Lecture 9 Sp 25
26 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
86 37 196 Mod 5
No ratings yet
86 37 196 Mod 5
52 pages
Machine
No ratings yet
Machine
61 pages
Chp8 Classification Basic Concepts - Lecture#8
No ratings yet
Chp8 Classification Basic Concepts - Lecture#8
40 pages
08 Class Basic
No ratings yet
08 Class Basic
141 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
Classification
No ratings yet
Classification
53 pages
Module 6
No ratings yet
Module 6
24 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
AAM ans
No ratings yet
AAM ans
3 pages
0 Machine Learning Overview and Metrics LT
No ratings yet
0 Machine Learning Overview and Metrics LT
84 pages
Measurement - Task Sheets Gr. 3-5
From Everand
Measurement - Task Sheets Gr. 3-5
Chris Forest
No ratings yet
Data Analysis & Probability - Drill Sheets Gr. 6-8
From Everand
Data Analysis & Probability - Drill Sheets Gr. 6-8
Chris Forest
No ratings yet
Data Analysis & Probability - Task & Drill Sheets Gr. 6-8
From Everand
Data Analysis & Probability - Task & Drill Sheets Gr. 6-8
Tanya Cook and Chris Forest
No ratings yet
Chapter_3_descion_&_repeation_statements_&_Arrays_1 (1)
No ratings yet
Chapter_3_descion_&_repeation_statements_&_Arrays_1 (1)
48 pages
Chapter 4- Functions
No ratings yet
Chapter 4- Functions
18 pages
OOSE-2
No ratings yet
OOSE-2
47 pages
SoftwareQualityAssurance-1
No ratings yet
SoftwareQualityAssurance-1
41 pages
Object Oriented Design
No ratings yet
Object Oriented Design
82 pages
Lect 4
No ratings yet
Lect 4
34 pages
Ch2-Architecture
No ratings yet
Ch2-Architecture
29 pages
Ch3-Processes
No ratings yet
Ch3-Processes
32 pages
Introduction To Real Time and Embedded System
No ratings yet
Introduction To Real Time and Embedded System
36 pages
Solving Problems by Searching Final
No ratings yet
Solving Problems by Searching Final
69 pages
Solutions
No ratings yet
Solutions
27 pages
Abetic Apps BD
No ratings yet
Abetic Apps BD
2 pages
Getting Started With Python Programming
100% (11)
Getting Started With Python Programming
1,484 pages
2089 340065 Multimeter Mastech Ms8240c
No ratings yet
2089 340065 Multimeter Mastech Ms8240c
11 pages
Mejoras Asce 41 17
No ratings yet
Mejoras Asce 41 17
10 pages
Chemistry Test
No ratings yet
Chemistry Test
11 pages
Aws Developer Guide
No ratings yet
Aws Developer Guide
784 pages
07-01-2022 - EEE3100S 2022 November EXAM - 2
No ratings yet
07-01-2022 - EEE3100S 2022 November EXAM - 2
6 pages
Manual Combina Frigorifica Candy Alba
No ratings yet
Manual Combina Frigorifica Candy Alba
230 pages
Install Guide
No ratings yet
Install Guide
17 pages
Gr6 - MATH - Posttest
No ratings yet
Gr6 - MATH - Posttest
6 pages
Math Word Problem Homework Help
100% (1)
Math Word Problem Homework Help
7 pages
Tablero de Interconexión Smart-Box
No ratings yet
Tablero de Interconexión Smart-Box
12 pages
Assignment No. 01: Human Digestive System and Respiratory System
No ratings yet
Assignment No. 01: Human Digestive System and Respiratory System
3 pages
100 SAP ABAP Interview Questions
No ratings yet
100 SAP ABAP Interview Questions
10 pages
FCA LP.7M001 Tension test and Calculation of n and r ed. 02.2015
No ratings yet
FCA LP.7M001 Tension test and Calculation of n and r ed. 02.2015
2 pages
Google API Flow
No ratings yet
Google API Flow
7 pages
Top Tronic Hellerman Tylon TDDGT Digital Programmable Geyser Timer
No ratings yet
Top Tronic Hellerman Tylon TDDGT Digital Programmable Geyser Timer
1 page
Lesson 3 - Inertia: The Meaning of Inertia
100% (1)
Lesson 3 - Inertia: The Meaning of Inertia
7 pages
INDR 372 PS EXERCISES, May 13, 2022: - (Z) - (Z) - (Z) - (Z)
No ratings yet
INDR 372 PS EXERCISES, May 13, 2022: - (Z) - (Z) - (Z) - (Z)
5 pages
Hurricane Gust Factors
No ratings yet
Hurricane Gust Factors
2 pages
Dövrələr Nəzəriyyəsi
No ratings yet
Dövrələr Nəzəriyyəsi
10 pages
Deg 001
No ratings yet
Deg 001
16 pages
Change
No ratings yet
Change
31 pages
Ketones
No ratings yet
Ketones
17 pages
1288 Au24 M2 Practice
No ratings yet
1288 Au24 M2 Practice
2 pages
Buchanan 1969 1999 Cost and Choice An Inquiry in Economic Theory
No ratings yet
Buchanan 1969 1999 Cost and Choice An Inquiry in Economic Theory
120 pages
PCon - Planner 8.1 Features
No ratings yet
PCon - Planner 8.1 Features
11 pages
Measurement Size
No ratings yet
Measurement Size
2 pages

Chap 5 Learning

Uploaded by

Chap 5 Learning

Uploaded by

LEARNING AGENTS

• Data set preparation for learning

Male Hair Fema Hair

 Data is partitioned into 3 sets

– The larger the test data the more accurate

NAME RANK YEARS TENURED Classifier

• Using the model- if the accuracy is acceptable, use the

Classified as Cancer Not Cancer

• The matrix is n-by-n, where n is the number of classes.

Table-1 Confusion Matrix

Compute the following performance measures from the

Recall = TP/(TP+FN) = TP/P= 90/300 = 30.0%

 Scalability: efficiency in disk-resident databases

• The aim of evaluation is to estimate the true

• The true error rate is defined as the error rate

-Data and corresponding labels are given.

-The outcome datasets are provided w/c are used to train

-Occurs where a set of input/output pairs are explicitly

 Fraud Detection:- Classifying credit

 Categorizing news stories as finance, 43

Unsupervised learning: given data, i.e. examples, but

Example 2: groups people of similar sizes

Given sequences of moves and whether or not

• Clustering is a machine learning technique

You might also like