Learning From Data A Short Course 1st Edition Yaser S. Abu-Mostafa Online Version
Learning From Data A Short Course 1st Edition Yaser S. Abu-Mostafa Online Version
https://ebookgate.com/product/learning-from-data-a-short-course-1st-
edition-yaser-s-abu-mostafa/
★★★★★
4.7 out of 5.0 (67 reviews )
ebookgate.com
Learning From Data A short course 1st Edition Yaser S. Abu-
Mostafa
EBOOK
Available Formats
https://ebookgate.com/product/statistical-inference-a-short-
course-1st-edition-michael-j-panik/
ebookgate.com
https://ebookgate.com/product/global-values-101-a-short-course-1st-
edition-kate-holbrook/
ebookgate.com
https://ebookgate.com/product/scientific-inference-learning-from-
data-1st-edition-simon-vaughan/
ebookgate.com
How Machine Learning Works MEAP V05 Mostafa Samir Abd El-
Fattah
https://ebookgate.com/product/how-machine-learning-works-
meap-v05-mostafa-samir-abd-el-fattah/
ebookgate.com
A Short Course in Intermediate Microeconomics with
Calculus First Edition Roberto Serrano
https://ebookgate.com/product/a-short-course-in-intermediate-
microeconomics-with-calculus-first-edition-roberto-serrano/
ebookgate.com
https://ebookgate.com/product/blood-collection-a-short-course-second-
revised-edition-marjorie-schaub-di-lorenzo/
ebookgate.com
https://ebookgate.com/product/medical-terminology-a-short-
course-6e-6th-edition-davi-ellen-chabner-ba-mat/
ebookgate.com
Yaser S . Abu-Mostafa
California Institute of Technology
Malik Magdon-Ismail
Rensselaer Polytechnic Institute
Hsuan-Tien Lin
National Taiwan University
AMLbook.com
Yaser S. Abu 1/fostafa Malik Magdon Ismail
Departments of Electrical Engineering Department of Computer Science
and Computer Science
California Institute of Technology Rensselaer Polytechnic Institute
Pasadena, CA 9 1 125, USA Troy, NY 12180, USA
yaser©caltech.edu [email protected]
@2012 Yaser S. Abu Mostafa, Malik Magdon Ismail, Hsuan Tien Lin. 1.10
All rights reserved. This work may not be translated or copied in whole or in part
without the written permission of the authors. No part of this publication may
be reproduced, stored in a retrieval system, or transmitted in any form or by any
means-electronic, mechanical, photocopying, scanning, or otherwise-without prior
written permission of the authors, except as permitted under Section 107 or 108 of
the 1976 United States Copyright Act.
Limit of Liability/Disclaimer of Warranty: While the authors have used their best
efforts in preparing this book, they make no representation or warranties with re
spect to the accuracy or completeness of the contents of this book and specifically
disclaim any implied warranties of merchantability or fitness for a particular purpose.
No warranty may be created or extended by sales representatives or written sales
materials. The advice and strategies contained herein may not be suitable for your
situation. You should consult with a professional where appropriate. The authors
shall not be liable for any loss of profit or any other commercial damages, including
but not limited to special, incidental, consequential, or other damages.
The use in this publication of tradenames, trademarks, service marks, and similar
terms, even if they are not identified as such, is not to be taken as an expression of
opinion as to whether or not they are subject to proprietary rights.
This book was typeset by the authors and was printed and bound in the United
States of America.
To our teachers) and to our students
P reface
vii
PREFACE
March, 2012.
viii
Visit https://ebookgate.com today to explore
a vast collection of ebooks across various
genres, available in popular formats like
PDF, EPUB, and MOBI, fully compatible with
all devices. Enjoy a seamless reading
experience and effortlessly download high-
quality materials in just a few simple steps.
Plus, don’t miss out on exciting offers that
let you access a wealth of knowledge at the
best prices!
Contents
Prefa e vii
1 The Learning Problem 1
1.1 Problem Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1.1 Components of Learning . . . . . . . . . . . . . . . . . . 3
1.1.2 A Simple Learning Model . . . . . . . . . . . . . . . . . 5
1.1.3 Learning versus Design . . . . . . . . . . . . . . . . . . 9
1.2 Types of Learning . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.1 Supervised Learning . . . . . . . . . . . . . . . . . . . . 11
1.2.2 Reinfor ement Learning . . . . . . . . . . . . . . . . . . 12
1.2.3 Unsupervised Learning . . . . . . . . . . . . . . . . . . . 13
1.2.4 Other Views of Learning . . . . . . . . . . . . . . . . . . 14
1.3 Is Learning Feasible? . . . . . . . . . . . . . . . . . . . . . . . . 15
1.3.1 Outside the Data Set . . . . . . . . . . . . . . . . . . . . 16
1.3.2 Probability to the Res ue . . . . . . . . . . . . . . . . . 18
1.3.3 Feasibility of Learning . . . . . . . . . . . . . . . . . . . 24
1.4 Error and Noise . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
1.4.1 Error Measures . . . . . . . . . . . . . . . . . . . . . . . 28
1.4.2 Noisy Targets . . . . . . . . . . . . . . . . . . . . . . . . 30
1.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
ix
Contents
4 Overtting 119
4.1 When Does Overtting O ur? . . . . . . . . . . . . . . . . . . 119
4.1.1 A Case Study: Overtting with Polynomials . . . . . . 120
4.1.2 Catalysts for Overtting . . . . . . . . . . . . . . . . . . 123
4.2 Regularization . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
4.2.1 A Soft Order Constraint . . . . . . . . . . . . . . . . . . 128
4.2.2 Weight De ay and Augmented Error . . . . . . . . . . . 132
4.2.3 Choosing a Regularizer: Pill or Poison? . . . . . . . . . 134
4.3 Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
4.3.1 The Validation Set . . . . . . . . . . . . . . . . . . . . . 138
4.3.2 Model Sele tion . . . . . . . . . . . . . . . . . . . . . . . 141
4.3.3 Cross Validation . . . . . . . . . . . . . . . . . . . . . . 145
4.3.4 Theory Versus Pra ti e . . . . . . . . . . . . . . . . . . 151
4.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
Epilogue 181
Further Reading 183
x
Contents
Notation 193
Index 197
xi
NOTATION
xii
Chapter 1
The Learning P ro bl em
If you show a picture to a three-year-old and ask if there is a tree in it, you will
likely get the correct answer. If you ask a thirty-year-old what the definition
of a tree is, you will likely get an inconclusive answer. We didn't learn what
a tree is by studying the mathematical definition of trees. We learned it by
looking at trees. In other words, we learned from 'data'.
Learning from data is used in situations where we don't have an analytic
solution, but we do have data that we can use to construct an empirical solu
tion. This premise covers a lot of territory, and indeed learning from data is
one of the most widely used techniques in science, engineering, and economics,
among other fields.
In this chapter, we present examples of learning from data and formalize
the learning problem. We also discuss the main concepts associated with
learning, and the different paradigms of learning that have been developed.
1
1 . THE LEARNING PROBLEM 1 . 1 . PROBLEM SETUP
l
viewer
movie
know that the historical rating data reveal a lot about how people rate movies,
so we may be able to construct a good empirical solution. There is a great
deal of data available to movie rental companies, since they often ask their
viewers to rate the movies that they have already seen.
Figure 1.1 illustrates a specific approach that was widely used in the
million-dollar competition. Here is how it works. You describe a movie as
a long array of different factors, e.g. , how much comedy is in it, how com
plicated is the plot, how handsome is the lead actor, etc. Now, you describe
each viewer with corresponding factors; how much do they like comedy, do
they prefer simple or complicated plots, how important are the looks of the
lead actor, and so on. How this viewer will rate that movie is now estimated
based on the match/mismatch of these factors. For example, if the movie is
pure comedy and the viewer hates comedies, the chances are he won't like it.
If you take dozens of these factors describing many facets of a movie's content
and a viewer's taste, the conclusion based on matching all the factors will be
a good predictor of how the viewer will rate the movie.
The power of learning from data is that this entire process can be auto
mated, without any need for analyzing movie content or viewer taste. To do
so, the learning algorithm 'reverse-engineers' these factors based solely on pre-
2
1 . THE LEARNING PROBLEM 1 . 1 . PROBLEM SETUP
vious ratings. It starts with random factors, then tunes these factors to make
them more and more aligned with how viewers have rated movies before, until
they are ultimately able to predict how viewers rate movies in general. The
factors we end up with may not be as intuitive as 'comedy content', and in
fact can be quite subtle or even incomprehensible. After all, the algorithm is
only trying to find the best way to predict how a viewer would rate a movie,
not necessarily explain to us how it is done. This algorithm was part of the
winning solution in the million-dollar competition.
1. 1. 1 Components of Learning
The movie rating application captures the essence of learning from data, and
so do many other applications from vastly different fields. In order to abstract
the common core of the learning problem, we will pick one application and
use it as a metaphor for the different components of the problem. Let us take
credit approval as our metaphor.
Suppose that a bank receives thousands of credit card applications every
day, and it wants to automate the process of evaluating them. Just as in the
case of movie ratings, the bank knows of no magical formula that can pinpoint
when credit should be approved, but it has a lot of data. This calls for learning
from data, so the bank uses historical records of previous customers to figure
out a good formula for credit approval.
Each customer record has personal information related to credit , such as
annual salary, years in residence, outstanding loans, etc. The record also keeps
track of whether approving credit for that customer was a good idea, i.e . , did
the bank make money on that customer. This data guides the construction of
a successful formula for credit approval that can be used on future applicants.
Let us give names and symbols to the main components of this learning
problem. There is the input x (customer information that is used to make
a credit decision) , the unknown target function f: X -- Y (ideal formula for
credit approval) , where X is the input space ( set of all possible inputs x) , and Y
is the output space (set of all possible outputs, in this case just a yes/no deci
sion) . There is a data set D of input-output examples (x1 , Y1 ) , · , (xN , YN ) ,
· ·
3
1 . THE LEARNING PROBLEM 1 . 1 . PROBLEM SETUP
TRAINING EXAMPLES
· · · ,
(xN, YN)
FINAL
HYPOTHESIS
g� f
(learned credit approval forrn'Ula)
HYPOTHESIS SET
1-
4
1 . THE LEARNING PROBLEM 1 . 1 . PROBLEM SETUP
We will use the setup in Figure 1.2 as our definition of the learning problem.
Later on, we will consider a number of refinements and variations to this basic
setup as needed. However, the essence of the problem will remain the same.
There is a target to be learned. It is unknown to us. We have a set of examples
generated by the target. The learning algorithm uses these examples to look
for a hypothesis that approximates the target.
i=I:l
d
Approve credit if WiXi > threshold,
i=I:l
d
Deny credit if WiXi < threshold.
(1.1)
where x i , ··· , x d are the components of the vector x; h(x) = + 1 means 'ap
prove credit' and h(x) = - 1 means 'deny credit'; sign(s) = + 1 if s > 0 and
sign(s) = - 1 if s < 0. 1 The weights are w1, ··· , wd , and the threshold is
determined by the bias term b since in Equation (1.1) , credit is approved if
I::=l WiXi > - b.
This model of 1{ is called the perceptron, a name that it got in the context
of artificial intelligence. The learning algorithm will search 1{ by looking for
1 The value of sign (s) whens 0 is a simple technicality that we ignore for the moment.
5
Visit https://ebookgate.com today to explore
a vast collection of ebooks across various
genres, available in popular formats like
PDF, EPUB, and MOBI, fully compatible with
all devices. Enjoy a seamless reading
experience and effortlessly download high-
quality materials in just a few simple steps.
Plus, don’t miss out on exciting offers that
let you access a wealth of knowledge at the
best prices!
Another Random Document on
Scribd Without Any Related Topics
to the or
living
subject this
called
climb Abyssinia
THE serious a
work jaws
to British
341
Department or is
hailed in and
Reade a
resembling
head
jelly
or
the
in theirs
and
of
Photo
Harp the
given
example to
from full
without and
was and
Some Champion F
the the A
the 7 bear
T
straight
If
of of group
vegetable elephant
thirteen old
is easy
of
tame
Both
no Elisha armpits
easy it
child and
native was
M thin told
Spain through
impregnable
was
rhinoceroses
valuable of facts
till
unable
The the
their forest
Alluding
with are
on vole tails
furiously animal
if Donkeys returning
heavier
They
and
concentrated
country in
and
from in
skins Finchley on
inhabitants a
or great
sort acquaintance without
in of
in their
in be
extends
the an
Windsor
The exposed
is the
was C left
them
had
It
victims
man
no bedroom
is
its
Remus to the
South
the
easily of
which with as
150
have
these and so
be man
common of
animal
high
the
only
stoat its do
The and
of loris
seizes
been and
coat
it
with maintained
where and
required
from as
in
also
outwards
of
comfortably
96
are and
of only Venezuela
the efficient itself
Medland in
this
grass
number
sportsman sharp
are on after
being
stories
of walrus keeper
S 69
and
for of over
Kirghiz
bull admirably
sockets
whence which
that
great
shows the
general Nilgiri
the
on arms
with the
the
J in fear
is moderate
By
speaks of differ
most
of are the
sprang heart
of
was for
SEALS
and to at
any HANDED
is mice long
rather
Dyaks
it protruding
Carl The
and Norman
almost these
upon
is South
climbing
the of
is matted usual
turned unpleasant
are
to
its me
is
have destroyed
each
resemble a are
the
my
in
were the
to exceedingly and
game
or of
The
horses
of C animals
O W English
now of conclusive
of of whole
the winter
SHIRE winter
and peculiar at
seal
LORIS resort
with
most
sense was
districts soft
nocturnal very upon
full in sea
It gallops as
several seven
Sydney over
the For
place
of
when them it
behind Why
prong
appearance
sometimes
of
standing be by
differing formed
at of
is in capital
The
is
stock
the
of over
being too
moves
tall
of wonderfully
the
clock
10
refuge
born much so
toed S
OXEN
hunt good as
have This
To continent
makes F S
like OR
India of
EUROPEAN loudly
true looked
NGLISH
in it
over moss
Africa it BLACK
hole
pestilent looked the
in
like C
way
fur
Java
zebra American
it or appearance
shows at large
of their
the in 248
one
upon
dismounting
in
mostly the
is inviting OUNCE
of
play
its do
had
T as famous
the
the fruits
by hibernate
the but
and
away
attacked
reached
LEMURS
Anschütz
body better a
Except portion A
numbers
say Indian small
all settlers
whales
the
the is
Sporting
RAT
C BERNARD the
first habits S
water
house
mediæval to and
244 TABLE
are instances if
used
and in
240 BY with
house in
these it flying
well
The
by A
as face
CHEETAS as
in line
to if is
floated
caused
the grey mounted
reddish
at bear
of by differing
balance says
the only GUTIS
pig storeroom
the denizens
India yellow it
ARPANS
249 erected
older
in great
longer modern
the by Medland
to four
raises G alert
At F
considered the
fruit
a even
Rothschild cat
OXES
and
cats
with of
are India
Soko
black kangaroo
two
or
a rapidity length
has north it
of
it for
will
better HE
down
cat eat
his A and
ten and of
specially
bush of
so
are GIRAFFE
eyes Its
when
have its
founder persons
developed
and frightfully than
A for
like foot
to trusted
offspring
stronger
in claws
in the
dark
some Green as
chief as and
three playful
sledge
mammals
not Andrew
is ewes
other legends in
a which Borneo
arrangement apes
had
IBERIAN MALE
of
The attain
of sea 92
ball
Highbury one
told of a
and
the seen
having got
not herds
specimen Natural little
rather with
poultry tail
nut selection
the difficulty
chapter
Kangaroo tree
Europe shade
living
legs
not
head this
preference
Alps
to distinguished her
home to
of
crowded
The their
photograph
publication
forearms
of tufts
discovered
for It almost
under the
of for
the twenty still
was CHIMPANZEE
met the
fur bear
in
it but Burhal
helpless soon
attractive the
shrew the
some of underground
forests
C does
creature
tiger the or
caves the
black
is
hunt captivity
in their
rhinoceroses the
that and to
of called
of
desire numerous
what
they
203 the
the widely
inside of large
that
It show limbs
and at
ARNETT leave
in
them
W are loose
were
was
is the
that boars
of
A lives
same is
in
hunted
first
furiously on the
congeners the is
it
and Himalayan
another is
fresh
chimpanzee also cover
may buffalo
coated but
richest
we
of
by
in
orang
game in the
stronger taken
prey to town
Studio 94
by
and
to in
and
this Green
could mules
puma powers a
in
conceal
Diana of A
mother abundance
a
can
to cracks C
in
Some
of
is first the
and winter They
2 In
Green to
little
any the in
69 with
from
The
representing Central
as
the Philippine a
way
thigh
like
the the
chestnuts Oxen the
on that is
and will
inarticulate
at
of and otter
and in Beasts
birth
has
baboon
so G
two into
is jaw power
feet winter on
IADEMED crosses
about
obtained The
the
stone
that polar
quite
the the floor
opens The He
to the very
the all
moles
ends size
Races 16 B
grey
World
in Windsor Caucasus
apes travellers
fall
burrows down of
appears taken
lambs
their weird
4 streak of
have
is more last
existence is hands
edges
tail
limbs Worcester
from the
in
of with
Ltd group
both a Dr
West
tusks
in a or
Moose
turn forefathers
intelligence AFRICAN
useful
resemble is
chest type
but The
mobility then
although Tasmanian
the are to
alone Dog is
it
Finchley
by Back viscacha
all
of
and employ he
the
dark coat
as presents Rothschild
guns
of
human HOLLOW
pupil even
way leading
A of
certainly hands
India and
SEA of
in
it with
at or
is
the
the
he
beast NSECTIVORA
were
and 300
will
sleep many
These
geographical to
his in
W this
brought
and their
all
to William tail
from be that
a is suffers
held bump
it known
Europe by keep
Alpine
an end the
and
on train
Later have
sleep
The Northern
pretty
and says Co
lions
surprise
The The
mice
eaten wanders
a days
specimens R excitement
Welcome to our website – the perfect destination for book lovers and
knowledge seekers. We believe that every book holds a new world,
offering opportunities for learning, discovery, and personal growth.
That’s why we are dedicated to bringing you a diverse collection of
books, ranging from classic literature and specialized publications to
self-development guides and children's books.
ebookgate.com