0% found this document useful (0 votes)

16 views40 pages

Fundamentals Part 2

This document discusses machine learning categories and supervised vs unsupervised learning. It introduces supervised learning techniques like regression and classification that learn models to predict outputs given inputs. Regression predicts quantitative variables while classification predicts categorical variables. Unsupervised learning discovers relationships within inputs when there are no labeled outputs. The goal is to cluster or segment data to find patterns rather than make predictions. Examples of both supervised and unsupervised techniques are provided.

Uploaded by

Agustin Agustin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views40 pages

Fundamentals Part 2

Uploaded by

Agustin Agustin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Machine Learning and Data Analytics

Fundamentals – Part 2

Dr. Rossana Cavagnini

Deutsche Post Chair – Optimization of Distribution Networks (DPO)

RWTH Aachen University

[email protected]
Machine learning categories
What learning means
Why and how estimating f ?

Agenda

1 Machine learning categories

2 What learning means

3 Why and how estimating f ?

DPO MLDA 2
Machine learning categories
What learning means
Why and how estimating f ?

DPO MLDA 3
Machine learning categories
What learning means
Why and how estimating f ?

1. Supervised learning

Learn a model for predicting or estimating an output based on one or more inputs
For each observation of the predictors, there is an associated response measurement
(labeled training data)
Regression and classification tasks
Examples:
Linear regression
Logistic regression
Boosting
Support Vector Machines
...

DPO MLDA 4
Machine learning categories
What learning means
Why and how estimating f ?

1.a Regression problems

Problems with a quantitative response

Quantitative variables (numerical values)

DPO MLDA 5
Machine learning categories
What learning means
Why and how estimating f ?

Example: wage data

Predict the wage based on age, education level, and income year.

300

300
200

200

200
Wage

Wage

Wage
50 100

50 100

50 100
20 40 60 80 2003 2006 2009 1 2 3 4 5

Age Year Education Level

Lots of variability → combine the three features

Methodologies: linear regression, but non-linear relationship between wage and age!

DPO MLDA 6
Machine learning categories
What learning means
Why and how estimating f ?

1.b Classification problems

Problems with a qualitative (categorical) response

Qualitative variables (values of one of K different classes, categories)

DPO MLDA 7
Machine learning categories
What learning means
Why and how estimating f ?

Example: stock market data

Predict if a market index will increase or decrease based on the performance of
yesterday, of two, and of three previous days.

Yesterday Two Days Previous Three Days Previous

6
Percentage change in S&P

Percentage change in S&P

4
2

2
0

0
−2

−2

−2
−4

−4

−4
Down Up Down Up Down Up

Today’s Direction Today’s Direction Today’s Direction

Will the answer fall into the Up or Down bucket?

There is no simple strategy for using yesterday’s movement to predict today’s returns.
DPO MLDA 8
Machine learning categories
What learning means
Why and how estimating f ?

Regression vs classification

Sometimes the difference between a regression and a classification task is not clear
from the beginning

DPO MLDA 9
Machine learning categories
What learning means
Why and how estimating f ?

Regression vs classification

Sometimes the difference between a regression and a classification task is not clear
from the beginning
Classification tasks can be interpreted as estimating the probability that an element
has a given label

DPO MLDA 9
Machine learning categories
What learning means
Why and how estimating f ?

Regression vs classification

DPO MLDA 9
Machine learning categories
What learning means
Why and how estimating f ?

2. Unsupervised learning
There are inputs but no supervising output
We can learn relationships and structure from data
We observe only features and have no measurements of the outcome
There is no response variable to predict

DPO MLDA 10
Machine learning categories
What learning means
Why and how estimating f ?

Example: Market segmentation study

Dataset with zipcode and family income for customers. Determine whether there are
clusters for customers (goal: are there spending patterns?).

8
10
8

6
X2

X2
6

4
4

2
2

0 2 4 6 8 10 12 0 2 4 6

X1 X1

If we had the spending patterns among the observed variables → supervised task

DPO MLDA 11
Machine learning categories
What learning means
Why and how estimating f ?

What learning means

X : input variables (predictors, independent variables, features, ...)
Y : output variables (response variables, dependent variables, target, ...)
Hypothesis: there is a certain hidden (unknown) relation between the inputs and the
outputs. Let’s estimate it!
Learning refers to the set of approaches for estimating f

Y = f (X ) +

DPO MLDA 12
Machine learning categories
What learning means
Why and how estimating f ?

What learning means

Y = f (X ) +

f fixed unknown function of X1 , . . . , Xp (systematic information that X provides about Y )

DPO MLDA 12
Machine learning categories
What learning means
Why and how estimating f ?

What learning means

Y = f (X ) +

f fixed unknown function of X1 , . . . , Xp (systematic information that X provides about Y )

error term: complex real-world relations vs simplifications and assumptions
- independent of X : properties of Y which cannot be inferred from the features
- mean zero (E() = 0): over (infinitely) many observations, the effects of the error are
unpredictable
DPO MLDA 12
Machine learning categories
What learning means
Why and how estimating f ?

Example for the error term

Describe the grade obtained in a class as a function of some inputs: share of class
attendance, number of hours of study, student’s GPA, number of weekends spent
partying, availability of a quiet room to study
These features are not enough (for ex. a student gets a cold and performs poorly) →
uncertainty contained in
The error :
is independent from the other variables: we cannot guess if a student will have a bad
exam day (e.g., a cold) using the other input features.
has zero mean: on average, given a very large number of observations, there will be as
many students with unlucky as with lucky days
We could reduce this error by increasing the number of features (ex. add the feature
“body temperature on the exam day”)
But we could still miss something...

DPO MLDA 13
Machine learning categories
What learning means
Why and how estimating f ?

Example: advertising data

Sales of a product in 200 different markets, with advertising budgets for the product
in each of those markets for three different media: TV, radio, and newspaper.
Develop a model to predict sales based on the three media budgets.

25
20

20
Sales

Sales

Sales
15

15
10

10
5

5
0 50 100 200 300 0 10 20 30 40 50 0 20 40 60 80 100

TV Radio Newspaper

X : Budget
X1 : tv budget
X2 : radio budget
X3 : newspaper budget
Y : Sales
DPO MLDA 14
Machine learning categories
What learning means
Why and how estimating f ?

Example: income data

Develop a model to predict the income based on the years of education.

X : Years of education
80

80
70

70
Y : Income
60

60
Income

Income
50

50
Blue line: true relationship (unknown)
40

40
30

30
Black lines: errors (approx. mean zero)
20

20
10 12 14 16 18 20 22 10 12 14 16 18 20 22

Years of Education Years of Education

Incom

f may involve more than one input

variable
y
rit

Ye
nio

ars
Se

of
Edu
ca
tio
n

DPO MLDA 15
Machine learning categories
What learning means
Why and how estimating f ?

1. Prediction

Goal: find a good fˆ yielding good predictions Ŷ and keeping the error as low as
possible
fˆ: estimate for f
Ŷ : resulting prediction for Y