0% found this document useful (0 votes)

10 views27 pages

3. Introduction to Machine Learning

The document provides an introduction to machine learning, covering its definition, key concepts, and various algorithms including supervised and unsupervised learning. It discusses data representation, feature extraction, and the importance of training and testing data in building machine learning models. Additionally, it highlights applications of machine learning in various fields such as recommendation systems, virtual assistants, and autonomous vehicles.

Uploaded by

mathurarushi4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views27 pages

3. Introduction to Machine Learning

Uploaded by

mathurarushi4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Introduction to Machine

Learning
What to expect from the course?
 What is Machine Learning ?

 Data Visualization/Analysis, Pandas, NumPy,……..

 Dimensionality Reduction
Hands-On session will be
 Different Machine Learning Algorithms (Supervised, conducted in parallel
Unsupervised, metrics)

 Deep Learning (Neural Networks, back propagation, loss

functions)

 CNN, RNN, LSTM

 ….. and more

Introduction
Any technique which enables
computers to mimic human behaviour
Artificial Intelligence

Machine Learning  Learning is much deeper than

memorization and information recall

 Learning is “a process that leads to

Deep Learning
change, which occurs as a result of
experience and increases the potential
for improved performance and future
learning” (Ambrose et al, 2010, p.3)
Machine Learning
 Machine learning is a “Field of study that gives
computers the ability to learn without being explicitly INPUT, DATA
programmed.” : Arthur Samuel

 The function of a machine learning system can be: Intelligent

System
 descriptive, meaning that the system uses the data
to explain what happened
Decisions,
 predictive, meaning the system uses the data to
Output,
predict what will happen
Actions
 prescriptive, meaning the system will use the data
to make suggestions about what action to take
Data Driven Problem Solving
Area (sq.ft) Price Area (sq.ft) Price
250 250000 250 145500
120 120000 120 212800
310 310000 310 194390
290 290000 290

Not a trivial solution. There

Simple well-known solution.
should be more parameters,
(Price = Area *1000)
(e.g., Age, Location)
The above relation obtained in a
Lot more data is needed to
trivial way, with one example.
solve the above.
Remarks
General Strategy: Given many examples of (X,Y), learn an automated solution to predict Y
Given a new X, Y = F(X)

 Main Challenge: The data is becoming complex

3.1
-2.6
 What is X is not a simple number? 0.41 3.9 m

 A N-dim vector? 1.89 ₹ 8.2 L

15.2 Blue
 Entities other than numbers?
Sedan
 A picture? …
 A sound bite? 9.23
How do we get the machine to do this?

General Strategy: Given many examples of (X,Y), learn an automated solution to predict Y
Given a new X, Y = F(X)

3.1
 There is too much information in raw data
-2.6
0.41
 Relevant information is hidden probably? 1.89
3.9 m
₹ 8.2 L
15.2
 Leads to Feature Extraction: Extracting Blue
useful information (X) from raw data
…
Sedan

9.23
Representation: From Raw data to Features
Area Bedrooms Bathrooms Age Parking Basement Price
240 3 2 10 No Yes 250000

 Convert all data into a vector of real numbers: Raw Data

 Points in a feature space

𝐢 𝐢

 Convert all predictions into an integer/real number:

 How do we deal with categorical data?

Categorical Data
 Ordinal Data – The categories have a meaningful order or ranking, but the intervals between the
categories are not necessarily equal. e.g.-Satisfaction Rating: Poor, Fair, Good, Excellent.

 Nominal Data - The categories are names or labels with no inherent order or ranking.
e.g.- Colors: Red, Green, Blue, Types of Pets: Dog, Cat, Bird, Fish.

 Use Integer Encoding for ordinal data where the order of categories is meaningful.

 Categories like "low," "medium," and "high" can be represented as 1, 2, and 3, respectively. The
numerical values reflect the order or ranking among the categories.

 One-Hot Encoding is used for nominal data where there is no natural order, or to prevent
algorithms from mistakenly interpreting ordinal relationships between categories.
One-Hot Encoding
 Most widespread approach used for categorical data, unless your categorical variable takes on a large
number of values.

Pets Cat Dog Fish

Cat 1 0 0
Cat 1 0 0
Dog One-Hot Encoding 0 1 0
Fish 0 0 1
Dog 0 1 0
Cat 1 0 0
Fish 0 0 1

 Can lead to a significant increase in the number of features, especially if the categorical feature has
many unique values.
Representation: From Raw data to Features
Area Bedrooms Bathrooms Age Parking Basement Price
240 3 2 10 No Yes 250000

 We are given a set of n examples: 𝐢

 Our goal is to learn a model: that captures the pattern of the training
samples
 We can assume a model and learn its parameters

 Once we learn the model, we can predict the output, corresponding to any new
input, X’ :
Usual Programming vs Machine Learning
Programming: Machine Learning:

New Data: X’
Data Program Data: X Output: Y
F(X, Y)

Testing phase
Training phase
Computer
Computer Computer

Output: Y’
Output Program: F(X, Y)
ML Based on Training-Testing Data

Labelled Data [ samples]

Test Data for

Training Data [ samples] remaining
samples

 Take care to not leak information from Test Data into the Model
Feature extraction, Goal: to predict f()
Training Data with the Building
Learn about f() from
representation of a model
training data
feature space

Test Data Model Design

and Validation

Feature extraction, with the

representation of feature
space Trained Model

Model
Evaluation and Compute Prediction
Deployment for the test data
Data Representation
Age
Area Age Property
230 15 A
120 6 B
202 2 B
398 11 A
274 8 ?
Area
Feature Space Representation
Finding the best
Property Type Equation of the line
Feature extraction, fit line
with the Goal: to predict f()
Training Data Building
representation of Learn about f() from
a model
feature space training data
Unknown Property Type Area, Age as points in
Test Data 2D space Model Design
and Validation

Feature extraction, with the

representation of feature
Point vs. Line
space Trained Model
Area, Age as points in
2D space Property Type – A/B
Learning is concerned with accurate Model
Evaluation and Compute Prediction
prediction of future data, not accurate
Deployment for the test data
prediction of training or available data
Summary – Machine Learning Framework

y = f(x)  Note: Training set and

testing set comes from
the same distribution
output prediction feature or
function representation

 Training: given a training set of labeled examples 𝟏 𝟏 𝟐 𝟐 𝑵 𝑵 ,

estimate the prediction function f by minimizing the prediction error

 Testing: apply f to the test example x’ and output the predicted value y = f(x’)
Summary – Machine Learning Framework

y = f(x)
output prediction feature or
function representation

 The input is converted to a vector x

 The output is a value indicated by y
 Depending on the nature of x and y, we define
1) Regression
2) Classification
3) …………………
Representations
 Representations in machine learning refer to the way data is transformed or encoded into
a format that is suitable for a learning algorithm to process

Sepal Length Sepal Width Petal Length Petal Width Species

5.1 3.5 1.4 0.2 A
5.4 3.7 1.1 0.1 A
5.2 2.7 3.9 1.0 B
6.6 2.9 3.5 1.2 B
5.8 2.8 5.1 2.4 C
7.7 3.7 6.7 2.2 C
-------- ------ ----- ----- -----
Feature Space
Representations
 Images: Raw Pixel Representation, Deep Learning Based Features

 The sum of all the pixels

 The number of boundary pixels
 Edge detection
Representations

 Sound: Waveform Representation,

Spectrogram Representation, Mel-
Frequency Cepstral Coefficients

Reference: Towards Low-Complexity Wireless

Technology Classification Across Multiple
Environments,
DOI:10.1016/j.adhoc.2019.101881
Representations – Textual Data
 Text Data: N-grams, Bag of Words, Term Frequency-Inverse Document Frequency, Word
Embeddings
Sentence: The weather is sunny today

N-gram N-gram Generated Number of N-gram

Sentence Features
Unigram (1-Gram) “The”, “weather”, “is”, 5
“sunny”, “today”
Bigram (2-Gram) “The weather”, “The is”, 10
“The sunny”, ……
Trigram (3-Gram) “The weather is”, 3
“weather is sunny”, ….
Representations – Textual Data
 Text Data: N-grams, Bag of Words, Term Frequency-Inverse Document Frequency, Word
Embeddings
 Sentence 1: The weather is sunny today
 Sentence 2: The weather was rainy yesterday

1 2 3 4 5 6 7 8 Length
The weather is sunny today was rainy yesterday
1 1 1 1 1 1 0 0 0 5
2 1 1 0 0 0 1 1 1 5

 Vector of Sentence 1: [1 1 1 1 1 0 0 0]

 Vector of Sentence 2: [1 1 0 0 0 1 1 1]
Why sudden interest in AI?

 Appearance of large, high-quality labeled datasets

 Massively parallel computing with GPUs

 Backprop-friendly activation functions, Improved

architectures

 Software platforms, Cloud Compute, APIs,

Libraries
More People, Papers, Results,  New regularization techniques, Robust optimizers
Funding, Positive Feedback.
Where is Machine Learning?

Recommendation
Systems Virtual Assistants
Facial Recognition

E-Commerce
Create Photographs, Paintings
Chess/ Go Champions

Autonomous Cars/Navigation

Speech Recognition
Segmentation
Image Courtesy: Google
Other Applications
• Surveillance
• Automated Assembly
• Mail Sorting
• Face detection (photography)
• Robot Navigation
• Content-Based Image Retrieval
• Entertainment
• And many more…

Image Courtesy: Google

Introduction To Machine Learning: Jaime S. Cardoso
100% (1)
Introduction To Machine Learning: Jaime S. Cardoso
52 pages
737manual UK
96% (25)
737manual UK
60 pages
Google UX Design Certificate - Portfolio Project 1 - Case Study Slide Deck (Template)
No ratings yet
Google UX Design Certificate - Portfolio Project 1 - Case Study Slide Deck (Template)
25 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
92 pages
Class10-Introduction_to_ML
No ratings yet
Class10-Introduction_to_ML
32 pages
Slides on DataI
No ratings yet
Slides on DataI
33 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
65 pages
04_MLModelingBasics
No ratings yet
04_MLModelingBasics
61 pages
Module 2
No ratings yet
Module 2
73 pages
Lecture 06 Part A - Macine Learning
No ratings yet
Lecture 06 Part A - Macine Learning
77 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Introduction to machine learning
No ratings yet
Introduction to machine learning
33 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
58 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
ML Notes -2025
No ratings yet
ML Notes -2025
145 pages
ML UNIT-II
No ratings yet
ML UNIT-II
37 pages
An Introduction To Machine Learning and How To Teach Machines To See
No ratings yet
An Introduction To Machine Learning and How To Teach Machines To See
50 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Steven Skiena-The Algorithm Design Manual-En
50% (2)
Steven Skiena-The Algorithm Design Manual-En
27 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
ML1 17 Hepsi
No ratings yet
ML1 17 Hepsi
90 pages
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
No ratings yet
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
11 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Lecture 02
No ratings yet
Lecture 02
34 pages
Lecture 02
No ratings yet
Lecture 02
34 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Lecture Notes 2016
No ratings yet
Lecture Notes 2016
132 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
Implementing Artificial Neural Network in Python From Scratch
No ratings yet
Implementing Artificial Neural Network in Python From Scratch
16 pages
Machine Learning Slides
No ratings yet
Machine Learning Slides
281 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Lecture 1
No ratings yet
Lecture 1
36 pages
algorithmeknn-121213175830-phpapp02
No ratings yet
algorithmeknn-121213175830-phpapp02
52 pages
18ai61-Model Question Paper Solutions
No ratings yet
18ai61-Model Question Paper Solutions
71 pages
Linear Algebra For Machine Learning
No ratings yet
Linear Algebra For Machine Learning
65 pages
ML Overview
No ratings yet
ML Overview
26 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
ML week 8
No ratings yet
ML week 8
12 pages
Sep7 Classification
No ratings yet
Sep7 Classification
65 pages
Comp Vis Week 2
No ratings yet
Comp Vis Week 2
16 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
47 pages
ML Word To PDF
No ratings yet
ML Word To PDF
229 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
DS Cheat Sheets
No ratings yet
DS Cheat Sheets
18 pages
Warming-Up To ML, and Some Simple Supervised Learners (Distance-Based "Local" Methods)
No ratings yet
Warming-Up To ML, and Some Simple Supervised Learners (Distance-Based "Local" Methods)
29 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
1. U1 ML Intro and Applications
No ratings yet
1. U1 ML Intro and Applications
123 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Chapter-1 Ml Intro
No ratings yet
Chapter-1 Ml Intro
36 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
AI-900 - Fundamental Principles of ML
No ratings yet
AI-900 - Fundamental Principles of ML
55 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Class10 14 PatternClassification - 13 24sept2019
No ratings yet
Class10 14 PatternClassification - 13 24sept2019
50 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
1. Introduction to Linear Algebra
No ratings yet
1. Introduction to Linear Algebra
33 pages
Devops QB Unit2 Question And Amswers_241012_213207[1]
No ratings yet
Devops QB Unit2 Question And Amswers_241012_213207[1]
22 pages
CSE 3-1 DevOps QB
No ratings yet
CSE 3-1 DevOps QB
21 pages
DO Unit-II Notes[1]
No ratings yet
DO Unit-II Notes[1]
26 pages
DO Unit-IV Notes[1]
No ratings yet
DO Unit-IV Notes[1]
26 pages
QR Code Generation
No ratings yet
QR Code Generation
4 pages
VANSH 2025 Sponsor Brochur
100% (1)
VANSH 2025 Sponsor Brochur
12 pages
Research Paper-1
No ratings yet
Research Paper-1
5 pages
1539 IT Support Interview Questions Answers Guide
No ratings yet
1539 IT Support Interview Questions Answers Guide
7 pages
Odisha State Datacenter Project
No ratings yet
Odisha State Datacenter Project
377 pages
Microsoft Dynamics NAV 50 Change
No ratings yet
Microsoft Dynamics NAV 50 Change
38 pages
Modeling the Data Warehouse and Data Mart
No ratings yet
Modeling the Data Warehouse and Data Mart
10 pages
Scheme of Work - Computer Studies (Form 1A) 2009
88% (16)
Scheme of Work - Computer Studies (Form 1A) 2009
18 pages
Albumin
No ratings yet
Albumin
1 page
Mohamed Meligy Senior Software Engineer Silverkey Tech
No ratings yet
Mohamed Meligy Senior Software Engineer Silverkey Tech
28 pages
6 Class Paper Mid Term
No ratings yet
6 Class Paper Mid Term
2 pages
3GPP TS 23.246
No ratings yet
3GPP TS 23.246
57 pages
Computer Science Worksheet STD 4,5,6
No ratings yet
Computer Science Worksheet STD 4,5,6
3 pages
NPPlog
No ratings yet
NPPlog
2 pages
10 Mini CNC Lathe and Milling Machine For Education PDF
100% (2)
10 Mini CNC Lathe and Milling Machine For Education PDF
11 pages
Unit-4.1 PPT Notes
No ratings yet
Unit-4.1 PPT Notes
31 pages
Composite Providers With BW On HANA For Efficient Data Modelling
No ratings yet
Composite Providers With BW On HANA For Efficient Data Modelling
5 pages
CA Secure Browser Install 2020-2021
No ratings yet
CA Secure Browser Install 2020-2021
3 pages
Grounded SAM- Assembling Open-World Models for Diverse Visual Tasks
No ratings yet
Grounded SAM- Assembling Open-World Models for Diverse Visual Tasks
11 pages
POWL Configuration EP
No ratings yet
POWL Configuration EP
8 pages
"TIGER" OCR Library: (For Microsoft Windows NT and Windows 98/95)
No ratings yet
"TIGER" OCR Library: (For Microsoft Windows NT and Windows 98/95)
42 pages
Class 6
No ratings yet
Class 6
15 pages
Detection oof cyber bullying in social media using machine learningppt
No ratings yet
Detection oof cyber bullying in social media using machine learningppt
19 pages
Nick Jang
No ratings yet
Nick Jang
3 pages
Prc 005 Application Guide
No ratings yet
Prc 005 Application Guide
9 pages
Learning Activity 2: Integrating Images and External Materials
No ratings yet
Learning Activity 2: Integrating Images and External Materials
6 pages
Microprocessor Systems and Interfacing (EEE342)
No ratings yet
Microprocessor Systems and Interfacing (EEE342)
21 pages
Uml Lab Manual
No ratings yet
Uml Lab Manual
38 pages
Speed Optimization Best Practises Ebook by Swift
No ratings yet
Speed Optimization Best Practises Ebook by Swift
27 pages
QualNet 4.0 InstallationGuide
0% (1)
QualNet 4.0 InstallationGuide
34 pages

3. Introduction to Machine Learning

Uploaded by

3. Introduction to Machine Learning

Uploaded by

Introduction to Machine

 Data Visualization/Analysis, Pandas, NumPy,……..

 Deep Learning (Neural Networks, back propagation, loss

 CNN, RNN, LSTM

 ….. and more

Machine Learning  Learning is much deeper than

 Learning is “a process that leads to

 The function of a machine learning system can be: Intelligent

Not a trivial solution. There

 Main Challenge: The data is becoming complex

 A N-dim vector? 1.89 ₹ 8.2 L

 Convert all data into a vector of real numbers: Raw Data

 Convert all predictions into an integer/real number:

 How do we deal with categorical data?

Pets Cat Dog Fish

 We are given a set of n examples: 𝐢

Labelled Data [ samples]

Test Data for

Test Data Model Design

Feature extraction, with the

Feature extraction, with the

y = f(x)  Note: Training set and

 Training: given a training set of labeled examples 𝟏 𝟏 𝟐 𝟐 𝑵 𝑵 ,

 The input is converted to a vector x

Sepal Length Sepal Width Petal Length Petal Width Species

 The sum of all the pixels

 Sound: Waveform Representation,

Reference: Towards Low-Complexity Wireless

N-gram N-gram Generated Number of N-gram

 Appearance of large, high-quality labeled datasets

 Massively parallel computing with GPUs

 Backprop-friendly activation functions, Improved

 Software platforms, Cloud Compute, APIs,

Image Courtesy: Google

You might also like