Machine Learning (Unit I)

The document provides an overview of machine learning (ML), emphasizing its reliance on data to optimize performance and make predictions without explicit programming. It outlines the machine learning process, including problem exploration, data engineering, model engineering, and ML operations, highlighting the importance of quality data and iterative model training. Additionally, it touches on statistical concepts and decision theory that support machine learning algorithms in making informed predictions.

Uploaded by

MS. S.KARUNALAKSHMI MATHEMATICS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views12 pages

Machine Learning (Unit I)

Uploaded by

MS. S.KARUNALAKSHMI MATHEMATICS

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 12

MACHINE

MACHINE
MACHINE
LEARNING
LEARNING
UNIT I

LEARNING Dr S Devidhanshrii
Assistant Professor
Data Science & Analytics
Why “Learn” ?
• Machine learning is programming computers to optimize a performance criterion
usingexample data or past experience.
• There is no need to “learn” to calculate payroll Learning is used when:
• Human expertise does not exist (navigating on Mars),
• Humans are unable to explain their expertise (speech recognition)
• Solution changes in time (routing on a computer network)
• Solution needs to be adapted to particular cases (user biometrics)•Learning general
models from a data of particular examples
• Data is cheap and abundant (data warehouses, data marts); knowledge is expensive
and scarce.
• Example in retail: Customer transactions to consumer behavior:
• People who bought “Da Vinci Code” also bought “The Five People You Meet in Heaven”
(www.amazon.com)
• Build a model that is a good and useful approximation to the data.
Machine
Learning
• Machine learning (ML) allows computers to learn and make
decisions without being explicitly programmed.
• It involves feeding data into algorithms to identify patterns
and make predictions on new data.
• It is used in various applications like image recognition,
speech processing, language translation, recommender
systems, etc.
• Machine Learning solves these problems by learning from
examples and making predictions without fixed rules.
Importance of Data in Machine
Learning
Data is the foundation of machine learning (ML) without quality
data ML models cannot learn, perform or make accurate
predictions.
• Data provides the examples from which models learn patterns
and relationships.
• High-quality and diverse data improves how well models
perform and generalize to new situations.
• It helps models to understand real-world scenarios and adapt to
practical uses.
• Features extracted from data are important for effective
training.
• Separate datasets for validation and testing measure how well
Thynk Unlimited

MACHINE LEARNING PROCESS

• The machine learning process defines the
flow of work that a data science team
executes to create and deliver a machine
learning model.
• In addition, the ML process also defines
how the team works and collaborates
together, to create the most useful
predictive model.
• A High Level Machine Learning Process
• A high level view of the steps in the
machine learning process was described in
our post on machine learning life cycles.
• In short, this workflow includes problem
exploration, data engineering, model
engineering and ML Ops.
Problem
Exploration
First focus on how the model will be used. In the process, assess
the desired model accuracy and explore other details, such as if
false positives are worse than false negatives. This phase also
includes understanding what data might be available.
• Define Success: Define the problem to be solved. For example,
what should be predicted. This helps define what data will be
needed. Also, make sure it’s clear how success will be
measured.
• Evaluate Data: Determine what are the relevant data sources.
• In other words, evaluate what data the team will need, how
that data is collected, and where the data is stored.
DATA ENGINEERING
Design and build data pipelines. These pipelines get, clean and transform data into
a format that is more easily used to build a predictive model. Note that this data
might be coming from multiple data sources, so merging the data is also a key
aspect of data engineering. This is often where the most time is spent in an ML
project.
• Obtain Data: Assembling the data. This includes connecting to remove data stored
and databases, which might be in different formats. For example, some data might
be in CSV format, and other data could be available in JSON via web services.
• Scrub Data: The process of re-formatting particular attributes and correcting errors
in data, such as missing values imputation. Datasets are often missing values, or
they may contain values of the wrong type or range. Cleaning can include removing
duplicates, correcting errors, dealing with missing values, normalization, and
handling data type conversions.
• Explore / Validate Data: Get a basic understanding of the data. This exploratory
analysis includes data profiling to obtain information about the content and
structure of the data. The goal is to both understand the data attributes as well as
the quality of the data.
MODEL ENGINEERING

This is the phase that most people associate with building a machine learning model. During this
phase, data is used to train and evaluate the model. This is often an iterative task, where the different
models are tried, and the model is tuned.
• Select & Train Model: The process of identifying an appropriate model, and then building / training
the model (on training data). The goal of training is to answer a question or make a prediction
correctly as often as possible.
• Test Model: Run the model on data that the model has not yet seen (such as testing data). In other
words, perform model testing by using data that was withheld from training (i.e., backtesting).
• Evaluate & Interpret Model: Objectively measure the performance of the model. Note that basic
evaluation explores metrics such as accuracy and precision, to determine if the model is useable,
and which model is best for the specific problem being explored. This evaluation also includes an
understanding of when the model makes mistakes. More generally, validating the trained model
helps to ensure the model meets original organizational objectives before the ML model is put into
production.
• Tune Model: This step refers to parameter tuning, which, depending on the model being used, can
be more an art than a science. In short, models typically have parameters (i.e., dials for tuning the
model), which allows the model to get improved performance via parameter refinement. Simple
model parameters may include attributes such as the number of training steps and the initialization
of certain values.
ML Ops

Broadly defined, machine learning operations (ML Ops) spans a wide set of practices,
systems, and responsibilities that data scientists, data engineers, cloud engineers, IT
operations, and business stakeholders use to deploy, scale, and maintain machine
learning solutions.
• Deploy Model: Package and put the model to use (i.e., into production). While this
varies from one group to another, the team needs to understand the expected
model performance, how the model will be monitored, and in general, key
performance indicators (KPIs) of the model.
• Monitor Model: Maintain the model in production. This includes monitoring the KPIs
and proactively working to ensure stable and robust predictions.
Testing Machine Learning Turning Data into Probabilities
Preliminaries for ML
Algorithms
• ML often uses probabilities to make
• Understand domain and data. • Train/Test Split decisions.
• Prepare datasets (cleaning, • Cross-validation • E.g., Logistic Regression predicts
transformation). • Hyperparameter tuning probability of class membership.
• Select evaluation metrics (accuracy, F1- • Bias-Variance tradeoff • Probabilistic models deal with
score, etc.) • Tools: sklearn, TensorFlow, PyTorch uncertainty in predictions.

Statistics for Machine Learning Probability Distributions Decision Theory

• Statistics help in understanding and • Discrete Distributions: Bernoulli, • Helps machines make optimal decisions
preparing data: Binomial under uncertainty.
• Mean, Median, Mode • Continuous Distributions: Normal, • Expected Value, Loss Function, Risk
• Variance, Standard Deviation Gaussian Minimization
• Correlation, Covariance • Important for modeling uncertainties in • Foundation for models like decision
ML. trees and Bayesian classifiers.
THANK YOU
THANK YOU
THANK YOU

Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Lifecycle of ML
No ratings yet
Lifecycle of ML
12 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
Zarantech - Intro To ML
No ratings yet
Zarantech - Intro To ML
105 pages
Module 1
No ratings yet
Module 1
25 pages
Made By: Swati Tripathi
No ratings yet
Made By: Swati Tripathi
31 pages
Difference Between Machine Learning and Traditional Programming
No ratings yet
Difference Between Machine Learning and Traditional Programming
11 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
22 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
5 pages
Ocs353 DSF Unit III Notes
No ratings yet
Ocs353 DSF Unit III Notes
11 pages
Lecture 2 - What Is ML
No ratings yet
Lecture 2 - What Is ML
17 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
11 pages
Machine Learning (ML) - Comprehensive Summary
No ratings yet
Machine Learning (ML) - Comprehensive Summary
7 pages
Notes Unit 1-3 Part-II
No ratings yet
Notes Unit 1-3 Part-II
20 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
14 pages
Machine Learning
No ratings yet
Machine Learning
84 pages
State of The Art Research Methodology For Machine
No ratings yet
State of The Art Research Methodology For Machine
58 pages
ML Module I
No ratings yet
ML Module I
71 pages
Machine Learning?
100% (6)
Machine Learning?
114 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
316 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
Basic Concepts of Machine Learning For Beginners
No ratings yet
Basic Concepts of Machine Learning For Beginners
102 pages
01 Unit-I - ML
No ratings yet
01 Unit-I - ML
50 pages
Introduction To Data Science and Machine Learning
No ratings yet
Introduction To Data Science and Machine Learning
30 pages
Machine Learning Process Overview
No ratings yet
Machine Learning Process Overview
41 pages
Intro to Machine Learning & kNN
No ratings yet
Intro to Machine Learning & kNN
90 pages
Machine Learning in Data Science
No ratings yet
Machine Learning in Data Science
4 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
Module - 1
No ratings yet
Module - 1
9 pages
Unit 3
No ratings yet
Unit 3
13 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
Class1 - Introduction and Foundation-1717413257735
No ratings yet
Class1 - Introduction and Foundation-1717413257735
23 pages
dbms-10 Marks
No ratings yet
dbms-10 Marks
32 pages
Machine Learning for Level 5 Students
No ratings yet
Machine Learning for Level 5 Students
116 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Machine Learning Module Overview
No ratings yet
Machine Learning Module Overview
29 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Lec-7 Intro Machine Learning
No ratings yet
Lec-7 Intro Machine Learning
87 pages
Module 4
No ratings yet
Module 4
28 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
132 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
63 pages
Machine Learning Notes
83% (12)
Machine Learning Notes
19 pages
MATH 370: Intro to Machine Learning
No ratings yet
MATH 370: Intro to Machine Learning
60 pages
ML Lec 1
No ratings yet
ML Lec 1
51 pages
Machine Learning 1
No ratings yet
Machine Learning 1
34 pages
ML 1
No ratings yet
ML 1
79 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
10 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
Lab Manual
No ratings yet
Lab Manual
65 pages
Operating Systems Solved Paper 2013
No ratings yet
Operating Systems Solved Paper 2013
106 pages
Future Trends in Human Resource Management
No ratings yet
Future Trends in Human Resource Management
16 pages
Brianna Collins Resume
No ratings yet
Brianna Collins Resume
1 page
2021 A Survey of OCR Evaluation Tools and Metrics
No ratings yet
2021 A Survey of OCR Evaluation Tools and Metrics
6 pages
Modeling The Hysteretic Response of Mechanical Connections For Wood Structures
No ratings yet
Modeling The Hysteretic Response of Mechanical Connections For Wood Structures
11 pages
2022 CLIENT Transition Plan r1 1
No ratings yet
2022 CLIENT Transition Plan r1 1
7 pages
DFOT 2025 Technolympics Guidelines
No ratings yet
DFOT 2025 Technolympics Guidelines
20 pages
Fixed End Moments PDF
No ratings yet
Fixed End Moments PDF
1 page
49" and 55" Display Specifications
No ratings yet
49" and 55" Display Specifications
7 pages
AI Integration in Cloud Computing Review
No ratings yet
AI Integration in Cloud Computing Review
11 pages
Sim600 HD V1.06
No ratings yet
Sim600 HD V1.06
47 pages
Cell Phone Detector Project Report
75% (8)
Cell Phone Detector Project Report
34 pages
Process FMEA Checklist Guide
100% (2)
Process FMEA Checklist Guide
2 pages
NPTEL-BLOCKCHAIN Week-10
No ratings yet
NPTEL-BLOCKCHAIN Week-10
76 pages
It's All About BCM Institute
No ratings yet
It's All About BCM Institute
2 pages
02 Data Cleaning and Formulas
No ratings yet
02 Data Cleaning and Formulas
56 pages
JBL PRX735 Service ID9839
No ratings yet
JBL PRX735 Service ID9839
4 pages
CH 1 P2 Mathematical Proof - 51765880 2b1d 46c1 B8eb 3380aa434c9a
No ratings yet
CH 1 P2 Mathematical Proof - 51765880 2b1d 46c1 B8eb 3380aa434c9a
17 pages
Unit 1 Bce BT 205
No ratings yet
Unit 1 Bce BT 205
102 pages
Amoñwmz: Gîmm G J M Ho$ Xm¡Amz MWN 'M - H DGW Yam Amoo?
No ratings yet
Amoñwmz: Gîmm G J M Ho$ Xm¡Amz MWN 'M - H DGW Yam Amoo?
8 pages
Introduction to Distributed Systems
No ratings yet
Introduction to Distributed Systems
284 pages
IIT Madras: Experimental Stress Analysis
No ratings yet
IIT Madras: Experimental Stress Analysis
29 pages
SWAT User Manual for Researchers
100% (1)
SWAT User Manual for Researchers
472 pages
Smart Cities: Evolution & Impact
No ratings yet
Smart Cities: Evolution & Impact
45 pages
2ceit601 Toc - Practical List Jan May 2025
No ratings yet
2ceit601 Toc - Practical List Jan May 2025
5 pages
HC6/HC12 System Controller Manual
No ratings yet
HC6/HC12 System Controller Manual
61 pages
Using The Windows Event Log From Visual Foxpro
No ratings yet
Using The Windows Event Log From Visual Foxpro
31 pages
BOOK Phased Array Radar Design Application of Radar Fundamentals - Compress
No ratings yet
BOOK Phased Array Radar Design Application of Radar Fundamentals - Compress
336 pages
FirstTech CTO - ApplicationPage
No ratings yet
FirstTech CTO - ApplicationPage
3 pages

Machine Learning (Unit I)

Uploaded by

Machine Learning (Unit I)

Uploaded by

MACHINE

MACHINE LEARNING PROCESS

Statistics for Machine Learning Probability Distributions Decision Theory

You might also like