0% found this document useful (0 votes)

8 views4 pages

Week 3 - Machine Learnigng

This document outlines the fundamentals of machine learning, including its definitions, types, and workflows, emphasizing its integration with data science. It explains key concepts such as supervised, unsupervised, and reinforcement learning, as well as essential terminologies like features, labels, and model evaluation metrics. Additionally, it introduces Python libraries NumPy and Pandas for data manipulation and provides sample code for practical applications.

Uploaded by

UK Creation

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Week 3 - Machine Learnigng

Uploaded by

UK Creation

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Week 3: Machine Learning Fundamentals

Diploma in Computer Science & Engineering – Course Code: 20CS51I

Learning Objectives

This week introduces the core principles of machine learning and its integration with data
science. Students will explore how machines learn from data, understand the structure of ML
workflows, and become familiar with essential terminology used in academic and industry
settings.

What Is Machine Learning?

Machine Learning is a branch of Artificial Intelligence that enables systems to learn from
data and improve performance over time without being explicitly programmed. It powers
intelligent applications like recommendation engines, fraud detection systems, autonomous
vehicles, and chatbots.

In real-world scenarios, machine learning helps Netflix suggest movies, Google Maps predict
traffic, and platforms like CyberRaksha Grid detect cyber fraud patterns. These systems rely
on historical data and algorithms to make predictions or decisions.

Types of Machine Learning

Machine learning is broadly categorized into three types:

Supervised Learning involves training models on labeled data, where both inputs and
expected outputs are known. It’s commonly used for tasks like spam detection, disease
prediction, and price forecasting.

Unsupervised Learning works with unlabeled data to uncover hidden patterns or groupings.
It’s useful for customer segmentation, anomaly detection, and market basket analysis.

Reinforcement Learning teaches agents to make decisions by interacting with an

environment and receiving feedback in the form of rewards or penalties. This approach is
popular in robotics, gaming, and autonomous navigation.

Machine Learning Workflow

Every machine learning project follows a structured workflow that transforms raw data into
actionable insights. It begins with defining the problem and collecting relevant data from
sources like sensors, APIs, or logs. The data is then cleaned and transformed to prepare it for
modeling.

Choosing the right algorithm is crucial. Models are trained to learn patterns from the data and
then evaluated using performance metrics. Once validated, the model is deployed into a real-
world application and monitored for accuracy and reliability. Retraining may be necessary as
new data becomes available.
Data Science and Its Role

Data science is the foundation that supports machine learning. It combines statistical analysis,
programming, and domain expertise to extract meaningful insights from data. A data
scientist’s workflow includes data wrangling, visualization, modeling, and storytelling.

Python is the most widely used language in data science. Libraries like Pandas help
manipulate data, Matplotlib and Seaborn enable visualization, and Scikit-learn provides tools
for building and evaluating machine learning models.

Data science ensures that the data fed into ML models is clean, relevant, and structured for
optimal learning.

ML Pipeline: From Data to Deployment

A machine learning pipeline is a step-by-step process that automates the journey from raw
data to deployed model. It typically starts with data engineering, where missing values are
handled, categories are encoded, and features are scaled.

Modeling involves selecting and training algorithms on the processed data. Once the model
performs well, it’s deployed using tools like Flask or FastAPI. For scalability, Docker
containers or cloud platforms such as AWS or Azure are used.

This pipeline ensures reproducibility, efficiency, and scalability in real-world ML

applications.

Essential Terminologies

Understanding machine learning requires fluency in key terms:

 A feature is an input variable used for prediction, such as age or income.

 A label is the output variable, like “spam” or “not spam.”
 The training set is the portion of data used to teach the model.
 The test set evaluates how well the model generalizes to new data.
 Overfitting occurs when a model memorizes training data but fails on unseen data.
 Underfitting happens when a model is too simple to capture patterns.
 Accuracy, precision, and recall are metrics used to assess model performance.

These terms form the vocabulary of every data scientist and ML engineer.

Suggested Activities

Students can deepen their understanding through hands-on practice and reflection.
Comparing supervised and unsupervised learning with real-world examples helps clarify their
differences. Sketching a machine learning pipeline diagram reinforces the workflow stages.
Creating a glossary of ML terms with definitions and examples builds technical fluency.
Using Scikit-learn to train a basic classifier and visualizing data distributions with Seaborn
provides practical exposure. Splitting data into training and test sets and evaluating model
accuracy completes the learning loop.

Python NumPy & Pandas – Notes for Beginners

What is NumPy?

NumPy (Numerical Python) is a Python library used for numerical computations. It

provides support for:

 Multi-dimensional arrays and matrices

 Mathematical functions like sum, mean, dot product, etc.
 Efficient operations on large datasets

Key Features

 Fast array operations

 Broadcasting (automatic shape adjustment)
 Linear algebra support
 Random number generation

What is Pandas?

Pandas is a Python library for data manipulation and analysis. It provides two main data
structures:

 Series: One-dimensional labeled array

 DataFrame: Two-dimensional labeled data table (like Excel)

Key Features

 Easy data loading from CSV, Excel, JSON

 Data filtering, grouping, and aggregation
 Handling missing values
 Powerful indexing and slicing

Installation Steps

To install both libraries, use:

pip install numpy pandas

Or use Google Colab (no installation needed).

Sample Programs

NumPy Examples
Create and Manipulate Arrays
import numpy as np

arr = [Link]([10, 20, 30])

print("Array:", arr)
print("Sum:", [Link](arr))
print("Mean:", [Link](arr))
print("Squared:", arr ** 2)

Matrix Operations
A = [Link]([[1, 2], [3, 4]])
B = [Link]([[5, 6], [7, 8]])

print("Dot Product:\n", [Link](A, B))

print("Transpose of A:\n", A.T)

Pandas Examples

Create a DataFrame
import pandas as pd

data = {
'Name': ['Alice', 'Bob', 'Charlie'],
'Marks': [85, 92, 78]
}

df = [Link](data)
print(df)

Analyze and Filter Data

print([Link]()) # Summary statistics
print(df[df['Marks'] > 80]) # Filter rows
df['Grade'] = ['B', 'A', 'C'] # Add column
print(df)

Read CSV File

df = pd.read_csv('[Link]')
print([Link]())

FDP AIML Day1 Part1
No ratings yet
FDP AIML Day1 Part1
61 pages
Diya Basera
No ratings yet
Diya Basera
15 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Unit 1
No ratings yet
Unit 1
62 pages
Infosys Springboard Milestone 1
No ratings yet
Infosys Springboard Milestone 1
7 pages
Module 1 MMC201
No ratings yet
Module 1 MMC201
77 pages
Report Print
No ratings yet
Report Print
22 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
Master Data Science With Python
No ratings yet
Master Data Science With Python
87 pages
Machine Learning Basics and Process
No ratings yet
Machine Learning Basics and Process
273 pages
L2 - Machine Learning Process
No ratings yet
L2 - Machine Learning Process
17 pages
Unit 1 Introduction To ML
No ratings yet
Unit 1 Introduction To ML
2 pages
Programming Machine Learning (2024)
No ratings yet
Programming Machine Learning (2024)
589 pages
Steps ML
No ratings yet
Steps ML
37 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
ML-FDP - Sreekanth Jaladanki
No ratings yet
ML-FDP - Sreekanth Jaladanki
53 pages
Python For Data Science and Machine Learning
100% (3)
Python For Data Science and Machine Learning
31 pages
Machine Learning - The Complete Math Guide To Master Data Science With Python and Developing Artificial Intelligence
No ratings yet
Machine Learning - The Complete Math Guide To Master Data Science With Python and Developing Artificial Intelligence
171 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Scheme 1-2
No ratings yet
Scheme 1-2
21 pages
Machine Learning With Data Science
No ratings yet
Machine Learning With Data Science
31 pages
Exp 1.1
No ratings yet
Exp 1.1
17 pages
Unit Iv Dsa KSS
No ratings yet
Unit Iv Dsa KSS
24 pages
GenerativeAI ML Roadmap
No ratings yet
GenerativeAI ML Roadmap
26 pages
Master Data Science, Data Analytics and Machine Learning Using Python
No ratings yet
Master Data Science, Data Analytics and Machine Learning Using Python
16 pages
Intro To Machine Learning With Apache Cassandra and Apache Spark
No ratings yet
Intro To Machine Learning With Apache Cassandra and Apache Spark
80 pages
ML SIG - Day 1
No ratings yet
ML SIG - Day 1
55 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
Module 4
No ratings yet
Module 4
55 pages
EE353 - 769 00 Course Introduction
No ratings yet
EE353 - 769 00 Course Introduction
28 pages
Data Science Syllabus From Beginner To Advanced
No ratings yet
Data Science Syllabus From Beginner To Advanced
7 pages
Department of Electronics and Communication: Industrial Training Presentation
No ratings yet
Department of Electronics and Communication: Industrial Training Presentation
22 pages
Industrial Training Report (Sahil)
No ratings yet
Industrial Training Report (Sahil)
33 pages
Machine Learning Training Program Bright
No ratings yet
Machine Learning Training Program Bright
27 pages
ABES Presentation
No ratings yet
ABES Presentation
91 pages
Python Linear Regression Guide
No ratings yet
Python Linear Regression Guide
153 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
Exploring AI and Machine Lear...
No ratings yet
Exploring AI and Machine Lear...
10 pages
44bfad59528ee945f53d
No ratings yet
44bfad59528ee945f53d
51 pages
Foundations of Machine Learning and Data Science - Concepts, Techniques, and Applications
No ratings yet
Foundations of Machine Learning and Data Science - Concepts, Techniques, and Applications
9 pages
Python Essentials For ML - Nikitha P
No ratings yet
Python Essentials For ML - Nikitha P
27 pages
Week 1 Introduction To ML
100% (1)
Week 1 Introduction To ML
42 pages
Report
No ratings yet
Report
11 pages
Data Science Basics with Python
100% (1)
Data Science Basics with Python
25 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Manual Data
No ratings yet
Manual Data
13 pages
Aiml Notes
No ratings yet
Aiml Notes
12 pages
Data Science & AI Essentials
100% (1)
Data Science & AI Essentials
20 pages
Module - 1
No ratings yet
Module - 1
9 pages
Machine Learning Algorithms Theory - Vimal Mishra
100% (2)
Machine Learning Algorithms Theory - Vimal Mishra
931 pages
Complet ML
No ratings yet
Complet ML
44 pages
OceanofPDF - Com Hands-On Machine Learning From Scratch - Venelin Valkov
No ratings yet
OceanofPDF - Com Hands-On Machine Learning From Scratch - Venelin Valkov
119 pages
ML - Unit 1
No ratings yet
ML - Unit 1
68 pages
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
100% (1)
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
200 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
81 pages
Untitled Document
No ratings yet
Untitled Document
10 pages
The Innovation of Blockchain Transparency and Traceability in Logistic Food Chain
No ratings yet
The Innovation of Blockchain Transparency and Traceability in Logistic Food Chain
17 pages
Hazardous Waste Import Meeting
No ratings yet
Hazardous Waste Import Meeting
16 pages
Power System Protection Lecture Notes
No ratings yet
Power System Protection Lecture Notes
3 pages
DBMS Lab # 2
No ratings yet
DBMS Lab # 2
18 pages
Unit 3: Documents Used For Operation of Process Plants
No ratings yet
Unit 3: Documents Used For Operation of Process Plants
39 pages
Flask Web Application Development Guide
No ratings yet
Flask Web Application Development Guide
29 pages
Cooling System CAB
No ratings yet
Cooling System CAB
14 pages
TeSysDTMLibrary v2.17.0.0 ReleaseNotes
No ratings yet
TeSysDTMLibrary v2.17.0.0 ReleaseNotes
9 pages
AQ510 Wind Finder Classification Report
No ratings yet
AQ510 Wind Finder Classification Report
2 pages
GPON ONU Troubleshooting Guide
No ratings yet
GPON ONU Troubleshooting Guide
34 pages
Dangers of Cell Phone Use While Driving
No ratings yet
Dangers of Cell Phone Use While Driving
1 page
Wesan WP G
No ratings yet
Wesan WP G
4 pages
NSX Battle Card - Final
100% (1)
NSX Battle Card - Final
2 pages
Datasheet Mosfet Fs23n15d
No ratings yet
Datasheet Mosfet Fs23n15d
11 pages
A Sadeeq
No ratings yet
A Sadeeq
10 pages
3RS10001CK20 Datasheet en
No ratings yet
3RS10001CK20 Datasheet en
2 pages
Fat Studio 90 Plus
No ratings yet
Fat Studio 90 Plus
11 pages
1.8 - IoT Vs M2M Vs IoE
No ratings yet
1.8 - IoT Vs M2M Vs IoE
25 pages
National Instruments Software License Agreement
No ratings yet
National Instruments Software License Agreement
17 pages
Estes User Guide Index - Rev20241112 - Ecwr
No ratings yet
Estes User Guide Index - Rev20241112 - Ecwr
1 page
2014 CBNU Bulletin PDF
No ratings yet
2014 CBNU Bulletin PDF
386 pages
Lab 4
No ratings yet
Lab 4
9 pages
Active Directory Group Management Best Practices
No ratings yet
Active Directory Group Management Best Practices
8 pages
Thesis Vlan
100% (3)
Thesis Vlan
7 pages
Piling Equipments DMC Augur PDF
No ratings yet
Piling Equipments DMC Augur PDF
11 pages
Degree Plan
No ratings yet
Degree Plan
10 pages
TZMQ Pneumatic Valve
100% (1)
TZMQ Pneumatic Valve
7 pages
Muneer Mahfouz CV - 0504065804
No ratings yet
Muneer Mahfouz CV - 0504065804
1 page
SAN101 Brocade
No ratings yet
SAN101 Brocade
46 pages
Senior API Test Engineer
No ratings yet
Senior API Test Engineer
3 pages

Week 3 - Machine Learnigng

Uploaded by

Week 3 - Machine Learnigng

Uploaded by

Week 3: Machine Learning Fundamentals

Diploma in Computer Science & Engineering – Course Code: 20CS51I

What Is Machine Learning?

Types of Machine Learning

Machine learning is broadly categorized into three types:

Reinforcement Learning teaches agents to make decisions by interacting with an

Machine Learning Workflow

ML Pipeline: From Data to Deployment

This pipeline ensures reproducibility, efficiency, and scalability in real-world ML

Understanding machine learning requires fluency in key terms:

 A feature is an input variable used for prediction, such as age or income.

Python NumPy & Pandas – Notes for Beginners

NumPy (Numerical Python) is a Python library used for numerical computations. It

 Multi-dimensional arrays and matrices

 Fast array operations

 Series: One-dimensional labeled array

 Easy data loading from CSV, Excel, JSON

To install both libraries, use:

pip install numpy pandas

Or use Google Colab (no installation needed).

arr = [Link]([10, 20, 30])

print("Dot Product:\n", [Link](A, B))

Analyze and Filter Data

Read CSV File

You might also like