0% found this document useful (0 votes)

8 views39 pages

Unit-3 Packaging ML Model

Uploaded by

Shubham Singh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views39 pages

Unit-3 Packaging ML Model

Uploaded by

Shubham Singh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Packaging ML Model

Trainer: Ms. Nidhi Grover Raheja

Package an ML Model
• A package for an ML model refers to a structured bundle of
files and code that includes the trained machine learning
model, its dependencies, and necessary configurations.
• This package is designed to be easily distributable and
reusable across different environments or platforms.
• Packaging an ML model ensures that others can load the
model, process inputs, and make predictions without
needing to re-train the model from scratch.
Why Package an ML Model?
• Reusability: You can reuse the model across different
projects and environments.
• Consistency: Ensures that the same model and code are
used consistently across different platforms.
• Ease of Deployment: Makes it easier to deploy the model in
production, whether as an API or through Docker.
• Version Control: You can version different iterations of the
model and code.
Key Concepts of Packaging
• Packaging a machine learning (ML) model for production involves
wrapping the model and its dependencies into a reusable and
distributable form, allowing it to be deployed and used in various
environments.
• Below is a breakdown of the key concepts:
❑Model Serialization
❑Dependency Management
❑Project Structure
❑Configuration Files
❑Building the Package
❑Distribution and Installation
❑Prediction at Runtime
❑Deployment
1. Model Serialization: Train and save the model using joblib.
2. Dependency Management: Create a requirements.txt file listing
the dependencies.
3. Project Structure: Organize the code into separate directories for
model code, data, and configurations.
4. Configuration: Use YAML or JSON files to store environment-
specific settings.
5. Build the Package: Define the setup process using setup.py and
install the package.
6. Distribute and Install: Build the package using setuptools and install
it in any Python environment.
7. Prediction: Load the model at runtime and run inference.
8. Deployment: Optionally, package the entire project in Docker or
deploy it as a web service.
Step 1: Model Serialization
• After training your ML model, the first step is to serialize (or save) it
into a file so it can be loaded later for predictions without retraining.
• This allows you to "package" the model as a file.
• Serialization Tools are given below:
✓joblib and pickle: Commonly used for serializing ML models in
Python.
✓ONNX: An open standard format for exporting ML models across
different frameworks. Example: Saving a trained model using joblib
# Step 1: Train and serialize the model
import joblib
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier

# Load dataset and train a model

X, y = load_iris(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
model = RandomForestClassifier().fit(X_train, y_train)

# Save the trained model as a .pkl file

joblib.dump(model, "iris_model.pkl")
print("Model saved!")

Outcome: The model is saved in iris_model.pkl, which can be loaded later for predictions.
Step 2: Dependency Management
• Every ML model relies on libraries like scikit-learn, numpy, or
pandas.
• Packaging requires managing these dependencies to ensure
the model works in any environment.
✓requirements.txt: This file lists the libraries required to
run the ML model.
✓environment.yml: If using Conda, this file captures the
Python version, dependencies, and system packages.
Example requirements.txt:

scikit-learn==1.0.2
joblib==1.1.0
numpy==1.21.0
Step 3: Project Structure
• Organizing your code, model, and resources is crucial for packaging.
• A well-structured project ensures code maintainability and makes it easy to build the package.
• Here’s an example project structure:
Step 4: Configuration Files
• Configuration files, typically in formats like .yaml, .json, or .ini, store environment-
specific settings (e.g., paths, thresholds, or model hyperparameters) so that the
package can be easily reconfigured in different environments.
Example config.yaml:

Usage: The config.yaml file defines where the model is stored and which features are used for
prediction
Step 5: Building the Package
• Once the model and scripts are in place, create a setup.py file to define how your
project can be installed as a Python package.
• The setup file specifies the package name, version, dependencies, and entry points
(if needed).
Example setup.py:
from setuptools import setup, find_packages

setup(
name="ml_model_package",
version="0.1",
packages=find_packages(),
install_requires=[
"scikit-learn",
"numpy",
"joblib"
],
entry_points={
'console_scripts': [
'predict=ml_model_package.predict:main',
]
}
)
Step 6: Distribution and Installation

Once the setup file is ready, you can build and install the package.

1. Install Locally:
pip install .
This command installs the package in your Python environment.

2. Build the Package: To create a distributable package, use:

python setup.py sdist bdist_wheel

This generates distribution archives (like .tar.gz or .whl files) in the dist/
directory. These can be uploaded to PyPI or shared directly.
Example: predict.py
Step 7: Prediction at Runtime
• The packaged model can now be
used to make real-time predictions
by loading the serialized model and
running inference.

We can now run predictions by executing:

python ml_model_package/predict.py
Example Dockerfile:
Build and run the container:
Step-by-Step Process to Build an ML Package:
Step 1: Create a Virtual Environment
• Before packaging, create a virtual environment to ensure that all
dependencies are isolated.
1. Open Command Prompt and create a virtual environment:

python -m venv ml_env

2. Activate the virtual environment:

ml_env\Scripts\activate
Step 2: Create a Project Directory

1. Create a project directory for your package.

2. Inside this directory, create the following subdirectories.
Project Directory Structure
Step 3: Write the ML Model Code

• Train the Model: Create a script train_model.py in the ml_model_package

directory to train and save the ML model using echo.> command
• We'll use the Iris dataset for this example.
# train_model.py
We'll use the Iris dataset for this example.
import joblib
train_model.py → from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
import os

# Load dataset
X, y = load_iris(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Train model
model = RandomForestClassifier()
model.fit(X_train, y_train)

# Ensure the directory exists

model_dir = "data"
if not os.path.exists(model_dir):
os.makedirs(model_dir)

# Save the model

joblib.dump(model, os.path.join(model_dir, "iris_model.pkl"))
print("Model saved at 'data/iris_model.pkl'")
Prediction Code: Create a script predict.py to load the model and make predictions on new data.

Create predict.py using echo.> command

predict.py # predict.py

import joblib
import numpy as np

# Load the trained model

model = joblib.load("data/iris_model.pkl")

def predict_iris():
# Predict using the model
# Sample input for prediction
sample_input = [[5.1, 3.5, 1.4, 0.2]] # Example for Iris dataset

# Make prediction
prediction = model.predict(sample_input)

print(f"Prediction: {prediction}")

if __name__ == "__main__":
predict_iris()
Step 4: Define the Package with setup.py

The setup.py file defines how your project will be packaged and installed.

Create the setup.py file in the root directory (ml_package):

setup.py

import sys
sys.path.append('.')

from setuptools import setup, find_packages

setup(
name="ml_model_package",
version="0.1",
packages=find_packages(),
install_requires=[
"scikit-learn",
"joblib",
"numpy"
],
entry_points={
'console_scripts': [
'predict=ml_model_package.predict:predict_iris',
]
}
)
Step 5: Create __init__.py in ml_model_package

The __init__.py file is used to mark a directory as a Python package. It also allows you to
initialize or configure things when the package is imported. For your ml_model_package,
the __init__.py file can remain simple or can be used to import functions or classes to
make them accessible at the package level.
Example Contents for __init__.py

Suppose you want to make the predict function from the predict.py module directly accessible
when someone imports the package. If you have multiple useful functions across different
modules (e.g., train_model.py and predict.py), you can add them in __init__.py to provide
access to everything at the top level of your package

You can add an import statement like this in init.py:

Now, if someone imports your package, they can access the predict_iris function.
Step 6: Create config.yaml file
Create config.yaml file in ml_model_package

Check config.yaml FileEnsure the config.yaml file has the correct path to the model:
Add the following code to config.yaml

# config.yaml

model_path: "data/iris_model.pkl"
input_columns: ["sepal_length", "sepal_width", "petal_length", "petal_width"]
Step 6: Install the Package Locally

Now we can install the package locally for testing.

Run the following command to build and install the package:

pip install .

This will package the entire project, including the scripts and dependencies, and
install it in your virtual environment.
Step 7: Run the Prediction Script

Once the package is installed, you can use the predict command directly from
the terminal:

predict

MLR 3 Book
100% (1)
MLR 3 Book
291 pages
Machine Learning With Python Joseph T Handy instant download
No ratings yet
Machine Learning With Python Joseph T Handy instant download
46 pages
Hca Unit - 2 Answers
No ratings yet
Hca Unit - 2 Answers
22 pages
StatisticsMachineLearningPythonDraft
No ratings yet
StatisticsMachineLearningPythonDraft
329 pages
Tox Package for Deployment
No ratings yet
Tox Package for Deployment
30 pages
How to Easily Deploy Machine Learning Models Using Flask _ by Abhinav Sagar _ Towards Data Science
No ratings yet
How to Easily Deploy Machine Learning Models Using Flask _ by Abhinav Sagar _ Towards Data Science
10 pages
mlfile
No ratings yet
mlfile
33 pages
Probabilistic Programming in Python Using PyMC
No ratings yet
Probabilistic Programming in Python Using PyMC
19 pages
Python Ml Topics
No ratings yet
Python Ml Topics
2 pages
2-ML Principles
No ratings yet
2-ML Principles
34 pages
Chapter 2
No ratings yet
Chapter 2
42 pages
Week 9-Module 10 Build and Deploy ML Models
No ratings yet
Week 9-Module 10 Build and Deploy ML Models
27 pages
Python Mip
100% (1)
Python Mip
33 pages
phase 4hp (2)
No ratings yet
phase 4hp (2)
8 pages
Lecture Notes 9
No ratings yet
Lecture Notes 9
5 pages
Lec 03
No ratings yet
Lec 03
9 pages
ML 4 To 9 Keyur
No ratings yet
ML 4 To 9 Keyur
21 pages
ML Libraries
No ratings yet
ML Libraries
19 pages
Introduction To Scikit Learn
100% (1)
Introduction To Scikit Learn
108 pages
Lecture 6 - Model deployment
No ratings yet
Lecture 6 - Model deployment
22 pages
MLflow Présentation
No ratings yet
MLflow Présentation
51 pages
Unit-2
No ratings yet
Unit-2
9 pages
Advanced Python for ML
No ratings yet
Advanced Python for ML
2 pages
Applied ML
No ratings yet
Applied ML
74 pages
Machine Learning Model Deployment
No ratings yet
Machine Learning Model Deployment
88 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
DesineDataStruectres
No ratings yet
DesineDataStruectres
3 pages
Lecture+Notes_Intro_to_MLOps_Session3
No ratings yet
Lecture+Notes_Intro_to_MLOps_Session3
8 pages
week_3
No ratings yet
week_3
10 pages
Best Python Libraries For Machine Learning - GeeksforGeeks
No ratings yet
Best Python Libraries For Machine Learning - GeeksforGeeks
18 pages
Learning Predictive Analytics With Python - Sample Chapter
100% (2)
Learning Predictive Analytics With Python - Sample Chapter
28 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
index-1
No ratings yet
index-1
2 pages
Sagemaker-V1 18 0
No ratings yet
Sagemaker-V1 18 0
164 pages
MLOps
No ratings yet
MLOps
16 pages
About Scikit
No ratings yet
About Scikit
3 pages
PRACTICAL FILE DL
No ratings yet
PRACTICAL FILE DL
14 pages
Financial Course Certificate
No ratings yet
Financial Course Certificate
1 page
Linear and Circular Arrangements Questions
No ratings yet
Linear and Circular Arrangements Questions
1 page
ML LabManual (1)
No ratings yet
ML LabManual (1)
16 pages
Model Deployment GL
No ratings yet
Model Deployment GL
20 pages
Module 5.pptx_20250608_201231_0000
No ratings yet
Module 5.pptx_20250608_201231_0000
43 pages
ChatGPT_MyLearning on Coding for Machine Learning
No ratings yet
ChatGPT_MyLearning on Coding for Machine Learning
16 pages
Machine Learning - Python Libraries
No ratings yet
Machine Learning - Python Libraries
12 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
UNIT 1
No ratings yet
UNIT 1
28 pages
ML
No ratings yet
ML
8 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
7 - From ML To Production
No ratings yet
7 - From ML To Production
23 pages
ML_notion_1
No ratings yet
ML_notion_1
18 pages
algorithmeknn-121213175830-phpapp02
No ratings yet
algorithmeknn-121213175830-phpapp02
52 pages
Packages in Python
No ratings yet
Packages in Python
54 pages
Statistics Machine Learning Python
No ratings yet
Statistics Machine Learning Python
399 pages
Statistics Machine Learning Python Draft
No ratings yet
Statistics Machine Learning Python Draft
319 pages
Python Predictive Modeling
No ratings yet
Python Predictive Modeling
24 pages
ML Notesv1
100% (1)
ML Notesv1
300 pages
Python - Follow Dr. AngShu (@drangshu) For More
100% (1)
Python - Follow Dr. AngShu (@drangshu) For More
300 pages
How To Deploy Machine Learning Model As Microservices
No ratings yet
How To Deploy Machine Learning Model As Microservices
7 pages
Python Micro Project
No ratings yet
Python Micro Project
10 pages
Statistics Machine Learning Python
100% (1)
Statistics Machine Learning Python
389 pages
Lecture 13 & 14
No ratings yet
Lecture 13 & 14
573 pages
Copy of Lecture 10 Questions(1)
No ratings yet
Copy of Lecture 10 Questions(1)
272 pages
Copy of Impact of Sleep on Daily Life Assignment(1)
No ratings yet
Copy of Impact of Sleep on Daily Life Assignment(1)
202 pages
Impact of Sleep on Daily Life - FOE Assignment (1)
No ratings yet
Impact of Sleep on Daily Life - FOE Assignment (1)
64 pages
Introduction To C-Sharp
No ratings yet
Introduction To C-Sharp
21 pages
25-ARM 7 Assembly Programming-22-03-2024
No ratings yet
25-ARM 7 Assembly Programming-22-03-2024
24 pages
Excel FILe(Nitin Yadav)
No ratings yet
Excel FILe(Nitin Yadav)
31 pages
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Deploy A Machine Learning Model Using Flask - Towards Data Science
No ratings yet
Deploy A Machine Learning Model Using Flask - Towards Data Science
12 pages
Unit-4 Containers and Docker
No ratings yet
Unit-4 Containers and Docker
44 pages
DSA_practical_File[1] Sagar Kumar
No ratings yet
DSA_practical_File[1] Sagar Kumar
35 pages
Content Addressable Memory
No ratings yet
Content Addressable Memory
9 pages
Mdcm Sagar Assignment
No ratings yet
Mdcm Sagar Assignment
15 pages
Term Paper On Cloud Computing
100% (1)
Term Paper On Cloud Computing
7 pages
k
No ratings yet
k
11 pages
Name- Sameer Ali (Ppt)
No ratings yet
Name- Sameer Ali (Ppt)
11 pages
data_1690047573679
No ratings yet
data_1690047573679
13 pages
Data Preprocessing (Sagar)
No ratings yet
Data Preprocessing (Sagar)
31 pages
Name – Sameer Ali Ppt of Machine Learning
No ratings yet
Name – Sameer Ali Ppt of Machine Learning
9 pages
Zero (1) 1
No ratings yet
Zero (1) 1
12 pages
2048 00 HLTQ series
No ratings yet
2048 00 HLTQ series
7 pages
Practical Exam
No ratings yet
Practical Exam
7 pages
NLP
No ratings yet
NLP
9 pages
Compaq nx9420
No ratings yet
Compaq nx9420
41 pages
Project Report Minor Project (1)
No ratings yet
Project Report Minor Project (1)
15 pages
Exam1
No ratings yet
Exam1
6 pages
ehositalap-171017211724
No ratings yet
ehositalap-171017211724
15 pages
Language Translator
No ratings yet
Language Translator
5 pages
Kumar, Shubham
No ratings yet
Kumar, Shubham
5 pages
Unit 1-Chapter 1&2 - Basics of Programming
No ratings yet
Unit 1-Chapter 1&2 - Basics of Programming
61 pages
CS5002NI WK01 L IntroductiontoSoftwareEngineering 93444
No ratings yet
CS5002NI WK01 L IntroductiontoSoftwareEngineering 93444
35 pages
Edistrict.delhigovt.nic.in in en Print PrintOnlineApplication.html q=Mmn5euE4cXEKuUug8Xf2hY5biL89Ed8QFqYEAy3D
No ratings yet
Edistrict.delhigovt.nic.in in en Print PrintOnlineApplication.html q=Mmn5euE4cXEKuUug8Xf2hY5biL89Ed8QFqYEAy3D
4 pages
data_1690047616734
No ratings yet
data_1690047616734
3 pages
Dokumen - Tips Iec 60870 5pdf
No ratings yet
Dokumen - Tips Iec 60870 5pdf
42 pages
ERP Vs SAP & Odoo
No ratings yet
ERP Vs SAP & Odoo
15 pages
Barclays -Srm Shortlist 2025 Batch -Interview on 18-09-2024
No ratings yet
Barclays -Srm Shortlist 2025 Batch -Interview on 18-09-2024
2 pages
Data Structure and Algorithm CO
No ratings yet
Data Structure and Algorithm CO
4 pages
Output Boe
No ratings yet
Output Boe
2 pages
TAC Vista TAC Menta Technical Manual 2008
No ratings yet
TAC Vista TAC Menta Technical Manual 2008
416 pages
CORS: A Presentation
No ratings yet
CORS: A Presentation
16 pages
pps 15
No ratings yet
pps 15
5 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
6 pages
Launch Ástudio Á Álisting Ádetails - AC692x
0% (1)
Launch Ástudio Á Álisting Ádetails - AC692x
3 pages
Company Profile PT Aka Prima Komputindo - New
No ratings yet
Company Profile PT Aka Prima Komputindo - New
6 pages
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
PrintSelfDeclarationForm (1)
No ratings yet
PrintSelfDeclarationForm (1)
1 page
Course Outline EVS-II Sem3 (UG)
No ratings yet
Course Outline EVS-II Sem3 (UG)
3 pages
CUETScoreCard-233510577176
No ratings yet
CUETScoreCard-233510577176
1 page
CUETApplicationForm-233510577176
No ratings yet
CUETApplicationForm-233510577176
1 page
DSEU_admit_card_(1)[1]
No ratings yet
DSEU_admit_card_(1)[1]
1 page
CMR 900 Particular Operation For The 900-70 Supervisor Unit
No ratings yet
CMR 900 Particular Operation For The 900-70 Supervisor Unit
14 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
1 page
Cursuri Cisco CCNP - Structura Cursurilor
No ratings yet
Cursuri Cisco CCNP - Structura Cursurilor
31 pages
DICOM Conformance Statement Achieva R1.5
No ratings yet
DICOM Conformance Statement Achieva R1.5
92 pages
Integration Framework Comparison - Spring Integration, Mule ESB or Apache Camel
No ratings yet
Integration Framework Comparison - Spring Integration, Mule ESB or Apache Camel
3 pages
Lab 9.3.3 Designing An IP Subnetting Scheme For Growth: Objectives
No ratings yet
Lab 9.3.3 Designing An IP Subnetting Scheme For Growth: Objectives
4 pages
Datasheet E-Box Wifi en
No ratings yet
Datasheet E-Box Wifi en
2 pages
Siti Sarah Farhanah Binti Mazlen 2018236076 CS1104C
No ratings yet
Siti Sarah Farhanah Binti Mazlen 2018236076 CS1104C
2 pages
Chapter: 5 Normalization of Database Tables: in This Chapter, You Will Learn
No ratings yet
Chapter: 5 Normalization of Database Tables: in This Chapter, You Will Learn
43 pages
Introduction To Writing Network Tests With Pyats: Hank Preston Twitter: @hfpreston April 2020
No ratings yet
Introduction To Writing Network Tests With Pyats: Hank Preston Twitter: @hfpreston April 2020
28 pages

Unit-3 Packaging ML Model

Uploaded by

Unit-3 Packaging ML Model

Uploaded by

Packaging ML Model

Trainer: Ms. Nidhi Grover Raheja

# Load dataset and train a model

# Save the trained model as a .pkl file

2. Build the Package: To create a distributable package, use:

python setup.py sdist bdist_wheel

We can now run predictions by executing:

python -m venv ml_env

2. Activate the virtual environment:

1. Create a project directory for your package.

• Train the Model: Create a script train_model.py in the ml_model_package

# Ensure the directory exists

# Save the model

Create predict.py using echo.> command

# Load the trained model

Create the setup.py file in the root directory (ml_package):

from setuptools import setup, find_packages

You can add an import statement like this in __init__.py:

Now we can install the package locally for testing.

Run the following command to build and install the package:

You might also like

You can add an import statement like this in init.py: