SlideShare a Scribd company logo
Yury Gorbachev
1
Intel ConfidentialIntel Confidential
Brief OpenVINO™ Introduction
• OpenVINO ™ is
• set of tools and libraries for CV/DL application developers
• high performance, low footprint solution for deployment
• API for unified access to CV/DL capabilities of Intel platforms
• OpenVINO ™ is not
• tool for data scientists
• solution for training of deep learning models
Intel ConfidentialIntel Confidential 3
OpenVINO™ Benefits
• Powerful combination of highly optimized Classical CV and DL primitives
• Allows to run inference on Intel CPU, GPU, VPU and FPGA
• Best performing solution on all Intel architectures
• ~2x faster than fastest TensorFlow and MxNet on CPU
• Smallest execution footprint (lowest memory consumption)
• 2x smaller than MXNet, 4x smaller than TensorFlow
• Minimum # of dependencies (no dependencies on training frameworks)
• Multiple OS support (Linux and Windows)
Intel ConfidentialIntel Confidential 4
OpenVINO™ Specifics
• Highly optimized implementations of DL primitives
• Most efficient on each Intel platform
• Focus on inference only
• Aggressive layer fusion at the inference step
• Including HW accelerated steps tuned for inference
• Efficient activation memory reuse
• Often close to bare minimum
Intel ConfidentialIntel Confidential 5
DL Workflow
Caffe
MXNet
TensorFlow
Caffe2
PyTorch
Serialized
trained
DL model
ONNX
MKLDNN
Plugin
clDNN
Plugin
FPGA
Plugin
Myriad
Plugin
Inference
Engine
Deploy
Application
Model
Optimizer
IR
.xml
.bin
Step 1: Import model from Framework format
to Framework independent representation
Step 2: Update application
to use Inference Engine API
Eliminate unnecessary layers,
lossless fusion where possible
Remove framework dependency
Accuracy against original model ensured
Intel ConfidentialIntel Confidential 6
Customization Capabilities
• OpenVINO™ provides good coverage of DL primitives out of the box
• Constantly growing list of primitives to support new DL topologies
• Frequent releases, substantial additions
• Not a problem if something is missing!
• Good extension mechanism for adding new primitives
• Possible to add proprietary layers, more optimized layers, etc.
• Both in Model Optimizer and Inference Engine (import and run)
Intel ConfidentialIntel Confidential 7
Application Design Workflow
Desig
n
Verify
logic
Debug
Fix
Best done on CPU:
- Easier to verify
- Simpler debugging procedures
CHANGE
TARGET
Check
scalability
Fix
pipeline
System
testing
Best done on Actual target (e.g. VPU):
- Exact performance
- Correct timings
Accuracy and functionality
across targets
Intel ConfidentialIntel Confidential 8
Heterogeneous Execution
• When a certain primitive is not supported on a target
• Custom proprietary primitive or inefficient HW for a task
• Heterogeneous execution ensures full topology execution
• Automatic data transfer between targets whenever needed
• Work splitting and scheduling
• No need to do any manual network manipulations!
FPGA CPU
Intel ConfidentialIntel Confidential 9
Power of Parallel Execution
Asynchronous API provides capabilities for:
• Running main thread in parallel with ongoing inference (CPU/GPU)
• Hiding data transfer latency for accelerators (VPU, FPGA)
• Filling accelerators with work
Transfer 1
Inference 1
Transfer 2
Transfer 1 Transfer 2
Inference 2
Inference 1 Inference 2
Result 1
Result 1
Result 2
Result 2
Sync API
Async API
Intel ConfidentialIntel Confidential 10
Efficient Frame Preprocessing
DECODE
Layout
Transform
Resize to
network
Detection
network
Resize to
network
Object Analysis
Network
Crop
Preprocessing
DL Inference
• Preprocessing is typically done via manual coding or using libraries
• OpenCV is most popular
• Fastest OpenCV build is available in OpenVINO™
• Deep Learning Inference Engine encapsulates basic preprocessing capability
• Automatic frame resize based on input frame and network size
• Automatic layout conversion and cropping
• In a nutshell –> Just provide frame and it will be suitable for inference automatically
Intel ConfidentialIntel Confidential 11
INT8 Quantization for CPU
• INT8 provides additional acceleration using AVX-512
• Not all CPU targets support it
• Very minor quality/accuracy loss
• No retraining is required
• No code update is needed
IR
Dataset
Analysis
tool Updated IR
Intel ConfidentialIntel Confidential 12
OpenVINO™ Model Zoo
• OpenVINO provides pre-trained DL models for deployment
• Lightweight, low compute, real time on Intel platforms
• Cover popular CV use cases
• Face analysis, Security use-cases (person, vehicle, bicycle detection)
• Transportation analytics (road segmentation, vehicle/pedestrian detection)
Intel ConfidentialIntel Confidential 13
Extensive Set of Samples
• Actual examples of applications, not just API demonstration
• Switching between targets, work distribution
• Multiple models in pipeline with preprocessing
• Open Models Zoo demonstration
14

More Related Content

PPTX
OpenVINO introduction
PDF
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
PPTX
ViT.pptx
PPTX
Getting started with TensorFlow
PPTX
ONNX - The Lingua Franca of Deep Learning
PPTX
What is TensorFlow? | Introduction to TensorFlow | TensorFlow Tutorial For Be...
PDF
A la découverte de kubernetes
PDF
Introduction to Kubernetes and Google Container Engine (GKE)
OpenVINO introduction
"How to Get the Best Deep Learning Performance with the OpenVINO Toolkit," a ...
ViT.pptx
Getting started with TensorFlow
ONNX - The Lingua Franca of Deep Learning
What is TensorFlow? | Introduction to TensorFlow | TensorFlow Tutorial For Be...
A la découverte de kubernetes
Introduction to Kubernetes and Google Container Engine (GKE)

What's hot (20)

PPTX
Attention Is All You Need
PDF
GitOps with Amazon EKS Anywhere by Dan Budris
PPTX
TensorRT survey
PDF
Generative models (Geek hub 2021 lecture)
PDF
PR-366: A ConvNet for 2020s
PPTX
Introduction to k3s and k3sup
PPTX
Kubernetes 101 for Beginners
PDF
Introduction To TensorFlow
PDF
Intro To Convolutional Neural Networks
PPTX
[Paper Reading] Attention is All You Need
PPTX
Multi tasking learning
PPTX
Kubernetes 101
PPTX
Depth estimation using deep learning
PDF
Kubernetes - introduction
PPTX
Introduction to continual learning
PPTX
ConvNeXt: A ConvNet for the 2020s explained
PDF
DevJam 2019 - Introduction to Kubernetes
PPTX
Image Segmentation Using Deep Learning : A survey
PDF
Get started with gitops and flux
PDF
Emerging Properties in Self-Supervised Vision Transformers
Attention Is All You Need
GitOps with Amazon EKS Anywhere by Dan Budris
TensorRT survey
Generative models (Geek hub 2021 lecture)
PR-366: A ConvNet for 2020s
Introduction to k3s and k3sup
Kubernetes 101 for Beginners
Introduction To TensorFlow
Intro To Convolutional Neural Networks
[Paper Reading] Attention is All You Need
Multi tasking learning
Kubernetes 101
Depth estimation using deep learning
Kubernetes - introduction
Introduction to continual learning
ConvNeXt: A ConvNet for the 2020s explained
DevJam 2019 - Introduction to Kubernetes
Image Segmentation Using Deep Learning : A survey
Get started with gitops and flux
Emerging Properties in Self-Supervised Vision Transformers
Ad

Similar to Develop and optimize CV/DL applications with Intel OpenVINO toolkit (20)

PPTX
How to Get the Best Deep Learning performance with OpenVINO Toolkit
PDF
"OpenCV for Embedded: Lessons Learned," a Presentation from itseez
PPTX
OpenCV for Embedded: Lessons Learned
PPTX
Enabling Cross-platform Deep Learning Applications with Intel OpenVINO™
PDF
Faster deep learning solutions from training to inference - Michele Tameni - ...
PPTX
Top 10 dev ops tools (1)
PDF
Scale Up Performance with Intel® Development
PDF
Jython for embedded software validation
PDF
Intel Parallel Studio XE 2016 網路開發工具包新版本功能介紹(現已上市,歡迎詢價)
PDF
Distributed Tensorflow with Kubernetes - data2day - Jakob Karalus
PDF
Multicore 101: Migrating Embedded Apps to Multicore with Linux
PDF
Develop, Deploy, and Innovate with Intel® Cluster Ready
PPTX
Automation for cloud
PPTX
OS for AI: Elastic Microservices & the Next Gen of ML
PPTX
Hyper-Theading
PDF
The journey to Native Cloud Architecture & Microservices, tracing the footste...
PDF
Get the Facts: Oracle's Unbreakable Enterprise Kernel
PDF
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
PDF
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
PPTX
.NET microservices with Azure Service Fabric
How to Get the Best Deep Learning performance with OpenVINO Toolkit
"OpenCV for Embedded: Lessons Learned," a Presentation from itseez
OpenCV for Embedded: Lessons Learned
Enabling Cross-platform Deep Learning Applications with Intel OpenVINO™
Faster deep learning solutions from training to inference - Michele Tameni - ...
Top 10 dev ops tools (1)
Scale Up Performance with Intel® Development
Jython for embedded software validation
Intel Parallel Studio XE 2016 網路開發工具包新版本功能介紹(現已上市,歡迎詢價)
Distributed Tensorflow with Kubernetes - data2day - Jakob Karalus
Multicore 101: Migrating Embedded Apps to Multicore with Linux
Develop, Deploy, and Innovate with Intel® Cluster Ready
Automation for cloud
OS for AI: Elastic Microservices & the Next Gen of ML
Hyper-Theading
The journey to Native Cloud Architecture & Microservices, tracing the footste...
Get the Facts: Oracle's Unbreakable Enterprise Kernel
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
TDC2019 Intel Software Day - Tecnicas de Programacao Paralela em Machine Lear...
.NET microservices with Azure Service Fabric
Ad

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Approach and Philosophy of On baking technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Understanding_Digital_Forensics_Presentation.pptx
Modernizing your data center with Dell and AMD
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Digital-Transformation-Roadmap-for-Companies.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Spectral efficient network and resource selection model in 5G networks
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The AUB Centre for AI in Media Proposal.docx
Approach and Philosophy of On baking technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
Electronic commerce courselecture one. Pdf
NewMind AI Monthly Chronicles - July 2025
20250228 LYD VKU AI Blended-Learning.pptx
Chapter 3 Spatial Domain Image Processing.pdf

Develop and optimize CV/DL applications with Intel OpenVINO toolkit

  • 2. Intel ConfidentialIntel Confidential Brief OpenVINO™ Introduction • OpenVINO ™ is • set of tools and libraries for CV/DL application developers • high performance, low footprint solution for deployment • API for unified access to CV/DL capabilities of Intel platforms • OpenVINO ™ is not • tool for data scientists • solution for training of deep learning models
  • 3. Intel ConfidentialIntel Confidential 3 OpenVINO™ Benefits • Powerful combination of highly optimized Classical CV and DL primitives • Allows to run inference on Intel CPU, GPU, VPU and FPGA • Best performing solution on all Intel architectures • ~2x faster than fastest TensorFlow and MxNet on CPU • Smallest execution footprint (lowest memory consumption) • 2x smaller than MXNet, 4x smaller than TensorFlow • Minimum # of dependencies (no dependencies on training frameworks) • Multiple OS support (Linux and Windows)
  • 4. Intel ConfidentialIntel Confidential 4 OpenVINO™ Specifics • Highly optimized implementations of DL primitives • Most efficient on each Intel platform • Focus on inference only • Aggressive layer fusion at the inference step • Including HW accelerated steps tuned for inference • Efficient activation memory reuse • Often close to bare minimum
  • 5. Intel ConfidentialIntel Confidential 5 DL Workflow Caffe MXNet TensorFlow Caffe2 PyTorch Serialized trained DL model ONNX MKLDNN Plugin clDNN Plugin FPGA Plugin Myriad Plugin Inference Engine Deploy Application Model Optimizer IR .xml .bin Step 1: Import model from Framework format to Framework independent representation Step 2: Update application to use Inference Engine API Eliminate unnecessary layers, lossless fusion where possible Remove framework dependency Accuracy against original model ensured
  • 6. Intel ConfidentialIntel Confidential 6 Customization Capabilities • OpenVINO™ provides good coverage of DL primitives out of the box • Constantly growing list of primitives to support new DL topologies • Frequent releases, substantial additions • Not a problem if something is missing! • Good extension mechanism for adding new primitives • Possible to add proprietary layers, more optimized layers, etc. • Both in Model Optimizer and Inference Engine (import and run)
  • 7. Intel ConfidentialIntel Confidential 7 Application Design Workflow Desig n Verify logic Debug Fix Best done on CPU: - Easier to verify - Simpler debugging procedures CHANGE TARGET Check scalability Fix pipeline System testing Best done on Actual target (e.g. VPU): - Exact performance - Correct timings Accuracy and functionality across targets
  • 8. Intel ConfidentialIntel Confidential 8 Heterogeneous Execution • When a certain primitive is not supported on a target • Custom proprietary primitive or inefficient HW for a task • Heterogeneous execution ensures full topology execution • Automatic data transfer between targets whenever needed • Work splitting and scheduling • No need to do any manual network manipulations! FPGA CPU
  • 9. Intel ConfidentialIntel Confidential 9 Power of Parallel Execution Asynchronous API provides capabilities for: • Running main thread in parallel with ongoing inference (CPU/GPU) • Hiding data transfer latency for accelerators (VPU, FPGA) • Filling accelerators with work Transfer 1 Inference 1 Transfer 2 Transfer 1 Transfer 2 Inference 2 Inference 1 Inference 2 Result 1 Result 1 Result 2 Result 2 Sync API Async API
  • 10. Intel ConfidentialIntel Confidential 10 Efficient Frame Preprocessing DECODE Layout Transform Resize to network Detection network Resize to network Object Analysis Network Crop Preprocessing DL Inference • Preprocessing is typically done via manual coding or using libraries • OpenCV is most popular • Fastest OpenCV build is available in OpenVINO™ • Deep Learning Inference Engine encapsulates basic preprocessing capability • Automatic frame resize based on input frame and network size • Automatic layout conversion and cropping • In a nutshell –> Just provide frame and it will be suitable for inference automatically
  • 11. Intel ConfidentialIntel Confidential 11 INT8 Quantization for CPU • INT8 provides additional acceleration using AVX-512 • Not all CPU targets support it • Very minor quality/accuracy loss • No retraining is required • No code update is needed IR Dataset Analysis tool Updated IR
  • 12. Intel ConfidentialIntel Confidential 12 OpenVINO™ Model Zoo • OpenVINO provides pre-trained DL models for deployment • Lightweight, low compute, real time on Intel platforms • Cover popular CV use cases • Face analysis, Security use-cases (person, vehicle, bicycle detection) • Transportation analytics (road segmentation, vehicle/pedestrian detection)
  • 13. Intel ConfidentialIntel Confidential 13 Extensive Set of Samples • Actual examples of applications, not just API demonstration • Switching between targets, work distribution • Multiple models in pipeline with preprocessing • Open Models Zoo demonstration
  • 14. 14