0% found this document useful (0 votes)

7 views17 pages

Aihub 1017

Uploaded by

zanyah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views17 pages

Aihub 1017

Uploaded by

zanyah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Open and Efficient AI:

Is it Possible?
Monica Livingston
Head of AI Center of Excellence
Intel

AIHUB-1017

#CiscoLive
The Challenge with AI: Energy Consumption

Training Inference Inference

Large Language Model ChatGPT AI Image Generation3
(LLM) inquiry2 (Stable Diffusion)
Solutions: Making AI More Energy Efficient & Sustainable

Optimize Optimize Optimize

Models Software Hardware + Architecture

Intel Offerings
Developing and Deploying AI Models More Sustainably

Model Optimization Software Optimization Carbon Aware Software

Quantization oneAPI
Pruning OpenVino
Distillation Intel Tiber Platform
Developing and Deploying AI Hardware More Sustainably
Processors: Liquid Cooling:
General Purpose Cold-plate
Dedicated Immersion

1Visithttps://edc.intel.com/content/www/us/en/products/performance/benchmarks/sustain

ability/ for more information on how Intel calculates our embodied processor
product carbon footprint.
Replace Aging Servers to Save Energy and Costs
Significantly reduce data center infrastructure space, power and costs.

Comparison is replacing 50 1st Generation servers with new 5th Gen Intel Xeon processor-based severs

(NGINX TLS) (RocksDB) (NLP w/ Bert-Large) (Recommender w/ DLRM)

Number of 5th Gen

Intel Xeon processor-
based servers

Lower fleet energy

Reduced CO2 emissions*

TCO savings* $254K $192K $541K $449K

Why Openness for AI?

1 2 3

Choice(s) of HW Portable Models: AI to be Ubiquitous

from all vendors: From DC to Edge to PC across the Enterprise:
Ability to pick best without vendor Lock-in
perf/watt solution

Open ecosystems can remove barriers

to Enterprise AI production
Enterprise AI
Data Models
2 distinct worlds today

Secure & Confidential Based on Public Data Today

Data Locality Open/Closed

Mature & Predictable Rapid Change

CPU based Accelerator-based

Through the power of open ecosystems

By working with industry leaders to provide end-to-end AI enterprise solutions at scale

By driving an open software ecosystem that bridges enterprise data & AI models

By shaping the enterprise AI infrastructure through reference architectures, together with partners

By building safe & AI capable compute platforms from client to data-center

Simplify enterprise generative AI adoption
and reduce the time to production of
hardened, trusted solutions
OPEA Solution Requirements

Generative AI pipelines built from industry leading, composable components for

more secure, turnkey enterprise AI deployment

Efficient Seamless Open Ubiquitous Trusted Scalable

Enterprise AI Ecosystem (not exhaustive)
Enterprise companies Enterprise A Enterprise B Enterprise C
Access to private data

Enterprise ISVs
Data services
Oracle SAP Microsoft Workday Salesforce Atlassian

RAG API definition and reference code

Secure across data, prompts, weights
Component ISVs
/ Open-Source OPEA Telemetry and manageability services GSIs
Extensible to vertical use case requirements
Projects Heterogeneous hardware and multi-vendor support

Enterprise OSVs
VMWare / ESXi RedHat / RHEL Microsoft / Windows
System services

Private/Public Cloud IaaS

OEMs / ODMs
Systems/appliances
CISCO OEM OEM ODM ODM
OPEA Offerings

Architecture Reference Benchmarks Certification Open Developer

Blueprints Implementations Governance Access
Bringing Enterprise AI EVERYWHERE

AI PC Node Node / Server Rack Cluster Super Cluster Mega Cluster

Light Fine-tuning, Tuning, Light Training, Training, Tuning, Large Scale Training
Inference Inference Peak Inf. Tuning, Peak Inf. Peak Inf. & Inference

AI PC ENTERPRISE & EDGE DATA CENTER

Broadest AI SW Ecosystem Open Standard, “Ready to Use” AI Open, Scalable Systems & Reference Arch
AI on Cisco M7 UCS powered by Intel Xeon

IPEX

Llama-2-7B

TexGen Test Precision Latency Response

Text Continuation INT8, BF16, F32 < 100ms
Text Translation INT8, BF16, F32 < 100ms
Question Response INT8, BF16, F32 < 100ms

➢ Latest M7 X-series now runs efficient inferencing for LLMs

➢ Refer to the At-A-Glance document for details on the setup
Open and Efficient AI: Is it Possible?
Thank you

#CiscoLive

Catholic Mass Responses PDF
100% (10)
Catholic Mass Responses PDF
2 pages
Cross-Cultural Servanthood
100% (2)
Cross-Cultural Servanthood
8 pages
9 Months - Professional Web Development & BI Overview
No ratings yet
9 Months - Professional Web Development & BI Overview
4 pages
Solution Brief SMCI AMD NVIDIA RedHat AIEnterprise
No ratings yet
Solution Brief SMCI AMD NVIDIA RedHat AIEnterprise
5 pages
Red Hat & NVIDIA For FSI - Final
No ratings yet
Red Hat & NVIDIA For FSI - Final
18 pages
Omnia Solution Overview
No ratings yet
Omnia Solution Overview
9 pages
Aihub 1012
No ratings yet
Aihub 1012
39 pages
Dell Ai Factory With Nvidia Ebook
No ratings yet
Dell Ai Factory With Nvidia Ebook
12 pages
Dell Ai Factory With Nvidia Ebook
No ratings yet
Dell Ai Factory With Nvidia Ebook
12 pages
AI Lab
No ratings yet
AI Lab
30 pages
Solution Overview Base Command Manager
No ratings yet
Solution Overview Base Command Manager
3 pages
BRKFP292
No ratings yet
BRKFP292
15 pages
MPCL Brief Overview
No ratings yet
MPCL Brief Overview
5 pages
AI COmpanies For Partnership
No ratings yet
AI COmpanies For Partnership
8 pages
Introduction To Ai
No ratings yet
Introduction To Ai
20 pages
Aihub 1025
No ratings yet
Aihub 1025
27 pages
Openstack On Intel: Billy Cox Director, Cloud Builders Software and Services Group Intel Corp
No ratings yet
Openstack On Intel: Billy Cox Director, Cloud Builders Software and Services Group Intel Corp
23 pages
En DAY4 David Chen Reinventing The IT Infrastructure To Bolster Digital Transformation en
No ratings yet
En DAY4 David Chen Reinventing The IT Infrastructure To Bolster Digital Transformation en
6 pages
Cloud DevOps Services VFA
No ratings yet
Cloud DevOps Services VFA
88 pages
Build The AI Cloud Provision and Manage Tensorflow Cluster With OpenStack8
No ratings yet
Build The AI Cloud Provision and Manage Tensorflow Cluster With OpenStack8
38 pages
Sifro PitchDeck Eng 01 08 25
No ratings yet
Sifro PitchDeck Eng 01 08 25
32 pages
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
From Everand
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning With TensorFlow and Spark Using GPUs and Docker Containers Presentation
No ratings yet
Deep Learning With TensorFlow and Spark Using GPUs and Docker Containers Presentation
38 pages
ASRock Industrial Company Profile - 2023.2.20
No ratings yet
ASRock Industrial Company Profile - 2023.2.20
21 pages
IBM AI Infrastructure Reference Architecture: Solution Brief
No ratings yet
IBM AI Infrastructure Reference Architecture: Solution Brief
4 pages
Containers or VMs - Deploy AI Workloads With Ease - 1647197291151001kben
No ratings yet
Containers or VMs - Deploy AI Workloads With Ease - 1647197291151001kben
26 pages
Aihub 1009
No ratings yet
Aihub 1009
25 pages
AI Computing Trends - Challenges Innovations-Final
No ratings yet
AI Computing Trends - Challenges Innovations-Final
18 pages
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
From Everand
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
Anand Vemula
No ratings yet
Microsoft Hds
No ratings yet
Microsoft Hds
58 pages
Thinkmate AI Cluster Solution Overview
No ratings yet
Thinkmate AI Cluster Solution Overview
3 pages
Nvidia Update For Lenovo
No ratings yet
Nvidia Update For Lenovo
30 pages
Enterprise Ai: Building Powerful Enterprise AI Infrastructure: How To Design Enduring Infrastructure For AI
No ratings yet
Enterprise Ai: Building Powerful Enterprise AI Infrastructure: How To Design Enduring Infrastructure For AI
8 pages
Configuring IPCop Firewalls: Closing Borders with Open Source
From Everand
Configuring IPCop Firewalls: Closing Borders with Open Source
James Eaton-Lee
No ratings yet
Meet AI Demands
No ratings yet
Meet AI Demands
23 pages
Open, Ethernet-Based AI Data Center Networking Solution Brief (Frontend - Backend)
No ratings yet
Open, Ethernet-Based AI Data Center Networking Solution Brief (Frontend - Backend)
4 pages
Aihub 1011
No ratings yet
Aihub 1011
21 pages
Gpu Applications Catalog
No ratings yet
Gpu Applications Catalog
32 pages
Pa Integrate and Simplify Ai Ecosystem Intel Brief
No ratings yet
Pa Integrate and Simplify Ai Ecosystem Intel Brief
2 pages
White Paper MicroK8s RA With Supermicro
No ratings yet
White Paper MicroK8s RA With Supermicro
17 pages
Concept Art
No ratings yet
Concept Art
11 pages
File 4
No ratings yet
File 4
27 pages
Nvitu 230307121950 c3b682cc
No ratings yet
Nvitu 230307121950 c3b682cc
24 pages
Ready To Scale Ai Idc 88025788USEN
No ratings yet
Ready To Scale Ai Idc 88025788USEN
17 pages
HCIA-Intelligent Computing V1.0 Training Material
No ratings yet
HCIA-Intelligent Computing V1.0 Training Material
316 pages
Module 4
No ratings yet
Module 4
14 pages
Session 18 Solution Architecture For Gen AI
No ratings yet
Session 18 Solution Architecture For Gen AI
34 pages
Security at Hyperscale: Check Point Maestro
No ratings yet
Security at Hyperscale: Check Point Maestro
27 pages
Intel Bluedata Ai ML Webinar 62818 Final - 435477
No ratings yet
Intel Bluedata Ai ML Webinar 62818 Final - 435477
31 pages
Unit 5
No ratings yet
Unit 5
35 pages
Devops and Gitops Automation in Artificial Intelligence
No ratings yet
Devops and Gitops Automation in Artificial Intelligence
12 pages
ROBOTRONIX Company Profile 2025
No ratings yet
ROBOTRONIX Company Profile 2025
9 pages
NVIDIA Investor Presentation Oct 2024
No ratings yet
NVIDIA Investor Presentation Oct 2024
30 pages
Brochure Embedded
No ratings yet
Brochure Embedded
116 pages
ECD 2025 Webinar Titles - Abstracts
No ratings yet
ECD 2025 Webinar Titles - Abstracts
5 pages
Jetson Platform Development Guide: Definitive Reference for Developers and Engineers
From Everand
Jetson Platform Development Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Emerging Technologies Future Scan en
No ratings yet
Emerging Technologies Future Scan en
1 page
Handout Build With Data and AI Faster in The Next Generation of Amazon SageMaker
No ratings yet
Handout Build With Data and AI Faster in The Next Generation of Amazon SageMaker
33 pages
Ai Clusters Data Center@nettrain
No ratings yet
Ai Clusters Data Center@nettrain
34 pages
Introduction To Compute
No ratings yet
Introduction To Compute
6 pages
VSD Schneider (Web)
100% (2)
VSD Schneider (Web)
132 pages
Dell Networking
No ratings yet
Dell Networking
27 pages
The Colloborative Growth Dicussion - Netweb - Presentation CADFEM - 2024
No ratings yet
The Colloborative Growth Dicussion - Netweb - Presentation CADFEM - 2024
16 pages
Brkarc 2029
No ratings yet
Brkarc 2029
31 pages
Brkarc 2012
No ratings yet
Brkarc 2012
134 pages
Brkarc 2028
No ratings yet
Brkarc 2028
37 pages
Brkarc 2021
No ratings yet
Brkarc 2021
67 pages
Brkarc 2011
No ratings yet
Brkarc 2011
282 pages
Brkarc 1975
No ratings yet
Brkarc 1975
48 pages
Brkarc 2007
No ratings yet
Brkarc 2007
136 pages
Brkaci 2403
No ratings yet
Brkaci 2403
96 pages
Brkarc 2885
No ratings yet
Brkarc 2885
98 pages
Brkarc 2098
No ratings yet
Brkarc 2098
60 pages
Brkarc 1881
No ratings yet
Brkarc 1881
85 pages
Brkarc 2097
No ratings yet
Brkarc 2097
69 pages
Brkapp 1009
No ratings yet
Brkapp 1009
30 pages
Brkapp 1008
No ratings yet
Brkapp 1008
65 pages
Brkaci 1002
No ratings yet
Brkaci 1002
300 pages
Brkarc 2099
No ratings yet
Brkarc 2099
55 pages
Brkapp 1007
No ratings yet
Brkapp 1007
28 pages
Brkapp 1006
No ratings yet
Brkapp 1006
58 pages
Aihub 1000
No ratings yet
Aihub 1000
49 pages
Introducing - Cisco - UCS by Apress
No ratings yet
Introducing - Cisco - UCS by Apress
186 pages
Aihub 1021
No ratings yet
Aihub 1021
1 page
Aihub 1008
No ratings yet
Aihub 1008
24 pages
Brkaci 2210
No ratings yet
Brkaci 2210
213 pages
Aihub 1001
No ratings yet
Aihub 1001
20 pages
VMKFSTOOLS Examples - Working With Virtual Disks
No ratings yet
VMKFSTOOLS Examples - Working With Virtual Disks
7 pages
Software Quality Assurance From Theory To Implementation-68-71
No ratings yet
Software Quality Assurance From Theory To Implementation-68-71
4 pages
System Analysis and Design! PP
No ratings yet
System Analysis and Design! PP
21 pages
Excel Formulas
100% (2)
Excel Formulas
2 pages
1.1 The Trinity
No ratings yet
1.1 The Trinity
3 pages
CHAPTER 2 Client-Server Model
No ratings yet
CHAPTER 2 Client-Server Model
1 page
History of Indigenous Christian Ministerial Training Institutions in India
No ratings yet
History of Indigenous Christian Ministerial Training Institutions in India
8 pages
PVKK Inggris Rinda 7057
No ratings yet
PVKK Inggris Rinda 7057
2 pages
Nine Parts of Speech
No ratings yet
Nine Parts of Speech
20 pages
Realization of IEC 60870-5-104 Protocol in DTU
No ratings yet
Realization of IEC 60870-5-104 Protocol in DTU
6 pages
SCC 2.0 Beta2 UserManual
No ratings yet
SCC 2.0 Beta2 UserManual
57 pages
Coroutines Flow - 1
No ratings yet
Coroutines Flow - 1
19 pages
Simultaneous Interpreting-Completeversion
No ratings yet
Simultaneous Interpreting-Completeversion
15 pages
Report On Android
No ratings yet
Report On Android
24 pages
Using Student-Centered Methods With Teacher-Centered Students (Marilyn Lewis Hayo Reinders) (Z-Library)
No ratings yet
Using Student-Centered Methods With Teacher-Centered Students (Marilyn Lewis Hayo Reinders) (Z-Library)
126 pages
PFRO
No ratings yet
PFRO
3 pages
Evaluation of Rajput States
No ratings yet
Evaluation of Rajput States
7 pages
SL-IV Lab Manual
100% (1)
SL-IV Lab Manual
35 pages
EXERCISE 2 ARRAY RafiRidzuan
No ratings yet
EXERCISE 2 ARRAY RafiRidzuan
6 pages
Kambi Kathakal Ammayude Maanthrikam PDF
No ratings yet
Kambi Kathakal Ammayude Maanthrikam PDF
16 pages
Part 4 - Overview PDF
No ratings yet
Part 4 - Overview PDF
7 pages
Jdk8u Main 191022 0810 4450 PDF
No ratings yet
Jdk8u Main 191022 0810 4450 PDF
4 pages
Process Description Powerpoint
No ratings yet
Process Description Powerpoint
23 pages
Careerit Devops Course Content
No ratings yet
Careerit Devops Course Content
9 pages
DLL English 10 2nd Week
No ratings yet
DLL English 10 2nd Week
2 pages
Calvary Chapel - Hermeneutica
100% (1)
Calvary Chapel - Hermeneutica
103 pages
Lesson 3 Generation Gap
No ratings yet
Lesson 3 Generation Gap
4 pages
Shakib Excel Data 1
No ratings yet
Shakib Excel Data 1
54 pages

Aihub 1017

Uploaded by

Aihub 1017

Uploaded by

Open and Efficient AI:

Training Inference Inference

Optimize Optimize Optimize

Model Optimization Software Optimization Carbon Aware Software

(NGINX TLS) (RocksDB) (NLP w/ Bert-Large) (Recommender w/ DLRM)

Number of 5th Gen

Lower fleet energy

Reduced CO2 emissions*

TCO savings* $254K $192K $541K $449K

Choice(s) of HW Portable Models: AI to be Ubiquitous

Open ecosystems can remove barriers

Secure & Confidential Based on Public Data Today

Data Locality Open/Closed

Mature & Predictable Rapid Change

CPU based Accelerator-based

By working with industry leaders to provide end-to-end AI enterprise solutions at scale

By building safe & AI capable compute platforms from client to data-center

Generative AI pipelines built from industry leading, composable components for

Efficient Seamless Open Ubiquitous Trusted Scalable

RAG API definition and reference code

Private/Public Cloud IaaS

Architecture Reference Benchmarks Certification Open Developer

AI PC Node Node / Server Rack Cluster Super Cluster Mega Cluster

AI PC ENTERPRISE & EDGE DATA CENTER

TexGen Test Precision Latency Response

➢ Latest M7 X-series now runs efficient inferencing for LLMs

You might also like