W E B I N A R S E R I E S
I’m Building a Data Lake,
So I Don’t Need Data
Virtualization
W E B I N A R S E R I E S
I’m Building a Data
Lake, So I Don’t Need
Data Virtualization
Paul Moxon
SVP Data Architectures & Chief Evangelist
Denodo
20th January 2021
Paul Moxon
SVP Data Architectures & Chief
Evangelist, Denodo
Speakers
1. Today’s Myth
2. Origins of the Myth
3. Just the Facts, Ma’am
4. The Proof is in the Pudding
5. Conclusions
6. Q&A
7. Next Steps
Agenda
5
Myth #3:
I’m building a Data Lake. I
don’t need Data Virtualization.
Origins of the Myth
7
A Bit of History – Etymology of “Data Lake”
https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/ (with my emphasis)
Pentaho’s CTO James Dixon is credited with coining the
term "data lake". He described it in his blog in 2010:
"If you think of a data mart as a store of bottled
water – cleansed and packaged and structured
for easy consumption – the data lake is a large
body of water in a more natural state. The
contents of the data lake stream in from a source
to fill the lake, and various users of the lake can
come to examine, dive in, or take samples."
8
Concept of a Data Lake
9
Data Lakes Become Data Science Playgrounds…
The early data scientists saw Hadoop as
their personal supercomputer.
Hadoop-based Data Lakes helped
democratize access to state of the art
supercomputing with off-the-shelf HW
(and later cloud).
The industry push for BI made Hadoop–
based solutions the standard to bring
modern analytics to any corporation.
10
Gartner Hype Cycle – Analytics & Business Intelligence, 2020
11
Changing the Data Lake Goals
“The popular view is that a data
lake will be the one destination
for all the data in their enterprise
and the optimal platform for all
their analytics.”
Nick Heudecker, Gartner
12
…Data lakes lack semantic consistency and
governed metadata. Meeting the needs of
wider audiences require curated repositories
with governance, semantic consistency and
access controls.”
Just the Facts, Ma’am
14
Consumers
BI/Visualization
“Bring Your Own
Tool” reporting and
visualization
capabilities
integrated into the
Data Lake
Analytics
Workbench
Self-service analytics
workbench
Data Sources
Any internal or
external data
source that
should be copied
into the Data
Lake
Data Lakes Reference Architecture
Data Sources
Any internal or
external data
source that
should be copied
into the Data
Lake
Search and Browse
Search and browse data sets, explore relationships, sample queries and export results
Data Governance and Catalog
Governance and Cataloging of business and technical data assets (Stewardship, Curation, Profiling, Quality)
Data and Operations Management
Provides a broad set of services across the ecosystem to enable security, auditing, scheduling, version
management, policies, etc.
Data Ingestion
Physical or virtual
services to ingest
and integrate data
rapidly across a
variety of sources
and data types
through a common
‘ingest’ layer
Data Landing
Centralized location to land new
data entering ecosystem
separated via logical partitions,
based on source, data type,
characteristics, and governance
requirements
Raw Zone
Original data received from
the originating system plus
tagging and typing to aid in
understanding
Selection & Provisioning
Services to select and integrate
data objects, including
provisioning and prep of data
ingested in Raw Zone and/or
accessed via Trusted and
Consumption Zones
Trusted Zone
Data is enhanced with
business rules and identifiers
added to enable integration
Standardization
Services to consolidate, enrich,
profile and steward datasets
and metadata for on-going
consumption
Refined Zone
Data is conformed to specific
uses as ‘fit for purpose’ data
sets supporting common
models and standards
Exploratory Zone
Provides a flexible and intuitive way for consumers (data stewards, data engineers, and data scientists)
to research and manage data
Data
Delivery
Services
Services to
connect
deliver data,
metadata, and
insights to
consumers for
specific use
cases
Data Sources Technology Capabilities Delivery Capabilities
Consumers
Data Marketplace
User-friendly, SSO
enabled & multi
tenant front-end
surfacing the data
lifecycle services
supported by the
Data Lake
BI/Visualization
“Bring Your Own
Tool” reporting and
visualization
capabilities
integrated into the
Data Lake
Analytics
Workbench
Self-service analytics
workbench
System/App/
Device
Non-user consumers
of data assets
15
Real World Data Lake Example – Using AWS
Trusted Zone
Raw Data Zone Refined Zone
Transformation Transformation Data Consumers
Networking, Infrastructure & Security
Data Ingestion
Data
Sources
Data Catalog and Search – Asset Registry Workflow Orchestration, DevOps and CI/CD
16
Real World Data Lake Example – Using Azure
17
Data Virtualization as the Data Lake ‘Delivery Layer’
1. As the Data Delivery
Services layer
2. In the Refined Zone layer
3. As the self-service Data
Catalog
4. As part of the Exploratory
Zone
18
Data Virtualization as the ‘Data Delivery Services’ Layer
Data
Virtualization
• Delivery Services must support
multiple data delivery styles and
protocols
• Real-time and batch
• Request/response and reactive
(event-driven)
• Ad-hoc queries and APIs
• Data Lake needs a delivery layer
and Data Virtualization fits this
requirement
• Enables access to Data Lake and
non-Data Lake sources through
single, unified access layer
• Data Virtualization provides data
catalog for searching, finding,
and understanding data available
in Data Lake
• Provides security and governance
capabilities for Data Lake
19
Real World Data Lake Example – Using Azure (Redux)
20
Real World Data Lake Example – Using Azure (Revised)
The Proof is in the Pudding
22
Customer Example - FESTO
• Founded 1925
• Annual revenues (FY
2018) €3.2 B
• Over 21,000
employees
• Headquarters in
Germany
• World´s leading
supplier of
automation
technology and
technical education.
BUSINESS NEED
• Optimize operational efficiency, automate manufacturing processes, and deliver on-
demand services to business consumers
• Find smarter ways to aggregate and analyze data
• An agile solution that enables the monetization of customer-facing data products
• Free business users from IT reliance to become self-sufficient with reporting and
analysis
THE CHALLENGE:
Find an agile way to integrate data from existing silos, including an analytical data lake,
machine data in an IoT data lake, and traditional databases and data warehouse, that will
reduce dependencies from business users on IT and provides quick turnaround and
flexibility.
23
FESTO – Digital Transformation Journey
24
FESTO – Digital Transformation
25
Customer - FESTO
SOLUTION:
• Festo developed a Big Data
Analytics Framework to
provide a data marketplace to
better support the business
• Using the Denodo Platform to
integrate data from numerous
on-prem and cloud systems in
real-time, including Cloud-
based IoT Data Lake for
machine data
• A unified layer for consistent
data access and governance
across different data silos
26
Pilot Use Case – Energy Transparency System 2.0
Summary & Conclusions
1. Large data lake projects are complex environments
that will benefit from a virtual ‘consumption’ layer
2. In most cases, not all the data is going to be in the
data lake, so data lake data will need integrating
with non-lake data.
3. Data virtualization provides a data delivery layer
that simplifies and accelerates data lake access.
4. It provides a governance, management, and
security capability required for successful data lake
implementation
Key Takeaways
29
Myth #3:
I’m building a Data Lake. I
don’t need Data Virtualization
Q&A
31
Next Steps
Access Denodo Platform 8.0 in the Cloud.
Start your Free Trial today!
www.denodo.com/free-trials
GET STARTED TODAY
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm,
without prior the written authorization from Denodo Technologies.

More Related Content

PDF
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
PDF
Data Virtualization: From Zero to Hero
PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
PDF
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
PDF
Advanced Analytics and Machine Learning with Data Virtualization
PPTX
Technical Demonstration - Denodo Platform 7.0
PDF
Introduction to Modern Data Virtualization (US)
PDF
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Data Virtualization: From Zero to Hero
Performance Acceleration: Summaries, Recommendation, MPP and more
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Advanced Analytics and Machine Learning with Data Virtualization
Technical Demonstration - Denodo Platform 7.0
Introduction to Modern Data Virtualization (US)
Building a Logical Data Fabric using Data Virtualization (ASEAN)

What's hot (20)

PDF
Best Practices: Data Virtualization Perspectives and Best Practices
PDF
In Memory Parallel Processing for Big Data Scenarios
PDF
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PDF
GDPR Noncompliance: Avoid the Risk with Data Virtualization
PDF
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
PDF
Advanced Analytics and Machine Learning with Data Virtualization
PPTX
Fast Data Strategy Houston Roadshow Presentation
PDF
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
PDF
Data Virtualization: An Introduction
PDF
Best Practices in the Cloud for Data Management (US)
PDF
Why Data Virtualization? An Introduction
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
PDF
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
PDF
Can data virtualization uphold performance with complex queries?
PDF
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
PPTX
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
PDF
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
PDF
An introduction to data virtualization in business intelligence
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Best Practices: Data Virtualization Perspectives and Best Practices
In Memory Parallel Processing for Big Data Scenarios
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Accelerate Self-Service Analytics with Data Virtualization and Visualization
GDPR Noncompliance: Avoid the Risk with Data Virtualization
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Advanced Analytics and Machine Learning with Data Virtualization
Fast Data Strategy Houston Roadshow Presentation
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Virtualization: An Introduction
Best Practices in the Cloud for Data Management (US)
Why Data Virtualization? An Introduction
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Can data virtualization uphold performance with complex queries?
SAP Analytics Cloud: Haben Sie schon alle Datenquellen im Live-Zugriff?
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
An introduction to data virtualization in business intelligence
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Ad

Similar to Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization (20)

PDF
Data Virtualization: An Essential Component of a Cloud Data Lake
PDF
Data Lakes: A Logical Approach for Faster Unified Insights
PDF
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
PDF
Unlock Your Data for ML & AI using Data Virtualization
PDF
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
PDF
What is the future of data strategy?
PDF
The Central Hub: Defining the Data Lake
PDF
Future of Data Strategy (ASEAN)
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
PDF
Are You Killing the Benefits of Your Data Lake?
PDF
Data lakes
PDF
Data Lakes: A Logical Approach for Faster Unified Insights (ASEAN)
PDF
Bridging the Last Mile: Getting Data to the People Who Need It
PDF
Self-Service Analytics with Guard Rails
PDF
A Key to Real-time Insights in a Post-COVID World (ASEAN)
PDF
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
PDF
Future of Data Strategy
PDF
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
PDF
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
PDF
Big data data lake and beyond
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Lakes: A Logical Approach for Faster Unified Insights
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
What is the future of data strategy?
The Central Hub: Defining the Data Lake
Future of Data Strategy (ASEAN)
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Are You Killing the Benefits of Your Data Lake?
Data lakes
Data Lakes: A Logical Approach for Faster Unified Insights (ASEAN)
Bridging the Last Mile: Getting Data to the People Who Need It
Self-Service Analytics with Guard Rails
A Key to Real-time Insights in a Post-COVID World (ASEAN)
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Future of Data Strategy
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Big data data lake and beyond
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades

Recently uploaded (20)

PDF
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
PPTX
ai agent creaction with langgraph_presentation_
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
PPTX
Fundementals of R Programming_Class_2.pptx
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PDF
Loose-Leaf for Auditing & Assurance Services A Systematic Approach 11th ed. E...
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
Business_Capability_Map_Collection__pptx
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PDF
Navigating the Thai Supplements Landscape.pdf
PDF
Session 11 - Data Visualization Storytelling (2).pdf
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
expt-design-lecture-12 hghhgfggjhjd (1).ppt
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
ai agent creaction with langgraph_presentation_
indiraparyavaranbhavan-240418134200-31d840b3.pptx
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
Fundementals of R Programming_Class_2.pptx
IMPACT OF LANDSLIDE.....................
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Loose-Leaf for Auditing & Assurance Services A Systematic Approach 11th ed. E...
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
AI AND ML PROPOSAL PRESENTATION MUST.pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
Business_Capability_Map_Collection__pptx
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Navigating the Thai Supplements Landscape.pdf
Session 11 - Data Visualization Storytelling (2).pdf

Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization

  • 1. W E B I N A R S E R I E S I’m Building a Data Lake, So I Don’t Need Data Virtualization
  • 2. W E B I N A R S E R I E S I’m Building a Data Lake, So I Don’t Need Data Virtualization Paul Moxon SVP Data Architectures & Chief Evangelist Denodo 20th January 2021
  • 3. Paul Moxon SVP Data Architectures & Chief Evangelist, Denodo Speakers
  • 4. 1. Today’s Myth 2. Origins of the Myth 3. Just the Facts, Ma’am 4. The Proof is in the Pudding 5. Conclusions 6. Q&A 7. Next Steps Agenda
  • 5. 5 Myth #3: I’m building a Data Lake. I don’t need Data Virtualization.
  • 7. 7 A Bit of History – Etymology of “Data Lake” https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/ (with my emphasis) Pentaho’s CTO James Dixon is credited with coining the term "data lake". He described it in his blog in 2010: "If you think of a data mart as a store of bottled water – cleansed and packaged and structured for easy consumption – the data lake is a large body of water in a more natural state. The contents of the data lake stream in from a source to fill the lake, and various users of the lake can come to examine, dive in, or take samples."
  • 8. 8 Concept of a Data Lake
  • 9. 9 Data Lakes Become Data Science Playgrounds… The early data scientists saw Hadoop as their personal supercomputer. Hadoop-based Data Lakes helped democratize access to state of the art supercomputing with off-the-shelf HW (and later cloud). The industry push for BI made Hadoop– based solutions the standard to bring modern analytics to any corporation.
  • 10. 10 Gartner Hype Cycle – Analytics & Business Intelligence, 2020
  • 11. 11 Changing the Data Lake Goals “The popular view is that a data lake will be the one destination for all the data in their enterprise and the optimal platform for all their analytics.” Nick Heudecker, Gartner
  • 12. 12 …Data lakes lack semantic consistency and governed metadata. Meeting the needs of wider audiences require curated repositories with governance, semantic consistency and access controls.”
  • 13. Just the Facts, Ma’am
  • 14. 14 Consumers BI/Visualization “Bring Your Own Tool” reporting and visualization capabilities integrated into the Data Lake Analytics Workbench Self-service analytics workbench Data Sources Any internal or external data source that should be copied into the Data Lake Data Lakes Reference Architecture Data Sources Any internal or external data source that should be copied into the Data Lake Search and Browse Search and browse data sets, explore relationships, sample queries and export results Data Governance and Catalog Governance and Cataloging of business and technical data assets (Stewardship, Curation, Profiling, Quality) Data and Operations Management Provides a broad set of services across the ecosystem to enable security, auditing, scheduling, version management, policies, etc. Data Ingestion Physical or virtual services to ingest and integrate data rapidly across a variety of sources and data types through a common ‘ingest’ layer Data Landing Centralized location to land new data entering ecosystem separated via logical partitions, based on source, data type, characteristics, and governance requirements Raw Zone Original data received from the originating system plus tagging and typing to aid in understanding Selection & Provisioning Services to select and integrate data objects, including provisioning and prep of data ingested in Raw Zone and/or accessed via Trusted and Consumption Zones Trusted Zone Data is enhanced with business rules and identifiers added to enable integration Standardization Services to consolidate, enrich, profile and steward datasets and metadata for on-going consumption Refined Zone Data is conformed to specific uses as ‘fit for purpose’ data sets supporting common models and standards Exploratory Zone Provides a flexible and intuitive way for consumers (data stewards, data engineers, and data scientists) to research and manage data Data Delivery Services Services to connect deliver data, metadata, and insights to consumers for specific use cases Data Sources Technology Capabilities Delivery Capabilities Consumers Data Marketplace User-friendly, SSO enabled & multi tenant front-end surfacing the data lifecycle services supported by the Data Lake BI/Visualization “Bring Your Own Tool” reporting and visualization capabilities integrated into the Data Lake Analytics Workbench Self-service analytics workbench System/App/ Device Non-user consumers of data assets
  • 15. 15 Real World Data Lake Example – Using AWS Trusted Zone Raw Data Zone Refined Zone Transformation Transformation Data Consumers Networking, Infrastructure & Security Data Ingestion Data Sources Data Catalog and Search – Asset Registry Workflow Orchestration, DevOps and CI/CD
  • 16. 16 Real World Data Lake Example – Using Azure
  • 17. 17 Data Virtualization as the Data Lake ‘Delivery Layer’ 1. As the Data Delivery Services layer 2. In the Refined Zone layer 3. As the self-service Data Catalog 4. As part of the Exploratory Zone
  • 18. 18 Data Virtualization as the ‘Data Delivery Services’ Layer Data Virtualization • Delivery Services must support multiple data delivery styles and protocols • Real-time and batch • Request/response and reactive (event-driven) • Ad-hoc queries and APIs • Data Lake needs a delivery layer and Data Virtualization fits this requirement • Enables access to Data Lake and non-Data Lake sources through single, unified access layer • Data Virtualization provides data catalog for searching, finding, and understanding data available in Data Lake • Provides security and governance capabilities for Data Lake
  • 19. 19 Real World Data Lake Example – Using Azure (Redux)
  • 20. 20 Real World Data Lake Example – Using Azure (Revised)
  • 21. The Proof is in the Pudding
  • 22. 22 Customer Example - FESTO • Founded 1925 • Annual revenues (FY 2018) €3.2 B • Over 21,000 employees • Headquarters in Germany • World´s leading supplier of automation technology and technical education. BUSINESS NEED • Optimize operational efficiency, automate manufacturing processes, and deliver on- demand services to business consumers • Find smarter ways to aggregate and analyze data • An agile solution that enables the monetization of customer-facing data products • Free business users from IT reliance to become self-sufficient with reporting and analysis THE CHALLENGE: Find an agile way to integrate data from existing silos, including an analytical data lake, machine data in an IoT data lake, and traditional databases and data warehouse, that will reduce dependencies from business users on IT and provides quick turnaround and flexibility.
  • 23. 23 FESTO – Digital Transformation Journey
  • 24. 24 FESTO – Digital Transformation
  • 25. 25 Customer - FESTO SOLUTION: • Festo developed a Big Data Analytics Framework to provide a data marketplace to better support the business • Using the Denodo Platform to integrate data from numerous on-prem and cloud systems in real-time, including Cloud- based IoT Data Lake for machine data • A unified layer for consistent data access and governance across different data silos
  • 26. 26 Pilot Use Case – Energy Transparency System 2.0
  • 28. 1. Large data lake projects are complex environments that will benefit from a virtual ‘consumption’ layer 2. In most cases, not all the data is going to be in the data lake, so data lake data will need integrating with non-lake data. 3. Data virtualization provides a data delivery layer that simplifies and accelerates data lake access. 4. It provides a governance, management, and security capability required for successful data lake implementation Key Takeaways
  • 29. 29 Myth #3: I’m building a Data Lake. I don’t need Data Virtualization
  • 30. Q&A
  • 31. 31 Next Steps Access Denodo Platform 8.0 in the Cloud. Start your Free Trial today! www.denodo.com/free-trials GET STARTED TODAY
  • 32. Thanks! www.denodo.com [email protected] © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.