0% found this document useful (0 votes)
22 views10 pages

openSAP bw4h2 Week 3 Unit 1 BIGDATAOVERVIEW Presentation

The document discusses modern trends in data management, including combining enterprise data with big data from sources like social media. It also covers challenges with enterprise data projects and how SAP Data Hub can help address these challenges by connecting different data sources and orchestrating data processes.

Uploaded by

sahoosunit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views10 pages

openSAP bw4h2 Week 3 Unit 1 BIGDATAOVERVIEW Presentation

The document discusses modern trends in data management, including combining enterprise data with big data from sources like social media. It also covers challenges with enterprise data projects and how SAP Data Hub can help address these challenges by connecting different data sources and orchestrating data processes.

Uploaded by

sahoosunit
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Week 3: Modern Trends in Data Management

Unit 1: Overview
Overview
Covered in this unit

Content
▪ Big Data meets enterprise data
▪ Real world examples and challenges
▪ Overcoming the challenges with SAP Data Hub

© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 2


Overview
The modern data warehouse landscape

Data Warehouse Analytics


▪ Predominantly structured data from Business Intelligence, Predictive, Planning
enterprise systems (ERP, CRM, HR, etc.)
▪ Standardized data models, harmonized
data
▪ Supports decision making Data
Data Lake
Data Lake Hadoop Warehouse
▪ Typically massive amounts of “raw” and
unstructured/non-relational data
Streaming Virtual Access Batch (ETL) Real-time
▪ “New” data types – sensor, web, social
media, devices, etc.
Data Sources
▪ Active archive for historical data SAP, non-SAP, relational, non-relational, on-premise, cloud
Integration of data warehouse and
data lake is key

© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 3


Overview
Example – Combining enterprise and social media data

Example

Warehouse
Master
Master
Data
Data

▪ Combine refined Big Data with enterprise data

Data
and corporate master data
Data
▪ Load or federate data into data warehouse Warehouse EXTRACT
FEDERATE

▪ Ingest data into cloud storage as landing zone


▪ Orchestrate and schedule all related JOIN
processes FILTER
CLEANSE

Data Lake
▪ Implement transformations and data pipelines LOCK-UP
▪ Harmonize data structures and lookup of Hadoop SCRIPT
(HDFS) MASK
reference data ANONYMIZE
PARSE
Cloud
Storage

Facebook Skype Twitter …


Google YouTube E-mail
© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 4
Overview
Example – Enterprise data for the data scientist

Data scientist requires enterprise data


▪ How to provide consistent data?
Data Science Community
▪ How to productize and standardize data
provisioning to the data lake?
TensorFlow
R–
Enterprise statistical
Data Warehouse computing
Spark
.csv OpenCV
.parquet

Data Lake Hadoop, S3, GCP, Azure

Kafka Streams Files Videos Images


© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 5
Overview
Challenges with data projects in an enterprise context

Data Lake Programming & Standards &


Scripting Automation
▪ Multitude of tools
▪ New tools emerging
▪ Handcrafting, scripting
▪ Lack of standards and automation Dashboard

Data Warehouse Collect Land Transform Present

▪ Standardized tools < <


▪ Automated processes < …./> < …./> <
…./> …./> …./>
How to combine both worlds without
sacrificing the strengths in both areas?
Cloud Spark
API Push Data Warehouse
Storage Python

© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 6


Overview
SAP Data Hub

SAP Data Hub is designed to


▪ Connect all types of data sources (enterprise systems, data lakes, etc.)
▪ Organize and manage all data assets
▪ Orchestrate and monitor data SAP Data Hub
processes
▪ Integrate existing assets SAP Data Discovery and
SAP ERP Metadata Governance Data Hadoop
(e.g. Python scripts on data HANA Hub
lake, process chains in Orchestration & Data Runtime
SAP S/4HANA Pipeline Cloud Storage
SAP BW/4HANA, etc.)
SAP BW/4HANA Connectivity, Machine Learning
Integration, Ingestion

E xi s t i n g S y s t e m s Distributed Data Systems

Holistic data Flow-based data Enterprise and 3rd


landscape processing party connectivity
© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 7
Overview
What you’ve learned in this unit

Key takeaways
▪ Combining Big Data and enterprise data is key to
more and more organizations
▪ Lack of standards in the data lake world are a
challenge to scalability
▪ SAP Data Hub bridges the gap between flexibility of
the data lake and governance and standards of
enterprise IT

© 2018 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 8


Thank you.
Contact information:

[email protected]
Follow all of SAP

www.sap.com/contactsap

© 2018 SAP SE or an SAP affiliate company. All rights reserved.


No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of
SAP SE or an SAP affiliate company.
The information contained herein may be changed without prior notice. Some software products marketed by SAP SE and its
distributors contain proprietary software components of other software vendors. National product specifications may vary.
These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or
warranty of any kind, and SAP or its affiliated companies shall not be liable for errors or omissions with respect to the materials.
The only warranties for SAP or SAP affiliate company products and services are those that are set forth in the express warranty
statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional
warranty.
In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or
any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation,
and SAP SE’s or its affiliated companies’ strategy and possible future developments, products, and/or platforms, directions, and
functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time for any reason
without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or
functionality. All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ
materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, and they
should not be relied upon in making purchasing decisions.
SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered
trademarks of SAP SE (or an SAP affiliate company) in Germany and other countries. All other product and service names
mentioned are the trademarks of their respective companies.
See www.sap.com/corporate-en/legal/copyright/index.epx for additional trademark information and notices.

You might also like