© RED HAT, INC.
1
Red Hat Storage - Emerging Use Cases
Narendra N. Narang
Sr. Cloud Storage Solutions Architect
narnar@redhat.com
January 2016
© RED HAT, INC.
2
Agenda
Based on discussions, customer presentations and information, we now highlight some
emerging use cases for our software-defined-storage products:
● Use Case 1.: Historical Tick Data
● Use Case 2.: Analytics
● Use Case 3.: Storage for Network Function Virtualization (NFV)
● Use Case 4.: Storage for IoT, Edge Computing
● Future Use Cases
Use Case 1: Historical Tick Data
What is a Tick?
A “tick” is the minimum upward or downward movement (any change) in the price of a security
as measured over a period of time.
An "uptick" refers to a trade where the current transaction occurred at a price higher than the
Previous transaction and a "downtick" refers to a transaction that has occurred at a lower price
than the previous transaction. Consequently, a “zerotick” refers to a trade where the current
transaction occurred at a price higher than the previous transaction.
What is Tick Data?
Tick data is time series data containing price, volume and many other dimensions (bid/ask prices,
bid/ask sizes, quote time, trade time, exchange information) for each point of granularity.
Tick Data and Storage
The higher the resolution of tick data collected, the larger will be the dataset size and hence,
the amount of storage capacity required.
High-level Tick Data Workflow
Data Feed 1 Data Feed 2 Data Feed 3
Market Data Servers
(Aggregation of Feed Handlers)
Data Feed N
KDB+
In-memory
TickDB
(Real-time)
Tick
LogFile
Historical Tick Database
EndofDay(EOD)
Intraday
EOD data stored as a distinct Historical Database Partitioned Format “hdpf” file
for that day. This file is typically written as a large sequential stream of blocks.
News
Social
Media
Historical Tick Data on
Red Hat Gluster Storage
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
GLUSTERFSRHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
Data Feed 1 Data Feed 2 Data Feed 3
Market Data Servers
(Aggregation of Feed Handlers)
Data Feed N
Tick
LogFile
EOD
Intraday
EOD data stored as a distinct Historical Database Partitioned Format “hdpf” file
for that day. This file is typically written as a large sequential stream of blocks.
News
Social
Media
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
GLUSTERFSRHGS
NODE
RHGS
NODE
RHGS
NODE
RHGS
NODE
SITE A SITE B
Async
Geo-rep
Mathematical, Algorithmic
In-memory
KDB+
(Real-time)
Use Case 2.: Analytics
● Splunk on Red Hat Gluster Storage
● Hadoop typically employed to run batch analytics against data
residing in HDFS. Incidentally, Red Hat Gluster Storage
functions as an HCFS
● MR framework, clusters typically high throughput, many disks,
colocated data and compute
A better way...
● Employ the Spark core analytical processing engine
● Directly access data stored in Red Hat Gluster Storage.
Splunk on Red Hat Gluster Storage
Scale-out operational analytics built on affordable, industry-standard infrastructure
Splunk Storage Approaches
© RED HAT, INC.
9
Analytics on Red Hat Gluster Storage
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Spark
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Red Hat Gluster Storage Cloud
Kubernetes Orchestration for Docker Containers
Message Bus for Microservices
Figure1. Shared Storage
Spark
Spark
Spark
Spark
Spark
Spark
Spark
Spark
Spark
Spark
Spark
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Spark
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
SSD, HDD 15K, HDD, 7.2K (Self-healing,Tiering, Replicas, Geo-replication, Erasure Coding)
Kubernetes Orchestration for Docker Containers
Message Bus for Microservices
Figure2. Containerized Storage
Spark
Spark
Storage
Spark
Spark
Spark
Spark
Spark
Storage
Storage
Storage
© RED HAT, INC.
10
Key Benefits
● Containerize and orchestrate Spark computation instances
● Scale computation instances elastically and independently
● Affine key storage resources and elastically scale computation
microservices
● Run the same batch analytics with less MR shuffles
● Flexibility to run both batch and streaming analytics within
the same framework
● Ability to spill data over to disk or to an in-memory filesystem
e.g. Tachyon.
© RED HAT, INC.
11
Use Case 3.: Storage for Network Function Virtualization (NFV)
Moving beyond the business of Network Function Virtualization (NFV) by
implementing containers.
At a high-level NFV extends virtualization technology to network functions that
may subsequently be connected and organized via the concept of “service
chaining” to create an end-to-end network service.
Example: Instead of deploying physical load balancers or firewalls, employ the
use of virtual network functions, within containers, that may be orchestrated to
create a “chain of service” to deliver an end-to-end network service.
Leverage the ability to deploy network functions as microservices, that may be
orchestrated to scale elastically and on demand.
The infrastructure on which NFV functions and operates is the NFVi.
© RED HAT, INC.
12
Red Hat Storage for Containerized VNFs
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
App1
Storage
VNF1
Storage
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
Physical, Virtual, Cloud
Linux Kernel
RHEL Atomic
Container Engine
SSD, HDD 15K, HDD, 7.2K (Self-healing,Tiering, Replicas, Geo-replication, Erasure Coding)
App1
App1
App2App2
App2
Storage
VNF1
VNF2App2
App3
App3Storage
App3
VNF3
VNF2
VNF3App4
App4
StorageStorage
App5
VNF4
VNF4
StorageApp4
App5
App4Storage
Kubernetes Orchestration for Docker Containers
Message Bus for Microservices
© RED HAT, INC.
13
Key Benefits
● Orchestrate a true microservices architecture, where application, storage and network
services are delivered elastically and on-demand
● Architect containerized platform within the realm of OpenStack infrastructure or
independently
● Deliver end-to-end network functions to a multinenant environment at hyperscale
● Segregate network function from session “stateful” information. Store “state”
information on a scalable, performant distributed storage platform
● Virtual network functions are now rendered stateless and may be scaled independently
● Distributed storage delivers the performance, resiliency and enterprise features like
tiering, replicas, snapshots, quotas, geo-replication and self-healing
● Choice of block or file implementations based on Red Hat Ceph Storage or
Red Hat Gluster Storage
● Operate within more determenistic parameters and with increased cost and performance
efficiencies.
© RED HAT, INC.
14
Use Case 4.: Storage for IoT, Edge Computing
The Internet of Things (IoT) - is a network of physical objects embedded with electronics,
software, sensors, and network connectivity, which enables these objects to collect and
exchange data.
IDC Predictions:
● By 2018, 20% of all IoT intelligent
gateways will have “container
technology” for packaging IoT
application code in a thin
containerized environment thus
accelerating IoT microservices
● By 2019, 45% of IoT-created data will
be stored, processed, analyzed and
acted upon close to, or at the edge, of
the network
Sources: http://event.lvl3.on24.com/event/10/38/69/4/rt/1/documents/resourceList1444759978563/idc_iot_futurescape_wc.pdf
http://www.idc.com/getdoc.jsp?containerId=prUS25291514
Edge Computing – Computation and/or analysis of data is performed at the edge devices
of a network rather than at a centralized location.
© RED HAT, INC.
15
The Broader Implications
● Within three years, 50% of
IT networks will transition
from having excess
capacity to handle the
additional IoT devices to
being network constrained
with nearly 10% of sites
being overwhelmed
● By 2017, 90% of
datacenter and enterprise
systems management will
rapidly adopt new business
models to manage
non-traditional
infrastructure and BYOD
device categories
● Within five years all
industries will have rolled
out IoT initiatives
Source: http://www.idc.com/Predictions2015.
© RED HAT, INC.
16
Newer Architectures, Hardware & Software
Implementations, Containerization
NEW INSIGHTS
NEW VALUE (IoT)
DATA COLLECTION & STORAGE
Not just a capacity issue any more!
It’s about enabling the edge nodes to filter, index
and compute the data stream. Equipping the edge
with sufficient computation and the appropriate
storage combination for both latency sensitive and
cold/archival storage.
Questions that really need answering:
* How much data actually needs to be stored?
* How much of the data is transient?
* How much data needs to be stored, but can’t be stored owing to cost/capacity constraints?
* Within cold storage what are access patterns and appropriate mediums?
NEW
REVENUE
Value
© RED HAT, INC.
17
Key Benefits
● Improved QoS and reduced latency of data analytics
● Less data traversing networks
● Higher resiliency since it’s pushed to redundant components at edge of
network
● Employ better “swarm intelligence” for diffusion of data across networks
● Optimized data placement for higher energy efficiency
● Agility in infrastructure with containerization
● Higher cost efficiency based on commoditized and open source
implementations.
© RED HAT, INC.
18
Stay tuned...
● Database Workloads on Ceph
● Latency sensitive, high IOPS workloads on Ceph RBD
● CephFS workloads in production
● Ceph iSCSI target implementation with HA gateways
● Hyperconvergent architectures
● Containerization of storage services
● Support for ARM processors in newer architectures
© RED HAT, INC.
19
Driving Value in a Hyperscale Model
IMHO, some predictions for incremental gains at hyperscale:
* Increased prevalence and dominance of open-source software-defined-storage technology
* Proliferation of containerization for heavy densities, availability and elasticity of microservices
delivery in the cloud
* Implementation of all-flash technologies and tiering for iops intensive and mixed workloads
* Increased prominence of ARM processors in scale-out architectures
* Use of key-value drives.
It will become imperative to leverage a combination of these technologies to remain competitive
and to maintain cost and operational efficiency.

More Related Content

PDF
Scalable POSIX File Systems in the Cloud
PPTX
Implementation of Dense Storage Utilizing HDDs with SSDs and PCIe Flash Acc...
PDF
Red Hat Storage for Mere Mortals
PPT
Containerized Storage
PPTX
Why Software-Defined Storage Matters
PDF
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
PPTX
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
PDF
Seagate Implementation of Dense Storage Utilizing HDDs and SSDs
Scalable POSIX File Systems in the Cloud
Implementation of Dense Storage Utilizing HDDs with SSDs and PCIe Flash Acc...
Red Hat Storage for Mere Mortals
Containerized Storage
Why Software-Defined Storage Matters
Red Hat Storage Day New York - Red Hat Gluster Storage: Historical Tick Data ...
Red Hat Storage Day Atlanta - Red Hat Gluster Storage vs. Traditional Storage...
Seagate Implementation of Dense Storage Utilizing HDDs and SSDs

What's hot (20)

PDF
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
PPTX
Red Hat Storage Day Dallas - Defiance of the Appliance
PDF
Red Hat Storage Day Boston - Persistent Storage for Containers
PPTX
Red Hat Storage Day Dallas - Why Software-defined Storage Matters
PPTX
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
PPTX
Architecting Ceph Solutions
PPTX
Red Hat Storage Day LA - Performance and Sizing Software Defined Storage
PDF
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
PDF
Red Hat Storage Day Dallas - Storage for OpenShift Containers
PDF
Red Hat Ceph Storage Roadmap: January 2016
PPTX
Red Hat Storage Day Seattle: Why Software-Defined Storage Matters
PPTX
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
PDF
Red Hat Storage Day Boston - OpenStack + Ceph Storage
PPTX
Red Hat Storage Day Boston - Supermicro Super Storage
PDF
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
PPTX
Red Hat Storage Day Seattle: Persistent Storage for Containerized Applications
PPTX
Why Software-Defined Storage Matters
PPTX
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
PDF
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
PPTX
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Boston - Persistent Storage for Containers
Red Hat Storage Day Dallas - Why Software-defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Architecting Ceph Solutions
Red Hat Storage Day LA - Performance and Sizing Software Defined Storage
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red Hat Storage Day Dallas - Storage for OpenShift Containers
Red Hat Ceph Storage Roadmap: January 2016
Red Hat Storage Day Seattle: Why Software-Defined Storage Matters
Red Hat Storage Day Dallas - Gluster Storage in Containerized Application
Red Hat Storage Day Boston - OpenStack + Ceph Storage
Red Hat Storage Day Boston - Supermicro Super Storage
The Future of Cloud Software Defined Storage with Ceph: Andrew Hatfield, Red Hat
Red Hat Storage Day Seattle: Persistent Storage for Containerized Applications
Why Software-Defined Storage Matters
Red Hat Storage Day Seattle: Stretching A Gluster Cluster for Resilient Messa...
Red Hat Storage Day New York - QCT: Avoid the mess, deploy with a validated s...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Ad

Viewers also liked (12)

PDF
Red Hat Forum London 2014 - Delivering Innovation at Speed, A JBoss Perspective
PDF
Samsung and the Path to Open Source Leadership
PPTX
Developing OSS Leadership (LinuxCon NA - 2014)
PDF
Samsung & The Path to Open Source Leadership
PDF
Kubernetes Scaling SIG (K8Scale)
PDF
SAMSUNG SDI In Japan
PDF
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
PDF
MapR アーキテクチャ概要 - MapR CTO Meetup 2013/11/12
PDF
2016 Flash Storage-NVMe Brand Leader Mini-Report
PDF
Cloud Storage: The Next 40 Years
PPTX
Samsung Business Strategy
DOCX
samsung strategy
Red Hat Forum London 2014 - Delivering Innovation at Speed, A JBoss Perspective
Samsung and the Path to Open Source Leadership
Developing OSS Leadership (LinuxCon NA - 2014)
Samsung & The Path to Open Source Leadership
Kubernetes Scaling SIG (K8Scale)
SAMSUNG SDI In Japan
Red Hat Storage Day New York -Performance Intensive Workloads with Samsung NV...
MapR アーキテクチャ概要 - MapR CTO Meetup 2013/11/12
2016 Flash Storage-NVMe Brand Leader Mini-Report
Cloud Storage: The Next 40 Years
Samsung Business Strategy
samsung strategy
Ad

Similar to Red Hat Storage: Emerging Use Cases (20)

PDF
Red hat storage el almacenamiento disruptivo
PDF
HPE Solutions for Challenges in AI and Big Data
PDF
Saviak lviv ai-2019-e-mail (1)
PPTX
SDN, OpenFlow, NFV, and Virtual Network
PDF
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
PDF
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
PDF
MinIO January 2020 Briefing
PDF
Red Hat Storage 2014 - Product(s) Overview
PPTX
How Open Source Will Change How You Think about Storage - LGI Tech Summit
PDF
Evolution from EDA to Data Mesh: Data in Motion
PPT
MongoDB Sharding Webinar 2014
PDF
optimizing_ceph_flash
PDF
HKG15-The Machine: A new kind of computer- Keynote by Dejan Milojicic
PPT
Systore07 V4
PDF
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
PDF
Machine Learning and Artificial Intelligence
PDF
OpenStack NFV Edge computing for IOT microservices
PPT
AWS Summit Berlin 2013 - Big Data Analytics
PDF
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
PPTX
TierraCloud HC2 Customer Presentation
Red hat storage el almacenamiento disruptivo
HPE Solutions for Challenges in AI and Big Data
Saviak lviv ai-2019-e-mail (1)
SDN, OpenFlow, NFV, and Virtual Network
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
MinIO January 2020 Briefing
Red Hat Storage 2014 - Product(s) Overview
How Open Source Will Change How You Think about Storage - LGI Tech Summit
Evolution from EDA to Data Mesh: Data in Motion
MongoDB Sharding Webinar 2014
optimizing_ceph_flash
HKG15-The Machine: A new kind of computer- Keynote by Dejan Milojicic
Systore07 V4
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Machine Learning and Artificial Intelligence
OpenStack NFV Edge computing for IOT microservices
AWS Summit Berlin 2013 - Big Data Analytics
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
TierraCloud HC2 Customer Presentation

More from Red_Hat_Storage (14)

PPTX
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
PPTX
Red Hat Storage Day Boston - Why Software-defined Storage Matters
PPTX
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
PDF
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
PDF
Red Hat Storage Day - When the Ceph Hits the Fan
PDF
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
PDF
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
PDF
Red Hat Storage Day New York - New Reference Architectures
PDF
Red Hat Storage Day New York - Persistent Storage for Containers
PDF
Red Hat Storage Day New York - Welcome Remarks
PDF
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
PPTX
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
PPTX
Storage: Limitations, Frustrations, and Coping with Future Needs
PDF
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...
Red Hat Storage Day Dallas - Red Hat Ceph Storage Acceleration Utilizing Flas...
Red Hat Storage Day Boston - Why Software-defined Storage Matters
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Storage Day Boston - Red Hat Gluster Storage vs. Traditional Storage ...
Red Hat Storage Day - When the Ceph Hits the Fan
Red Hat Storage Day New York - Penguin Computing Spotlight: Delivering Open S...
Red Hat Storage Day New York - Intel Unlocking Big Data Infrastructure Effici...
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - Persistent Storage for Containers
Red Hat Storage Day New York - Welcome Remarks
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day Seattle: Stabilizing Petabyte Ceph Cluster in OpenStack C...
Storage: Limitations, Frustrations, and Coping with Future Needs
Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...

Recently uploaded (20)

PPT
Geologic Time for studying geology for geologist
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Architecture types and enterprise applications.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
What is a Computer? Input Devices /output devices
PPTX
Build Your First AI Agent with UiPath.pptx
PDF
CloudStack 4.21: First Look Webinar slides
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
Configure Apache Mutual Authentication
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PPTX
The various Industrial Revolutions .pptx
PDF
Five Habits of High-Impact Board Members
Geologic Time for studying geology for geologist
NewMind AI Weekly Chronicles – August ’25 Week III
Architecture types and enterprise applications.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
What is a Computer? Input Devices /output devices
Build Your First AI Agent with UiPath.pptx
CloudStack 4.21: First Look Webinar slides
The influence of sentiment analysis in enhancing early warning system model f...
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
Enhancing plagiarism detection using data pre-processing and machine learning...
A review of recent deep learning applications in wood surface defect identifi...
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Module 1.ppt Iot fundamentals and Architecture
Final SEM Unit 1 for mit wpu at pune .pptx
Configure Apache Mutual Authentication
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Microsoft Excel 365/2024 Beginner's training
sbt 2.0: go big (Scala Days 2025 edition)
The various Industrial Revolutions .pptx
Five Habits of High-Impact Board Members

Red Hat Storage: Emerging Use Cases

  • 1. © RED HAT, INC. 1 Red Hat Storage - Emerging Use Cases Narendra N. Narang Sr. Cloud Storage Solutions Architect [email protected] January 2016
  • 2. © RED HAT, INC. 2 Agenda Based on discussions, customer presentations and information, we now highlight some emerging use cases for our software-defined-storage products: ● Use Case 1.: Historical Tick Data ● Use Case 2.: Analytics ● Use Case 3.: Storage for Network Function Virtualization (NFV) ● Use Case 4.: Storage for IoT, Edge Computing ● Future Use Cases
  • 3. Use Case 1: Historical Tick Data What is a Tick? A “tick” is the minimum upward or downward movement (any change) in the price of a security as measured over a period of time. An "uptick" refers to a trade where the current transaction occurred at a price higher than the Previous transaction and a "downtick" refers to a transaction that has occurred at a lower price than the previous transaction. Consequently, a “zerotick” refers to a trade where the current transaction occurred at a price higher than the previous transaction. What is Tick Data? Tick data is time series data containing price, volume and many other dimensions (bid/ask prices, bid/ask sizes, quote time, trade time, exchange information) for each point of granularity. Tick Data and Storage The higher the resolution of tick data collected, the larger will be the dataset size and hence, the amount of storage capacity required.
  • 4. High-level Tick Data Workflow Data Feed 1 Data Feed 2 Data Feed 3 Market Data Servers (Aggregation of Feed Handlers) Data Feed N KDB+ In-memory TickDB (Real-time) Tick LogFile Historical Tick Database EndofDay(EOD) Intraday EOD data stored as a distinct Historical Database Partitioned Format “hdpf” file for that day. This file is typically written as a large sequential stream of blocks. News Social Media
  • 5. Historical Tick Data on Red Hat Gluster Storage RHGS NODE RHGS NODE RHGS NODE RHGS NODE RHGS NODE RHGS NODE GLUSTERFSRHGS NODE RHGS NODE RHGS NODE RHGS NODE Data Feed 1 Data Feed 2 Data Feed 3 Market Data Servers (Aggregation of Feed Handlers) Data Feed N Tick LogFile EOD Intraday EOD data stored as a distinct Historical Database Partitioned Format “hdpf” file for that day. This file is typically written as a large sequential stream of blocks. News Social Media RHGS NODE RHGS NODE RHGS NODE RHGS NODE RHGS NODE RHGS NODE GLUSTERFSRHGS NODE RHGS NODE RHGS NODE RHGS NODE SITE A SITE B Async Geo-rep Mathematical, Algorithmic In-memory KDB+ (Real-time)
  • 6. Use Case 2.: Analytics ● Splunk on Red Hat Gluster Storage ● Hadoop typically employed to run batch analytics against data residing in HDFS. Incidentally, Red Hat Gluster Storage functions as an HCFS ● MR framework, clusters typically high throughput, many disks, colocated data and compute A better way... ● Employ the Spark core analytical processing engine ● Directly access data stored in Red Hat Gluster Storage.
  • 7. Splunk on Red Hat Gluster Storage Scale-out operational analytics built on affordable, industry-standard infrastructure
  • 9. © RED HAT, INC. 9 Analytics on Red Hat Gluster Storage Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Spark Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Red Hat Gluster Storage Cloud Kubernetes Orchestration for Docker Containers Message Bus for Microservices Figure1. Shared Storage Spark Spark Spark Spark Spark Spark Spark Spark Spark Spark Spark Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Spark Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine SSD, HDD 15K, HDD, 7.2K (Self-healing,Tiering, Replicas, Geo-replication, Erasure Coding) Kubernetes Orchestration for Docker Containers Message Bus for Microservices Figure2. Containerized Storage Spark Spark Storage Spark Spark Spark Spark Spark Storage Storage Storage
  • 10. © RED HAT, INC. 10 Key Benefits ● Containerize and orchestrate Spark computation instances ● Scale computation instances elastically and independently ● Affine key storage resources and elastically scale computation microservices ● Run the same batch analytics with less MR shuffles ● Flexibility to run both batch and streaming analytics within the same framework ● Ability to spill data over to disk or to an in-memory filesystem e.g. Tachyon.
  • 11. © RED HAT, INC. 11 Use Case 3.: Storage for Network Function Virtualization (NFV) Moving beyond the business of Network Function Virtualization (NFV) by implementing containers. At a high-level NFV extends virtualization technology to network functions that may subsequently be connected and organized via the concept of “service chaining” to create an end-to-end network service. Example: Instead of deploying physical load balancers or firewalls, employ the use of virtual network functions, within containers, that may be orchestrated to create a “chain of service” to deliver an end-to-end network service. Leverage the ability to deploy network functions as microservices, that may be orchestrated to scale elastically and on demand. The infrastructure on which NFV functions and operates is the NFVi.
  • 12. © RED HAT, INC. 12 Red Hat Storage for Containerized VNFs Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine App1 Storage VNF1 Storage Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine Physical, Virtual, Cloud Linux Kernel RHEL Atomic Container Engine SSD, HDD 15K, HDD, 7.2K (Self-healing,Tiering, Replicas, Geo-replication, Erasure Coding) App1 App1 App2App2 App2 Storage VNF1 VNF2App2 App3 App3Storage App3 VNF3 VNF2 VNF3App4 App4 StorageStorage App5 VNF4 VNF4 StorageApp4 App5 App4Storage Kubernetes Orchestration for Docker Containers Message Bus for Microservices
  • 13. © RED HAT, INC. 13 Key Benefits ● Orchestrate a true microservices architecture, where application, storage and network services are delivered elastically and on-demand ● Architect containerized platform within the realm of OpenStack infrastructure or independently ● Deliver end-to-end network functions to a multinenant environment at hyperscale ● Segregate network function from session “stateful” information. Store “state” information on a scalable, performant distributed storage platform ● Virtual network functions are now rendered stateless and may be scaled independently ● Distributed storage delivers the performance, resiliency and enterprise features like tiering, replicas, snapshots, quotas, geo-replication and self-healing ● Choice of block or file implementations based on Red Hat Ceph Storage or Red Hat Gluster Storage ● Operate within more determenistic parameters and with increased cost and performance efficiencies.
  • 14. © RED HAT, INC. 14 Use Case 4.: Storage for IoT, Edge Computing The Internet of Things (IoT) - is a network of physical objects embedded with electronics, software, sensors, and network connectivity, which enables these objects to collect and exchange data. IDC Predictions: ● By 2018, 20% of all IoT intelligent gateways will have “container technology” for packaging IoT application code in a thin containerized environment thus accelerating IoT microservices ● By 2019, 45% of IoT-created data will be stored, processed, analyzed and acted upon close to, or at the edge, of the network Sources: http://event.lvl3.on24.com/event/10/38/69/4/rt/1/documents/resourceList1444759978563/idc_iot_futurescape_wc.pdf http://www.idc.com/getdoc.jsp?containerId=prUS25291514 Edge Computing – Computation and/or analysis of data is performed at the edge devices of a network rather than at a centralized location.
  • 15. © RED HAT, INC. 15 The Broader Implications ● Within three years, 50% of IT networks will transition from having excess capacity to handle the additional IoT devices to being network constrained with nearly 10% of sites being overwhelmed ● By 2017, 90% of datacenter and enterprise systems management will rapidly adopt new business models to manage non-traditional infrastructure and BYOD device categories ● Within five years all industries will have rolled out IoT initiatives Source: http://www.idc.com/Predictions2015.
  • 16. © RED HAT, INC. 16 Newer Architectures, Hardware & Software Implementations, Containerization NEW INSIGHTS NEW VALUE (IoT) DATA COLLECTION & STORAGE Not just a capacity issue any more! It’s about enabling the edge nodes to filter, index and compute the data stream. Equipping the edge with sufficient computation and the appropriate storage combination for both latency sensitive and cold/archival storage. Questions that really need answering: * How much data actually needs to be stored? * How much of the data is transient? * How much data needs to be stored, but can’t be stored owing to cost/capacity constraints? * Within cold storage what are access patterns and appropriate mediums? NEW REVENUE Value
  • 17. © RED HAT, INC. 17 Key Benefits ● Improved QoS and reduced latency of data analytics ● Less data traversing networks ● Higher resiliency since it’s pushed to redundant components at edge of network ● Employ better “swarm intelligence” for diffusion of data across networks ● Optimized data placement for higher energy efficiency ● Agility in infrastructure with containerization ● Higher cost efficiency based on commoditized and open source implementations.
  • 18. © RED HAT, INC. 18 Stay tuned... ● Database Workloads on Ceph ● Latency sensitive, high IOPS workloads on Ceph RBD ● CephFS workloads in production ● Ceph iSCSI target implementation with HA gateways ● Hyperconvergent architectures ● Containerization of storage services ● Support for ARM processors in newer architectures
  • 19. © RED HAT, INC. 19 Driving Value in a Hyperscale Model IMHO, some predictions for incremental gains at hyperscale: * Increased prevalence and dominance of open-source software-defined-storage technology * Proliferation of containerization for heavy densities, availability and elasticity of microservices delivery in the cloud * Implementation of all-flash technologies and tiering for iops intensive and mixed workloads * Increased prominence of ARM processors in scale-out architectures * Use of key-value drives. It will become imperative to leverage a combination of these technologies to remain competitive and to maintain cost and operational efficiency.