0% found this document useful (0 votes)
92 views74 pages

Databases On AWS: Raul Hugo, Solutions Architect

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
92 views74 pages

Databases On AWS: Raul Hugo, Solutions Architect

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 74

Databases on AWS

Raul Hugo, Solutions Architect


Nov 2019
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
• AWS Database Services
• Traditional vs AWS Data services model
• Amazon RDS
• Amazon Redshift
• Amazon DynamoDB
• Amazon ElastiCache
• Amazon Neptune
• Amazon DocumentDB
• Amazon QLDB
• Amazon Timestream

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Database Services
Managed Relational Petabyte-scale Data In-Memory Key
Database Service Warehouse Value Store
Amazon Amazon
Amazon RDS Redshift Elasticache

Fully Managed Key- Fully Managed Cloud-Native


Value and Graph Database Relational Database
Amazon Document Database Amazon Amazon
DynamoDB Neptune Aurora

)
e w] e w] GA
e vi e vi ow
r r (N
[P [P MongoDB
Fully Managed Time Fully Managed Compatible
Series Database Ledger Database Document Database
Amazon Amazon Amazon
Timestream QLDB DocumentDB

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Traditional Database Architecture

Client Tier

one database
for all App/Web Tier
workloads
RDBMS

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Traditional Database Architecture

Client Tier
Key-value access
Complex queries
App/Web Tier
OLAP transactions
Analytics
RDBMS
All forced into the
relational database

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Data Tier Architecture
Client Tier
On AWS choose best
database service for
each workload App/Web Tier

Data Tier

Cache Data Warehouse Time Series Blob Store

NoSQL RDBMS Quantum Ledger Search

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Workload Driven Data Store Selection
hot reads analytics
logging
NoSQL complex queries Periodic rich search
simple query & transactions data
Graph / Key Value / Document Untampered
data

Data Tier

Cache Data Warehouse Time Series Blob Store

NoSQL RDBMS Quantum Ledger Search

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Database Services for the Data Tier
hot reads analytics
logging
NoSQL complex queries Periodic rich search
simple query & transactions data
Graph / Key Value / Document Untampered
data

Data Tier
Amazon Amazon Amazon
Amazon S3
ElastiCache Redshift Timestream

Amazon DynamoDB Amazon Amazon Amazon


Neptune / DocumentDB RDS QLDB ElasticSearch

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon RDS
Managed relational database service with a choice of popular database engines

Easy to administer Performant & scalable Available & durable Secure and compliant

Easily deploy and maintain Scale compute Automatic Multi-AZ data Data encryption at rest and in
hardware, OS and DB and storage with a few clicks; replication; automated transit; industry compliance
software; built-in monitoring minimal downtime for your backup, snapshots, and and assurance programs
application failover

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
If you host your databases on-premises…
App optimization
Scaling
High availability
Database backups
DB s/w patches
DB s/w installs
OS patches
OS installation
Server maintenance
Rack & stack
Power, HVAC, net

you

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
If you host your databases in Amazon EC2…
App optimization
Scaling
High availability
Database backups
DB s/w patches
DB s/w installs
OS patches
OS installation OS installation
Server maintenance Server maintenance
Rack & stack Rack & stack
Power, HVAC, net Power, HVAC, net

you

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
If you choose Amazon RDS…
App optimization
Scaling Scaling
High availability High availability
Database backups Database backups
DB s/w patches DB s/w patches
DB s/w installs DB s/w installs
OS patches OS patches
OS installation OS installation
Server maintenance Server maintenance
Rack & stack Rack & stack
Power, HVAC, net Power, HVAC, net

you

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Key Amazon RDS Features
Multi-AZ
Amazon RDS Improve Increase Reduce
Configuration Availability Throughput Latency
Push-Button Scaling
Multi AZ
Read Replicas availability availability
zone zone
Provisioned IOPS Region

Push-Button Scaling Read Replicas Provisioned IOPS

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Aurora
MySQL and PostgreSQL compatible relational database built for the cloud
Performance and availability of commercial-grade databases at 1/10th the cost

Performance Availability Highly Fully


& scalability & durability secure managed
5x throughput of standard Fault-tolerant, self-healing Network isolation, Managed by RDS: no
MySQL and 3x of standard storage; six copies of encryption at hardware provisioning,
PostgreSQL; scale-out up data across three AZs; rest/transit software patching, setup,
to15 read replicas continuous backup to S3 configuration, or backups

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Scale-out, distributed, multi-tenant architecture
Master Replica Replica
• Purpose-built log-structured
distributed storage system SQL SQL SQL
designed for databases Transactions
Transactions Transactions

Caching Caching Caching


• Storage volume is striped across
hundreds of storage nodes
distributed over 3 different
Availability Zones Shared storage volume

• Six copies of data, two copies in


each Availability Zone to protect
against AZ+1 failures
Storage nodes with SSDs
• Master and replicas all point to the
same storage Availability Availability Availability
Zone 1 Zone 2 Zone 3

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Aurora MySQL performance
WRITE PERFORMANCE READ PERFORMANCE

250000 700000

600000
200000
500000
150000
400000

100000 300000

200000
50000
100000
0
0

MySQL SysBench results; R4.16XL: 64cores / 488 GB RAM Aurora MySQL 5.6

Aurora read write throughput compared to MySQL 5.6


based on industry standard benchmarks.

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Aurora PostgreSQL performance
While running pgbench at load, throughput is 3x more consistent than
PostgreSQL
pgbench throughput over time, 150 GiB, 1024 clients
45000

40000

35000

30000
Throughput, tps

25000

20000

15000

10000

5000

0
10 15 20 25 30 35 40 45 50 55 60
Minutes
PostgreSQL (Single AZ) Amazon Aurora (Three AZs)

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Everything you get from Amazon RDS…
App optimization App optimization App optimization
Scaling Scaling Scaling
High availability High availability High availability
Database backups Database backups Database backups
DB software patches DB software patches DB software patches
Managed DB software installs DB software installs DB software installs Managed
by you OS patches OS patches OS patches by AWS
OS installation OS installation OS installation
Server maintenance Server maintenance Server maintenance
Rack and stack Rack and stack Rack and stack
Power, HVAC, net Power, HVAC, net Power, HVAC, net

Database on-premises Database on EC2 Amazon RDS

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
…and more

up to 64 TB

Up to 64TB of storage – auto-incremented in 10GB units

§ Automatic storage scaling up to 64 TB—no performance impact

§ Continuous, incremental backups to Amazon S3

§ Instantly create user snapshots—no performance impact

§ Automatic restriping, mirror repair, hot spot management, encryption

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Database backtrack
Invisible
t4
Invisible
t2 t3
Rewind to t3
t0 t1
Rewind to t1

t0 t1 t2 t3 t4

Backtrack brings the database to a point in time without requiring restore from backups
• Backtracking from an unintentional DML or DDL operation
• Backtrack is not destructive. You can backtrack multiple times to find the right point in time

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How does backtrack work?
SEGMENT LOG
SNAPSHOT RECORDS

SEGMENT 1

SEGMENT 2

SEGMENT 3

RECOVERY TIME
POINT

We keep periodic snapshot of each segment; we also preserve the redo logs
For backtrack, we identify the appropriate segment snapshots
Apply log streams to segment snapshots in parallel and asynchronously
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Zero downtime patching
Storage Service

Net App
Old DB
Before ZDP

state state
Engine

Net App
state state New DB
User sessions terminate Engine
during patching

Storage Service
With ZDP

Old DB
Engine

Application
Networking

state
state

New DB
User sessions remain Engine
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
active through patching
Fast database cloning BENCHMARKS

Clone database without copying data


DEV/TEST
§ Creation of a clone is nearly APPLICATIONS CLONE
instantaneous
§ Data copy happens only on write –
when original and cloned volume data
differ
CLONE CLONE

Example use cases PRODUCTION PRODUCTION


APPLICATIONS APPLICATIONS
§ Clone a production DB to run tests
§ Reorganize a database
§ Save a point in time snapshot for
analysis without impacting production PRODUCTION DATABASE

system.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
(Preview)
Aurora Multi-Master
First relational database service with scale-out reads and writes
across multiple data centers

Scale out both reads and writes

Zero application downtime from ANY instance failure

Zero application downtime from ANY AZ failure

Faster write performance and higher scale

Shared distributed storage volume


Sign up for single-region multi-master preview today;
Availability Availability Availability multi-region multi-master coming in 2019
Zone 1 Zone 2 Zone 3

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Global database
Faster disaster recovery and enhanced data locality

Promote read-replica to a master


for faster recovery in the event
of disaster

Bring data close to your


customer’s applications in
different regions

Promote to a master for easy


migration
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Global database – physical replication
Region 1: Primary Aurora Cluster Region 2: Read Replica
AZ 1 AZ 2 AZ 3 AZ 1

Aurora
Primary Aurora Aurora

Replication Server

Replication Agent
Replica
Instance Replica Replica
(optional)

Async.

Consistently fast, low-lag, high-performance replication for global relational databases


• Global-scale replication in seconds or less
• Dedicated replication infrastructure ensures unconstrained performance
• Local reads, faster recovery, tighter DR objectives, and seamless cross-region migration

TYPE OF WRITE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
REDO LOG FRM FILES
Aurora Serverless
On-demand, auto-scaling database for applications with variable workloads

Application

Starts up on demand, shuts down when not in use


Database Endpoint

Automatically scales with no instances to manage


Scalable Database Capacity
(Compute + Memory) Warm Capacity
Pool
Pay per second for the database capacity you use

Shared Distributed Storage

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance Insights for Aurora
Analyze and troubleshoot your database performance

• Supports PostgreSQL and MySQL

• Expands on existing Amazon RDS


monitoring features to analyze
issues and performance

• Easy bottleneck identification –


keep track of performance metrics
such as high CPU consumption,
lock waits, I/O latency, and SQL
statements

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
for as low as
$934/TB per year

Petabyte scale
Massively parallel
Amazon Columnar Store
Redshift

Relational data warehouse


Fully managed = no admin

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Redshift – Data Warehousing
Fast, powerful, and simple data warehousing at 1/10 the cost
Massively parallel, petabyte scale

Fast Inexpensive Scalable Secure

$
Columnar storage As low as $1000 per Resize your cluster up Data encrypted at rest
technology to improve I/O terabyte per year, and down as your and transit. Isolate
efficiency and parallelize 1/10th the cost of performance and clusters with VPC.
queries. Data load scales traditional data capacity needs Manage your own keys
linearly. warehouse solutions change with KMS

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Redshift cluster architecture
Massively parallel, shared nothing architecture
Streaming Backup/Restore from S3 JDBC/ODBC

Redshift Cluster
Leader node
Leader Node
• SQL endpoint
• Stores metadata
• Coordinates parallel SQL processing
Compute Nodes
Compute nodes
• Local, columnar storage
• Executes queries in parallel Efficient Data Loads
• Load, backup, restore Streaming Backup/Restore

• 2, 16, or 32 slices

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Redshift Spectrum
Run SQL queries directly against data in S3 using thousands of nodes

High concurrency: Multiple No ETL: Query data in-place Full Amazon Redshift SQL
clusters access same data using open file formats support

S3 SQL

Fast at exabyte scale Elastic and highly available On-demand, pay-per-query

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
NoSQL database
Seamless scalability
Zero admin
Amazon
DynamoDB
Single-digit millisecond latency
Multi-Master
Multi-Region
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon DynamoDB

Highly available
Fully managed Consistently fast at any scale and durable

Integrates with AWS Lambda,


Secure Cost-effective
Amazon Redshift, and more
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Highly available and durable

Designed to support
Built for high durability 99.99%
of availability

WRITES READS

3-way replication Strongly or eventually consistent


Persisted to disk No latency trade-off
(Custom SSD) Data is always replicated to
three Availability Zones

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Highly available and durable
3-way replication
OrderId: 1
CustomerId: 1 Data is always replicated to
ASIN: [B00X4WHP5E] three Availability Zones

Hash(1) = 7B

Availability Zone A Availability Zone B Availability Zone C

Partition A Partition B Partition C Partition A Partition B Partition C Partition


PartitionAA Partition B Partition C

Host 1 Host 2 Host 3 Host 4 Host 5 Host 6 Host 7 Host 8 Host 9

CustomerOrdersTable

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Backup and restore
The only cloud database to provide on-demand and continuous backups

On-demand Point in time restore Point in time recovery with


backups for long- for short term restore times in a few hours
term data archival retention and data depending on table size
and compliance corruption protection
(35 days)

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Global Tables
The first fully-managed, multi-master, multi-region database

Globally dispersed users

Build high performance, globally distributed applications

Low latency reads & writes to locally available tables

Disaster proof with multi-region redundancy

Easy to setup and no application re-writes required

Global Table

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DynamoDB On-Demand
Features
• No capacity planning, provisioning, or
reservations–simply make API calls

• Pay only for the reads and writes you


perform

Key benefits
• Eliminates tradeoffs of over- or under-
provisioning

• Instantly accommodates your workload


as traffic ramps up or down

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Capacity managed for you

Govern max consumption

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DynamoDB Accelerator (DAX)
High performance

Your applications Fully managed, highly


available cache for DynamoDB

Even faster—
DAX microsecond latency

Scales to millions of
requests per second

DynamoDB API compatible

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fully managed auto scaling

Automated
scaling policies
$$$ Savings Scales up when
you need it
Scales down when
you don’t
Scheduled
auto scaling

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
NoSQL vs. SQL for a new app: how to choose?
Want simplest possible DB Need joins, transactions, frequent
management? table scans?
Want app to manage DB integrity? Want DB engine to manage DB
integrity?
Team has SQL skills?

Amazon Amazon
DynamoDB RDS

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Introducing Amazon ElastiCache
Fully-managed, Redis or Memcached compatible, low-latency, in-memory data
store

Extreme Fully Easily


Performance Managed Scalable
In-memory data store and AWS manages all Read scaling with
cache for sub-millisecond hardware and software replicas. Write and memory
response times setup, configuration, scaling with sharding.
monitoring Non disruptive scaling

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
µs is the new ms

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Internet-scale apps need low latency and high
concurrency
Users 1M+

Data volume TB-PB-EB

Locality Global

Performance Milliseconds
to microseconds
Request Rate Millions

Access Mobile, IoT,


Devices
Gaming Financial Social Ride
leaderboards trading media hailing Scale Up-Out-In

Economics Pay as you go

Dating Media Session Developer access Instant API access


streaming stores
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Developers use various approaches to reduce
latency

In-memory databases Specialized hardware Data reduction


and data grids such approaches
as multi-core processors, such as sampling,
GPUs, accelerators aggregation

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon ElastiCache
• In-memory cache in the cloud
• Improve latency and throughput for read-heavy
workloads
• Supports open-source caching engines
• Memcached
• Redis
• Fully managed
• Multi-AZ

Examples
• Caching of MySQL database query results
• Caching of post-processing results
• Caching of user session and frequently accessed data
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ElastiCache Redis
#1 Key-Value Store* Highly Available & Reliable
Fast in-memory data store in the cloud. Use as a database, cache, Read replicas, multiple primaries, multi-AZ with
message broker, queue automatic failover

Fully Managed & Hardened Easily Scalable


AWS manages hardware, software, setup, configuration, Cluster with up to 6.1 TiB of in-memory data
monitoring, failure recovery, and backups
Read scaling with replicas
Write and memory scaling with sharding
Scale out or in
Secure & Compliant
VPC for cluster isolation, encryption at rest/transit,
HIPAA compliance

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

*: https://db-engines.com/en/ranking
ElastiCache Memcached

Fully Managed Secure & Easily


Memcached Hardened Scalable
Fast in-memory data store VPC for cluster Sharding to scale in-memory
in the cloud. Use as a cache isolation cache with up to 20 nodes and
to reduce latency and 8.14 TiB per cluster
improve throughput

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fully managed graph database
Supports open graph APIs
Scalable
Amazon
Neptune
ACID compliant
Multi-AZ

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Neptune
Fully managed graph database for highly connected data

Open Fast & Scalable Reliable Easy

Gremlin
SPARQL

Supports Apache Store billions of relationships; 6 replicas of your data Build powerful queries
TinkerPopTM & W3C RDF query with millisecond latency across 3 AZs with full easily with Gremlin and
graph models backup and restore SPARQL
+
GRAPHQL with AppSync

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Use cases for highly connected data
• Social networking

• Recommendations

• Knowledge graphs

• Fraud detection

• Life sciences

• Network and IT operations


© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fully managed service
• Easily configurable via the console

• Multi-AZ high availability

• Up to 15 read replicas

• Supports encryption at rest

• Supports encryption in transit (TLS)

• Backup and restore, point-in-time recovery

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
(Preview)

Fully managed ledger database


Immutable and transparent
Cryptographically verifiable
Amazon
QLDB Scalable
Serverless

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Quantum Ledger Database (QLDB) (Preview)

Fully managed ledger database


Track and verify history of all changes made to your application’s data

Cryptographically
Immutable verifiable Highly scalable Easy to use

Maintains a sequenced Uses cryptography to Executes 2–3X as many Easy to use, letting you
record of all changes to your generate a secure output transactions as ledgers use familiar database
data, which cannot be file of your data’s history in common blockchain capabilities like SQL APIs
deleted or modified; you have frameworks for querying the data
the ability to query and
analyze the full history

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How Amazon QLDB works

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Common customer use cases

Banking & Finance E-Commerce Transport & Logistics


Keeping track of transactions, Where’s my stuff? Tracking transportation
trades and accounts of goods

HR & Payroll Manufacturing Government


Tracking changes to an Recording components used Tracking vehicle
individual’s profile in manufacturing title history

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
(Now GA)

Fully managed document database


MongoDB-compatible
Amazon Multi-AZ
DocumentDB

Performance at scale

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Why use a document database?

The JSON document model maps naturally to


application data

Each document can have a different data


structure and is independent of other documents

Index on any key in a document, and run ad hoc and


aggregation queries across your data set

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Use cases for document databases

Content Mobile Personalization


Management

Catalog Retail and User profiles


Marketing
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Use case: Profile Management
users table tankfight_users table
id usernam first_nam last_name id hi_scor global_ra
e e e nk
18127 sue1942 Susan Benoit 18127 3185400 5139
6 6

{
id: 181276,
{ username: 'sue1942',
id: 181276, name: {first: 'Susan',
username: 'sue1942', last: 'Benoit'},
name: {first: 'Susan', tankfight: {
last: 'Benoit'}, hi_score: 3185400,
} global_rank: 5139
}
}

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MongoDB – #1 NoSQL database engine

Over 60 million downloads


Easy for developers to get started
MongoDB API is rich and powerful
Easy to programmatically query

Source: http://db-engines.com
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MongoDB Architecture
Sharded cluster scaling dramatically increases operational complexity

Availability Zone 1 Availability Zone 2 Availability Zone 3

Shard Shard Shard


Router Router Router

reads / writes

Shard
Shard Shard
Primary Primary Primary
replication

Secondaries Secondaries Secondaries

Shard Balancing

Config Config Config


Server Server Server

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Running MongoDB is difficult……

TCO

What if you could create and


Experts only scale MongoDB compatible
database clusters in minutes?

Time to scale

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon DocumentDB
Fast, scalable, highly available, fully managed MongoDB-compatible database
service

Millions of requests per Same code, drivers, and Simple


second, millisecond latency tools you use with MongoDB and fully managed

2x throughput of Deeply integrated


Secure and compliant
managed MongoDB services with AWS services

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DocumentDB Architecture
Separate compute and storage provide 2x throughput of current MongoDB
managed services
AWS Region

Availability Zone 1 Availability Zone 2 Availability Zone 3

Instance Instance Instance


(primary)
writes (replica) (replica)
reads

reads
wri

reads
tes writes

Data copies Data copies Data copies


Cluster volume
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fast
Fast, reliable, and fully-managed MongoDB-compatible database service

Automatic
Fast More throughput Analytics
storage scaling

Millions of requests per Separation of storage and DocumentDB will Launch instances in minutes
second with millisecond compute offloads replication, automatically grow the size for analytical queries and
latency; scale-out up to providing 2x the throughput of of your storage volume as shut them down at the end
15 read replicas current MongoDB managed your cluster storage needs of the day
services grow.

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Reliable
Fast, reliable, and fully-managed MongoDB-compatible database service

Automatic failure Automatic Simplified Durable


recovery failover backups

Failing instances are Replicas are Continuous backups with Data is replicated six-
automatically detected and automatically promoted to point in time recovery. ways across three AZs
recovered; no cache warm- primary Scheduled snapshots.
up needed No performance impact.

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fully-managed
Fast, reliable, and fully-managed MongoDB-compatible database service

Automatic patching Quick start Monitoring Integrated

Up-to-date with the latest Provision production- Over 20 key operational Deeply integrated with AWS
patches ready clusters in minutes metrics for your clusters services such as
at no extra charge CloudFormation, CloudTrail,
CloudWatch, DMS, IAM, VPC,
and more.

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MongoDB-compatible
Fast, reliable, fully-managed MongoDB-compatible database

Same drivers, Migration with


MongoDB 3.6 Replica sets
tools DMS

Compatible with MongoDB Use the same MongoDB Live migrations with Read scaling is easy with
Community Edition 3.6 drivers and tools with DMS; free for 6-months automatic replica set
DocumentDB; as simple as configurations
changing an application
connection string

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
It’s all about
choice Performance-oriented
Cost-oriented

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Any questions?

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!

© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

You might also like