0% found this document useful (0 votes)

66 views59 pages

APP306 - Using AWS CloudFormation For Deployment and Management at Scale

Using AWS CloudFormation for Deployment and Management at Scale

Uploaded by

Allan Huang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views59 pages

APP306 - Using AWS CloudFormation For Deployment and Management at Scale

Using AWS CloudFormation for Deployment and Management at Scale

Uploaded by

Allan Huang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 59

APP306: Using AWS CloudFormation for

Deployment and Management at Scale

Tom Cartwright and Yavor Atanasov, BBC
November 12, 2014 Las Vegas, Nevada

© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Who are we?
Fifth largest site in UK, 55th Globally
• Top 20 in News, Sport, Arts, Childrens

Juggling depth of audience and breadth of services is

a key challenge

Source: Alexa
What do our services do?
Deploy at scale
• > 300 deployments per day
• 60,000 deployments in first 18 months

Deploy robustly
• All key video transcoding and packaging for BBC iPlayer
• Pipeline delivering election results to BBC News
• Live text for all BBC Sport events
How are Yavor and I involved?
We build tools for the full development lifecycle

Develop Build Deploy Run

And what are we going to talk about?
• Part One – Where did we come from and how did we get
where we are?
• Part Two – What have we built and how do we use AWS
CloudFormation to keep it running?
The beginning
The beginning — 2012
• Olympics dominating our planning and capacity
• On-premises platforms running key BBC Online
properties
• Hard to get focus on other projects
Ops are a constrained resource
Devs can touch test, but Ops own live:
• “Jira-powered deployment”
• 40,000 change tickets since October ’09

Leading to:
• Greater delta between releases
• Longer feedback loops
• High stress around emergency changes
Infrastructure is a constrained resource

Physical infrastructure needs to be bought, racked, configured:

• Weeks of lead time on new hardware
• Limited supplies of existing hardware

Leading to:
• Inflexibility to changing requirements
• Shared tenancy of hardware, weak software isolation
Three emerging trends
Continuous delivery
– Can we build better quality things, faster?
Cloud
– Can we reduce our costs or increase our agility?
DevOps
– Can we strike a better balance of freedom and
responsibility for engineers?
The grappling hook
The grappling hook
• Take two teams: one product, one platform

• Product team takes advantage of features as they become

available from platform and feeds requirements in

• Platform team builds features based on need but looks to

make them scale to many users

• Get the learning in software, not slideware

Continuous delivery
• Automate everything
• Keep everything in source control
• Build your binaries once
• Use the same mechanism to deploy to
all environments
• If anything fails, stop the line

Think continuous improvement — direction not position

DevOps
The people that wrote it:
– Will fix problems fastest
– Know when it is sensible to deploy

So give them the access to do it and ask them to take

responsibility for their actions
November 2012
• Spoke to others others solving the same problems
• Began to focus on the underlying principles rather than
immediate problems
• Came home and mustered the Simian Army
Grappling hook — reflections
The good
• Infrastructure costs exactly as predicted
• Numerous platform features ready for further use
• Had a developing set of principles around good practice

The not-so-good
• We learned many lessons about how to build, fewer about why…
Storming the tower
The platform pendulum

Restriction Freedom
The platform pendulum

Predictability Chaos
The platform pendulum

Slow Fast

Tools
Establishing principles
• Establish strong defaults for the way things get
built and create tooling for that
• Assume that there will be use cases where the
defaults don’t fit
Managing infrastructure at scale
• Repeatability
– Never “spin it up in the console and hope”
• Flexibility
– Teams are going to need that obscure service
• StackOverflow-ability
– If there is a well-known way of expressing it in the
world, use it
Managing deployment at scale
• Repeatability
– All instances should be identical
• Robustness
– Look for fail-safe mechanisms
• Resilience
– Minimize dependencies at instance startup
Handling support at scale
• Access
– Engineers should have access to the services they run
• Patterns
– Create patterns and templates for core infrastructure pieces
• Support
– Ask developers to take “the phone”
The rest is just software…
Inside the machine
Version Control How we deploy
Pull
Build binaries in a reproducible way;
build them once; automate everything
Commit
Push
Build Jenkins Repos

Registe
r Deploy

Cosmos
Test
Promote
Bake

Bakery

Live
Infrastructure
provisioning
Hardware is now software, embrace it
and treat it that way!

• Build infrastructure in a reliable and

reproducible way, just like you build
software
Infrastructure as code and AWS
CloudFormation

• Managed infrastructure dependencies

• AWS API interactions taken care for you
• Reproducibility
• Versioning
What does that mean for my
application?

• I can build identical copies of my app in different

environments
• I can version my infrastructure templates with
my code and reproduce the full stack at any
point in time
So my application is not just software,
it is software and infrastructure
combined

v1
v2

v3
Application infrastructure
Let’s look at what an application might look like and how we can define it with AWS
CloudFormation

Auto Scaling Group RDS database

= Security Groups
IAM Roles and Policies + S3 bucket
SQS Queue
Elastic Load Balancer SNS Topic
Route 53 Record
…defined in CloudFormation stacks
Separate stateful and stateless resources into separate templates

Auto Scaling Group RDS database

Security Groups S3 bucket
IAM Roles and Policies SQS Queue
Elastic Load Balancer SNS Topic
Route 53 Record

service-0.1.0.json resources-0.1.0.json
The best way to form clouds

• JSON is great for defining infrastructure

• But if you find yourself repeating the same template
over and over, consider abstracting it in code
• E.g., https://github.com/cloudtools/troposphere for
python
JSON vs code
Abstracting AWS CloudFormation
allows us to create default service
templates and provide them to teams
in a concise way.

530 lines of JSON vs 5 lines of python

AWS CloudFormation and deployments
Version Control How we deploy
Pull
Cosmos bakes an AMI and then
updates the service stack…
Commit
Push
Build Jenkins Repos

Registe
r
serv ice stack
Updates
Cosmos
Upd Test
ates
serv
Bake ice
s tack

Bakery

Live
The Bakery

• Takes repository information, packages to install

and environment specific configuration
• Bakes AMIs using a 2 step snapshot process – 1
snapshot just for the software and 1 for the software
with the configuration
Building machines is like building
software

• Build binaries once

• Build them in a reproducible way
What’s in a machine?

Environment
configuration

Service

Software binary

Base OS
2 step snapshotting

snap-432jrse
snap-w3r153r
Re-baking for different environments

snap-456qwf
snap-w3r153r

snap-w3r153r
snap-w3r153r
Version Control How we deploy
Pull
Cosmos bakes an AMI and then
updates the service stack…
Commit
Push
Build Jenkins Repos

Registe
r
serv ice stack
Updates
Cosmos
Upd Test
ates
serv
Bake ice
s tack

Bakery

Live
…what actually happens

• Cosmos updates the ImageId property of the Auto

Scaling Group’s LaunchConfiguration
• Based on the specified UpdatePolicy, the ASG
starts refreshing the instances with new ones using
the new AMI
Optimizing the ASG UpdatePolicy

• On test environments you can optimize for speed

and replace all instances at once
• Once live, you should update the ASG in batches
making sure you don’t have downtime
…for example
For a service with an ASG with 5 instances…

TEST LIVE
"UpdatePolicy": { "UpdatePolicy": {
"AutoScalingRollingUpdate": { "AutoScalingRollingUpdate": {
"PauseTime": "PT0S", "PauseTime": "PT15S",
"MaxBatchSize": "5", "MaxBatchSize": "2",
"MinInstancesInService": "0" "MinInstancesInService": "2"
} }
} }
Version Control Let’s see it in
Pull
action!
Commit
Push
Build Jenkins Repos

Registe
r
serv ice stack
Updates
Cosmos
Upd Test
ates
serv
Bake ice
s tack

Bakery

Live
Demo time
Let’s deploy one of our services and
see what happens…
AWS CloudFormation beyond the app
Defining our core infrastructure

• Provides the frame upon which services’

infrastructure is built
• Provides security and resilience through levels of
isolation
Levels of isolation

• Network and instance access — be isolated by

default
• Resource isolation — find all API limits and
resource limits and avoid sharing those among your
critical services; use different AWS accounts
Core
infrastructure
eu-west-1a

Private Public
Each AWS account is setup an
Amazon Virtual Private Cloud
spreading across the three Availability
Zones; the VPC contains three private
eu-west-1b

and three public subnets

Private Public Service’s ASGs are positioned in the

private subnets and their load
balancers go in the public ones
eu-west-1c

Private Public
Environments
Development and production
environments are built in separate
Production Development accounts to bring full isolation from
API and resource limits

All managed via AWS CloudFormation

stacks
SSH access
SSH access is granted via Bastion
machines positioned in a dedicated
Production Development VPC, which is peered with the VPCs
that should be accessed

Bastions
In Closing…
Recapping
Scale
• > 300 deployments per day
• 50,000 deployments in first 18 months

Speed
• Time from laptop to live reduced from 2 days to 10 minutes

Commitment
• All key video transcoding and packaging for BBC iPlayer
• Pipeline delivering election results to BBC News
• Live text for all BBC Sport events
Want to know more?
• We’re starting to share our work: https://github.com/bbc
• Come and talk to us, or our colleagues this week
• We’re hiring, in London and Salford, UK: http
://www.bbc.co.uk/careers
Please give us your feedback on this session.
Complete session evaluations and earn re:Invent swag.

APP306 http://bit.ly/awsevals

Join the conversation on Twitter with #reinvent

Fundamental Cloud Concepts - Guided Notes - Completed
No ratings yet
Fundamental Cloud Concepts - Guided Notes - Completed
24 pages
Traffic Signal CG Mini Project Using OpenGL Report
100% (1)
Traffic Signal CG Mini Project Using OpenGL Report
43 pages
Overview of Deployment Options On AWS: June 2020
No ratings yet
Overview of Deployment Options On AWS: June 2020
21 pages
Need of Cloud in DevOps
No ratings yet
Need of Cloud in DevOps
17 pages
Module 2 - Automating The Development Pipeline
No ratings yet
Module 2 - Automating The Development Pipeline
59 pages
AWS DevOps Course Syllabus
No ratings yet
AWS DevOps Course Syllabus
6 pages
Overview of Deployment Options On AWS: Peter Dalbhanjan March 2015
No ratings yet
Overview of Deployment Options On AWS: Peter Dalbhanjan March 2015
23 pages
DevOps-Ultimate-Guide-bbfgrw
No ratings yet
DevOps-Ultimate-Guide-bbfgrw
112 pages
Cloud DevOps_Syll
No ratings yet
Cloud DevOps_Syll
8 pages
Terrform Guide
No ratings yet
Terrform Guide
38 pages
Introduction To Devops On Aws: October 2020
100% (1)
Introduction To Devops On Aws: October 2020
27 pages
Cloud DevOps Nanodegree Program Syllabus PDF
No ratings yet
Cloud DevOps Nanodegree Program Syllabus PDF
5 pages
1. Introduction to DevOps and Cloud Computing With AWS
No ratings yet
1. Introduction to DevOps and Cloud Computing With AWS
40 pages
Handout Deploy Infrastructure As A Code On AWS
No ratings yet
Handout Deploy Infrastructure As A Code On AWS
24 pages
Introduction To Devops On Aws: David Chapman
No ratings yet
Introduction To Devops On Aws: David Chapman
20 pages
Aws Devops
No ratings yet
Aws Devops
20 pages
(13)CloudFormationandOpsWork
No ratings yet
(13)CloudFormationandOpsWork
55 pages
The Bad Parts of AWS copy 6
No ratings yet
The Bad Parts of AWS copy 6
128 pages
s6 PPT 1
No ratings yet
s6 PPT 1
18 pages
AWS - Interview Guide
100% (1)
AWS - Interview Guide
229 pages
Fundamental Cloud Concepts - Guided Notes - Completed
No ratings yet
Fundamental Cloud Concepts - Guided Notes - Completed
25 pages
AWS Cloud Practitioner Essentials Resume
No ratings yet
AWS Cloud Practitioner Essentials Resume
40 pages
Cloud Devops Engineer: Nanodegree Program Syllabus
No ratings yet
Cloud Devops Engineer: Nanodegree Program Syllabus
15 pages
Aws
No ratings yet
Aws
30 pages
Cloud DevOps Nanodegree Program Syllabus
No ratings yet
Cloud DevOps Nanodegree Program Syllabus
14 pages
Being Well-Architected in The Cloud
No ratings yet
Being Well-Architected in The Cloud
69 pages
Devopsandaws 150924013128 Lva1 App6891 PDF
No ratings yet
Devopsandaws 150924013128 Lva1 App6891 PDF
82 pages
Course Slides
No ratings yet
Course Slides
287 pages
devops aws cloud
No ratings yet
devops aws cloud
12 pages
CICD With Docker Kubernetes Semaphore
No ratings yet
CICD With Docker Kubernetes Semaphore
92 pages
Devops Intro
No ratings yet
Devops Intro
18 pages
Final5 Introduction To DevOps and The Practical Use Cases at Credit OK
No ratings yet
Final5 Introduction To DevOps and The Practical Use Cases at Credit OK
68 pages
Useful Application For AWS ECS
No ratings yet
Useful Application For AWS ECS
5 pages
Updated_tcs.johni
No ratings yet
Updated_tcs.johni
15 pages
Cloud DevOps Interview QA
No ratings yet
Cloud DevOps Interview QA
5 pages
3542_718_DOC_Categorize and Describe Key AWS DevOps Services That Support the Application Lifecycle
No ratings yet
3542_718_DOC_Categorize and Describe Key AWS DevOps Services That Support the Application Lifecycle
4 pages
1. Introduction Notes
No ratings yet
1. Introduction Notes
9 pages
Description: The Fundamental-Level Course Is Intended For Individuals Who Seek An Overall
No ratings yet
Description: The Fundamental-Level Course Is Intended For Individuals Who Seek An Overall
20 pages
Introduction to Cloud DevOps
No ratings yet
Introduction to Cloud DevOps
4 pages
Aws Cloud Deploy
No ratings yet
Aws Cloud Deploy
21 pages
177 Sk Momrej Ali
No ratings yet
177 Sk Momrej Ali
10 pages
Devops Interview
No ratings yet
Devops Interview
5 pages
Preamble: Intro To Cloud Computing: Presented By: Aater Suleman, PHD
No ratings yet
Preamble: Intro To Cloud Computing: Presented By: Aater Suleman, PHD
48 pages
Module 1 - AWS
No ratings yet
Module 1 - AWS
25 pages
Production Readiness Checklist
No ratings yet
Production Readiness Checklist
31 pages
AWS-DEVOPS DevOps Engineering On AWS
No ratings yet
AWS-DEVOPS DevOps Engineering On AWS
5 pages
AWS DevOps Q&A_3
No ratings yet
AWS DevOps Q&A_3
8 pages
Docker Fundamentals Jumpstart
No ratings yet
Docker Fundamentals Jumpstart
34 pages
Cloud DevOps Nanodegree Program Syllabus PDF
No ratings yet
Cloud DevOps Nanodegree Program Syllabus PDF
14 pages
GitOps Best Practice Document
No ratings yet
GitOps Best Practice Document
18 pages
AWS Handbook
No ratings yet
AWS Handbook
9 pages
Implementing Cloud Design Patterns For AWS - Sample Chapter
No ratings yet
Implementing Cloud Design Patterns For AWS - Sample Chapter
14 pages
Unit 3 Chapter 3 Notes
No ratings yet
Unit 3 Chapter 3 Notes
10 pages
Week12 Slides
No ratings yet
Week12 Slides
43 pages
DIY Cloud Database On Amazon Web Services Best Practices PDF
No ratings yet
DIY Cloud Database On Amazon Web Services Best Practices PDF
31 pages
Terraform Modules: Reusable, Composable, Battle-Tested
100% (1)
Terraform Modules: Reusable, Composable, Battle-Tested
150 pages
LI Git Terra
No ratings yet
LI Git Terra
18 pages
Learning Azure DevOps
From Everand
Learning Azure DevOps
Myra Kelnor
No ratings yet
Learning Azure DevOps: Outperform DevOps using Azure Pipelines, Artifacts, Boards, Azure CLI, Test Plans and Repos
From Everand
Learning Azure DevOps: Outperform DevOps using Azure Pipelines, Artifacts, Boards, Azure CLI, Test Plans and Repos
Myra Kelnor
No ratings yet
Hands-On Multi-Cloud Kubernetes: Multi-cluster kubernetes deployment and scaling with FluxCD, Virtual Kubelet, Submariner and KubeFed
From Everand
Hands-On Multi-Cloud Kubernetes: Multi-cluster kubernetes deployment and scaling with FluxCD, Virtual Kubelet, Submariner and KubeFed
Joe Brian
No ratings yet
AWS Certified Solutions Architect - Associate Exam Prep kit
From Everand
AWS Certified Solutions Architect - Associate Exam Prep kit
SUJAN
No ratings yet
Sun Yuan
No ratings yet
Sun Yuan
7 pages
NetAct Introduction Training - 2020
No ratings yet
NetAct Introduction Training - 2020
61 pages
DTU 2025 Cutoff
No ratings yet
DTU 2025 Cutoff
2 pages
Omar Samer P6 2022
No ratings yet
Omar Samer P6 2022
342 pages
Dungeons and DQNs Toward Reinforcement Learning
No ratings yet
Dungeons and DQNs Toward Reinforcement Learning
8 pages
Final Term Module Purposive Communication 2 PDF
100% (1)
Final Term Module Purposive Communication 2 PDF
119 pages
Analysis Template
No ratings yet
Analysis Template
4 pages
Add Datafile TEMP
No ratings yet
Add Datafile TEMP
2 pages
g10 Maths
No ratings yet
g10 Maths
3 pages
Dxdiag
No ratings yet
Dxdiag
20 pages
Trading Strategy - Technical Analysis With Python TA-Lib
No ratings yet
Trading Strategy - Technical Analysis With Python TA-Lib
12 pages
n3k c3172pq XL Datasheet
No ratings yet
n3k c3172pq XL Datasheet
8 pages
Power BI Is Awesome!: Steve Wake
No ratings yet
Power BI Is Awesome!: Steve Wake
12 pages
UNIT-3 Two Marks Question and Answers
No ratings yet
UNIT-3 Two Marks Question and Answers
17 pages
The Preservation of Kahalawan Te Sebseb "The Festival of Springs" in Maramag, BukidnonFINAL3
No ratings yet
The Preservation of Kahalawan Te Sebseb "The Festival of Springs" in Maramag, BukidnonFINAL3
3 pages
Headquarters U.S. Air Force: Dod Enterprise Devsecops Initiative (Software Factory)
No ratings yet
Headquarters U.S. Air Force: Dod Enterprise Devsecops Initiative (Software Factory)
35 pages
Calibration of Car Audio Systems
No ratings yet
Calibration of Car Audio Systems
41 pages
Coaching Combination Play Build Up Midfield Rotation Coordinated
No ratings yet
Coaching Combination Play Build Up Midfield Rotation Coordinated
8 pages
Complete List of SAP Modules
No ratings yet
Complete List of SAP Modules
16 pages
每一天梦想练习 - 另维_files
No ratings yet
每一天梦想练习 - 另维_files
112 pages
SS1 F Maths
No ratings yet
SS1 F Maths
6 pages
p454
No ratings yet
p454
8 pages
FCPC Lecture Plan
No ratings yet
FCPC Lecture Plan
2 pages
Photoshop
No ratings yet
Photoshop
126 pages
Best Gadget For Students
No ratings yet
Best Gadget For Students
9 pages
Guideline For CBT For AEE
No ratings yet
Guideline For CBT For AEE
41 pages
KBA 1663549 - SOAMANAGER Bug PDF
No ratings yet
KBA 1663549 - SOAMANAGER Bug PDF
3 pages
Mobile Phone Usage Terms & Conditions
No ratings yet
Mobile Phone Usage Terms & Conditions
3 pages
T-B Series: Industrial Robot: SCARA ROBOT
No ratings yet
T-B Series: Industrial Robot: SCARA ROBOT
180 pages

APP306 - Using AWS CloudFormation For Deployment and Management at Scale

Uploaded by

APP306 - Using AWS CloudFormation For Deployment and Management at Scale

Uploaded by

APP306: Using AWS CloudFormation for

Deployment and Management at Scale

Juggling depth of audience and breadth of services is

Develop Build Deploy Run

Physical infrastructure needs to be bought, racked, configured:

• Product team takes advantage of features as they become

• Platform team builds features based on need but looks to

• Get the learning in software, not slideware

Think continuous improvement — direction not position

So give them the access to do it and ask them to take

• Build infrastructure in a reliable and

• Managed infrastructure dependencies

• I can build identical copies of my app in different

Auto Scaling Group RDS database

Auto Scaling Group RDS database

• JSON is great for defining infrastructure

530 lines of JSON vs 5 lines of python

• Takes repository information, packages to install

• Build binaries once

• Cosmos updates the ImageId property of the Auto

• On test environments you can optimize for speed

• Provides the frame upon which services’

• Network and instance access — be isolated by

and three public subnets

Private Public Service’s ASGs are positioned in the

All managed via AWS CloudFormation

Join the conversation on Twitter with #reinvent

You might also like