State of Cassandra 2011
Jonathan Ellis
Apache Chair
CTO
DataStax
Job Trends from Indeed.com




                             2	
  
Customer and Verticals
•    Financial
•    Social Media
•    Advertising
•    Entertainment
•    Energy
•    E-tail
•    Health care
•    Government



                         3	
  
Why?
Why?	
  
           4	
  
           4	
  
5	
  
Why Cassandra?
Why	
  Cassandra?	
  
                        6	
  
                        6	
  
Better technology
•    Multi-master, multi-DC
•    Linearly scalable
•    Larger-than-memory datasets
•    Best-in-class performance (not just writes!)
•    Fully durable
•    Integrated caching
•    Tuneable consistency




                                                    7	
  
Tunable Consistency
 WRITE                               READ
 Level                Level

 ANY

 ONE                  ONE

 LOCAL_QUORUM         LOCAL_QUORUM

 QUORUM               QUORUM

 ALL                  ALL
Generalizes Easily to Multi-DC




                                 9	
  
0.7
•    CREATE COLUMN FAMILY
•    Expiring columns (TTL)
•    Secondary (column) indexes
•    Efficient streaming




                                  1
                                  0	
  
0.8
•    CQL
•    Counters
•    Automatic memtable tuning
•    New bulk load interface




                                 1
                                 1	
  
A performance retrospective




                              1
                              2	
  
October 8, 2011

Road to 1.0

                  13	
  
Theme: polish
•    Repair
•    Compaction
•    Optimize reads for update-heavy workloads
•    CQL 1.1




                                                 1
                                                 4	
  
Repair
•  Consistency is checked per-ColumnFamily but data
   is transferred per-Keyspace
•  Merkle trees requests are sent en masse, but may
   not execute start at the same time




                                                      1
                                                      5	
  
Compression
•  Rows-per-block or blocks-per-row




                                      1
                                      6	
  
Read Performance: Compaction




                               1
                               7	
  
Level-based Compaction
•  SSTables are non-overlapping within a level
•  Bounds the number that can contain a given row




                                  L2: 1000 MB

                                  L1: 100 MB


                                  L0: newly flushed
                                                      1
                                                      8	
  
Read performance: maxtimestamp
•  Sort sstables by maximum (client-provided)
   timestamp
•  Only merge sstables until we have the columns
   request
•  Allows pre-merging highly fragmented rows without
   waiting for compaction




                                                       1
                                                       9	
  
CQL

cqlsh> SELECT * FROM users WHERE state='UT' AND birth_date > 1970;!


        KEY | birth_date |         full_name | state |

 bsanderson |       1975 | Brandon Sanderson |    UT |
	
  




                                                               2
                                                               0	
  
CQL 1.1
•    ALTER
•    Counter support
•    TTL support
•    Compound columns
•    Prepared statements




                           2
                           1	
  
Post-1.0
•  Ease of use

• Ease of use

• Ease of use




                 2
                 2	
  
Post-1.0 features
•    “Native” CQL transport
•    Triggers
•    Entity groups
•    Smarter range queries




                              2
                              3	
  
Brisk
•  Analytics for your
   realtime data
   without ETL
•  Widens scope of
   Cassandra’s
   applicability
•  Also: Solandra




                        2
                        4	
  
Questions
Ques/ons?	
  
                25	
  
                25	
  
State of Cassandra, 2011

More Related Content

PPTX
VIO on Cisco UCS and Network
PPTX
Nested Virtualization Update from Intel
PDF
MesosCon EU 2017 - Criteo - Operating Mesos-based Infrastructures
ODP
Xen 4.3 Roadmap
PPTX
Virtual Deep-Dive: Citrix Xen Server
PPTX
Windows Azure Pack: Containerized SQL Server Resource provider
PDF
Cassandra Core Concepts - Cassandra Day Toronto
PPTX
Xenserver Networking
VIO on Cisco UCS and Network
Nested Virtualization Update from Intel
MesosCon EU 2017 - Criteo - Operating Mesos-based Infrastructures
Xen 4.3 Roadmap
Virtual Deep-Dive: Citrix Xen Server
Windows Azure Pack: Containerized SQL Server Resource provider
Cassandra Core Concepts - Cassandra Day Toronto
Xenserver Networking

What's hot (20)

PPT
VMware Virtual SAN slideshow
PDF
Windsor: Domain 0 Disaggregation for XenServer and XCP
PDF
How we build caibangzi.com
PPTX
Scale11x : Virtualization with Xen and XCP
ODP
Openvz booth
PPT
OpenStack Database as a Service - Juno Updates
PPTX
Xen Cloud Platform Update
PPTX
DevOps Fest 2019. Stanislav Kolenkin. Сonnecting pool Kubernetes clusters: Fe...
PDF
Oscon 2012 : From Datacenter to the Cloud - Featuring Xen and XCP
PDF
TechDay - Cambridge 2016 - OpenNebula Corona
PPTX
Xen Cloud Platform by Tim Mackey
PPTX
Multi-HV OpenStack - It's 2015 already, are we there yet?
PPTX
Xen cloud platform
PPT
OpenNebula Administrator View
PDF
XPDS13: VIRTUAL DISK INTEGRITY IN REAL TIME JP BLAKE, ASSURED INFORMATION SE...
PPTX
Bringing ESX Deployments into native OpenStack OVSvApp
PDF
Dokcer swarm
PPTX
OpenStack & OVS: From Love-Hate Relationship to Match Made in Heaven - Erez C...
PDF
Automating Gluster @ Facebook - Shreyas Siravara
PPTX
Scott Schnoll - Exchange server 2013 virtualization best practices
VMware Virtual SAN slideshow
Windsor: Domain 0 Disaggregation for XenServer and XCP
How we build caibangzi.com
Scale11x : Virtualization with Xen and XCP
Openvz booth
OpenStack Database as a Service - Juno Updates
Xen Cloud Platform Update
DevOps Fest 2019. Stanislav Kolenkin. Сonnecting pool Kubernetes clusters: Fe...
Oscon 2012 : From Datacenter to the Cloud - Featuring Xen and XCP
TechDay - Cambridge 2016 - OpenNebula Corona
Xen Cloud Platform by Tim Mackey
Multi-HV OpenStack - It's 2015 already, are we there yet?
Xen cloud platform
OpenNebula Administrator View
XPDS13: VIRTUAL DISK INTEGRITY IN REAL TIME JP BLAKE, ASSURED INFORMATION SE...
Bringing ESX Deployments into native OpenStack OVSvApp
Dokcer swarm
OpenStack & OVS: From Love-Hate Relationship to Match Made in Heaven - Erez C...
Automating Gluster @ Facebook - Shreyas Siravara
Scott Schnoll - Exchange server 2013 virtualization best practices
Ad

Similar to State of Cassandra, 2011 (20)

PDF
PayPal Big Data and MySQL Cluster
KEY
Andy Parsons Pivotal June 2011
PPTX
A Global In-memory Data System for MySQL
PPTX
M6d cassandrapresentation
PPTX
Yes sql08 inmemorydb
PPTX
Cassandra training
PDF
Introduction to Akka-Streams
PPTX
An Introduction to Cassandra - Oracle User Group
PDF
Cassandra Day Denver 2014: Introduction to Apache Cassandra
PDF
Introduction to Cassandra - Denver
PPTX
L6.sp17.pptx
PPTX
Cassandra tech talk
PDF
Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...
PDF
NYC Meetup November 15, 2012
PDF
Bigtable and Dynamo
PDF
On Cassandra Development: Past, Present and Future
PPTX
Cистема распределенного, масштабируемого и высоконадежного хранения данных дл...
PDF
From 0 to syncing
PDF
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
PDF
The Cassandra Distributed Database
PayPal Big Data and MySQL Cluster
Andy Parsons Pivotal June 2011
A Global In-memory Data System for MySQL
M6d cassandrapresentation
Yes sql08 inmemorydb
Cassandra training
Introduction to Akka-Streams
An Introduction to Cassandra - Oracle User Group
Cassandra Day Denver 2014: Introduction to Apache Cassandra
Introduction to Cassandra - Denver
L6.sp17.pptx
Cassandra tech talk
Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...
NYC Meetup November 15, 2012
Bigtable and Dynamo
On Cassandra Development: Past, Present and Future
Cистема распределенного, масштабируемого и высоконадежного хранения данных дл...
From 0 to syncing
Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn
The Cassandra Distributed Database
Ad

More from jbellis (20)

PPTX
Vector Search @ sw2con for slideshare.pptx
PDF
Five Lessons in Distributed Databases
PDF
Data day texas: Cassandra and the Cloud
PDF
Cassandra Summit 2015
PDF
Cassandra summit keynote 2014
PDF
Cassandra 2.1
PDF
Tokyo cassandra conference 2014
PDF
Cassandra Summit EU 2013
PDF
London + Dublin Cassandra 2.0
PDF
Cassandra Summit 2013 Keynote
PDF
Cassandra at NoSql Matters 2012
PDF
Top five questions to ask when choosing a big data solution
PDF
State of Cassandra 2012
PDF
Massively Scalable NoSQL with Apache Cassandra
PDF
Cassandra 1.1
PDF
Pycon 2012 What Python can learn from Java
PDF
Apache Cassandra: NoSQL in the enterprise
PDF
Dealing with JVM limitations in Apache Cassandra (Fosdem 2012)
PDF
Cassandra at High Performance Transaction Systems 2011
PDF
Cassandra 1.0 and the future of big data (Cassandra Tokyo 2011)
Vector Search @ sw2con for slideshare.pptx
Five Lessons in Distributed Databases
Data day texas: Cassandra and the Cloud
Cassandra Summit 2015
Cassandra summit keynote 2014
Cassandra 2.1
Tokyo cassandra conference 2014
Cassandra Summit EU 2013
London + Dublin Cassandra 2.0
Cassandra Summit 2013 Keynote
Cassandra at NoSql Matters 2012
Top five questions to ask when choosing a big data solution
State of Cassandra 2012
Massively Scalable NoSQL with Apache Cassandra
Cassandra 1.1
Pycon 2012 What Python can learn from Java
Apache Cassandra: NoSQL in the enterprise
Dealing with JVM limitations in Apache Cassandra (Fosdem 2012)
Cassandra at High Performance Transaction Systems 2011
Cassandra 1.0 and the future of big data (Cassandra Tokyo 2011)

Recently uploaded (20)

PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
Statistics on Ai - sourced from AIPRM.pdf
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PPTX
Build Your First AI Agent with UiPath.pptx
DOCX
Basics of Cloud Computing - Cloud Ecosystem
PPT
What is a Computer? Input Devices /output devices
PPTX
Configure Apache Mutual Authentication
PPT
Geologic Time for studying geology for geologist
PDF
Architecture types and enterprise applications.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Module 1.ppt Iot fundamentals and Architecture
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Enhancing plagiarism detection using data pre-processing and machine learning...
sustainability-14-14877-v2.pddhzftheheeeee
A proposed approach for plagiarism detection in Myanmar Unicode text
Statistics on Ai - sourced from AIPRM.pdf
Getting started with AI Agents and Multi-Agent Systems
UiPath Agentic Automation session 1: RPA to Agents
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
Flame analysis and combustion estimation using large language and vision assi...
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
A review of recent deep learning applications in wood surface defect identifi...
sbt 2.0: go big (Scala Days 2025 edition)
Build Your First AI Agent with UiPath.pptx
Basics of Cloud Computing - Cloud Ecosystem
What is a Computer? Input Devices /output devices
Configure Apache Mutual Authentication
Geologic Time for studying geology for geologist
Architecture types and enterprise applications.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...

State of Cassandra, 2011