SlideShare a Scribd company logo
Highly available, scalable and secure data with
Cassandra and DataStax Enterprise
GOTO Berlin
27th February 2014
DataStax
•  Founded in April 2010
•  We drive Apache Cassandra™
•  400+ customers (24 of the Fortune 100)
•  220+ employees
•  Contribute approximately 80% of the code to Cassandra
•  Home to Apache Cassandra Chair & most committers
•  Headquartered in San Francisco Bay area
•  European headquarters established in London
Our Goal
To be the first and best database choice for online applications
©2014 DataStax Confidential. Do not distribute without consent. 2
Training
•  Checkout the DataStax academy for free online virtual training!
•  http://www.datastax.com/virtual-training
•  Public courses
•  http://www.datastax.com/course-catalog
•  On-site training
http://www.datastax.com/training
©2014 DataStax Confidential. Do not distribute without consent. 3
DataStax Enterprise for start-ups
•  DataStax gives qualifying start-ups access to DataStax Enterprise for free!
•  For more information:
•  http://www.datastax.com/startup
©2014 DataStax Confidential. Do not distribute without consent. 4
DataStax
•  DataStax supports both the open source community and enterprises.
©2014 DataStax Confidential. Do not distribute without consent. 5
Open Source/Community Enterprise Software
•  Apache Cassandra (employ
Cassandra chair and 80+% of
the committers)
•  DataStax Community Edition
•  DataStax OpsCenter
•  DataStax DevCenter
•  DataStax Drivers/Connectors
•  Online Documentation
•  Online Training
•  Mailing lists and forums
•  DataStax Enterprise Edition
•  Certified Cassandra
•  Built-in Analytics
•  Built-in Enterprise Search
•  Enterprise Security
•  DataStax OpsCenter
•  Expert Support
•  Consultative Help
•  Professional Training
DataStax Enterprise
•  DataStax Enterprise: LOB* applications, with analytics and search for online/real-time
application data.
©2014 DataStax Confidential. Do not distribute without consent. 6
LOB
App
NoSQL
LOB
App
NoSQL
LOB
App
NoSQL
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
Transactions:
•  LOB Style
•  Tunable
consistency
Analytics:
•  MapReduce
•  Hive
•  Pig
•  Mahout
Search
•  Solr
Data Warehouse
Hadoop
Transactions:
•  None
Analytics:
•  MapReduce
•  Hive
•  Pig
•  Mahout
Search
•  Solr
*Line of business
•  Hadoop: data warehouse applications with analytics and search for data warehouse.
Availability and Speed Matters for online apps!
•  UK retailers lost 8.5 billion last year to slow web sites, which is 1 million
for every 10 million in online sales
•  Over half of all web users expect a response time of 2 seconds or less
•  A 1 second delay causes a nearly 10% reduction in customer interactions
•  A 1 second decrease in Amazon page load time costs the company $1.6
billion in sales
©2014 DataStax Confidential. Do not distribute without consent. 7
Apache Cassandra™
•  Apache Cassandra™ is a massively scalable, open source, NoSQL, distributed
database built for modern, mission-critical online applications.
•  Written in Java and is a hybrid of Amazon Dynamo and Google BigTable
•  Masterless with no single point of failure
•  Distributed and data centre aware
•  100% uptime
•  Predictable scaling
©2014 DataStax Confidential. Do not distribute without consent. 8
Dynamo
BigTable
BigTable: http://research.google.com/archive/bigtable-osdi06.pdf
Dynamo: http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf
CASSANDRA"
Cassandra – Core Values
•  Ease of use
•  Massive scalability
•  High performance
•  Always Available
©2014 DataStax Confidential. Do not distribute without consent. 9
Cassandra – Performance and Scale
©2014 DataStax Confidential. Do not distribute without consent. 10
“In terms of scalability, there is a clear winner throughout our
experiments. Cassandra achieves the highest throughput for the
maximum number of nodes in all experiments with a linear increasing
throughput.”
Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et
al., August 2012. Benchmark paper presented at the Very Large Database Conference, 2012.
http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2012.pdf
End Point Independent NoSQL Benchmark
Lowest in latency…
http://techblog.netflix.com/2011/11/benchmarking-
cassandra-scalability-on.html
http://www.datastax.com/wp-content/uploads/2013/02/WP-
Benchmarking-Top-NoSQL-Databases.pdf
Netflix Cloud Benchmark…
Highest in throughput…
Cassandra - Performance and Scale
•  Cassandra works for small to huge deployments.
©2014 DataStax Confidential. Do not distribute without consent. 11
•  Cassandra Footprint @ Netflix
•  80+ Clusters
•  2500+ nodes
•  4 Data Centres (Amazon Regions)
•  > 1 Trillion transactions per day
http://planetcassandra.org/functional-use-cases/
Cassandra –Overview
•  Cassandra was designed with the understanding that system/hardware failures can and do
occur
•  Peer-to-peer, distributed system
•  All nodes the same
•  Data partitioned among all nodes in the cluster
•  Custom data replication to ensure fault tolerance
•  Read/Write-anywhere and across data centres
©2014 DataStax Confidential. Do not distribute without consent. 12
Node 1
Node 2
Node 3Node 4
Node 5
Cassandra – More Than One Server
•  All nodes participate in a cluster
•  Add or remove as needed
•  All nodes the same – masterless with no single point of failure
•  Each node communicates with each other through the Gossip
protocol, which exchanges information across the cluster every
second
•  Data partitioned among all nodes in the cluster
•  A commit log is used on each node to capture write activity.
Data durability is assured
•  Data also written to an in-memory structure (memtable) and
then to disk once the memory structure is full (an SStable)
•  More capacity? Add a server!
•  More throughput? Add a server!
©2014 DataStax Confidential. Do not distribute without consent. 13
Node 1
Node 2
Node 3Node 4
Node 5
•  Replication factor (RF): How many copies of your
data?
•  RF = 3 in this example
•  Each node is storing 60% of the clusters total data
i.e. 3/5
Handy Calculator: http://www.ecyrd.com/cassandracalculator/
Cassandra – Locally Distributed
©2014 DataStax Confidential. Do not distribute without consent. 14
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
•  Client reads or writes to any node
•  Node coordinates with others
•  Data read or replicated in parallel
Cassandra – Rack Aware
•  Cassandra is aware of which rack (or
availability zone) each node resides in.
•  It will attempt to place each data copy in a
different rack.
•  RF = 3 in this example
©2014 DataStax Confidential. Do not distribute without consent. 15
Node 1
1st copy
Node 4
Node 2
Node 3
2nd copy
Rack 1
Rack 2Rack 2
Rack 3
Rack 1
Node 5
3rd copy
Cassandra – Data Centre Aware
©2014 DataStax Confidential. Do not distribute without consent. 16
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
DC: EUROPEDC: USA•  Active Everywhere – reads/writes in
multiple data centres
•  Client writes local
•  Data syncs across WAN
•  Replication Factor per DC
•  Different number of nodes per
data centre
Cassandra – Tunable Consistency
•  Consistency Level (CL)
•  Client specifies per read or write
•  Handles multi-data center operations
•  ALL = All replicas ack
•  QUORUM = > 51% of replicas ack
•  LOCAL_QUORUM = > 51% in local DC ack
•  ONE = Only one replica acks
•  Plus more…. (see docs)
•  Blog: Eventual Consistency != Hopeful Consistency
http://planetcassandra.org/blog/post/a-netflix-experiment-eventual-consistency-
hopeful-consistency-by-christos-kalantzis/
©2014 DataStax Confidential. Do not distribute without consent. 17
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Parallel
Write
Write
CL=QUORUM
5 μs ack
12 μs ack
500 μs ack
12 μs ack
Node Failure
•  A single node failure shouldn’t bring failure.
•  Replication Factor + Consistency Level = Success
•  This example:
•  RF = 3
•  CL = QUORUM
©2014 DataStax Confidential. Do not distribute without consent. 18
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Parallel
Write
Write
CL=QUORUM
5 μs ack
12 μs ack
12 μs ack
>51% ack – so request is a success
Node Recovery
•  When a write is performed and a replica node for the row is
unavailable the coordinator will store a hint locally.
•  When the node recovers, the coordinator replays the missed
writes.
•  Note: a hinted write does not count towards the consistency
level
•  Note: you should still run repairs across your cluster
http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/
dml_about_hh_c.html
©2014 DataStax Confidential. Do not distribute without consent. 19
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Stores Hints while Node 3 is offline
Rack Failure
•  Cassandra will place the data in as many different
racks or availability zones as it can.
•  This example:
•  RF = 3
•  CL = QUORUM
•  Rack 2 fails
•  Data copies still available in Node 1 and Node 5
•  Quorum can be honored i.e. > 51% ack
©2014 DataStax Confidential. Do not distribute without consent. 20
Node 1
1st copy
Node 4
Node 2
Node 3
2nd copy
Rack 1
Rack 2Rack 2
Rack 3
Rack 1
Node 5
3rd copy
request is a success
Don’t be afraid of Weak Consistency
•  More tolerant to failure
•  Consistency Level of 1 is the most popular (I think)
•  If you want stronger consistency go for LOCAL_QUORUM i.e. quorum is
honored in the local data centre.
•  If you go stronger than LOCAL_QUORUM – understand what this means
and why you are doing it.
•  Remember – you can have different consistency levels for reads and
writes e.g. write with CL:1, read with CL:LOCAL_QUORUM
©2014 DataStax Confidential. Do not distribute without consent. 21
Example Application
©2014 DataStax Confidential. Do not distribute without consent. 22
Cassandra
Replication
Cassandra
Replication
•  Active-Active-Active
•  Service based DNS routing
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Example Application - Uptime
©2014 DataStax Confidential. Do not distribute without consent. 23
Cassandra
Replication
Cassandra
Replication
•  Normal service maintenance
•  Application is unaware
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Example Application – DC Failure
©2014 DataStax Confidential. Do not distribute without consent. 24
Cassandra
Replication
Cassandra
Replication
•  Data is safe.
•  Route Traffic
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Tier Failure
©2014 DataStax Confidential. Do not distribute without consent. 25
Cassandra
Replication
•  App Tier is aware of the other DC
•  Switches to access remote DC automatically
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
WAN Failure
©2014 DataStax Confidential. Do not distribute without consent. 26
Cassandra
Replication
Cassandra
Replication
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Consistency level?
Cassandra Clients - Native Driver
•  Clients that use the native driver also have access to various policies that enable the client to
intelligently route requests as required.
•  This includes:
•  Load Balancing
•  Data Centre Aware
•  Latency Aware
•  Token Aware
•  Reconnection policies
•  Retry policies
•  Downgrading Consistency
•  Plus others..
•  http://www.datastax.com/download/clientdrivers
©2014 DataStax Confidential. Do not distribute without consent. 27
Quotes
•  “Cassandra, our distributed cloud persistence store which is
distributed across all zones and regions, dealt with the loss of one
third of its regional nodes without any loss of data or availability”.
http://techblog.netflix.com/2012/07/lessons-netflix-learned-from-aws-storm.html
•  “During Hurricane Sandy, we lost an entire data center. Completely.
Lost. It. Our application fail-over resulted in us losing just a few
moments of serving requests for a particular region of the country, but
our data in Cassandra never went offline.”
http://planetcassandra.org/blog/post/outbrain-touches-over-80-of-all-us-online-users-with-help-from-cassandra/
©2014 DataStax Confidential. Do not distribute without consent. 28
Ring-fenced resources
•  If you need to isolate resources for different uses, Cassandra is a great fit.
•  You can create separate virtual data centres optimised as required – different workloads,
hardware, availability etc..
•  Cassandra will replicate the data for you – no ETL is necessary
©2014 DataStax Confidential. Do not distribute without consent. 29
Cassandra
Replication
Customer
Facing
Analytics
Hybrid Cloud
•  DataStax Enterprise and Cassandra are multi-data centre and cloud capable
•  Data written to any node is automatically and transparently written to all other nodes in
multiple data centres i.e. no etl
©2014 DataStax Confidential. Do not distribute without consent. 30
Data Centre 1 Data Centre2
Public Cloud
Security in Cassandra
FEATURESBENEFITS
Internal Authentication Manages
login IDs and passwords inside
the database
+  Ensures only authorized users
can access a database system
using internal validation
+  Simple to implement and easy
to understand
+  No learning curve from the
relational world
Object Permission Management
controls who has access to what
and who can do what in the
database
+  Provides granular based control
over who can add/change/
delete/read data
+  Uses familiar GRANT/REVOKE
from relational systems
+  No learning curve
Client to Node Encryption
protects data in flight to and from
a database cluster
+  Ensures data cannot be
captured/stolen in route to a
server
+  Data is safe both in flight from/
to a database and on the
database; complete coverage is
ensured
Advanced Security in DataStax Enterprise
FEATURESBENEFITS
External Authentication uses
external security software
packages to control security
+  Only authorized users have
access to a database system
using external validation
+  Uses most trusted external
security packages (Kerberos),
mainstays in government and
finance
+  Single sign on to all data
domains
Transparent Data Encryption
encrypts data at rest
+  Protects sensitive data at rest
from theft and from being read
at the file system level
+  No changes needed at
application level
Data Auditing
provides trail of who did and
looked at what/when
+  Supplies admins with an audit
trail of all accesses and
changes
+  Granular control to audit only
what’s needed
+  Uses log4j interface to ensure
performance and efficient audit
operations
Data Replication Security in Cassandra
•  A popular feature from a data security perspective is the ability to control at a keyspace/schema level which data centres
data should be replicated to.
•  What this means is that in a multi-data centre (both physical and virtual) cluster you can ensure that data is not shipped
anywhere it shouldn’t be and access to that data can be controlled.
•  This is very simple to set-up and is extremely useful when you need to share some of your data, but not all of you data or if
you have requirements around where your data is permitted to reside.
DC 1 DC 2
Shared Data
DataStax Enterprise 4.0, OpsCentre 4.1
•  DataStax Enterprise 4.0
•  New in-memory option. Brings all of the goodness of Cassandra to an in-memory
database
•  Production-certified version of Apache Cassandra (2.0)
•  Enterprise search enhancements
•  OpsCenter 4.1
•  Capacity planning updates
•  Better insight into node performance
•  More information:
http://www.datastax.com/wp-content/uploads/2014/02/WP-WhatsNewDSE40.pdf
http://www.datastax.com/download
©2014 DataStax Confidential. Do not distribute without consent. 34
How does in-memory work?
•  Developers can create new tables to be in-memory or alter existing tables
to be in-memory
•  Writes are durable
•  10-100x improvement
©2014 DataStax Confidential. Do not distribute without consent. 35
Find Out More
•  DataStax: http://www.datastax.com
•  Getting Started: http://www.datastax.com/documentation/gettingstarted/index.html
•  Training: http://www.datatstax.com/training
•  Downloads: http://www.datastax.com/download
•  Documentation: http://www.datastax.com/docs
•  Developer Blog: http://www.datastax.com/dev/blog
•  Community Site: http://planetcassandra.org
•  Webinars: http://planetcassandra.org/Learn/CassandraCommunityWebinars
•  Summit Talks: http://planetcassandra.org/Learn/CassandraSummit
©2014 DataStax Confidential. Do not distribute without consent. 36
Thank You
We power the big data
apps that transform business.
©2014 DataStax Confidential. Do not distribute without consent. 37

More Related Content

What's hot (20)

PPTX
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
DataStax
 
PDF
Multi-Region Cassandra Clusters
Instaclustr
 
PPTX
Hindsight is 20/20: MySQL to Cassandra
Michael Kjellman
 
PPTX
BigData Developers MeetUp
Christian Johannsen
 
PDF
Advanced Operations
DataStax Academy
 
PPTX
Webinar - VPS New Range
OVHcloud
 
PDF
Introduction to Cassandra and CQL for Java developers
Julien Anguenot
 
PDF
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
DataStax
 
PDF
Cassandra and security
Ben Bromhead
 
PDF
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
DataStax Academy
 
PDF
Cassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
DataStax Academy
 
PDF
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
DataStax
 
PDF
Mesosphere and Contentteam: A New Way to Run Cassandra
DataStax Academy
 
PPTX
Storage for VDI
Howard Marks
 
PDF
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
DataStax
 
PDF
Run Cloud Native MySQL NDB Cluster in Kubernetes
Bernd Ocklin
 
PDF
Client Drivers and Cassandra, the Right Way
DataStax Academy
 
PDF
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
DataStax
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Building Scalable, Real Time Applications for Financial Services with DataStax
DataStax
 
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
DataStax
 
Multi-Region Cassandra Clusters
Instaclustr
 
Hindsight is 20/20: MySQL to Cassandra
Michael Kjellman
 
BigData Developers MeetUp
Christian Johannsen
 
Advanced Operations
DataStax Academy
 
Webinar - VPS New Range
OVHcloud
 
Introduction to Cassandra and CQL for Java developers
Julien Anguenot
 
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
DataStax
 
Cassandra and security
Ben Bromhead
 
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
DataStax Academy
 
Cassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
DataStax Academy
 
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
DataStax
 
Mesosphere and Contentteam: A New Way to Run Cassandra
DataStax Academy
 
Storage for VDI
Howard Marks
 
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
DataStax
 
Run Cloud Native MySQL NDB Cluster in Kubernetes
Bernd Ocklin
 
Client Drivers and Cassandra, the Right Way
DataStax Academy
 
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
DataStax
 
Advanced Cassandra
DataStax Academy
 
Building Scalable, Real Time Applications for Financial Services with DataStax
DataStax
 

Viewers also liked (20)

PPTX
How to size up an Apache Cassandra cluster (Training)
DataStax Academy
 
PDF
Understanding Data Partitioning and Replication in Apache Cassandra
DataStax
 
PPTX
An Overview of Apache Cassandra
DataStax
 
PDF
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
DataStax
 
PDF
CrowdStrike: Real World DTCS For Operators
DataStax Academy
 
PDF
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
DataStax Academy
 
PDF
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
COIICV
 
PDF
NGCC 2016 - Support large partitions
Robert Stupp
 
PDF
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
DataStax Academy
 
PDF
Cassandra at scale
Patrick McFadin
 
PDF
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
DataStax
 
PDF
Cassandra: Two data centers and great performance
DATAVERSITY
 
PPTX
3800 die-bonder overview
fastbr
 
PPTX
Cassandra Summit 2015: Real World DTCS For Operators
Jeff Jirsa
 
PDF
Securing Cassandra
Instaclustr
 
PDF
Cassandra multi-datacenter operations essentials
Julien Anguenot
 
PDF
Rethinking Topology In Cassandra (ApacheCon NA)
Eric Evans
 
PDF
Time Series Data with Apache Cassandra
Eric Evans
 
PPT
Cassandra architecture
T Jake Luciani
 
PDF
Time Series Data with Apache Cassandra
Eric Evans
 
How to size up an Apache Cassandra cluster (Training)
DataStax Academy
 
Understanding Data Partitioning and Replication in Apache Cassandra
DataStax
 
An Overview of Apache Cassandra
DataStax
 
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
DataStax
 
CrowdStrike: Real World DTCS For Operators
DataStax Academy
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
DataStax Academy
 
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
COIICV
 
NGCC 2016 - Support large partitions
Robert Stupp
 
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
DataStax Academy
 
Cassandra at scale
Patrick McFadin
 
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
DataStax
 
Cassandra: Two data centers and great performance
DATAVERSITY
 
3800 die-bonder overview
fastbr
 
Cassandra Summit 2015: Real World DTCS For Operators
Jeff Jirsa
 
Securing Cassandra
Instaclustr
 
Cassandra multi-datacenter operations essentials
Julien Anguenot
 
Rethinking Topology In Cassandra (ApacheCon NA)
Eric Evans
 
Time Series Data with Apache Cassandra
Eric Evans
 
Cassandra architecture
T Jake Luciani
 
Time Series Data with Apache Cassandra
Eric Evans
 
Ad

Similar to Highly available, scalable and secure data with Cassandra and DataStax Enterprise (20)

PPTX
Cassandra - A Basic Introduction Guide
Mohammed Fazuluddin
 
PDF
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Johnny Miller
 
PPTX
cybersecurity notes for mca students for learning
VitsRangannavar
 
PDF
Apache Cassandra and The Multi-Cloud by Amanda Moran
Data Con LA
 
PPTX
Using Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Data Con LA
 
PDF
Apache Cassandra in the Real World
Jeremy Hanna
 
PDF
Apache Cassandra in the Real World
Jeremy Hanna
 
PPTX
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Data Con LA
 
PPTX
Spark Streaming & Kafka-The Future of Stream Processing
Jack Gudenkauf
 
PDF
The MySQL High Availability Landscape and where Galera Cluster fits in
Sakari Keskitalo
 
PPTX
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
DataStax Academy
 
PPTX
Unit -3 _Cassandra-CRUD Operations_Practice Examples
chayapathiar1
 
PPTX
Unit -3 -Features of Cassandra, CQL Data types, CQLSH, Keyspaces
ssuser9d6aac
 
PDF
Hacking apache cloud stack
Nitin Mehta
 
PDF
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
Big Data Joe™ Rossi
 
PPTX
Real Time Data Processing Using Spark Streaming
Hari Shreedharan
 
PPTX
Devops kc
Philip Thompson
 
PPTX
Apache kafka
Kumar Shivam
 
PDF
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Helena Edelson
 
PDF
Cassandra 2.0 to 2.1
Johnny Miller
 
Cassandra - A Basic Introduction Guide
Mohammed Fazuluddin
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Johnny Miller
 
cybersecurity notes for mca students for learning
VitsRangannavar
 
Apache Cassandra and The Multi-Cloud by Amanda Moran
Data Con LA
 
Using Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Data Con LA
 
Apache Cassandra in the Real World
Jeremy Hanna
 
Apache Cassandra in the Real World
Jeremy Hanna
 
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Data Con LA
 
Spark Streaming & Kafka-The Future of Stream Processing
Jack Gudenkauf
 
The MySQL High Availability Landscape and where Galera Cluster fits in
Sakari Keskitalo
 
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
DataStax Academy
 
Unit -3 _Cassandra-CRUD Operations_Practice Examples
chayapathiar1
 
Unit -3 -Features of Cassandra, CQL Data types, CQLSH, Keyspaces
ssuser9d6aac
 
Hacking apache cloud stack
Nitin Mehta
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
Big Data Joe™ Rossi
 
Real Time Data Processing Using Spark Streaming
Hari Shreedharan
 
Devops kc
Philip Thompson
 
Apache kafka
Kumar Shivam
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Helena Edelson
 
Cassandra 2.0 to 2.1
Johnny Miller
 
Ad

Recently uploaded (20)

PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 

Highly available, scalable and secure data with Cassandra and DataStax Enterprise

  • 1. Highly available, scalable and secure data with Cassandra and DataStax Enterprise GOTO Berlin 27th February 2014
  • 2. DataStax •  Founded in April 2010 •  We drive Apache Cassandra™ •  400+ customers (24 of the Fortune 100) •  220+ employees •  Contribute approximately 80% of the code to Cassandra •  Home to Apache Cassandra Chair & most committers •  Headquartered in San Francisco Bay area •  European headquarters established in London Our Goal To be the first and best database choice for online applications ©2014 DataStax Confidential. Do not distribute without consent. 2
  • 3. Training •  Checkout the DataStax academy for free online virtual training! •  http://www.datastax.com/virtual-training •  Public courses •  http://www.datastax.com/course-catalog •  On-site training http://www.datastax.com/training ©2014 DataStax Confidential. Do not distribute without consent. 3
  • 4. DataStax Enterprise for start-ups •  DataStax gives qualifying start-ups access to DataStax Enterprise for free! •  For more information: •  http://www.datastax.com/startup ©2014 DataStax Confidential. Do not distribute without consent. 4
  • 5. DataStax •  DataStax supports both the open source community and enterprises. ©2014 DataStax Confidential. Do not distribute without consent. 5 Open Source/Community Enterprise Software •  Apache Cassandra (employ Cassandra chair and 80+% of the committers) •  DataStax Community Edition •  DataStax OpsCenter •  DataStax DevCenter •  DataStax Drivers/Connectors •  Online Documentation •  Online Training •  Mailing lists and forums •  DataStax Enterprise Edition •  Certified Cassandra •  Built-in Analytics •  Built-in Enterprise Search •  Enterprise Security •  DataStax OpsCenter •  Expert Support •  Consultative Help •  Professional Training
  • 6. DataStax Enterprise •  DataStax Enterprise: LOB* applications, with analytics and search for online/real-time application data. ©2014 DataStax Confidential. Do not distribute without consent. 6 LOB App NoSQL LOB App NoSQL LOB App NoSQL C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* Transactions: •  LOB Style •  Tunable consistency Analytics: •  MapReduce •  Hive •  Pig •  Mahout Search •  Solr Data Warehouse Hadoop Transactions: •  None Analytics: •  MapReduce •  Hive •  Pig •  Mahout Search •  Solr *Line of business •  Hadoop: data warehouse applications with analytics and search for data warehouse.
  • 7. Availability and Speed Matters for online apps! •  UK retailers lost 8.5 billion last year to slow web sites, which is 1 million for every 10 million in online sales •  Over half of all web users expect a response time of 2 seconds or less •  A 1 second delay causes a nearly 10% reduction in customer interactions •  A 1 second decrease in Amazon page load time costs the company $1.6 billion in sales ©2014 DataStax Confidential. Do not distribute without consent. 7
  • 8. Apache Cassandra™ •  Apache Cassandra™ is a massively scalable, open source, NoSQL, distributed database built for modern, mission-critical online applications. •  Written in Java and is a hybrid of Amazon Dynamo and Google BigTable •  Masterless with no single point of failure •  Distributed and data centre aware •  100% uptime •  Predictable scaling ©2014 DataStax Confidential. Do not distribute without consent. 8 Dynamo BigTable BigTable: http://research.google.com/archive/bigtable-osdi06.pdf Dynamo: http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf CASSANDRA"
  • 9. Cassandra – Core Values •  Ease of use •  Massive scalability •  High performance •  Always Available ©2014 DataStax Confidential. Do not distribute without consent. 9
  • 10. Cassandra – Performance and Scale ©2014 DataStax Confidential. Do not distribute without consent. 10 “In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.” Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2012. Benchmark paper presented at the Very Large Database Conference, 2012. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2012.pdf End Point Independent NoSQL Benchmark Lowest in latency… http://techblog.netflix.com/2011/11/benchmarking- cassandra-scalability-on.html http://www.datastax.com/wp-content/uploads/2013/02/WP- Benchmarking-Top-NoSQL-Databases.pdf Netflix Cloud Benchmark… Highest in throughput…
  • 11. Cassandra - Performance and Scale •  Cassandra works for small to huge deployments. ©2014 DataStax Confidential. Do not distribute without consent. 11 •  Cassandra Footprint @ Netflix •  80+ Clusters •  2500+ nodes •  4 Data Centres (Amazon Regions) •  > 1 Trillion transactions per day http://planetcassandra.org/functional-use-cases/
  • 12. Cassandra –Overview •  Cassandra was designed with the understanding that system/hardware failures can and do occur •  Peer-to-peer, distributed system •  All nodes the same •  Data partitioned among all nodes in the cluster •  Custom data replication to ensure fault tolerance •  Read/Write-anywhere and across data centres ©2014 DataStax Confidential. Do not distribute without consent. 12 Node 1 Node 2 Node 3Node 4 Node 5
  • 13. Cassandra – More Than One Server •  All nodes participate in a cluster •  Add or remove as needed •  All nodes the same – masterless with no single point of failure •  Each node communicates with each other through the Gossip protocol, which exchanges information across the cluster every second •  Data partitioned among all nodes in the cluster •  A commit log is used on each node to capture write activity. Data durability is assured •  Data also written to an in-memory structure (memtable) and then to disk once the memory structure is full (an SStable) •  More capacity? Add a server! •  More throughput? Add a server! ©2014 DataStax Confidential. Do not distribute without consent. 13 Node 1 Node 2 Node 3Node 4 Node 5
  • 14. •  Replication factor (RF): How many copies of your data? •  RF = 3 in this example •  Each node is storing 60% of the clusters total data i.e. 3/5 Handy Calculator: http://www.ecyrd.com/cassandracalculator/ Cassandra – Locally Distributed ©2014 DataStax Confidential. Do not distribute without consent. 14 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy •  Client reads or writes to any node •  Node coordinates with others •  Data read or replicated in parallel
  • 15. Cassandra – Rack Aware •  Cassandra is aware of which rack (or availability zone) each node resides in. •  It will attempt to place each data copy in a different rack. •  RF = 3 in this example ©2014 DataStax Confidential. Do not distribute without consent. 15 Node 1 1st copy Node 4 Node 2 Node 3 2nd copy Rack 1 Rack 2Rack 2 Rack 3 Rack 1 Node 5 3rd copy
  • 16. Cassandra – Data Centre Aware ©2014 DataStax Confidential. Do not distribute without consent. 16 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy DC: EUROPEDC: USA•  Active Everywhere – reads/writes in multiple data centres •  Client writes local •  Data syncs across WAN •  Replication Factor per DC •  Different number of nodes per data centre
  • 17. Cassandra – Tunable Consistency •  Consistency Level (CL) •  Client specifies per read or write •  Handles multi-data center operations •  ALL = All replicas ack •  QUORUM = > 51% of replicas ack •  LOCAL_QUORUM = > 51% in local DC ack •  ONE = Only one replica acks •  Plus more…. (see docs) •  Blog: Eventual Consistency != Hopeful Consistency http://planetcassandra.org/blog/post/a-netflix-experiment-eventual-consistency- hopeful-consistency-by-christos-kalantzis/ ©2014 DataStax Confidential. Do not distribute without consent. 17 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Parallel Write Write CL=QUORUM 5 μs ack 12 μs ack 500 μs ack 12 μs ack
  • 18. Node Failure •  A single node failure shouldn’t bring failure. •  Replication Factor + Consistency Level = Success •  This example: •  RF = 3 •  CL = QUORUM ©2014 DataStax Confidential. Do not distribute without consent. 18 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Parallel Write Write CL=QUORUM 5 μs ack 12 μs ack 12 μs ack >51% ack – so request is a success
  • 19. Node Recovery •  When a write is performed and a replica node for the row is unavailable the coordinator will store a hint locally. •  When the node recovers, the coordinator replays the missed writes. •  Note: a hinted write does not count towards the consistency level •  Note: you should still run repairs across your cluster http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/ dml_about_hh_c.html ©2014 DataStax Confidential. Do not distribute without consent. 19 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Stores Hints while Node 3 is offline
  • 20. Rack Failure •  Cassandra will place the data in as many different racks or availability zones as it can. •  This example: •  RF = 3 •  CL = QUORUM •  Rack 2 fails •  Data copies still available in Node 1 and Node 5 •  Quorum can be honored i.e. > 51% ack ©2014 DataStax Confidential. Do not distribute without consent. 20 Node 1 1st copy Node 4 Node 2 Node 3 2nd copy Rack 1 Rack 2Rack 2 Rack 3 Rack 1 Node 5 3rd copy request is a success
  • 21. Don’t be afraid of Weak Consistency •  More tolerant to failure •  Consistency Level of 1 is the most popular (I think) •  If you want stronger consistency go for LOCAL_QUORUM i.e. quorum is honored in the local data centre. •  If you go stronger than LOCAL_QUORUM – understand what this means and why you are doing it. •  Remember – you can have different consistency levels for reads and writes e.g. write with CL:1, read with CL:LOCAL_QUORUM ©2014 DataStax Confidential. Do not distribute without consent. 21
  • 22. Example Application ©2014 DataStax Confidential. Do not distribute without consent. 22 Cassandra Replication Cassandra Replication •  Active-Active-Active •  Service based DNS routing DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 23. Example Application - Uptime ©2014 DataStax Confidential. Do not distribute without consent. 23 Cassandra Replication Cassandra Replication •  Normal service maintenance •  Application is unaware DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 24. Example Application – DC Failure ©2014 DataStax Confidential. Do not distribute without consent. 24 Cassandra Replication Cassandra Replication •  Data is safe. •  Route Traffic DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 25. Tier Failure ©2014 DataStax Confidential. Do not distribute without consent. 25 Cassandra Replication •  App Tier is aware of the other DC •  Switches to access remote DC automatically DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 26. WAN Failure ©2014 DataStax Confidential. Do not distribute without consent. 26 Cassandra Replication Cassandra Replication DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier Consistency level?
  • 27. Cassandra Clients - Native Driver •  Clients that use the native driver also have access to various policies that enable the client to intelligently route requests as required. •  This includes: •  Load Balancing •  Data Centre Aware •  Latency Aware •  Token Aware •  Reconnection policies •  Retry policies •  Downgrading Consistency •  Plus others.. •  http://www.datastax.com/download/clientdrivers ©2014 DataStax Confidential. Do not distribute without consent. 27
  • 28. Quotes •  “Cassandra, our distributed cloud persistence store which is distributed across all zones and regions, dealt with the loss of one third of its regional nodes without any loss of data or availability”. http://techblog.netflix.com/2012/07/lessons-netflix-learned-from-aws-storm.html •  “During Hurricane Sandy, we lost an entire data center. Completely. Lost. It. Our application fail-over resulted in us losing just a few moments of serving requests for a particular region of the country, but our data in Cassandra never went offline.” http://planetcassandra.org/blog/post/outbrain-touches-over-80-of-all-us-online-users-with-help-from-cassandra/ ©2014 DataStax Confidential. Do not distribute without consent. 28
  • 29. Ring-fenced resources •  If you need to isolate resources for different uses, Cassandra is a great fit. •  You can create separate virtual data centres optimised as required – different workloads, hardware, availability etc.. •  Cassandra will replicate the data for you – no ETL is necessary ©2014 DataStax Confidential. Do not distribute without consent. 29 Cassandra Replication Customer Facing Analytics
  • 30. Hybrid Cloud •  DataStax Enterprise and Cassandra are multi-data centre and cloud capable •  Data written to any node is automatically and transparently written to all other nodes in multiple data centres i.e. no etl ©2014 DataStax Confidential. Do not distribute without consent. 30 Data Centre 1 Data Centre2 Public Cloud
  • 31. Security in Cassandra FEATURESBENEFITS Internal Authentication Manages login IDs and passwords inside the database +  Ensures only authorized users can access a database system using internal validation +  Simple to implement and easy to understand +  No learning curve from the relational world Object Permission Management controls who has access to what and who can do what in the database +  Provides granular based control over who can add/change/ delete/read data +  Uses familiar GRANT/REVOKE from relational systems +  No learning curve Client to Node Encryption protects data in flight to and from a database cluster +  Ensures data cannot be captured/stolen in route to a server +  Data is safe both in flight from/ to a database and on the database; complete coverage is ensured
  • 32. Advanced Security in DataStax Enterprise FEATURESBENEFITS External Authentication uses external security software packages to control security +  Only authorized users have access to a database system using external validation +  Uses most trusted external security packages (Kerberos), mainstays in government and finance +  Single sign on to all data domains Transparent Data Encryption encrypts data at rest +  Protects sensitive data at rest from theft and from being read at the file system level +  No changes needed at application level Data Auditing provides trail of who did and looked at what/when +  Supplies admins with an audit trail of all accesses and changes +  Granular control to audit only what’s needed +  Uses log4j interface to ensure performance and efficient audit operations
  • 33. Data Replication Security in Cassandra •  A popular feature from a data security perspective is the ability to control at a keyspace/schema level which data centres data should be replicated to. •  What this means is that in a multi-data centre (both physical and virtual) cluster you can ensure that data is not shipped anywhere it shouldn’t be and access to that data can be controlled. •  This is very simple to set-up and is extremely useful when you need to share some of your data, but not all of you data or if you have requirements around where your data is permitted to reside. DC 1 DC 2 Shared Data
  • 34. DataStax Enterprise 4.0, OpsCentre 4.1 •  DataStax Enterprise 4.0 •  New in-memory option. Brings all of the goodness of Cassandra to an in-memory database •  Production-certified version of Apache Cassandra (2.0) •  Enterprise search enhancements •  OpsCenter 4.1 •  Capacity planning updates •  Better insight into node performance •  More information: http://www.datastax.com/wp-content/uploads/2014/02/WP-WhatsNewDSE40.pdf http://www.datastax.com/download ©2014 DataStax Confidential. Do not distribute without consent. 34
  • 35. How does in-memory work? •  Developers can create new tables to be in-memory or alter existing tables to be in-memory •  Writes are durable •  10-100x improvement ©2014 DataStax Confidential. Do not distribute without consent. 35
  • 36. Find Out More •  DataStax: http://www.datastax.com •  Getting Started: http://www.datastax.com/documentation/gettingstarted/index.html •  Training: http://www.datatstax.com/training •  Downloads: http://www.datastax.com/download •  Documentation: http://www.datastax.com/docs •  Developer Blog: http://www.datastax.com/dev/blog •  Community Site: http://planetcassandra.org •  Webinars: http://planetcassandra.org/Learn/CassandraCommunityWebinars •  Summit Talks: http://planetcassandra.org/Learn/CassandraSummit ©2014 DataStax Confidential. Do not distribute without consent. 36
  • 37. Thank You We power the big data apps that transform business. ©2014 DataStax Confidential. Do not distribute without consent. 37