DataStax Enterprise 5.1:
3X the operational analytics speed, help for
multi-tenant SaaS apps, & other shiny things
Gehrig Kunz, Product Marketing
David Gilardi, Technical Evangelist
• What is a DataStax Enterprise?
• Explore what’s new in:
• DataStax Enterprise 5.1
• Core
• Analytics
• Search
• Graph
• *Live demo*
• OpsCenter 6.1
• Studio 2.0
Hey.
Thanks for
joining us.
David
@sonicdmg
Gehrig
@gehrigkunz
(#BearDown)
Powering cloud applications
Personalization
Customer 360
Recommendation
Fraud Detection
Inventory Management
Identity Management
Security
Supply Chain
Cloud application characteristics
Real-Time DistributedAlways-OnContextual Scalable
Easy to build, effortless to scale
• DataStax Enterprise 5.1
• Core
• Analytics
• Search
• Graph
• DataStax OpsCenter 6.1
• DataStax Studio 2.0
• DSE Drivers
Easy to build, effortless to scale
• DataStax Enterprise 5.1
• Core
• Analytics
• Search
• Graph
• DataStax OpsCenter 6.1
• DataStax Studio 2.0
• DSE Drivers
Something for everyone.
Let’s dive in
DSE Core
Powered by the best distribution of Apache Cassandra™
• Multi-tenancy made easy
• Improvements to Advanced Replication
• Continuous paging (DSE Analytics)
• Production certified Cassandra 3.11.1
Apache Cassandra, Spark and Solr are trademarks of the
Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Best distro of Apache Cassandra™
A look back at 5.0 introductions to DSE Core:
• Introduced advanced replication
• Tiered storage
• Multi-instance
Row Level Access Control (RLAC)
• Part of DSE Advanced Security.
• Secures data in tables at the row
level.
• Handled via CQL.
• Enables multi-tenancy capabilities
on Cassandra tables.
RLAC FTW
CREATE TABLE microservices.shoppingcart_by_tx (
timestamp UUID,
cart UUID,
amount decimal,
store text,
tx_id UUID
PRIMARY_KEY((store, tx_id))
);
RESTRICT ROWS on microservices.shoppingcart_by_tx USING store;
CREATE ROLE bobcoapp;
GRANT SELECT ON ‘bobco’ IN microservices.shoppingcart_by_tx TO bobcoapp;
Advanced replication advanced-er
• Lower overhead and improved performance.
• Hub to Spoke replication; multi-directional replication between clusters now
supported.
• Support for multi-datacenter edge clusters.
Advanced replication for retail
Advanced replication for IoT
DSE Analytics
● 3x operational analytics performance improvements
● DataStax Enterprise File System (DSEFS)
• HDFS minus the bad
• Example use: PDFs and their metadata
● Separate Spark-only cluster
• Adding/removing Spark nodes is cheap
• Good for scaling up/down for specific needs
• Example use: weekly analytics reporting
● Spark UI reachable from any node
● Production certified Apache Spark™ 2.0
• Support for SparkR.
What is DSEFS?
• A new file system for DSE that is...
• HDFS compatible
• Distributed
• Fault tolerant with no single point of failure
• Scale-out
Benefits of DSEFS over HDFS Benefits of DSEFS over Cassandra
No namenode No overhead from compactions or commit log writes
No secondary namenode No data density ramifications
No zookeeper No overhead from delete operations; provides immediate
deletes
No single point of failure No JVM overhead for file system data
To the core
Analytics
Table
Scan
1
––– Prior to continuous paging –––
To the core
Analytics
Table
Scan
1
Scan
2
––– Without continuous paging –––
To the core
Analytics
Table
Scan
1
Scan
2
––– With continuous paging –––
The result – 3xResponseTime
(Seconds)
0
50
100
150
200
250
300
Test 1 Test 2 Test 3 Test 4
Read performance
(Lower is better)
OSS C*/Spark DSE 5.1
DSE Search
• Increased performance for Indexing; new data
now made available for search much faster.
• Enhanced CQL support for search index
management; makes using search easier than
ever.
• Production certified Apache Solr 6.0
CREATE SEARCH INDEX IF NOT EXISTS ON songs WITH COLUMNS id,val1;
DROP SEARCH INDEX ON songs WITH OPTIONS { deleteResources : true };
DSE Graph
The first scale-out, real-time graph database
• First on the market to enable out-of-the-box graph search.
• Improved graph analytics:
• With Spark GraphFrames, known as ‘DSEGraphFrames’
• Introducing Fluent API with 2 flavors:
• Explicit = if you’re coming from DSE drivers
• Implicit = if you’re coming from Tinkerpop world
Graph-based Search
Type ahead, fuzzy, and spell-check searches now available on graph data
Easy to build, effortless to scale
• DataStax Enterprise 5.1
• Core
• Analytics
• Search
• Graph
• DataStax OpsCenter 6.1
• DataStax Studio 2.0
• DSE Drivers
OpsCenter 6.1 improvements
Live demo! Oooooooooooo
OpsCenter 6.1 improvements
• Full support for DSE 5.1
• Updated repair section
• Improved resiliency
• Granular control to ignore keyspaces and/or tables
• Improved UI to track repair progress
• Enhanced metrics and alerts
• Graph
• Datacenter latency
• Datacenter level backup and improved support for Amazon S3
• Backup and restore times decreased
• Backup/restore SASI indexes
• SASI index definitions automatically backed up on tables backed up, index definitions are
rebuilt on restore
• Backward compatibility of agents
• OpsCenter can be upgraded ahead of agents from 6.1 moving forward
Updated repairs section!
Improved UI
Enhanced metrics for DSE Graph and
Datacenter Latency
Datacenter
level backup
and improved
support for
Amazon S3
Backward compatibility of agents
from 6.1 moving forward
Show you stuff
Studio 2.0
More demos!
Some developer love - Studio
● Studio 2.0
○ New graph visuals
■Vertex size by property value
■Color by label or property value
■Shapes & Icons
■Improved cluster segregation and reduced overlap
○ Gremlin scan steps not using indexes highlighted in profiler
○ CQL Support
■Traditionally “Devcenter” functionality
■CQL
■Schema views
■Tracing
■Consistency level settings
■Schema aware content assist “intellisense magic”
Vertex size by
property value
Color by label or
property value
Shapes & Icons
Improved cluster segregation and reduced overlap
Improved cluster segregation and reduced overlap
Show you more
stuff
• OpsCenter 6.1
• Timestamps in human readable format instead
of epoch
• Repair
• Better logging
• Added “pause” between repairs to avoid
aggressive cyclic repairs
• Lifecycle Manager
• Concurrent inter-cluster jobs
• More hardening
• Schema viewer support for SASI indexes,
MVs, UDTs, UDFs, UDAs
Honorable
mentions
(1/3)
• Graph
• API support for edit distance queries
• Type ahead
• Spell check
• Support for multiple vertices & edges
and commit as a single
operation/transaction
• Support for units in Geo.distance
• Time and Date types added
Honorable
mentions
(2/3)
• Studio 2.0
• Syntax highlighting
• Code completion
• Schema-aware completion
• Validations
• Snippets
• Dockable schema viewer
• Detailed view
• CQL execution configurations
• Support for charts
• DSE Drivers
• Unified authentication
• Ability to run queries as another user
• DateRangeField support in search
Honorable
mentions
(3/3)
April 4th!
Release notes –
http://docs.datastax.com/en/dse/5.1/dse-
dev/datastax_enterprise/RNdse.html
How to upgrade –
http://docs.datastax.com/en/dse/5.1/upgrade/upgrade/dat
astax_enterprise/upgdDSE51.html
Graph example data sets –
Things to
check out
All downloads
available April 4th
academy.datastax.com/slack
https://github.com/datastax/graph-examples
Thank you

More Related Content

PPTX
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
PPTX
Webinar: Become PSD2 ready with DataStax
PPTX
Webinar: Transforming Customer Experience Through an Always-On Data Platform
PDF
Building Killr Applications with DataStax Enterprise
PPTX
Getting Big Value from Big Data
PPTX
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
PDF
Webinar - Bringing Game Changing Insights with Graph Databases
PPTX
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Partner Webinar: Mesosphere and DSE: Production-Proven Infrastructure for Fas...
Webinar: Become PSD2 ready with DataStax
Webinar: Transforming Customer Experience Through an Always-On Data Platform
Building Killr Applications with DataStax Enterprise
Getting Big Value from Big Data
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Bringing Game Changing Insights with Graph Databases
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra

What's hot (20)

PPTX
How to Successfully Visualize DSE Graph data
PPTX
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
PPTX
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
PPTX
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
PPTX
Webinar - Bringing connected graph data to Cassandra with DSE Graph
PPT
Webinar: 2 Billion Data Points Each Day
PPTX
Building and Maintaining Bulletproof Systems with DataStax
PPTX
How much money do you lose every time your ecommerce site goes down?
PPTX
Webinar: DataStax Managed Cloud: focus on innovation, not administration
PPTX
How To Tell if Your Business Needs NoSQL
PDF
Data Modeling a Scheduling App (Adam Hutson, DataScale) | Cassandra Summit 2016
PDF
Building Killr Applications with DSE
PPTX
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
PPTX
Bloor Research & DataStax: How graph databases solve previously unsolvable bu...
PPTX
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
PPTX
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
PDF
DataStax Training – Everything you need to become a Cassandra Rockstar
PPT
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
PPTX
Webinar: Don't Leave Your Data in the Dark
PPTX
C*ollege Credit: Keep the DB, Lose the A
How to Successfully Visualize DSE Graph data
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Webinar - Bringing connected graph data to Cassandra with DSE Graph
Webinar: 2 Billion Data Points Each Day
Building and Maintaining Bulletproof Systems with DataStax
How much money do you lose every time your ecommerce site goes down?
Webinar: DataStax Managed Cloud: focus on innovation, not administration
How To Tell if Your Business Needs NoSQL
Data Modeling a Scheduling App (Adam Hutson, DataScale) | Cassandra Summit 2016
Building Killr Applications with DSE
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Bloor Research & DataStax: How graph databases solve previously unsolvable bu...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
DataStax Training – Everything you need to become a Cassandra Rockstar
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
Webinar: Don't Leave Your Data in the Dark
C*ollege Credit: Keep the DB, Lose the A
Ad

Viewers also liked (20)

PPTX
Give sense to your Big Data w/ Apache TinkerPop™ & property graph databases
PPTX
Webinar: Fighting Fraud with Graph Databases
PDF
Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...
PPTX
Using Spark to Load Oracle Data into Cassandra (Jim Hatcher, IHS Markit) | C*...
PDF
Troubleshooting Cassandra (J.B. Langston, DataStax) | C* Summit 2016
PPTX
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
PDF
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
PDF
Can My Inventory Survive Eventual Consistency?
PDF
Tuning Speculative Retries to Fight Latency (Michael Figuiere, Minh Do, Netfl...
PPTX
There are More Clouds! Azure and Cassandra (Carlos Rolo, Pythian) | C* Summit...
PPTX
Introduction to DataStax Enterprise Graph Database
PPTX
An Overview of Apache Cassandra
PDF
Cassandra Architecture Certificate-RameshKumar
PDF
Cassandra 101
PDF
DataStax | DataStax Tools for Developers (Alex Popescu) | Cassandra Summit 2016
PDF
C*ollege Credit: Data Modeling for Apache Cassandra
PPTX
Webinar - Security and Manageability: Key Criteria in Selecting Enterprise-Gr...
PPTX
Introducing DataStax Enterprise 4.7
PDF
Lessons from Running Large Scale Spark Workloads
PDF
Cassandra is great but how do I test my application?
Give sense to your Big Data w/ Apache TinkerPop™ & property graph databases
Webinar: Fighting Fraud with Graph Databases
Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...
Using Spark to Load Oracle Data into Cassandra (Jim Hatcher, IHS Markit) | C*...
Troubleshooting Cassandra (J.B. Langston, DataStax) | C* Summit 2016
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
Can My Inventory Survive Eventual Consistency?
Tuning Speculative Retries to Fight Latency (Michael Figuiere, Minh Do, Netfl...
There are More Clouds! Azure and Cassandra (Carlos Rolo, Pythian) | C* Summit...
Introduction to DataStax Enterprise Graph Database
An Overview of Apache Cassandra
Cassandra Architecture Certificate-RameshKumar
Cassandra 101
DataStax | DataStax Tools for Developers (Alex Popescu) | Cassandra Summit 2016
C*ollege Credit: Data Modeling for Apache Cassandra
Webinar - Security and Manageability: Key Criteria in Selecting Enterprise-Gr...
Introducing DataStax Enterprise 4.7
Lessons from Running Large Scale Spark Workloads
Cassandra is great but how do I test my application?
Ad

Similar to Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help for multi-tenant SaaS apps, & other shiny things (20)

PDF
Datastax enterprise presentation
PPTX
Azure satpn19 time series analytics with azure adx
PDF
USQL Trivadis Azure Data Lake Event
PPTX
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
PPTX
Accelerating Business Intelligence Solutions with Microsoft Azure pass
PPTX
Webinar - QuerySurge and Azure DevOps in the Azure Cloud
PPTX
CC -Unit4.pptx
PPTX
Azure Lowlands: An intro to Azure Data Lake
PPTX
Gs08 modernize your data platform with sql technologies wash dc
PPTX
2014.10.22 Building Azure Solutions with Office 365
PPTX
Time Series Analytics Azure ADX
PPTX
Postgres for Digital Transformation: NoSQL Features, Replication, FDW & More
PDF
J1 T1 3 - Azure Data Lake store & analytics 101 - Kenneth M. Nielsen
PPTX
Azure Synapse Analytics Overview (r2)
PPTX
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
PPTX
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
PDF
Exploring sql server 2016
PPTX
Azure Databricks - An Introduction 2019 Roadshow.pptx
PDF
Ibm integrated analytics system
PPTX
BI, Reporting and Analytics on Apache Cassandra
Datastax enterprise presentation
Azure satpn19 time series analytics with azure adx
USQL Trivadis Azure Data Lake Event
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Webinar - QuerySurge and Azure DevOps in the Azure Cloud
CC -Unit4.pptx
Azure Lowlands: An intro to Azure Data Lake
Gs08 modernize your data platform with sql technologies wash dc
2014.10.22 Building Azure Solutions with Office 365
Time Series Analytics Azure ADX
Postgres for Digital Transformation: NoSQL Features, Replication, FDW & More
J1 T1 3 - Azure Data Lake store & analytics 101 - Kenneth M. Nielsen
Azure Synapse Analytics Overview (r2)
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
Exploring sql server 2016
Azure Databricks - An Introduction 2019 Roadshow.pptx
Ibm integrated analytics system
BI, Reporting and Analytics on Apache Cassandra

More from DataStax (20)

PPTX
Is Your Enterprise Ready to Shine This Holiday Season?
PPTX
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
PPTX
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
PPTX
Best Practices for Getting to Production with DataStax Enterprise Graph
PPTX
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
PPTX
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
PDF
Webinar | Better Together: Apache Cassandra and Apache Kafka
PDF
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
PDF
Introduction to Apache Cassandra™ + What’s New in 4.0
PPTX
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
PPTX
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
PDF
Designing a Distributed Cloud Database for Dummies
PDF
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
PDF
How to Evaluate Cloud Databases for eCommerce
PPTX
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
PPTX
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
PPTX
Datastax - The Architect's guide to customer experience (CX)
PPTX
An Operational Data Layer is Critical for Transformative Banking Applications
PPTX
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
PPTX
Innovation Around Data and AI for Fraud Detection
Is Your Enterprise Ready to Shine This Holiday Season?
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
Best Practices for Getting to Production with DataStax Enterprise Graph
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar | Better Together: Apache Cassandra and Apache Kafka
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
Introduction to Apache Cassandra™ + What’s New in 4.0
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Designing a Distributed Cloud Database for Dummies
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Evaluate Cloud Databases for eCommerce
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Datastax - The Architect's guide to customer experience (CX)
An Operational Data Layer is Critical for Transformative Banking Applications
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Innovation Around Data and AI for Fraud Detection

Recently uploaded (20)

PDF
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PDF
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
PPTX
Microsoft User Copilot Training Slide Deck
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPTX
future_of_ai_comprehensive_20250822032121.pptx
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
PDF
Advancing precision in air quality forecasting through machine learning integ...
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
PDF
Rapid Prototyping: A lecture on prototyping techniques for interface design
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
Lung cancer patients survival prediction using outlier detection and optimize...
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
sbt 2.0: go big (Scala Days 2025 edition)
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
Transform-Your-Supply-Chain-with-AI-Driven-Quality-Engineering.pdf
Microsoft User Copilot Training Slide Deck
Comparative analysis of machine learning models for fake news detection in so...
future_of_ai_comprehensive_20250822032121.pptx
Taming the Chaos: How to Turn Unstructured Data into Decisions
sustainability-14-14877-v2.pddhzftheheeeee
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
Advancing precision in air quality forecasting through machine learning integ...
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
Rapid Prototyping: A lecture on prototyping techniques for interface design
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Lung cancer patients survival prediction using outlier detection and optimize...
Data Virtualization in Action: Scaling APIs and Apps with FME
4 layer Arch & Reference Arch of IoT.pdf
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
sbt 2.0: go big (Scala Days 2025 edition)

Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help for multi-tenant SaaS apps, & other shiny things

  • 1. DataStax Enterprise 5.1: 3X the operational analytics speed, help for multi-tenant SaaS apps, & other shiny things Gehrig Kunz, Product Marketing David Gilardi, Technical Evangelist
  • 2. • What is a DataStax Enterprise? • Explore what’s new in: • DataStax Enterprise 5.1 • Core • Analytics • Search • Graph • *Live demo* • OpsCenter 6.1 • Studio 2.0 Hey. Thanks for joining us. David @sonicdmg Gehrig @gehrigkunz (#BearDown)
  • 3. Powering cloud applications Personalization Customer 360 Recommendation Fraud Detection Inventory Management Identity Management Security Supply Chain
  • 4. Cloud application characteristics Real-Time DistributedAlways-OnContextual Scalable
  • 5. Easy to build, effortless to scale • DataStax Enterprise 5.1 • Core • Analytics • Search • Graph • DataStax OpsCenter 6.1 • DataStax Studio 2.0 • DSE Drivers
  • 6. Easy to build, effortless to scale • DataStax Enterprise 5.1 • Core • Analytics • Search • Graph • DataStax OpsCenter 6.1 • DataStax Studio 2.0 • DSE Drivers
  • 9. DSE Core Powered by the best distribution of Apache Cassandra™ • Multi-tenancy made easy • Improvements to Advanced Replication • Continuous paging (DSE Analytics) • Production certified Cassandra 3.11.1 Apache Cassandra, Spark and Solr are trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
  • 10. Best distro of Apache Cassandra™ A look back at 5.0 introductions to DSE Core: • Introduced advanced replication • Tiered storage • Multi-instance
  • 11. Row Level Access Control (RLAC) • Part of DSE Advanced Security. • Secures data in tables at the row level. • Handled via CQL. • Enables multi-tenancy capabilities on Cassandra tables.
  • 12. RLAC FTW CREATE TABLE microservices.shoppingcart_by_tx ( timestamp UUID, cart UUID, amount decimal, store text, tx_id UUID PRIMARY_KEY((store, tx_id)) ); RESTRICT ROWS on microservices.shoppingcart_by_tx USING store; CREATE ROLE bobcoapp; GRANT SELECT ON ‘bobco’ IN microservices.shoppingcart_by_tx TO bobcoapp;
  • 13. Advanced replication advanced-er • Lower overhead and improved performance. • Hub to Spoke replication; multi-directional replication between clusters now supported. • Support for multi-datacenter edge clusters.
  • 16. DSE Analytics ● 3x operational analytics performance improvements ● DataStax Enterprise File System (DSEFS) • HDFS minus the bad • Example use: PDFs and their metadata ● Separate Spark-only cluster • Adding/removing Spark nodes is cheap • Good for scaling up/down for specific needs • Example use: weekly analytics reporting ● Spark UI reachable from any node ● Production certified Apache Spark™ 2.0 • Support for SparkR.
  • 17. What is DSEFS? • A new file system for DSE that is... • HDFS compatible • Distributed • Fault tolerant with no single point of failure • Scale-out Benefits of DSEFS over HDFS Benefits of DSEFS over Cassandra No namenode No overhead from compactions or commit log writes No secondary namenode No data density ramifications No zookeeper No overhead from delete operations; provides immediate deletes No single point of failure No JVM overhead for file system data
  • 18. To the core Analytics Table Scan 1 ––– Prior to continuous paging –––
  • 19. To the core Analytics Table Scan 1 Scan 2 ––– Without continuous paging –––
  • 20. To the core Analytics Table Scan 1 Scan 2 ––– With continuous paging –––
  • 21. The result – 3xResponseTime (Seconds) 0 50 100 150 200 250 300 Test 1 Test 2 Test 3 Test 4 Read performance (Lower is better) OSS C*/Spark DSE 5.1
  • 22. DSE Search • Increased performance for Indexing; new data now made available for search much faster. • Enhanced CQL support for search index management; makes using search easier than ever. • Production certified Apache Solr 6.0 CREATE SEARCH INDEX IF NOT EXISTS ON songs WITH COLUMNS id,val1; DROP SEARCH INDEX ON songs WITH OPTIONS { deleteResources : true };
  • 23. DSE Graph The first scale-out, real-time graph database • First on the market to enable out-of-the-box graph search. • Improved graph analytics: • With Spark GraphFrames, known as ‘DSEGraphFrames’ • Introducing Fluent API with 2 flavors: • Explicit = if you’re coming from DSE drivers • Implicit = if you’re coming from Tinkerpop world
  • 24. Graph-based Search Type ahead, fuzzy, and spell-check searches now available on graph data
  • 25. Easy to build, effortless to scale • DataStax Enterprise 5.1 • Core • Analytics • Search • Graph • DataStax OpsCenter 6.1 • DataStax Studio 2.0 • DSE Drivers
  • 26. OpsCenter 6.1 improvements Live demo! Oooooooooooo
  • 27. OpsCenter 6.1 improvements • Full support for DSE 5.1 • Updated repair section • Improved resiliency • Granular control to ignore keyspaces and/or tables • Improved UI to track repair progress • Enhanced metrics and alerts • Graph • Datacenter latency • Datacenter level backup and improved support for Amazon S3 • Backup and restore times decreased • Backup/restore SASI indexes • SASI index definitions automatically backed up on tables backed up, index definitions are rebuilt on restore • Backward compatibility of agents • OpsCenter can be upgraded ahead of agents from 6.1 moving forward
  • 30. Enhanced metrics for DSE Graph and Datacenter Latency
  • 32. Backward compatibility of agents from 6.1 moving forward
  • 35. Some developer love - Studio ● Studio 2.0 ○ New graph visuals ■Vertex size by property value ■Color by label or property value ■Shapes & Icons ■Improved cluster segregation and reduced overlap ○ Gremlin scan steps not using indexes highlighted in profiler ○ CQL Support ■Traditionally “Devcenter” functionality ■CQL ■Schema views ■Tracing ■Consistency level settings ■Schema aware content assist “intellisense magic”
  • 37. Color by label or property value
  • 39. Improved cluster segregation and reduced overlap
  • 40. Improved cluster segregation and reduced overlap
  • 42. • OpsCenter 6.1 • Timestamps in human readable format instead of epoch • Repair • Better logging • Added “pause” between repairs to avoid aggressive cyclic repairs • Lifecycle Manager • Concurrent inter-cluster jobs • More hardening • Schema viewer support for SASI indexes, MVs, UDTs, UDFs, UDAs Honorable mentions (1/3)
  • 43. • Graph • API support for edit distance queries • Type ahead • Spell check • Support for multiple vertices & edges and commit as a single operation/transaction • Support for units in Geo.distance • Time and Date types added Honorable mentions (2/3)
  • 44. • Studio 2.0 • Syntax highlighting • Code completion • Schema-aware completion • Validations • Snippets • Dockable schema viewer • Detailed view • CQL execution configurations • Support for charts • DSE Drivers • Unified authentication • Ability to run queries as another user • DateRangeField support in search Honorable mentions (3/3)
  • 46. Release notes – http://docs.datastax.com/en/dse/5.1/dse- dev/datastax_enterprise/RNdse.html How to upgrade – http://docs.datastax.com/en/dse/5.1/upgrade/upgrade/dat astax_enterprise/upgdDSE51.html Graph example data sets – Things to check out All downloads available April 4th academy.datastax.com/slack https://github.com/datastax/graph-examples

Editor's Notes

  • #5: Now what we mean by cloud applications, is these few characteristics. Apps today have evolved to be contextual, personalized for us, always-on can never go down with real-time responsiveness. We’re serving up a global application that has numerous endpoints or users around the world and it needs to be scalable to support all of this. So where do we begin with that data layer?
  • #14: To enable replication, edit the dse.yaml file. And enable Change Data Capture in cassandra.yaml. DSE Advanced Replication stores all of its settings within an automatically generated CQL tables. To configure replication, load values directly to the tables, or use the dse advrep command line tool. Multi-dc edge = cross microservice hub
  • #15: Example use: Microservices analytics but the other major use case we see is replicating individual microservices into a consolidated cluster for cross-microservices view/analysis. It fits in with Customer 360 (in the case where the microservices are customer-related/focused).
  • #16: Example use: Internet of Things The benefits have nothing to do with intermittent connections, in this case.
  • #17: The "Separate Spark-only cluster" is all about options. A customer can deploy a separate cluster for Spark - good because segregated resources (lower contention), and ease of scaling up/down (like you say). I would shy away from saying this is a preferred mode or not, just another deployment mode that a number of folks have wanted.
  • #19: The point of continuous paging is that it pipelines the pages expecting that you will ask for the next page.  In "normal" paging, a single page is returned to the client application, and the server frees up all the resources.  This allows the server to not have to hold on to expensive memory, etc, and is more efficient - unless you want more pages.  When the client asks for the second page, the server has to spin up a bunch of data structures and find where it had left off from the last page, get the data, and then free those resources again.  If you are going to fetch a bunch of pages, this statelessness is expensive. But with Continuous Paging the server keeps all those data structures in memory for a while anticipating that you will ask for the next page.  It does eat up RAM, but at the benefit of more efficiently getting the next page.  There is a timeout that if you haven't fetched the next page in a while (5 seconds default), then it will free the resources. If you picture and talk track is something like that, then :thumbsup:.
  • #22: Lower is better.
  • #28: Resiliency - previous, if encountered > 100 errors repair service would shutdown, in 6.1 it continues on and warns users, no more repair service shutdown “Generally speaking, when encountering any obstacles 6.1 RS will try to pause and resume or pause and restart, but it should never just stop”
  • #30: Improved resiliency Resiliency - previous, if encountered > 100 errors repair service would shutdown, in 6.1 it continues on and warns users, no more repair service shutdown “Generally speaking, when encountering any obstacles 6.1 RS will try to pause and resume or pause and restart, but it should never just stop” Granular control to ignore keyspaces and/or tables Improved UI to track repair progress
  • #32: S3 Backup and restore speeds decreased
  • #33: OpsCenter can be upgraded ahead of agents from 6.1 moving forward
  • #34: Repair screen Graph and datacenter metrics Datacenter level backup Backward compatibility of agents
  • #36: Neat example with consistency level ONE compared to ALL for “select * from users” Tracing really illustrates the difference Easily demonstrate schema view for both graph and CQL Schema aware will constrain results to object looking at...makes you think SO MUCH LESS than before
  • #38: Community detection Label Duration property w/linear option
  • #42: Vertex size - Shapes and icons Color by label and property Graph profiler not using index CQL schema aware Tracing and replication factor Schema views