SlideShare a Scribd company logo
How to Tell if Your Business
Needs NoSQL
Robin Schumacher
VP Products
• Founded in April 2010
• The Apache Cassandra™ company
• Home to Apache Cassandra Chair & most committers
• Cassandra is a massively scalable NoSQL database
• Provide enterprise-class big data platform based on
Cassandra
• 270+ customers
• Headquartered in San Francisco Bay area
• Funded by prominent venture firms
Overview of DataStax
Serving Every Industry
Leading in Performance
http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-
on.html
Netflix Cloud Benchmark…
“In terms of scalability, there is a clear winner throughout
our experiments. Cassandra achieves the highest
throughput for the maximum number of nodes in all
experiments with a linear increasing throughput.”
Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August
2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013.
http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf
End Point Independent NoSQL Benchmark
Highest in throughput…
Lowest in latency…
NoSQL Momentum
“According to analysis by
Wikibon‟s David Floyer (and
highlighted in the Wall
Street Journal), the NoSQL
database market is
expected to grow at a
compound annual growth
rate of nearly 60% between
2011 and 2017. The SQL
slice of the Big Data market,
in contrast, will grow at just
a 26% CAGR during that
same time period.”
NoSQL Momentum
“NoSQL is the stuff of the Internet
Age.”
- Andrew
Oliver, InfoWorld
But Does My Business Need NoSQL…?
Just because a technology
appears to be having
strong adoption in the
market, that doesn‟t mean
it‟s right for your
business…
What is NoSQL…?
• Progressive data management
engines
• Go beyond legacy relational
databases
• Flexible data model
• Horizontal scalability
• Distributed architectures
• Use of languages and
interfaces that are “not only”
SQL
NoSQL Example – Apache Cassandra
Apache Cassandra is a massively scalable NoSQL database that
offers continuous availability and easy data distribution.
NoSQL Example – Apache Cassandra
“Cassandra stands at the front of the NoSQL pack when it
comes to supporting real-time, big data applications.”
– Wikibon
How Can I Tell if NoSQL Can Help Me Run My
Business and Reduce Costs?
©2013 DataStax Confidential. Do not distribute without consent.
NoSQL Business Considerations
• Need scale-out (vs. scale-up)?
• Manage different types of data like social media?
• Lots of data coming in (and fast)?
• Have non-RDBMS, non-ACID transactions?
• Must keep large data volumes online?
• Continuous uptime necessary?
• Wide-scale data distribution needed?
• Need to integrate different systems?
• Cost a factor?
Need Scale-Out (vs. Scale-Up)?
No
• Application does not require multiple machines
• Can scale-up and meet the application’s current and future needs
Yes
• Application demands divide-and-conquer
• Capacity expansion is best/can only be handled via new machines
Key takeaway: If your applications can easily run on one machine, fit all your
data in RAM or can easily expand via new cores/more drives to fulfill current
and future requirements, you may not need NoSQL…
NoSQL Case Study
Ooyala distributes and analyzes video content for companies like
ESPN, Rolling Stone and others. They track about one quarter of all
online video viewers each day and generate 1-2 billion events that are
streaming in real-time through their system.
Manage Different Types of Data?
No
• No non-structured data (all or mostly rigid formats)
• E.G. No social media data
Yes
• All types of data (structured, semi, and unstructured)
• Social media data
Key takeaway: If all your data systems deal with standard RDBMS structured
data and that won‟t be changing, then you may not need NoSQL…
NoSQL Case Study
HealthCare Anytime needs to analyze doctor’s notes and other types
of difficult data to properly bill back Medicare / Medicaid.
NoSQL Case Study
“Cassandra‟s NoSQL data model allows us to insert and query data much more
naturally than what we had previously. The analysts who routinely use this data were
impressed with the flexibility and speed at which the queries came back.”
– CSC/NASA
Lots of Data Coming In (and Fast)?
No
• No high velocity data (e.g. device, sensors, web streaming, etc.)
• No multiple locations
• Little/no concern about write speed
Yes
• High velocity, write intensive
• Multiple locations sending data
• Must consume data as quickly as possible
Key takeaway: Business applications involving rapid time series data, device
„exhaust‟, web or financial streaming data make good use cases for
NoSQL…
NoSQL Case Study
Gnip takes in huge volumes of social media data at high rates of
speed (e.g. 20,000 Tweets per second).
Non-RDBMS, Non-ACID transactions?
No
• Standard RDBMS, Nested, ACID transactions required
• Complex, requiring rollbacks, savepoints, etc., needed
Yes
• “Big Data” transactions OK or are necessary
• Atomic, Isolated, Durable (AID), but eventual or tunable consistency
allowed
Key takeaway: NoSQL databases do transactions, but since they don‟t
support joins or foreign keys, consistency conforms to the CAP theorem vs.
RDBMS ACID styled consistency…
NoSQL Case Study
eBay does transactions, but does not want overhead of RDBMS
ACID-type transactions.
Cassandra and Transactions
Individual or batch transactions with AID and tunable consistency.
Must Keep Large Data Volumes Online?
No
• No application requirement to keep large volumes of data
• System typically purges data older than certain time period
Yes
• Must keep large volumes of data online and available to customers
• Retain both hot and cold data
Key takeaway: Some NoSQL databases like Cassandra can excel over
typical RDBMS‟s when it comes to maintaining large volumes of data online
and meeting stringent performance SLA‟s …
NoSQL Case Study
Easou is the #1 mobile search firm in China. One of their Cassandra
applications stores online video images for retrieval / viewing and is
300TB in size.
Continuous Uptime Necessary?
No
• Applications have no need for constant uptime
• Unplanned downtime can be handled via traditional failover
Yes
• Applications cannot tolerate any downtime
• Standard log shipping, failover, hot backups, won’t do
Key takeaway: Some NoSQL databases like Cassandra are able to
guarantee no downtime because of their architectures…
NoSQL Case Study
Netflix systems are run in the cloud across multiple availability zones
with Cassandra and sport constant uptime.
NoSQL Case Study
Commenting on Amazon outage in Oct 2012: “We configure all our clusters
to use a replication factor of three, with each replica located in a different
Availability Zone. This allowed Cassandra to handle the outage remarkably
well. When a single zone became unavailable, we didn't need to do
anything. Cassandra routed requests around the unavailable zone and when
it recovered, the ring was repaired.”
- Netflix Tech Blog
Wide-Scale Data Distribution Needed?
No
• Application’s data needs are single site only
• No need to distribute data in other locales for any reason
Yes
• Application serves customers in multiple locations
• Data is distributed across multiple data centers / cloud zones for
latency/performance or disaster recover reasons
Key takeaway: Cassandra is the gold standard among NoSQL databases for
multi-data center, data distribution use cases…
NoSQL Case Study
Rightscale keeps its customers in contact with each other all over the
world via Cassandra clusters in 5+ global data centers.
Need to Integrate Different Systems?
No
• Applications use siloed databases
• No need for different data systems to interact with each other
Yes
• Application has different database workloads
• Multiple data domains serve single application
Key takeaway: ETL and simple connectors oftentimes do not do the job.
Instead, what‟s needed is something like DataStax Enterprise, which
provides one database that serves multiple database workloads…
NoSQL Case Study
Datafiniti, which is a search engine for data, needs to consume lots
of data in real time and provide fast search on top of the same data.
Cost a Factor?
No
• Application is small and not cost intensive to operate
• Software license costs not a factor
Yes
• Large scale business applications
• Traditional RDBMS software costs a significant concern
Key takeaway: NoSQL databases costs can oftentimes be 70-80% less than
legacy RDBMS software. Further large operations staff are not required to
manage NoSQL systems.
NoSQL Case Study
Constant Contact found that scaling out with NoSQL vs. an RDBMS
saved them 90% in software costs, and was implemented in 1/3 the
time...
What Strategies Can I Use To Implement
NoSQL in my Business?
©2013 DataStax Confidential. Do not distribute without consent.
NoSQL Implementation Strategies
New Hybrid Replacement
• New big data
applications
• Legacy systems keep
old databases
• NoSQL database
used for heavy lifting /
big data management
• Legacy RDBMS
maintains smaller
parts of database
• Legacy RDBMS
cannot meet
demands of new or
evolving big data
system
• Data models and data
are migrated
DataStax Enterprise – NoSQL for the Enterprise
DataStax Enterprise is a complete big data platform, built on Cassandra, that
is architected to manage real-time, analytic, and enterprise search data all
in the same database cluster.
What You Get With DataStax Enterprise
1. DataStax Enterprise
Database Server
1. OpsCenter Enterprise
Management solution
1. Expert 24x7 support
Use Cases Handled By DataStax Enterprise
Managed by Cassandra Managed by Hadoop Managed by Solr
• Time series data
• Device/Sensor/Data
“exhaust” systems
• Distributed applications
• Media streaming
• Online Web retail
(transactional, shopping
carts, etc.)
• Real-time data analytics
• Social media capture and
analysis
• Web click-stream analysis
• Write-intensive transactional
systems
• Buyer behavior analytics
• Compliance/regulatory
analysis
• Customer
recommendation output
• Fraud detection
• Risk analysis
• Sales program
campaign analysis
• Supply chain analytics
• Batch Web clickstream
analysis
• General Web search
• Web retail faceted
(categorization) search
• Search/hit prioritization
and highlighting
• Application log search and
analysis
• Document (PDF, MS
Word, etc.) search and
analysis
• Geospatial search
• Real estate location and
property search
• Social media match ups
Next Steps
Download DataStax Enterprise and try it in your own
environment.
• Go to www.datastax.com/download
• Download a copy of DataStax
Enterprise
• Installs and configures in minutes
• Completely free for development use;
subscription required for production
deployments
For More Information
Thank You
We power the big data applications
that transform business.
©2013 DataStax Confidential. Do not distribute without consent.

More Related Content

What's hot (20)

PPTX
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
DataStax
 
PPTX
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
DataStax
 
PDF
DataStax Training – Everything you need to become a Cassandra Rockstar
DataStax
 
PPTX
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
DataStax
 
PPTX
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
DataStax
 
PPTX
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
DataStax
 
PDF
Building a Digital Bank
DataStax
 
PPTX
Transforms Document Management at Scale with Distributed Database Solution wi...
DataStax Academy
 
PPTX
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
DataStax
 
PPTX
How to Successfully Visualize DSE Graph data
DataStax
 
PPTX
Webinar: Transforming Customer Experience Through an Always-On Data Platform
DataStax
 
PDF
Top 5 Considerations for a Big Data Solution
DataStax
 
PPTX
Webinar: Become PSD2 ready with DataStax
DataStax
 
PPTX
Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...
DataStax
 
PPTX
Introduction: Architecting for Scale
DataStax
 
PPTX
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
DataStax
 
PDF
Building a scalable analytics environment to support diverse workloads
Alluxio, Inc.
 
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
PPTX
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
PPTX
Announcing Spark Driver for Cassandra
DataStax
 
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
DataStax
 
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
DataStax
 
DataStax Training – Everything you need to become a Cassandra Rockstar
DataStax
 
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
DataStax
 
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
DataStax
 
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
DataStax
 
Building a Digital Bank
DataStax
 
Transforms Document Management at Scale with Distributed Database Solution wi...
DataStax Academy
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
DataStax
 
How to Successfully Visualize DSE Graph data
DataStax
 
Webinar: Transforming Customer Experience Through an Always-On Data Platform
DataStax
 
Top 5 Considerations for a Big Data Solution
DataStax
 
Webinar: Become PSD2 ready with DataStax
DataStax
 
Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...
DataStax
 
Introduction: Architecting for Scale
DataStax
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
DataStax
 
Building a scalable analytics environment to support diverse workloads
Alluxio, Inc.
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
DataStax
 
Announcing Spark Driver for Cassandra
DataStax
 

Viewers also liked (18)

PDF
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
DataStax
 
PPTX
Webinar: Eventual Consistency != Hopeful Consistency
DataStax
 
PPTX
Cassandra Community Webinar: Back to Basics with CQL3
DataStax
 
PPTX
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
DataStax
 
PDF
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
DataStax
 
PDF
Cassandra Community Webinar | In Case of Emergency Break Glass
DataStax
 
PPTX
Webinar | Introducing DataStax Enterprise 4.6
DataStax
 
PPT
Webinar: Getting Started with Apache Cassandra
DataStax
 
PDF
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
DataStax
 
PDF
Cassandra TK 2014 - Large Nodes
aaronmorton
 
PPTX
Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...
DataStax
 
PPTX
Webinar: Building Blocks for the Future of Television
DataStax
 
PPTX
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
DataStax
 
PDF
Webinar: Diagnosing Apache Cassandra Problems in Production
DataStax Academy
 
PDF
Cassandra Community Webinar | Become a Super Modeler
DataStax
 
PDF
Cassandra Community Webinar: Apache Cassandra Internals
DataStax
 
PPT
Community Webinar: 15 Commandments of Cassandra DBAs
DataStax
 
PDF
Cassandra Community Webinar | The World's Next Top Data Model
DataStax
 
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
DataStax
 
Webinar: Eventual Consistency != Hopeful Consistency
DataStax
 
Cassandra Community Webinar: Back to Basics with CQL3
DataStax
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
DataStax
 
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
DataStax
 
Cassandra Community Webinar | In Case of Emergency Break Glass
DataStax
 
Webinar | Introducing DataStax Enterprise 4.6
DataStax
 
Webinar: Getting Started with Apache Cassandra
DataStax
 
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
DataStax
 
Cassandra TK 2014 - Large Nodes
aaronmorton
 
Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...
DataStax
 
Webinar: Building Blocks for the Future of Television
DataStax
 
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
DataStax
 
Webinar: Diagnosing Apache Cassandra Problems in Production
DataStax Academy
 
Cassandra Community Webinar | Become a Super Modeler
DataStax
 
Cassandra Community Webinar: Apache Cassandra Internals
DataStax
 
Community Webinar: 15 Commandments of Cassandra DBAs
DataStax
 
Cassandra Community Webinar | The World's Next Top Data Model
DataStax
 
Ad

Similar to How To Tell if Your Business Needs NoSQL (20)

PDF
Considerations for using NoSQL technology on your next IT project - Akmal Cha...
BCS Data Management Specialist Group
 
PDF
Apache Cassandra: NoSQL in the enterprise
jbellis
 
PDF
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax
 
PPT
Big data
HR Krutika Meheta
 
PDF
Big data
HR Krutika Meheta
 
PPTX
NoSQL Architecture Overview
Christopher Foot
 
PDF
Slides: Relational to NoSQL Migration
DATAVERSITY
 
PPTX
DataStax C*ollege Credit: What and Why NoSQL?
DataStax
 
PDF
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
PDF
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
PDF
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
PDF
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
PPTX
UNIT II Evaluating NoSQL for various .pptx
Rahul Borate
 
PDF
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
 
PPTX
Minnebar 2013 - Scaling with Cassandra
Jeff Bollinger
 
PPTX
NoSQLDatabases
Adi Challa
 
PPT
No sql databases explained
Salil Mehendale
 
PPTX
John Glendenning - Real time data driven services in the Cloud
WeAreEsynergy
 
PPTX
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Storage Switzerland
 
PDF
NoSQL – Back to the Future or Yet Another DB Feature?
Martin Scholl
 
Considerations for using NoSQL technology on your next IT project - Akmal Cha...
BCS Data Management Specialist Group
 
Apache Cassandra: NoSQL in the enterprise
jbellis
 
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax
 
NoSQL Architecture Overview
Christopher Foot
 
Slides: Relational to NoSQL Migration
DATAVERSITY
 
DataStax C*ollege Credit: What and Why NoSQL?
DataStax
 
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
Considerations for using NoSQL technology on your next IT project
Akmal Chaudhri
 
UNIT II Evaluating NoSQL for various .pptx
Rahul Borate
 
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
 
Minnebar 2013 - Scaling with Cassandra
Jeff Bollinger
 
NoSQLDatabases
Adi Challa
 
No sql databases explained
Salil Mehendale
 
John Glendenning - Real time data driven services in the Cloud
WeAreEsynergy
 
Webinar: Overcoming the Storage Challenges Cassandra and Couchbase Create
Storage Switzerland
 
NoSQL – Back to the Future or Yet Another DB Feature?
Martin Scholl
 
Ad

More from DataStax (20)

PPTX
Is Your Enterprise Ready to Shine This Holiday Season?
DataStax
 
PPTX
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
DataStax
 
PPTX
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
DataStax
 
PPTX
Best Practices for Getting to Production with DataStax Enterprise Graph
DataStax
 
PPTX
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
DataStax
 
PPTX
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
DataStax
 
PDF
Webinar | Better Together: Apache Cassandra and Apache Kafka
DataStax
 
PDF
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
DataStax
 
PDF
Introduction to Apache Cassandra™ + What’s New in 4.0
DataStax
 
PPTX
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
DataStax
 
PPTX
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
DataStax
 
PDF
Designing a Distributed Cloud Database for Dummies
DataStax
 
PDF
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
PDF
How to Evaluate Cloud Databases for eCommerce
DataStax
 
PPTX
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
DataStax
 
PPTX
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
DataStax
 
PPTX
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
DataStax
 
PPTX
Datastax - The Architect's guide to customer experience (CX)
DataStax
 
PPTX
An Operational Data Layer is Critical for Transformative Banking Applications
DataStax
 
PPTX
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
DataStax
 
Is Your Enterprise Ready to Shine This Holiday Season?
DataStax
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
DataStax
 
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
DataStax
 
Best Practices for Getting to Production with DataStax Enterprise Graph
DataStax
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
DataStax
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
DataStax
 
Webinar | Better Together: Apache Cassandra and Apache Kafka
DataStax
 
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
DataStax
 
Introduction to Apache Cassandra™ + What’s New in 4.0
DataStax
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
DataStax
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
DataStax
 
Designing a Distributed Cloud Database for Dummies
DataStax
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
How to Evaluate Cloud Databases for eCommerce
DataStax
 
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
DataStax
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
DataStax
 
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
DataStax
 
Datastax - The Architect's guide to customer experience (CX)
DataStax
 
An Operational Data Layer is Critical for Transformative Banking Applications
DataStax
 
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
DataStax
 

Recently uploaded (20)

PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
Digital Circuits, important subject in CS
contactparinay1
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 

How To Tell if Your Business Needs NoSQL

  • 1. How to Tell if Your Business Needs NoSQL Robin Schumacher VP Products
  • 2. • Founded in April 2010 • The Apache Cassandra™ company • Home to Apache Cassandra Chair & most committers • Cassandra is a massively scalable NoSQL database • Provide enterprise-class big data platform based on Cassandra • 270+ customers • Headquartered in San Francisco Bay area • Funded by prominent venture firms Overview of DataStax
  • 4. Leading in Performance http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability- on.html Netflix Cloud Benchmark… “In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.” Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf End Point Independent NoSQL Benchmark Highest in throughput… Lowest in latency…
  • 5. NoSQL Momentum “According to analysis by Wikibon‟s David Floyer (and highlighted in the Wall Street Journal), the NoSQL database market is expected to grow at a compound annual growth rate of nearly 60% between 2011 and 2017. The SQL slice of the Big Data market, in contrast, will grow at just a 26% CAGR during that same time period.”
  • 6. NoSQL Momentum “NoSQL is the stuff of the Internet Age.” - Andrew Oliver, InfoWorld
  • 7. But Does My Business Need NoSQL…? Just because a technology appears to be having strong adoption in the market, that doesn‟t mean it‟s right for your business…
  • 8. What is NoSQL…? • Progressive data management engines • Go beyond legacy relational databases • Flexible data model • Horizontal scalability • Distributed architectures • Use of languages and interfaces that are “not only” SQL
  • 9. NoSQL Example – Apache Cassandra Apache Cassandra is a massively scalable NoSQL database that offers continuous availability and easy data distribution.
  • 10. NoSQL Example – Apache Cassandra “Cassandra stands at the front of the NoSQL pack when it comes to supporting real-time, big data applications.” – Wikibon
  • 11. How Can I Tell if NoSQL Can Help Me Run My Business and Reduce Costs? ©2013 DataStax Confidential. Do not distribute without consent.
  • 12. NoSQL Business Considerations • Need scale-out (vs. scale-up)? • Manage different types of data like social media? • Lots of data coming in (and fast)? • Have non-RDBMS, non-ACID transactions? • Must keep large data volumes online? • Continuous uptime necessary? • Wide-scale data distribution needed? • Need to integrate different systems? • Cost a factor?
  • 13. Need Scale-Out (vs. Scale-Up)? No • Application does not require multiple machines • Can scale-up and meet the application’s current and future needs Yes • Application demands divide-and-conquer • Capacity expansion is best/can only be handled via new machines Key takeaway: If your applications can easily run on one machine, fit all your data in RAM or can easily expand via new cores/more drives to fulfill current and future requirements, you may not need NoSQL…
  • 14. NoSQL Case Study Ooyala distributes and analyzes video content for companies like ESPN, Rolling Stone and others. They track about one quarter of all online video viewers each day and generate 1-2 billion events that are streaming in real-time through their system.
  • 15. Manage Different Types of Data? No • No non-structured data (all or mostly rigid formats) • E.G. No social media data Yes • All types of data (structured, semi, and unstructured) • Social media data Key takeaway: If all your data systems deal with standard RDBMS structured data and that won‟t be changing, then you may not need NoSQL…
  • 16. NoSQL Case Study HealthCare Anytime needs to analyze doctor’s notes and other types of difficult data to properly bill back Medicare / Medicaid.
  • 17. NoSQL Case Study “Cassandra‟s NoSQL data model allows us to insert and query data much more naturally than what we had previously. The analysts who routinely use this data were impressed with the flexibility and speed at which the queries came back.” – CSC/NASA
  • 18. Lots of Data Coming In (and Fast)? No • No high velocity data (e.g. device, sensors, web streaming, etc.) • No multiple locations • Little/no concern about write speed Yes • High velocity, write intensive • Multiple locations sending data • Must consume data as quickly as possible Key takeaway: Business applications involving rapid time series data, device „exhaust‟, web or financial streaming data make good use cases for NoSQL…
  • 19. NoSQL Case Study Gnip takes in huge volumes of social media data at high rates of speed (e.g. 20,000 Tweets per second).
  • 20. Non-RDBMS, Non-ACID transactions? No • Standard RDBMS, Nested, ACID transactions required • Complex, requiring rollbacks, savepoints, etc., needed Yes • “Big Data” transactions OK or are necessary • Atomic, Isolated, Durable (AID), but eventual or tunable consistency allowed Key takeaway: NoSQL databases do transactions, but since they don‟t support joins or foreign keys, consistency conforms to the CAP theorem vs. RDBMS ACID styled consistency…
  • 21. NoSQL Case Study eBay does transactions, but does not want overhead of RDBMS ACID-type transactions.
  • 22. Cassandra and Transactions Individual or batch transactions with AID and tunable consistency.
  • 23. Must Keep Large Data Volumes Online? No • No application requirement to keep large volumes of data • System typically purges data older than certain time period Yes • Must keep large volumes of data online and available to customers • Retain both hot and cold data Key takeaway: Some NoSQL databases like Cassandra can excel over typical RDBMS‟s when it comes to maintaining large volumes of data online and meeting stringent performance SLA‟s …
  • 24. NoSQL Case Study Easou is the #1 mobile search firm in China. One of their Cassandra applications stores online video images for retrieval / viewing and is 300TB in size.
  • 25. Continuous Uptime Necessary? No • Applications have no need for constant uptime • Unplanned downtime can be handled via traditional failover Yes • Applications cannot tolerate any downtime • Standard log shipping, failover, hot backups, won’t do Key takeaway: Some NoSQL databases like Cassandra are able to guarantee no downtime because of their architectures…
  • 26. NoSQL Case Study Netflix systems are run in the cloud across multiple availability zones with Cassandra and sport constant uptime.
  • 27. NoSQL Case Study Commenting on Amazon outage in Oct 2012: “We configure all our clusters to use a replication factor of three, with each replica located in a different Availability Zone. This allowed Cassandra to handle the outage remarkably well. When a single zone became unavailable, we didn't need to do anything. Cassandra routed requests around the unavailable zone and when it recovered, the ring was repaired.” - Netflix Tech Blog
  • 28. Wide-Scale Data Distribution Needed? No • Application’s data needs are single site only • No need to distribute data in other locales for any reason Yes • Application serves customers in multiple locations • Data is distributed across multiple data centers / cloud zones for latency/performance or disaster recover reasons Key takeaway: Cassandra is the gold standard among NoSQL databases for multi-data center, data distribution use cases…
  • 29. NoSQL Case Study Rightscale keeps its customers in contact with each other all over the world via Cassandra clusters in 5+ global data centers.
  • 30. Need to Integrate Different Systems? No • Applications use siloed databases • No need for different data systems to interact with each other Yes • Application has different database workloads • Multiple data domains serve single application Key takeaway: ETL and simple connectors oftentimes do not do the job. Instead, what‟s needed is something like DataStax Enterprise, which provides one database that serves multiple database workloads…
  • 31. NoSQL Case Study Datafiniti, which is a search engine for data, needs to consume lots of data in real time and provide fast search on top of the same data.
  • 32. Cost a Factor? No • Application is small and not cost intensive to operate • Software license costs not a factor Yes • Large scale business applications • Traditional RDBMS software costs a significant concern Key takeaway: NoSQL databases costs can oftentimes be 70-80% less than legacy RDBMS software. Further large operations staff are not required to manage NoSQL systems.
  • 33. NoSQL Case Study Constant Contact found that scaling out with NoSQL vs. an RDBMS saved them 90% in software costs, and was implemented in 1/3 the time...
  • 34. What Strategies Can I Use To Implement NoSQL in my Business? ©2013 DataStax Confidential. Do not distribute without consent.
  • 35. NoSQL Implementation Strategies New Hybrid Replacement • New big data applications • Legacy systems keep old databases • NoSQL database used for heavy lifting / big data management • Legacy RDBMS maintains smaller parts of database • Legacy RDBMS cannot meet demands of new or evolving big data system • Data models and data are migrated
  • 36. DataStax Enterprise – NoSQL for the Enterprise DataStax Enterprise is a complete big data platform, built on Cassandra, that is architected to manage real-time, analytic, and enterprise search data all in the same database cluster.
  • 37. What You Get With DataStax Enterprise 1. DataStax Enterprise Database Server 1. OpsCenter Enterprise Management solution 1. Expert 24x7 support
  • 38. Use Cases Handled By DataStax Enterprise Managed by Cassandra Managed by Hadoop Managed by Solr • Time series data • Device/Sensor/Data “exhaust” systems • Distributed applications • Media streaming • Online Web retail (transactional, shopping carts, etc.) • Real-time data analytics • Social media capture and analysis • Web click-stream analysis • Write-intensive transactional systems • Buyer behavior analytics • Compliance/regulatory analysis • Customer recommendation output • Fraud detection • Risk analysis • Sales program campaign analysis • Supply chain analytics • Batch Web clickstream analysis • General Web search • Web retail faceted (categorization) search • Search/hit prioritization and highlighting • Application log search and analysis • Document (PDF, MS Word, etc.) search and analysis • Geospatial search • Real estate location and property search • Social media match ups
  • 39. Next Steps Download DataStax Enterprise and try it in your own environment. • Go to www.datastax.com/download • Download a copy of DataStax Enterprise • Installs and configures in minutes • Completely free for development use; subscription required for production deployments
  • 41. Thank You We power the big data applications that transform business. ©2013 DataStax Confidential. Do not distribute without consent.

Editor's Notes

  • #28: http://techblog.netflix.com/2012/10/post-mortem-of-october-222012-aws.html