SlideShare a Scribd company logo
7 Deadly Sins for Cassandra Ops

Rachel Pedreschi

DSE Evangelist, Datastax
2© 2015. All Rights Reserved.
Lust
3© 2015. All Rights Reserved.
#1 USE THE SIMPLE SNITCH
So many snitches…
4© 2015. All Rights Reserved.
• SimpleSnitch
• RackInferringSnitch
• PropertyFileSnitch
• GossipingPropertyFileSnitch
• Ec2Snitch
• Ec2MultiRegionSnitch
• GoogleCloudSnitch
• CloudstackSnitch
Switching snitches
5© 2015. All Rights Reserved.
If the topology of the network has changed:
-Shut down all the nodes, then restart them.
-Run a sequential repair and nodetool cleanup on each node.
DOWNTIME ALERT!!!!
Greed
6© 2015. All Rights Reserved.
#3 NOT UNDERSTANDING
REPAIR
Repair options
7© 2015. All Rights Reserved.
repair (default, check your version!)
repair -pr (only repair the primary range)
repair -inc (only new data that has not previously been
repaired)
sequential repair (creates snapshots)
parallel repair uses replica not being repaired
Envy
8© 2015. All Rights Reserved.
#3 CHOOSE THE WRONG
COMPACTION STRATEGY FOR
YOUR WORKLOAD
Sized Tiered
9© 2015. All Rights Reserved.
SST1 SST1 SST2 SST1 SST2 SST3 SST1 SST2 SST3 SST4 SST5
FLUSH
FLUSH
FLUSH FLUSH
COMPACT
SST5 SST6 SST5 SST6 SST7 SST5 SST6 SST7 SST8 SST5 SST10
FLUSH
FLUSH FLUSH
SST5 SST6 SST7 SST8
FLUSH
SST9
COMPACT
Compacts a set number of SSTables into a single, larger SSTable
Leveled
10© 2015. All Rights Reserved.
Level 0
Level 1
10
Level 2
100
Level 3
1,000
Level 4
10,000
Level 5
100,000
… Level 6, 7, etc.
Date Tiered
11© 2015. All Rights Reserved. https://labs.spotify.com/2014/12/18/date-tiered-compaction/
Gluttony
12© 2015. All Rights Reserved.
#4 CHOOSING THE WRONG
HARDWARE
13© 2015. All Rights Reserved.
CPU
RAM
DISK
14© 2015. All Rights Reserved.
• 2 socket, ECC memory
• 16GiB minimum, prefer 32-64GiB, over 128GiB and Linux will need
serious tuning
• SSD where possible, Samsung 840 Pro is a good choice, any
Intel is fine
• NO SAN/NAS, 20ms latency tops
• if you MUST (and please, don’t) dedicate spindles to C* nodes, use
separate network
• Avoid disk configurations targeted at Hadoop, disks are too
slow
Sloth
15© 2015. All Rights Reserved.
#5 NOT TUNING YOUR OS
/etc/rc.local
16© 2015. All Rights Reserved.
ra=$((2**14))# 16k
ss=$(blockdev --getss /dev/sda)
blockdev --setra $(($ra / $ss)) /dev/sda
echo 128 > /sys/block/sda/queue/nr_requests
echo deadline > /sys/block/sda/queue/scheduler
echo 16384 > /sys/block/md7/md/stripe_cache_size
/etc/sysctl.conf
17© 2015. All Rights Reserved.
fs.file-max = 1048576
vm.max_map_count = 1048576
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.ipv4.tcp_rmem = 4096 65536 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
vm.swappiness = 1
Wrath
18© 2015. All Rights Reserved.
#6 OVER OR UNDERSIZING
YOUR JVM
G1 or CMS?
19© 2015. All Rights Reserved.
• Cassandra 8150 vs 7486
• Larger heap? Look into using G1
• Read the Docs
• Test, test and did I mention, test?
Pride
20© 2015. All Rights Reserved.
#7 NOT STRESS TESTING
cassandra-stress .yaml (>= 2.1)
21© 2015. All Rights Reserved.
1. DDL – for defining your schema
2. Column Distributions – for defining the shape and size of each
column globally and within each partition
3. Insert Distributions – for defining how the data is written during the
stress test
4. DML – for defining how the data is queried during the stress test
Recommended Sessions
22© 2015. All Rights Reserved.
DataStax
Making Cassandra Fail (for effective testing)
3:30 Thursday Ballroom H
Pythian
Manage your compactions before they manage you
4:20 Ballroom H
The Last Pickle
Steady State Data Size with Compaction, Tombstones, and TTL
4:20 Great America #2
Wednesday
Crowdstrike, Inc.
Real World DTCS For Operators
Thank you
@RachelPedreschi

More Related Content

What's hot (20)

PDF
Live traffic capture and replay in cassandra 4.0
Vinay Kumar Chella
 
PDF
Webinar: Diagnosing Apache Cassandra Problems in Production
DataStax Academy
 
PDF
Cassandra Summit 2014: Active-Active Cassandra Behind the Scenes
DataStax Academy
 
PPTX
How to size up an Apache Cassandra cluster (Training)
DataStax Academy
 
PDF
Terror & Hysteria: Cost Effective Scaling of Time Series Data with Cassandra ...
DataStax
 
PDF
Cassandra serving netflix @ scale
Vinay Kumar Chella
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PDF
Cassandra Exports as a Trivially Parallelizable Problem (Emilio Del Tessandor...
DataStax
 
PPTX
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
DataStax
 
PDF
Tsinghua University: Two Exemplary Applications in China
DataStax Academy
 
PDF
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
DataStax
 
PPTX
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
DataStax
 
PDF
Mesosphere and Contentteam: A New Way to Run Cassandra
DataStax Academy
 
PDF
Scylla Summit 2016: Analytics Show Time - Spark and Presto Powered by Scylla
ScyllaDB
 
PDF
A glimpse of cassandra 4.0 features netflix
Vinay Kumar Chella
 
PPTX
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
DataStax
 
PPTX
Maintaining Consistency Across Data Centers (Randy Fradin, BlackRock) | Cassa...
DataStax
 
PDF
Safer restarts, faster streaming, and better repair, just a glimpse of cassan...
Vinay Kumar Chella
 
PDF
Managing Cassandra at Scale by Al Tobey
DataStax Academy
 
PDF
Distribute Key Value Store
Santal Li
 
Live traffic capture and replay in cassandra 4.0
Vinay Kumar Chella
 
Webinar: Diagnosing Apache Cassandra Problems in Production
DataStax Academy
 
Cassandra Summit 2014: Active-Active Cassandra Behind the Scenes
DataStax Academy
 
How to size up an Apache Cassandra cluster (Training)
DataStax Academy
 
Terror & Hysteria: Cost Effective Scaling of Time Series Data with Cassandra ...
DataStax
 
Cassandra serving netflix @ scale
Vinay Kumar Chella
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra Exports as a Trivially Parallelizable Problem (Emilio Del Tessandor...
DataStax
 
Building a Multi-Region Cluster at Target (Aaron Ploetz, Target) | Cassandra ...
DataStax
 
Tsinghua University: Two Exemplary Applications in China
DataStax Academy
 
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
DataStax
 
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
DataStax
 
Mesosphere and Contentteam: A New Way to Run Cassandra
DataStax Academy
 
Scylla Summit 2016: Analytics Show Time - Spark and Presto Powered by Scylla
ScyllaDB
 
A glimpse of cassandra 4.0 features netflix
Vinay Kumar Chella
 
Lessons Learned on Java Tuning for Our Cassandra Clusters (Carlos Monroy, Kne...
DataStax
 
Maintaining Consistency Across Data Centers (Randy Fradin, BlackRock) | Cassa...
DataStax
 
Safer restarts, faster streaming, and better repair, just a glimpse of cassan...
Vinay Kumar Chella
 
Managing Cassandra at Scale by Al Tobey
DataStax Academy
 
Distribute Key Value Store
Santal Li
 

Viewers also liked (20)

PDF
Cassandra Community Webinar | Introduction to Apache Cassandra 1.2
DataStax
 
PPTX
Despliegue de Cassandra en la nube de Amazon
Jose Felix Hernandez Barrio
 
PDF
Cassandra Day Atlanta 2015: Diagnosing Problems in Production
DataStax Academy
 
PDF
Feeding Cassandra with Spark-Streaming and Kafka
DataStax Academy
 
PDF
Silicon Valley Data Science: From Oracle to Cassandra with Spark
DataStax Academy
 
PPTX
Cassandra internals
narsiman
 
PPTX
3800 die-bonder overview
fastbr
 
PDF
Coursera's Adoption of Cassandra
DataStax Academy
 
PDF
Production Ready Cassandra (Beginner)
DataStax Academy
 
PDF
New features in 3.0
DataStax Academy
 
PDF
Introduction to .Net Driver
DataStax Academy
 
PPTX
Spark Cassandra Connector: Past, Present and Furure
DataStax Academy
 
PDF
Playlists at Spotify
DataStax Academy
 
PPTX
Lessons Learned with Cassandra and Spark at the US Patent and Trademark Office
DataStax Academy
 
PPTX
Using Event-Driven Architectures with Cassandra
DataStax Academy
 
PDF
Cassandra: One (is the loneliest number)
DataStax Academy
 
PDF
Getting Started with Graph Databases
DataStax Academy
 
PDF
Successful Software Development with Apache Cassandra
DataStax Academy
 
PDF
Traveler's Guide to Cassandra
DataStax Academy
 
PDF
Analytics with Spark and Cassandra
DataStax Academy
 
Cassandra Community Webinar | Introduction to Apache Cassandra 1.2
DataStax
 
Despliegue de Cassandra en la nube de Amazon
Jose Felix Hernandez Barrio
 
Cassandra Day Atlanta 2015: Diagnosing Problems in Production
DataStax Academy
 
Feeding Cassandra with Spark-Streaming and Kafka
DataStax Academy
 
Silicon Valley Data Science: From Oracle to Cassandra with Spark
DataStax Academy
 
Cassandra internals
narsiman
 
3800 die-bonder overview
fastbr
 
Coursera's Adoption of Cassandra
DataStax Academy
 
Production Ready Cassandra (Beginner)
DataStax Academy
 
New features in 3.0
DataStax Academy
 
Introduction to .Net Driver
DataStax Academy
 
Spark Cassandra Connector: Past, Present and Furure
DataStax Academy
 
Playlists at Spotify
DataStax Academy
 
Lessons Learned with Cassandra and Spark at the US Patent and Trademark Office
DataStax Academy
 
Using Event-Driven Architectures with Cassandra
DataStax Academy
 
Cassandra: One (is the loneliest number)
DataStax Academy
 
Getting Started with Graph Databases
DataStax Academy
 
Successful Software Development with Apache Cassandra
DataStax Academy
 
Traveler's Guide to Cassandra
DataStax Academy
 
Analytics with Spark and Cassandra
DataStax Academy
 
Ad

Similar to Beginning Operations: 7 Deadly Sins for Apache Cassandra Ops (20)

PPTX
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
DataStax
 
PDF
Diagnosing Problems in Production (Nov 2015)
Jon Haddad
 
PDF
Cassandra Day Chicago 2015: Diagnosing Problems in Production
DataStax Academy
 
PDF
Cassandra Day London 2015: Diagnosing Problems in Production
DataStax Academy
 
PDF
Cassandra from tarball to production
Ron Kuris
 
PDF
Diagnosing Problems in Production - Cassandra
Jon Haddad
 
PDF
Joel Jacobson (Datastax) - Diagnosing Cassandra Problems in Production
Outlyer
 
PPTX
Cassandra Tuning - above and beyond
Matija Gobec
 
PPTX
Cassandra Tuning - Above and Beyond (Matija Gobec, SmartCat) | Cassandra Summ...
DataStax
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
DataStax: Extreme Cassandra Optimization: The Sequel
DataStax Academy
 
PPTX
Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016
DataStax
 
PPTX
Everyday I’m scaling... Cassandra
Instaclustr
 
PDF
Cassandra CLuster Management by Japan Cassandra Community
Hiromitsu Komatsu
 
PPTX
Cassandra in Operation
niallmilton
 
PDF
1 Million Writes per second on 60 nodes with Cassandra and EBS
Jim Plush
 
PDF
Instaclustr introduction to managing cassandra
Instaclustr
 
PDF
GumGum: Multi-Region Cassandra in AWS
DataStax Academy
 
PDF
Building Apache Cassandra clusters for massive scale
Alex Thompson
 
Lessons Learned From Running 1800 Clusters (Brooke Jensen, Instaclustr) | Cas...
DataStax
 
Diagnosing Problems in Production (Nov 2015)
Jon Haddad
 
Cassandra Day Chicago 2015: Diagnosing Problems in Production
DataStax Academy
 
Cassandra Day London 2015: Diagnosing Problems in Production
DataStax Academy
 
Cassandra from tarball to production
Ron Kuris
 
Diagnosing Problems in Production - Cassandra
Jon Haddad
 
Joel Jacobson (Datastax) - Diagnosing Cassandra Problems in Production
Outlyer
 
Cassandra Tuning - above and beyond
Matija Gobec
 
Cassandra Tuning - Above and Beyond (Matija Gobec, SmartCat) | Cassandra Summ...
DataStax
 
Standing Up Your First Cluster
DataStax Academy
 
DataStax: Extreme Cassandra Optimization: The Sequel
DataStax Academy
 
Everyday I'm Scaling... Cassandra (Ben Bromhead, Instaclustr) | C* Summit 2016
DataStax
 
Everyday I’m scaling... Cassandra
Instaclustr
 
Cassandra CLuster Management by Japan Cassandra Community
Hiromitsu Komatsu
 
Cassandra in Operation
niallmilton
 
1 Million Writes per second on 60 nodes with Cassandra and EBS
Jim Plush
 
Instaclustr introduction to managing cassandra
Instaclustr
 
GumGum: Multi-Region Cassandra in AWS
DataStax Academy
 
Building Apache Cassandra clusters for massive scale
Alex Thompson
 
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Data Modeling for Apache Cassandra
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
PDF
Cassandra Data Maintenance with Spark
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Data Modeling for Apache Cassandra
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 
Cassandra Data Maintenance with Spark
DataStax Academy
 

Recently uploaded (20)

PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 

Beginning Operations: 7 Deadly Sins for Apache Cassandra Ops

  • 1. 7 Deadly Sins for Cassandra Ops
 Rachel Pedreschi
 DSE Evangelist, Datastax
  • 2. 2© 2015. All Rights Reserved.
  • 3. Lust 3© 2015. All Rights Reserved. #1 USE THE SIMPLE SNITCH
  • 4. So many snitches… 4© 2015. All Rights Reserved. • SimpleSnitch • RackInferringSnitch • PropertyFileSnitch • GossipingPropertyFileSnitch • Ec2Snitch • Ec2MultiRegionSnitch • GoogleCloudSnitch • CloudstackSnitch
  • 5. Switching snitches 5© 2015. All Rights Reserved. If the topology of the network has changed: -Shut down all the nodes, then restart them. -Run a sequential repair and nodetool cleanup on each node. DOWNTIME ALERT!!!!
  • 6. Greed 6© 2015. All Rights Reserved. #3 NOT UNDERSTANDING REPAIR
  • 7. Repair options 7© 2015. All Rights Reserved. repair (default, check your version!) repair -pr (only repair the primary range) repair -inc (only new data that has not previously been repaired) sequential repair (creates snapshots) parallel repair uses replica not being repaired
  • 8. Envy 8© 2015. All Rights Reserved. #3 CHOOSE THE WRONG COMPACTION STRATEGY FOR YOUR WORKLOAD
  • 9. Sized Tiered 9© 2015. All Rights Reserved. SST1 SST1 SST2 SST1 SST2 SST3 SST1 SST2 SST3 SST4 SST5 FLUSH FLUSH FLUSH FLUSH COMPACT SST5 SST6 SST5 SST6 SST7 SST5 SST6 SST7 SST8 SST5 SST10 FLUSH FLUSH FLUSH SST5 SST6 SST7 SST8 FLUSH SST9 COMPACT Compacts a set number of SSTables into a single, larger SSTable
  • 10. Leveled 10© 2015. All Rights Reserved. Level 0 Level 1 10 Level 2 100 Level 3 1,000 Level 4 10,000 Level 5 100,000 … Level 6, 7, etc.
  • 11. Date Tiered 11© 2015. All Rights Reserved. https://labs.spotify.com/2014/12/18/date-tiered-compaction/
  • 12. Gluttony 12© 2015. All Rights Reserved. #4 CHOOSING THE WRONG HARDWARE
  • 13. 13© 2015. All Rights Reserved. CPU RAM DISK
  • 14. 14© 2015. All Rights Reserved. • 2 socket, ECC memory • 16GiB minimum, prefer 32-64GiB, over 128GiB and Linux will need serious tuning • SSD where possible, Samsung 840 Pro is a good choice, any Intel is fine • NO SAN/NAS, 20ms latency tops • if you MUST (and please, don’t) dedicate spindles to C* nodes, use separate network • Avoid disk configurations targeted at Hadoop, disks are too slow
  • 15. Sloth 15© 2015. All Rights Reserved. #5 NOT TUNING YOUR OS
  • 16. /etc/rc.local 16© 2015. All Rights Reserved. ra=$((2**14))# 16k ss=$(blockdev --getss /dev/sda) blockdev --setra $(($ra / $ss)) /dev/sda echo 128 > /sys/block/sda/queue/nr_requests echo deadline > /sys/block/sda/queue/scheduler echo 16384 > /sys/block/md7/md/stripe_cache_size
  • 17. /etc/sysctl.conf 17© 2015. All Rights Reserved. fs.file-max = 1048576 vm.max_map_count = 1048576 net.core.rmem_max = 16777216 net.core.wmem_max = 16777216 net.ipv4.tcp_rmem = 4096 65536 16777216 net.ipv4.tcp_wmem = 4096 65536 16777216 vm.swappiness = 1
  • 18. Wrath 18© 2015. All Rights Reserved. #6 OVER OR UNDERSIZING YOUR JVM
  • 19. G1 or CMS? 19© 2015. All Rights Reserved. • Cassandra 8150 vs 7486 • Larger heap? Look into using G1 • Read the Docs • Test, test and did I mention, test?
  • 20. Pride 20© 2015. All Rights Reserved. #7 NOT STRESS TESTING
  • 21. cassandra-stress .yaml (>= 2.1) 21© 2015. All Rights Reserved. 1. DDL – for defining your schema 2. Column Distributions – for defining the shape and size of each column globally and within each partition 3. Insert Distributions – for defining how the data is written during the stress test 4. DML – for defining how the data is queried during the stress test
  • 22. Recommended Sessions 22© 2015. All Rights Reserved. DataStax Making Cassandra Fail (for effective testing) 3:30 Thursday Ballroom H Pythian Manage your compactions before they manage you 4:20 Ballroom H The Last Pickle Steady State Data Size with Compaction, Tombstones, and TTL 4:20 Great America #2 Wednesday Crowdstrike, Inc. Real World DTCS For Operators