SlideShare a Scribd company logo
Elassandra
Diego Pacheco
@diego_pacheco
❏ Cat's Father
❏ Principal Software Architect
❏ Agile Coach
❏ SOA Expert
❏ DevOps Practitioner
❏ Speaker
❏ Author
diegopacheco
http://diego-pacheco.blogspot.com.br/
https://goo.gl/eEqvzl
About me...
Cassandra: Great! But there is no real search capability.
ES: Loose DATA! Can't be used as Source of Truth.
Elassandra
+
Elassandra = ES Search in Cassandra
Elassandra Concepts mapping
Elasticsearch Cassandra
Mapping Schema
Cluster Virtual DC
Index Keyspace
Type Table
Document Row
Field Column
Object Nested Field User Defined Type
Elassandra Chaos
Elassandra Cluster
Node 1 Node 2
NDbench
Elassandra Chaos - Strategy
❏ Via Cassandra API (Reads/Writes)
❏ Test 1. Tear Down node 2 -> Keep node 1 up
❏ Test 2. Tear Down node 1 -> Them put it up after 10s
❏ Via ES API (Reads/Writes)
❏ Test 1. Tear Down node 2 -> Keep node 1 up
❏ Test 2. Tear Down node 1 -> Them put it up after 10s
Elassandra Chaos - Results - Reads
Elassandra Chaos - Results - Reads
Elassandra Chaos - Results - Reads
Elassandra Chaos - Results - Writes
Elassandra Performance - versions
2.2.8
2.1.1
2.1.1-18 on CASS 2.2.8
NDBench
2.2.2
Master
(github)
ORACLE JDK 8_u45
2.4.2 on CASS 2.2.8
Elassandra Performance - ELA and ES
CLUSTER
CLUSTER
Elassandra Performance - ELA and CASS
CLUSTER
CLUSTER
NDBench
NDBench
Elassandra Performance - Results - the surprise!
Hardware Memoria CPU
m4.4xlarge 64 GB 16 vCPU
2.2.8
2.1.1
2.1.1-18 on CASS 2.2.8
Reads 28k
Writes 121k
Reads 20k
Writes 2k
Reads 24k
Writes 97k
Reads 3k
Writes 5k
Elassandra Performance - Results - Details!
2.1.1 2.1.1-18 on CASS 2.2.8
Read ALL 21k
Read Get ID 59k
Read Search ID 15k
Read ALL 3k
Read Get ID 17k
Read Search ID 3k
Elassandra Performance - Results - tuning!
2.1.1-18 on CASS 2.2.8
Read ALL OK 3k
Read ALL KO 4k
Threadpool.index.queue_size = 10k
Threadpool.index.queue_size = 1k (default)
Read ALL OK 5k
Read ALL KO 6k
Elassandra Performance - Results - 2.4.2!
2.4.2 on CASS 2.2.8
Read ALL OK 12k
Read ALL KO 13k
Elassandra ES Reads - 12k OK
Hardware Memoria CPU
m4.4xlarge 64 GB 16 vCPU
ELA ES READ 12k Search CASS JAVASTATS
ELA ES READ 12k Search CASS Nodes
ELA ES READ 12k Search ES
ELA ES READ 12k Search ES NODE
ELA ES READ 12k Search GATLING OS
Conclusions
❏ Performance has some minor degradation with CASS but is acceptable.
❏ ES performance still 50% less than original elasticsearch but you can manage to workaround.
❏ Tests was run on 4th JAN 2017 from 12:00 to 17:00 BRT
❏ CPU CASS Improvement from 1500% to 500%
❏ Up to 50% Throughput improvement from 2.1.1 to 2.4.2 (5k to 12k)
❏ OS Used was Amazon Linux (CentOS Based)
❏ All Boxes was Optimized in sense on Open Files (65k) and Gatling TCP Settings
https://gist.github.com/diegopacheco/31091f9c507b9c78128e73b48f2ce08a
Elassandra
Diego Pacheco

More Related Content

PDF
Using ZFS file system with MySQL
Mydbops
 
PDF
Scaling PHP apps
Matteo Moretti
 
PDF
Case Study: Troubleshooting Cassandra performance issues as a developer
Carlos Alonso Pérez
 
PDF
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
Raghavendra Prabhu
 
PDF
AWS Summit Milan - AWS RDS for your data (and your sleep)
Matteo Moretti
 
PDF
Redis
Diego Pacheco
 
PDF
Mixing Batch and Real-time: Cassandra with Shark (Cassandra Europe 2013)
Richard Low
 
PDF
AWS Cost Control: Cloud Custodian
OlinData
 
Using ZFS file system with MySQL
Mydbops
 
Scaling PHP apps
Matteo Moretti
 
Case Study: Troubleshooting Cassandra performance issues as a developer
Carlos Alonso Pérez
 
Orchestrating Cassandra with Kubernetes: Challenges and Opportunities
Raghavendra Prabhu
 
AWS Summit Milan - AWS RDS for your data (and your sleep)
Matteo Moretti
 
Mixing Batch and Real-time: Cassandra with Shark (Cassandra Europe 2013)
Richard Low
 
AWS Cost Control: Cloud Custodian
OlinData
 

What's hot (20)

PPTX
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
PPTX
Scylla Summit 2018: What's New in Scylla Manager?
ScyllaDB
 
PDF
Node.js and Cassandra
Stratio
 
PPTX
Using Cassandra with your Web Application
supertom
 
PDF
Real-time Data Pipeline: Kafka Streams / Kafka Connect versus Spark Streaming
Abdelhamide EL ARIB
 
PPTX
Dmp hadoop getting_start
Gim GyungJin
 
PDF
Postgres connections at scale
Mydbops
 
ODP
MongoDB San Francisco DrupalCon 2010
Karoly Negyesi
 
PDF
Introduction to cassandra 2014
Patrick McFadin
 
PDF
RedisConf17 - Observability and the Glorious Future
Redis Labs
 
PPTX
PHP and Cassandra
Dave Gardner
 
PDF
WordPress Need For Speed
pdeschen
 
PDF
Insight QUBE concept
Insight Technology, Inc.
 
PDF
High Performance Solr and JVM Tuning Strategies used for MapQuest’s Search Ah...
Lucidworks
 
PDF
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
DataStax
 
PDF
Lessons Learned at Wakoopa
Menno van der Sman
 
PDF
Using cassandra as a distributed logging to store pb data
Ramesh Veeramani
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PDF
Introduction to performance tuning perl web applications
Perrin Harkins
 
PDF
15th Athens Big Data Meetup - 1st Talk - Running Spark On Mesos
Athens Big Data
 
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
Scylla Summit 2018: What's New in Scylla Manager?
ScyllaDB
 
Node.js and Cassandra
Stratio
 
Using Cassandra with your Web Application
supertom
 
Real-time Data Pipeline: Kafka Streams / Kafka Connect versus Spark Streaming
Abdelhamide EL ARIB
 
Dmp hadoop getting_start
Gim GyungJin
 
Postgres connections at scale
Mydbops
 
MongoDB San Francisco DrupalCon 2010
Karoly Negyesi
 
Introduction to cassandra 2014
Patrick McFadin
 
RedisConf17 - Observability and the Glorious Future
Redis Labs
 
PHP and Cassandra
Dave Gardner
 
WordPress Need For Speed
pdeschen
 
Insight QUBE concept
Insight Technology, Inc.
 
High Performance Solr and JVM Tuning Strategies used for MapQuest’s Search Ah...
Lucidworks
 
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
DataStax
 
Lessons Learned at Wakoopa
Menno van der Sman
 
Using cassandra as a distributed logging to store pb data
Ramesh Veeramani
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Introduction to performance tuning perl web applications
Perrin Harkins
 
15th Athens Big Data Meetup - 1st Talk - Running Spark On Mesos
Athens Big Data
 
Ad

Viewers also liked (20)

PDF
Apache Cassandra - part 2
Diego Pacheco
 
PDF
Lean/Agile/DevOps 2016 part 3
Diego Pacheco
 
PDF
Elassandra: Elasticsearch as a Cassandra Secondary Index (Rémi Trouville, Vin...
DataStax
 
PDF
Lean/Agile/DevOps 2016 part 1
Diego Pacheco
 
PDF
DevOps: Plain English Business Benefits
Diego Pacheco
 
PDF
TI na ERA DEVOPS
ilegra
 
PDF
Spring framework 2.0 pt_BR
Diego Pacheco
 
PDF
Spring framework 2.5
Diego Pacheco
 
PDF
Cassandra
Diego Pacheco
 
PDF
Stream Processing with Kafka and Samza
Diego Pacheco
 
PDF
Polyglot Persistence
Diego Pacheco
 
PDF
From Design Thinking to DevOps and Back Again: Unifying Design and Operations
Jeff Sussna
 
PDF
Dev opsdaykeynote
Diego Pacheco
 
PDF
FleetDB
Diego Pacheco
 
PDF
Lean/Agile/DevOps 2016 part 2
Diego Pacheco
 
PDF
Microservices reativos usando a stack do Netflix na AWS
Diego Pacheco
 
PDF
DevOps - Estado da Arte
ilegra
 
PDF
Cloud Native, Microservices and SRE/Chaos Engineering: The new Rules of The G...
Diego Pacheco
 
PDF
Wayne State University & DataStax: World's best data modeling tool for Apache...
DataStax Academy
 
PDF
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
Apache Cassandra - part 2
Diego Pacheco
 
Lean/Agile/DevOps 2016 part 3
Diego Pacheco
 
Elassandra: Elasticsearch as a Cassandra Secondary Index (Rémi Trouville, Vin...
DataStax
 
Lean/Agile/DevOps 2016 part 1
Diego Pacheco
 
DevOps: Plain English Business Benefits
Diego Pacheco
 
TI na ERA DEVOPS
ilegra
 
Spring framework 2.0 pt_BR
Diego Pacheco
 
Spring framework 2.5
Diego Pacheco
 
Cassandra
Diego Pacheco
 
Stream Processing with Kafka and Samza
Diego Pacheco
 
Polyglot Persistence
Diego Pacheco
 
From Design Thinking to DevOps and Back Again: Unifying Design and Operations
Jeff Sussna
 
Dev opsdaykeynote
Diego Pacheco
 
FleetDB
Diego Pacheco
 
Lean/Agile/DevOps 2016 part 2
Diego Pacheco
 
Microservices reativos usando a stack do Netflix na AWS
Diego Pacheco
 
DevOps - Estado da Arte
ilegra
 
Cloud Native, Microservices and SRE/Chaos Engineering: The new Rules of The G...
Diego Pacheco
 
Wayne State University & DataStax: World's best data modeling tool for Apache...
DataStax Academy
 
DataStax: Rigorous Cassandra Data Modeling for the Relational Data Architect
DataStax Academy
 
Ad

Similar to Elassandra (17)

PPTX
Perl and Elasticsearch
Dean Hamstead
 
PPTX
Elasticsearch Arcihtecture & What's New in Version 5
Burak TUNGUT
 
PDF
Elasticsearch: An Overview
Ruby Shrestha
 
PDF
SUE 2018 - Migrating a 130TB Cluster from Elasticsearch 2 to 5 in 20 Hours Wi...
Fred de Villamil
 
PPTX
Search and analyze your data with elasticsearch
Anton Udovychenko
 
ODP
Elasticsearch for beginners
Neil Baker
 
PDF
Elasticsearch, a distributed search engine with real-time analytics
Tiziano Fagni
 
PPTX
Dev nexus 2017
Roy Russo
 
PDF
Elasticsearch quick Intro (English)
Federico Panini
 
PDF
Is your Elastic Cluster Stable and Production Ready?
DoiT International
 
PPTX
Episerver and search engines
Mikko Huilaja
 
PDF
Scalability and Real-time Queries with Elasticsearch
Ivo Andreev
 
PDF
An Introduction to Elasticsearch for Beginners
Amir Sedighi
 
PPTX
ElasticSearch Basic Introduction
Mayur Rathod
 
PDF
Cassandra Workshop - Cassandra from scratch in one day
Carlos Alonso Pérez
 
PDF
Elasticsearch
Oto Brglez
 
PDF
Elasticsearch
Shagun Rathore
 
Perl and Elasticsearch
Dean Hamstead
 
Elasticsearch Arcihtecture & What's New in Version 5
Burak TUNGUT
 
Elasticsearch: An Overview
Ruby Shrestha
 
SUE 2018 - Migrating a 130TB Cluster from Elasticsearch 2 to 5 in 20 Hours Wi...
Fred de Villamil
 
Search and analyze your data with elasticsearch
Anton Udovychenko
 
Elasticsearch for beginners
Neil Baker
 
Elasticsearch, a distributed search engine with real-time analytics
Tiziano Fagni
 
Dev nexus 2017
Roy Russo
 
Elasticsearch quick Intro (English)
Federico Panini
 
Is your Elastic Cluster Stable and Production Ready?
DoiT International
 
Episerver and search engines
Mikko Huilaja
 
Scalability and Real-time Queries with Elasticsearch
Ivo Andreev
 
An Introduction to Elasticsearch for Beginners
Amir Sedighi
 
ElasticSearch Basic Introduction
Mayur Rathod
 
Cassandra Workshop - Cassandra from scratch in one day
Carlos Alonso Pérez
 
Elasticsearch
Oto Brglez
 
Elasticsearch
Shagun Rathore
 

More from Diego Pacheco (20)

PDF
Naming Things Book : Simple Book Review!
Diego Pacheco
 
PDF
Continuous Discovery Habits Book Review.pdf
Diego Pacheco
 
PDF
Thoughts about Shape Up
Diego Pacheco
 
PDF
Holacracy
Diego Pacheco
 
PDF
AWS IAM
Diego Pacheco
 
PDF
CDKs
Diego Pacheco
 
PDF
Encryption Deep Dive
Diego Pacheco
 
PDF
Sec 101
Diego Pacheco
 
PDF
Reflections on SCM
Diego Pacheco
 
PDF
Management: Doing the non-obvious! III
Diego Pacheco
 
PDF
Design is not Subjective
Diego Pacheco
 
PDF
Architecture & Engineering : Doing the non-obvious!
Diego Pacheco
 
PDF
Management doing the non-obvious II
Diego Pacheco
 
PDF
Testing in production
Diego Pacheco
 
PDF
Nine lies about work
Diego Pacheco
 
PDF
Management: doing the nonobvious!
Diego Pacheco
 
PDF
AI and the Future
Diego Pacheco
 
PDF
Dealing with dependencies
Diego Pacheco
 
PDF
Dealing with dependencies in tests
Diego Pacheco
 
PDF
Kanban 2020
Diego Pacheco
 
Naming Things Book : Simple Book Review!
Diego Pacheco
 
Continuous Discovery Habits Book Review.pdf
Diego Pacheco
 
Thoughts about Shape Up
Diego Pacheco
 
Holacracy
Diego Pacheco
 
AWS IAM
Diego Pacheco
 
Encryption Deep Dive
Diego Pacheco
 
Sec 101
Diego Pacheco
 
Reflections on SCM
Diego Pacheco
 
Management: Doing the non-obvious! III
Diego Pacheco
 
Design is not Subjective
Diego Pacheco
 
Architecture & Engineering : Doing the non-obvious!
Diego Pacheco
 
Management doing the non-obvious II
Diego Pacheco
 
Testing in production
Diego Pacheco
 
Nine lies about work
Diego Pacheco
 
Management: doing the nonobvious!
Diego Pacheco
 
AI and the Future
Diego Pacheco
 
Dealing with dependencies
Diego Pacheco
 
Dealing with dependencies in tests
Diego Pacheco
 
Kanban 2020
Diego Pacheco
 

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Software Development Methodologies in 2025
KodekX
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Doc9.....................................
SofiaCollazos
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 

Elassandra