SlideShare a Scribd company logo
Supercharge Your RDBMS with
Elasticsearch
Arthur Gimpel, Director of DataZone
Name: Arthur Gimpel
Position: Technology Evangelist, Solutions Architect,
Trainer
Tech Stack: Elastic Stack, SQL Server, MongoDB,
Couchbase, Redis, Kafka, StreamSets,
Python, .NET…
Free Time: Motorcycles, Skydiving…
Click to edit Master
title styleAbout Me
• First RDBMS was introduced in late 1970s
• Exist in all possible flavors but share one thing - ACID
• Still dominate the database market
Click to edit Master
title styleRelational Database Management Systems
• Atomicity: All or nothing approach, transactions
• Consistency: Hard state, every transaction changes the whole DBMS
• Isolation: Transactions cannot interfere with each other
• Durability: Every transaction is persisted
Click to edit Master
title styleRDBMS in Theory - ACID
• Everything is persisted, synchronously. Limited by IO
performance
• All data is bound to a tabular schema, hard to make changes in
big databases
• ACID makes horizontal scaling nearly* impossible
• Complex schema slows down aggregations and queries
drastically
Click to edit Master
title styleACID Is Not Perfect
• Distributed / Horizontal Scalability
• Mostly Open Source
• Mostly schema less:
• Key - Value
• Document
• Graph
• Serves specific purposes
Click to edit Master
title styleNoSQL - New Kid in Town
• Every data store has its purpose. There is no single solution to
all database needs
• NoSQL does not implement all of RDBMS’s abilities (CDC,
Jobs, Stored Procedures, Triggers)
• Every data store has its own languages, and APIs. There is no
ANSI SQL
Click to edit Master
title styleNoSQL - Challenges
Click to edit Master
title style
NoSQL = Not Only SQL | Polyglot Persistence
• Search platform, data store based on Apache Lucene
• Supports various search types: Filtered, Full-text, Geography,
Aggregation (Facet, Nested, Pipeline), Graph
• Distributed - every index is split to shards relying on (potentially) a node
• Document store - JSON
• “Optimistic” Schema-less architecture
• Supports Replication by nature
• Supports Unsupervised Machine Learning by nature (Prelert, in beta)
Click to edit Master
title style
Click to edit Master
title styleSearch != SQL Querying
Click to edit Master
title styleReference Architecture #1
Click to edit Master
title styleReference Architecture #2
Click to edit Master
title styleArchitecture Comparison
Architecture #1 Architecture #2
Data distribution strategy Data store based Application based
Data distribution component Data Pipeline ( StreamSets ) Message Queue ( Kafka )
Implementation Team Data Engineers / DevOps DevOps / Developers
Implementation Complexity Low: Data pipeline development High: data access layer refactor
Potential additional licensing Elasticsearch, StreamSets None
Scalability Limited to RDBMS Scale
Fully scalable regardless of
RDBMS
Thank You!

More Related Content

PDF
Optimize MySQL performance for developers
jackbillow
 
PDF
Oracle Week 2016 - Modern Data Architecture
Arthur Gimpel
 
PPTX
Distributed SQL databases
Anurag S
 
PDF
REDIS (Remote Dictionary Server)
Ameya Vijay Gokhale
 
PPTX
Cosmosdb graph
Mohit Chhabra
 
KEY
SortaSQL
Cloudflare
 
PPTX
Meta cloud architecture for the mobile agile enterprise
Evarist Lobo
 
PPTX
Relational Database and mysql insight
mentallog
 
Optimize MySQL performance for developers
jackbillow
 
Oracle Week 2016 - Modern Data Architecture
Arthur Gimpel
 
Distributed SQL databases
Anurag S
 
REDIS (Remote Dictionary Server)
Ameya Vijay Gokhale
 
Cosmosdb graph
Mohit Chhabra
 
SortaSQL
Cloudflare
 
Meta cloud architecture for the mobile agile enterprise
Evarist Lobo
 
Relational Database and mysql insight
mentallog
 

What's hot (20)

ODP
Nosql
mentallog
 
PDF
Nosql primer
Seshu Kumar Loka
 
PDF
Application Development with Apache Cassandra as a Service
WSO2
 
PPTX
NoSQL - Not Only SQL
EasyData
 
PDF
Transitioning From SQL Server to MySQL - Presentation from Percona Live 2016
Dylan Butler
 
PPTX
AWS Database Services
Mackenzie LeJeune
 
PPTX
SQL Azure for ISUG(SQL Server Israeli User Group)
Pini Krisher
 
ZIP
Ext JS Meetup Presentation
Patrick Sheridan
 
PPT
JCR Content Management
elliando dias
 
PPTX
No SQL - Intro
Boris Bucha
 
PPTX
Deven s presentation
dshastri001
 
PDF
Java scalability considerations yogesh deshpande
IndicThreads
 
PPT
Architecture Of Large Scale Websites
Feng Yu
 
PPTX
Spider进化论
cjhacker
 
PPTX
Redux: server side rendering and hot code reload for single-page applications
Alex Bumbu
 
PPTX
iMobileMagic Teck Talk Scale Up
Pedro Machado
 
PDF
MySQL At University Of Nottingham - 2018 MySQL Days
Mark Swarbrick
 
PPTX
Bootstrap SaaS startup using Open Source Tools
botsplash.com
 
PPTX
Moving to the Cloud: AWS, Zend, RightScale
mmoline
 
PDF
Getting started in the cloud for developers
MariaDB plc
 
Nosql
mentallog
 
Nosql primer
Seshu Kumar Loka
 
Application Development with Apache Cassandra as a Service
WSO2
 
NoSQL - Not Only SQL
EasyData
 
Transitioning From SQL Server to MySQL - Presentation from Percona Live 2016
Dylan Butler
 
AWS Database Services
Mackenzie LeJeune
 
SQL Azure for ISUG(SQL Server Israeli User Group)
Pini Krisher
 
Ext JS Meetup Presentation
Patrick Sheridan
 
JCR Content Management
elliando dias
 
No SQL - Intro
Boris Bucha
 
Deven s presentation
dshastri001
 
Java scalability considerations yogesh deshpande
IndicThreads
 
Architecture Of Large Scale Websites
Feng Yu
 
Spider进化论
cjhacker
 
Redux: server side rendering and hot code reload for single-page applications
Alex Bumbu
 
iMobileMagic Teck Talk Scale Up
Pedro Machado
 
MySQL At University Of Nottingham - 2018 MySQL Days
Mark Swarbrick
 
Bootstrap SaaS startup using Open Source Tools
botsplash.com
 
Moving to the Cloud: AWS, Zend, RightScale
mmoline
 
Getting started in the cloud for developers
MariaDB plc
 
Ad

Viewers also liked (20)

PDF
Spark Tuning for Enterprise System Administrators
Alpine Data
 
PDF
Bending Spark towards enterprise needs
b0ris_1
 
PPTX
Developing and deploying big data machine learning models
Narayana Swamy
 
PDF
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
MapR Technologies
 
PDF
Elasticsearch for SQL Users
All Things Open
 
PPTX
Building an ETL pipeline for Elasticsearch using Spark
Itai Yaffe
 
PDF
Kappa Architecture, IoT of the cars - LibreCon 2016
LibreCon
 
PDF
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
PDF
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
 
PDF
Step one
oviedojesse
 
PDF
Classificação petizes
caxana
 
DOC
Producto 2 sesion uno
marialuisa_10
 
PDF
B2T BeYourself
Joaquin Mateos
 
PDF
de Inrichting
de Inrichting
 
PPT
Prueba internet
Berthitalinda
 
PPT
Aula - Teoria Cognitiva
paula
 
PPTX
áLbum De FotografíAs
Liinitha Velandhia
 
PPTX
Presentation1
محمد مرسي مرسي
 
PDF
Programa derecho administrativo iii UCR
Adriana Zamora López
 
PPTX
Aspiraciones profecionales
jessi4235656
 
Spark Tuning for Enterprise System Administrators
Alpine Data
 
Bending Spark towards enterprise needs
b0ris_1
 
Developing and deploying big data machine learning models
Narayana Swamy
 
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
MapR Technologies
 
Elasticsearch for SQL Users
All Things Open
 
Building an ETL pipeline for Elasticsearch using Spark
Itai Yaffe
 
Kappa Architecture, IoT of the cars - LibreCon 2016
LibreCon
 
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
 
Step one
oviedojesse
 
Classificação petizes
caxana
 
Producto 2 sesion uno
marialuisa_10
 
B2T BeYourself
Joaquin Mateos
 
de Inrichting
de Inrichting
 
Prueba internet
Berthitalinda
 
Aula - Teoria Cognitiva
paula
 
áLbum De FotografíAs
Liinitha Velandhia
 
Programa derecho administrativo iii UCR
Adriana Zamora López
 
Aspiraciones profecionales
jessi4235656
 
Ad

Similar to Supercharge your RDBMS with Elasticsearch (20)

PDF
Solr cloud the 'search first' nosql database extended deep dive
lucenerevolution
 
PPTX
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
PDF
NoSQL Engines Meetup Opening Slides
Arthur Gimpel
 
PPTX
BigData, NoSQL & ElasticSearch
Sanura Hettiarachchi
 
PPTX
NoSQL
kirandanduprolu
 
PPTX
NoSql - mayank singh
Mayank Singh
 
PPTX
cours database pour etudiant NoSQL (1).pptx
ssuser1fde9c
 
PPTX
Sql vs NoSQL
RTigger
 
PDF
NOsql Presentation.pdf
AkshayDwivedi31
 
PPTX
NoSQL.pptx
RithikRaj25
 
PPTX
No sql database
vishal gupta
 
PPTX
NoSQL
Radu Vunvulea
 
PPTX
Comparative study of modern databases
Anirban Konar
 
PDF
Database Systems - A Historical Perspective
Karoly K
 
PDF
Deep dive to ElasticSearch - معرفی ابزار جستجوی الاستیکی
Ehsan Asgarian
 
PPTX
Relational databases vs Non-relational databases
James Serra
 
PDF
Why nosql also_why_somany
Prashanth Panduranga
 
PDF
If NoSQL is your answer, you are probably asking the wrong question.
Lukas Smith
 
PDF
the rising no sql technology
INFOGAIN PUBLICATION
 
PDF
NOSQL Overview
Tobias Lindaaker
 
Solr cloud the 'search first' nosql database extended deep dive
lucenerevolution
 
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
NoSQL Engines Meetup Opening Slides
Arthur Gimpel
 
BigData, NoSQL & ElasticSearch
Sanura Hettiarachchi
 
NoSql - mayank singh
Mayank Singh
 
cours database pour etudiant NoSQL (1).pptx
ssuser1fde9c
 
Sql vs NoSQL
RTigger
 
NOsql Presentation.pdf
AkshayDwivedi31
 
NoSQL.pptx
RithikRaj25
 
No sql database
vishal gupta
 
Comparative study of modern databases
Anirban Konar
 
Database Systems - A Historical Perspective
Karoly K
 
Deep dive to ElasticSearch - معرفی ابزار جستجوی الاستیکی
Ehsan Asgarian
 
Relational databases vs Non-relational databases
James Serra
 
Why nosql also_why_somany
Prashanth Panduranga
 
If NoSQL is your answer, you are probably asking the wrong question.
Lukas Smith
 
the rising no sql technology
INFOGAIN PUBLICATION
 
NOSQL Overview
Tobias Lindaaker
 

Recently uploaded (20)

PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Software Development Methodologies in 2025
KodekX
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Doc9.....................................
SofiaCollazos
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 

Supercharge your RDBMS with Elasticsearch

  • 1. Supercharge Your RDBMS with Elasticsearch Arthur Gimpel, Director of DataZone
  • 2. Name: Arthur Gimpel Position: Technology Evangelist, Solutions Architect, Trainer Tech Stack: Elastic Stack, SQL Server, MongoDB, Couchbase, Redis, Kafka, StreamSets, Python, .NET… Free Time: Motorcycles, Skydiving… Click to edit Master title styleAbout Me
  • 3. • First RDBMS was introduced in late 1970s • Exist in all possible flavors but share one thing - ACID • Still dominate the database market Click to edit Master title styleRelational Database Management Systems
  • 4. • Atomicity: All or nothing approach, transactions • Consistency: Hard state, every transaction changes the whole DBMS • Isolation: Transactions cannot interfere with each other • Durability: Every transaction is persisted Click to edit Master title styleRDBMS in Theory - ACID
  • 5. • Everything is persisted, synchronously. Limited by IO performance • All data is bound to a tabular schema, hard to make changes in big databases • ACID makes horizontal scaling nearly* impossible • Complex schema slows down aggregations and queries drastically Click to edit Master title styleACID Is Not Perfect
  • 6. • Distributed / Horizontal Scalability • Mostly Open Source • Mostly schema less: • Key - Value • Document • Graph • Serves specific purposes Click to edit Master title styleNoSQL - New Kid in Town
  • 7. • Every data store has its purpose. There is no single solution to all database needs • NoSQL does not implement all of RDBMS’s abilities (CDC, Jobs, Stored Procedures, Triggers) • Every data store has its own languages, and APIs. There is no ANSI SQL Click to edit Master title styleNoSQL - Challenges
  • 8. Click to edit Master title style NoSQL = Not Only SQL | Polyglot Persistence
  • 9. • Search platform, data store based on Apache Lucene • Supports various search types: Filtered, Full-text, Geography, Aggregation (Facet, Nested, Pipeline), Graph • Distributed - every index is split to shards relying on (potentially) a node • Document store - JSON • “Optimistic” Schema-less architecture • Supports Replication by nature • Supports Unsupervised Machine Learning by nature (Prelert, in beta) Click to edit Master title style
  • 10. Click to edit Master title styleSearch != SQL Querying
  • 11. Click to edit Master title styleReference Architecture #1
  • 12. Click to edit Master title styleReference Architecture #2
  • 13. Click to edit Master title styleArchitecture Comparison Architecture #1 Architecture #2 Data distribution strategy Data store based Application based Data distribution component Data Pipeline ( StreamSets ) Message Queue ( Kafka ) Implementation Team Data Engineers / DevOps DevOps / Developers Implementation Complexity Low: Data pipeline development High: data access layer refactor Potential additional licensing Elasticsearch, StreamSets None Scalability Limited to RDBMS Scale Fully scalable regardless of RDBMS