SlideShare a Scribd company logo
Big Data
NoSQL Database Types: episode I
Content
▪ Setup
▪ Introduction
▪ Key/Value
▪ Column Store
Setup
1. Go to https://github.com/tomvdbulck/cassandrainitiationsearchworkshop
And https://github.com/tomvdbulck/redisinitiationsearchworkshop
2. Make sure the following items have been installed on your machine:
o Java 7 or higher
o Git (if you like a pretty interface to deal with git, try SourceTree)
o Maven
3. Install VirtualBox https://www.virtualbox.org/wiki/Downloads
4. Install Vagrant https://www.vagrantup.com/downloads.html
5. Clone the repository into your workspace
Introduction
▪ 4 Types of NoSQL
▪ CAP Theorem
Types of NoSQL data stores
Following 4 types exist
▪ Key/Value Store
▪ Column Store
▪ Document Store
▪ Graph Database
Types of NoSQL data stores
Key/Value
- key/value
- are often “in-memory”
- Strength
▪simple to implement
▪fast lookup
- Weakness
▪querying
▪stored data has no schema
- Use Case:
▪Caching
▪Top 10 list of facebook games
Types of NoSQL data stores
Column Store:
- Stores everything in columns
- Strength
▪fast lookup
▪distributed storage of data
▪better querying then key/value
- Weakness
▪low-level api
▪cumbersome to do more complex queryies
- Use Case:
▪Distributed file system
▪(twitter, netflix)
Types of NoSQL data stores
Document Store:
- collections of key/value collections (documents)
- Strength
▪Tolerant of incomplete data
▪Easier to do more complex queries
- Weakness
▪Query performance
- Use Case
▪standard web applications
Types of NoSQL data stores
Graph Database
- store everything in a graph - use of nodes
- nodes have relations to adjacent nodes - no index lookup required
- Strength
▪graph algorithms
▪visualize relations
- Weakness
▪has to traverse entire graph to get answer
▪not easy to cluster
- Use Case:
▪Social Networking
▪Recommendations
Types of NoSQL data stores
Types of NoSQL data stores
Graph Database:
Types of NoSQL data stores
Graph Database: playing around
Visualize your own linkedin network:
http://neo4j.com/blog/exploring-linkedin-in-neo4j/
Types of NoSQL data stores
Which to use?
▪ Often you will be using more then one, based on which one is the
best fit for specific requirements
▪ You could also use 1 for development - schemaless, pretty feature
complete (document store) and when feature-complete choose more
appropriate databases.
=> a modular architecture will be important when you develop like this
CAP Theorem
Impossible for a distributed file system to simultaneously provide the
following guarantees:
▪ Consistency: all nodes see the same data at the same time
▪ Availability: guarantee that every request receives a response about
whether it succeeded or failed
▪ Partition Tolerance: the system continues to operate despite
arbitrary message loss or failure of part of the system
CAP Theorem
Consistency:
When I ask the same question to any part of the system I should get the same answer.
CAP Theorem
Consistency:
When I ask the same question to any part of the system I should get the same answer.
CAP Theorem
Consistency:
When I ask the same question to any part of the system I should get the same answer.
CAP Theorem
Availability:
When I ask a question I will get an answer.
CAP Theorem
Availability:
When I ask a question I will get an answer.
CAP Theorem
Partition Tolerance:
I can ask questions even if the system is having intra-system communication problems
CAP Theorem
Partition Tolerance:
I can ask questions even if the system is having intra-system communication problems
CAP Theorem
CAP Theorem
▪ Consistent Available (CA):
- have trouble with partitions and deal with it via replications
- Examples: RDBMs
▪ Consistent, Partition-Tolerant (CP):
- have trouble with availability while keeping data consistent across
partitioned nodes
- Examples: MongoDB, HBase,BigTable, HyperTable, Redis
▪ Available, Partition-Tolerant (AP)
- achieve “eventual consistency” through replication and verification
- Examples: CouchDB, Cassandra, Voldemort, Riak
Content
▪ Key/Value
▪ Column Store
Key/Value
Column Store
Questions or Suggestions?

More Related Content

Viewers also liked (15)

PPTX
Column Stores and Google BigQuery
Csaba Toth
 
PPTX
Developer's QA Toolkit - 34th National IT Conference - Sri Lanka
Chamil Jeewantha
 
PDF
SQL for Elasticsearch
Jodok Batlogg
 
PPTX
Fibrosis pulmonar
manelbiologia
 
PDF
Key-Value-Stores -- The Key to Scaling?
Tim Lossen
 
PPTX
Energia Luminica
Sandra Balderrama
 
PDF
VLDB 2009 Tutorial on Column-Stores
Daniel Abadi
 
PDF
Gossip & Key Value Store
Sajeev P
 
PPTX
Fracture neck of femur
drsram
 
PDF
Non-Relational Databases & Key/Value Stores
Joël Perras
 
PDF
Key-Value Stores: a practical overview
Marc Seeger
 
PPTX
MongoDB Introduction - Document Oriented Nosql Database
Sudhir Patil
 
PDF
Intro to column stores
Justin Swanhart
 
PPTX
Fármacos para la insuficiencia cardíaca y antianginosos
fernando_torres52
 
PPTX
What is NoSQL and CAP Theorem
Rahul Jain
 
Column Stores and Google BigQuery
Csaba Toth
 
Developer's QA Toolkit - 34th National IT Conference - Sri Lanka
Chamil Jeewantha
 
SQL for Elasticsearch
Jodok Batlogg
 
Fibrosis pulmonar
manelbiologia
 
Key-Value-Stores -- The Key to Scaling?
Tim Lossen
 
Energia Luminica
Sandra Balderrama
 
VLDB 2009 Tutorial on Column-Stores
Daniel Abadi
 
Gossip & Key Value Store
Sajeev P
 
Fracture neck of femur
drsram
 
Non-Relational Databases & Key/Value Stores
Joël Perras
 
Key-Value Stores: a practical overview
Marc Seeger
 
MongoDB Introduction - Document Oriented Nosql Database
Sudhir Patil
 
Intro to column stores
Justin Swanhart
 
Fármacos para la insuficiencia cardíaca y antianginosos
fernando_torres52
 
What is NoSQL and CAP Theorem
Rahul Jain
 

Similar to Big data key-value and column stores redis - cassandra (20)

PPTX
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
PPTX
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
PPT
No sql
Shruti_gtbit
 
PPTX
Master.pptx
KarthikR780430
 
PDF
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
IJCERT JOURNAL
 
PPTX
NoSql
Girish Khanzode
 
PPTX
NOSQL PRESENTATION ON INTRRODUCTION Intro.pptx
plvdravikumarit
 
PDF
NoSql and it's introduction features-Unit-1.pdf
ajajkhan16
 
PPT
6269441.ppt
Swapna Jk
 
PPTX
NoSql Database
Suresh Parmar
 
PPTX
No sql databases
Ankit Dubey
 
PPT
No sql
Murat Çakal
 
PPT
No SQL Databases as modern database concepts
debasisdas225831
 
PPTX
Relational and non relational database 7
abdulrahmanhelan
 
PPTX
No sq lv2
Nusrat Sharmin
 
PPTX
NoSQL Intro with cassandra
Brian Enochson
 
PPTX
To SQL or NoSQL, that is the question
Krishnakumar S
 
PPTX
NoSQL and Couchbase
Sangharsh agarwal
 
PPTX
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
PPT
No sql landscape_nosqltips
imarcticblue
 
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
No sql
Shruti_gtbit
 
Master.pptx
KarthikR780430
 
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
IJCERT JOURNAL
 
NOSQL PRESENTATION ON INTRRODUCTION Intro.pptx
plvdravikumarit
 
NoSql and it's introduction features-Unit-1.pdf
ajajkhan16
 
6269441.ppt
Swapna Jk
 
NoSql Database
Suresh Parmar
 
No sql databases
Ankit Dubey
 
No sql
Murat Çakal
 
No SQL Databases as modern database concepts
debasisdas225831
 
Relational and non relational database 7
abdulrahmanhelan
 
No sq lv2
Nusrat Sharmin
 
NoSQL Intro with cassandra
Brian Enochson
 
To SQL or NoSQL, that is the question
Krishnakumar S
 
NoSQL and Couchbase
Sangharsh agarwal
 
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
No sql landscape_nosqltips
imarcticblue
 
Ad

More from JWORKS powered by Ordina (20)

PDF
Introduction to Webpack - Ordina JWorks - CC JS & Web
JWORKS powered by Ordina
 
PDF
Lagom in Practice
JWORKS powered by Ordina
 
PDF
Netflix OSS and HATEOAS deployed on production - JavaLand
JWORKS powered by Ordina
 
PDF
Introduction to Docker
JWORKS powered by Ordina
 
PDF
An introduction to Cloud Foundry
JWORKS powered by Ordina
 
PDF
Cc internet of things LoRa and IoT - Innovation Enablers
JWORKS powered by Ordina
 
PPTX
Mongodb @ vrt
JWORKS powered by Ordina
 
PPTX
Mongo db intro.pptx
JWORKS powered by Ordina
 
PPTX
Big data document and graph d bs - couch-db and orientdb
JWORKS powered by Ordina
 
PPTX
Hadoop bootcamp getting started
JWORKS powered by Ordina
 
PPTX
Big data elasticsearch practical
JWORKS powered by Ordina
 
PDF
Intro to cassandra
JWORKS powered by Ordina
 
PPTX
Android wear - CC Mobile
JWORKS powered by Ordina
 
PPTX
Clean Code - A&BP CC
JWORKS powered by Ordina
 
PDF
Unit testing - A&BP CC
JWORKS powered by Ordina
 
PDF
Integration testing - A&BP CC
JWORKS powered by Ordina
 
PDF
Documenting your REST API with Swagger - JOIN 2014
JWORKS powered by Ordina
 
PDF
Spring 4 - A&BP CC
JWORKS powered by Ordina
 
PPTX
Android secure offline storage - CC Mobile
JWORKS powered by Ordina
 
PPTX
Meteor - JOIN 2015
JWORKS powered by Ordina
 
Introduction to Webpack - Ordina JWorks - CC JS & Web
JWORKS powered by Ordina
 
Lagom in Practice
JWORKS powered by Ordina
 
Netflix OSS and HATEOAS deployed on production - JavaLand
JWORKS powered by Ordina
 
Introduction to Docker
JWORKS powered by Ordina
 
An introduction to Cloud Foundry
JWORKS powered by Ordina
 
Cc internet of things LoRa and IoT - Innovation Enablers
JWORKS powered by Ordina
 
Mongo db intro.pptx
JWORKS powered by Ordina
 
Big data document and graph d bs - couch-db and orientdb
JWORKS powered by Ordina
 
Hadoop bootcamp getting started
JWORKS powered by Ordina
 
Big data elasticsearch practical
JWORKS powered by Ordina
 
Intro to cassandra
JWORKS powered by Ordina
 
Android wear - CC Mobile
JWORKS powered by Ordina
 
Clean Code - A&BP CC
JWORKS powered by Ordina
 
Unit testing - A&BP CC
JWORKS powered by Ordina
 
Integration testing - A&BP CC
JWORKS powered by Ordina
 
Documenting your REST API with Swagger - JOIN 2014
JWORKS powered by Ordina
 
Spring 4 - A&BP CC
JWORKS powered by Ordina
 
Android secure offline storage - CC Mobile
JWORKS powered by Ordina
 
Meteor - JOIN 2015
JWORKS powered by Ordina
 
Ad

Recently uploaded (20)

PPTX
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PDF
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
PPTX
Finding Your License Details in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
Home Care Tools: Benefits, features and more
Third Rock Techkno
 
PDF
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
PDF
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
PDF
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
PDF
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
PPTX
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
PDF
MiniTool Power Data Recovery 8.8 With Crack New Latest 2025
bashirkhan333g
 
PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
PPTX
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
PDF
Top Agile Project Management Tools for Teams in 2025
Orangescrum
 
PDF
The 5 Reasons for IT Maintenance - Arna Softech
Arna Softech
 
PDF
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PDF
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
PDF
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
PPTX
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
Finding Your License Details in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
Home Care Tools: Benefits, features and more
Third Rock Techkno
 
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
MiniTool Power Data Recovery 8.8 With Crack New Latest 2025
bashirkhan333g
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
Top Agile Project Management Tools for Teams in 2025
Orangescrum
 
The 5 Reasons for IT Maintenance - Arna Softech
Arna Softech
 
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
How to Hire AI Developers_ Step-by-Step Guide in 2025.pdf
DianApps Technologies
 
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 

Big data key-value and column stores redis - cassandra

  • 1. Big Data NoSQL Database Types: episode I
  • 2. Content ▪ Setup ▪ Introduction ▪ Key/Value ▪ Column Store
  • 3. Setup 1. Go to https://github.com/tomvdbulck/cassandrainitiationsearchworkshop And https://github.com/tomvdbulck/redisinitiationsearchworkshop 2. Make sure the following items have been installed on your machine: o Java 7 or higher o Git (if you like a pretty interface to deal with git, try SourceTree) o Maven 3. Install VirtualBox https://www.virtualbox.org/wiki/Downloads 4. Install Vagrant https://www.vagrantup.com/downloads.html 5. Clone the repository into your workspace
  • 4. Introduction ▪ 4 Types of NoSQL ▪ CAP Theorem
  • 5. Types of NoSQL data stores Following 4 types exist ▪ Key/Value Store ▪ Column Store ▪ Document Store ▪ Graph Database
  • 6. Types of NoSQL data stores Key/Value - key/value - are often “in-memory” - Strength ▪simple to implement ▪fast lookup - Weakness ▪querying ▪stored data has no schema - Use Case: ▪Caching ▪Top 10 list of facebook games
  • 7. Types of NoSQL data stores Column Store: - Stores everything in columns - Strength ▪fast lookup ▪distributed storage of data ▪better querying then key/value - Weakness ▪low-level api ▪cumbersome to do more complex queryies - Use Case: ▪Distributed file system ▪(twitter, netflix)
  • 8. Types of NoSQL data stores Document Store: - collections of key/value collections (documents) - Strength ▪Tolerant of incomplete data ▪Easier to do more complex queries - Weakness ▪Query performance - Use Case ▪standard web applications
  • 9. Types of NoSQL data stores Graph Database - store everything in a graph - use of nodes - nodes have relations to adjacent nodes - no index lookup required - Strength ▪graph algorithms ▪visualize relations - Weakness ▪has to traverse entire graph to get answer ▪not easy to cluster - Use Case: ▪Social Networking ▪Recommendations
  • 10. Types of NoSQL data stores
  • 11. Types of NoSQL data stores Graph Database:
  • 12. Types of NoSQL data stores Graph Database: playing around Visualize your own linkedin network: http://neo4j.com/blog/exploring-linkedin-in-neo4j/
  • 13. Types of NoSQL data stores Which to use? ▪ Often you will be using more then one, based on which one is the best fit for specific requirements ▪ You could also use 1 for development - schemaless, pretty feature complete (document store) and when feature-complete choose more appropriate databases. => a modular architecture will be important when you develop like this
  • 14. CAP Theorem Impossible for a distributed file system to simultaneously provide the following guarantees: ▪ Consistency: all nodes see the same data at the same time ▪ Availability: guarantee that every request receives a response about whether it succeeded or failed ▪ Partition Tolerance: the system continues to operate despite arbitrary message loss or failure of part of the system
  • 15. CAP Theorem Consistency: When I ask the same question to any part of the system I should get the same answer.
  • 16. CAP Theorem Consistency: When I ask the same question to any part of the system I should get the same answer.
  • 17. CAP Theorem Consistency: When I ask the same question to any part of the system I should get the same answer.
  • 18. CAP Theorem Availability: When I ask a question I will get an answer.
  • 19. CAP Theorem Availability: When I ask a question I will get an answer.
  • 20. CAP Theorem Partition Tolerance: I can ask questions even if the system is having intra-system communication problems
  • 21. CAP Theorem Partition Tolerance: I can ask questions even if the system is having intra-system communication problems
  • 23. CAP Theorem ▪ Consistent Available (CA): - have trouble with partitions and deal with it via replications - Examples: RDBMs ▪ Consistent, Partition-Tolerant (CP): - have trouble with availability while keeping data consistent across partitioned nodes - Examples: MongoDB, HBase,BigTable, HyperTable, Redis ▪ Available, Partition-Tolerant (AP) - achieve “eventual consistency” through replication and verification - Examples: CouchDB, Cassandra, Voldemort, Riak