SlideShare a Scribd company logo
5
Most read
8
Most read
9
Most read
Presentation
on
NoSQL
“Towards the end of RDBMS ?”
By: Ramakant Soni
Asst. Professor, Dept. of Computer
Science, BKBIET, Pilani
What is RDBMS
 RDBMS: the relational database
management system.
 Relation: a relation is a 2D table
which has the following features:
 Name
 Attributes
 Tuples
Name
2
Issues with RDBMS- Scalability
 Issues with scaling up when the dataset is
just too big e.g. Big Data.
 Not designed to be distributed.
 Looking at multi-node database solutions.
Known as ‘horizontal scaling’.
 Different approaches include:
 Master-slave
 Sharding
3
Scaling RDBMS
Master-Slave
 All writes are written to the master.
All reads are performed against
the replicated slave databases.
 Critical reads may be incorrect as
writes may not have been
propagated down.
 Large data sets can pose problems
as master needs to duplicate data
to slaves.
Sharding
 Scales well for both reads and
writes.
 Not transparent, application needs
to be partition-aware.
 Can no longer have relationships or
joins across partitions.
 Loss of referential integrity across
shards.
4
What is NoSQL
 Stands for Not Only SQL. Term was redefined by Eric Evans after Carlo
Strozzi.
 Class of non-relational data storage systems.
 Do not require a fixed table schema nor do they use the concept of joins.
 Relaxation for one or more of the ACID properties (Atomicity, Consistency,
Isolation, Durability) using CAP theorem.
5
Need of NoSQL
 Explosion of social media sites (Facebook, Twitter, Google etc.) with large
data needs. (Sharding is a problem)
 Rise of cloud-based solutions such as Amazon S3 (simple storage solution).
 Just as moving to dynamically-typed languages (Ruby/Groovy), a shift to
dynamically-typed data with frequent schema changes.
 Expansion of Open-source community.
 NoSQL solution is more acceptable to a client now than a year ago.
6
NoSQL Types
NoSQL database are classified into four types:
• Key Value pair based
• Column based
• Document based
• Graph based
7
Key Value Pair Based
• Designed for processing dictionary. Dictionaries contain a
collection of records having fields containing data.
• Records are stored and retrieved using a key that uniquely
identifies the record, and is used to quickly find the data
within the database.
Example: CouchDB, Oracle NoSQL Database, Riak etc.
We use it for storing session information, user profiles, preferences,
shopping cart data.
We would avoid it when we need to query data having relationships
between entities.
8
Column based
It store data as Column families containing rows that have
many columns associated with a row key. Each row can have
different columns.
Column families are groups of related data that is accessed
together.
Example: Cassandra, HBase, Hypertable, and Amazon
DynamoDB.
We use it for content management systems, blogging platforms, log aggregation.
We would avoid it for systems that are in early development, changing query patterns.
9
Document Based
The database stores and retrieves documents. It stores documents in
the value part of the key-value store.
Self- describing, hierarchical tree data structures consisting of maps,
collections, and scalar values.
Example: Lotus Notes, MongoDB, Couch DB, Orient DB, Raven DB.
We use it for content management systems, blogging platforms, web analytics, real-time analytics,
e- commerce applications.
We would avoid it for systems that need complex transactions spanning multiple operations or
queries against varying aggregate structures.
10
Graph Based
Store entities and relationships between these entities as nodes
and edges of a graph respectively. Entities have properties.
Traversing the relationships is very fast as relationship between
nodes is not calculated at query time but is actually persisted
as a relationship.
Example: Neo4J, Infinite Graph, OrientDB, FlockDB.
It is well suited for connected data, such as social networks,
spatial data, routing information for goods and supply.
11
CAP Theorem
 According to Eric Brewer a distributed system has 3 properties :
 Consistency
 Availability
 Partitions
 We can have at most two of these three properties for any shared-data system
 To scale out, we have to partition. It leaves a choice between consistency and
availability. ( In almost all cases, we would choose availability over consistency)
 Everyone who builds big applications builds them on CAP : Google, Yahoo,
Facebook, Amazon, eBay, etc.
12
Advantages of NoSQL
 Cheap and easy to implement (open source)
 Data are replicated to multiple nodes (therefore identical and fault-
tolerant) and can be partitioned
 When data is written, the latest version is on at least one node and then
replicated to other nodes
 No single point of failure
 Easy to distribute
 Don't require a schema
13
What is not provided by NoSQL
 Joins
 Group by
 ACID transactions
 SQL
 Integration with applications that are based on SQL
14
Where to use NoSQL
 NoSQL Data storage systems makes sense for applications that process very large
semi-structured data –like Log Analysis, Social Networking Feeds, Time-based
data.
 To improve programmer productivity by using a database that better matches an
application's needs.
 To improve data access performance via some combination of handling larger data
volumes, reducing latency, and improving throughput.
15
Conclusion
 All the choices provided by the rise of NoSQL databases does not mean the demise
of RDBMS databases as Relational databases are a powerful tool.
 We are entering an era of Polyglot persistence, a technique that uses different data
storage technologies to handle varying data storage needs. It can apply across an
enterprise or within an individual application.
16
References
1. “NoSQL Databases: An Overview”. Pramod Sadalage, thoughtworks.com(2014)
2. “Data management in cloud environments: NoSQL and NewSQL data stores”.
Grolinger, K.; Higashino, W. A.; Tiwari, A.; Capretz, M. A. M. (2013). JoCCASA,
Springer.
3. “Making the Shift from Relational to NoSQL”. Couchbase.com(2014).
4. “NoSQL - Death to Relational Databases”. Scofield, Ben (2010).
17
Thank
You
Thank
You

More Related Content

What's hot (20)

PPT
Schemaless Databases
Dan Gunter
 
PPTX
Distributed database
ReachLocal Services India
 
PPT
Hive(ppt)
Abhinav Tyagi
 
PPTX
Hadoop File system (HDFS)
Prashant Gupta
 
PPTX
Map Reduce
Prashant Gupta
 
PPTX
Database security
Software Engineering
 
PPTX
Non relational databases-no sql
Ram kumar
 
PPTX
Introduction to MongoDB.pptx
Surya937648
 
PPTX
Distributed dbms architectures
Pooja Dixit
 
PDF
NoSQL databases
Marin Dimitrov
 
PPTX
Relational Database Design
Archit Saxena
 
PDF
Lecture4 big data technology foundations
hktripathy
 
PPTX
Introduction to NoSQL
PolarSeven Pty Ltd
 
PPT
Hadoop Map Reduce
VNIT-ACM Student Chapter
 
PPTX
Introduction to HDFS
Bhavesh Padharia
 
PPTX
Data cubes
Mohammed
 
PPTX
HADOOP TECHNOLOGY ppt
sravya raju
 
PPTX
Transaction management DBMS
Megha Patel
 
PPTX
OLAP operations
kunj desai
 
Schemaless Databases
Dan Gunter
 
Distributed database
ReachLocal Services India
 
Hive(ppt)
Abhinav Tyagi
 
Hadoop File system (HDFS)
Prashant Gupta
 
Map Reduce
Prashant Gupta
 
Database security
Software Engineering
 
Non relational databases-no sql
Ram kumar
 
Introduction to MongoDB.pptx
Surya937648
 
Distributed dbms architectures
Pooja Dixit
 
NoSQL databases
Marin Dimitrov
 
Relational Database Design
Archit Saxena
 
Lecture4 big data technology foundations
hktripathy
 
Introduction to NoSQL
PolarSeven Pty Ltd
 
Hadoop Map Reduce
VNIT-ACM Student Chapter
 
Introduction to HDFS
Bhavesh Padharia
 
Data cubes
Mohammed
 
HADOOP TECHNOLOGY ppt
sravya raju
 
Transaction management DBMS
Megha Patel
 
OLAP operations
kunj desai
 

Similar to NOSQL- Presentation on NoSQL (20)

PDF
Nosql Presentation.pdf for DBMS understanding
HUSNAINAHMAD39
 
PPTX
NoSQL powerpoint presentation difference with rdbms
AtulKabbur
 
PPSX
A Seminar on NoSQL Databases.
Navdeep Charan
 
PPTX
Presentation on NOSQL and mongodb .pptx
jokisil929
 
PPTX
No sq lv2
Nusrat Sharmin
 
PPTX
NoSQL(NOT ONLY SQL)
Rahul P
 
PPTX
Relational and non relational database 7
abdulrahmanhelan
 
PPTX
No sqlpresentation
Salma Gouia
 
PPT
NoSQL - 05March2014 Seminar
Jainul Musani
 
PPTX
NoSQL Basics and MongDB
Shamima Yeasmin Mukta
 
PDF
the rising no sql technology
INFOGAIN PUBLICATION
 
DOCX
Know what is NOSQL
Prasoon Sharma
 
PPTX
unit2-ppt1.pptx
revathigollu23
 
PPTX
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
PPTX
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
PDF
NoSql and it's introduction features-Unit-1.pdf
ajajkhan16
 
PPTX
NOSQL
akbarashaikh
 
PPTX
NOSQL PRESENTATION ON INTRRODUCTION Intro.pptx
plvdravikumarit
 
PPT
NoSql Databases
Nimat Khattak
 
PPT
6269441.ppt
Swapna Jk
 
Nosql Presentation.pdf for DBMS understanding
HUSNAINAHMAD39
 
NoSQL powerpoint presentation difference with rdbms
AtulKabbur
 
A Seminar on NoSQL Databases.
Navdeep Charan
 
Presentation on NOSQL and mongodb .pptx
jokisil929
 
No sq lv2
Nusrat Sharmin
 
NoSQL(NOT ONLY SQL)
Rahul P
 
Relational and non relational database 7
abdulrahmanhelan
 
No sqlpresentation
Salma Gouia
 
NoSQL - 05March2014 Seminar
Jainul Musani
 
NoSQL Basics and MongDB
Shamima Yeasmin Mukta
 
the rising no sql technology
INFOGAIN PUBLICATION
 
Know what is NOSQL
Prasoon Sharma
 
unit2-ppt1.pptx
revathigollu23
 
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
NoSql and it's introduction features-Unit-1.pdf
ajajkhan16
 
NOSQL PRESENTATION ON INTRRODUCTION Intro.pptx
plvdravikumarit
 
NoSql Databases
Nimat Khattak
 
6269441.ppt
Swapna Jk
 
Ad

More from Ramakant Soni (13)

PDF
GATE 2021 Exam Information
Ramakant Soni
 
PDF
What is Algorithm - An Overview
Ramakant Soni
 
PDF
Role of Data Cleaning in Data Warehouse
Ramakant Soni
 
PDF
Internet of things
Ramakant Soni
 
PDF
Huffman and Arithmetic coding - Performance analysis
Ramakant Soni
 
PDF
UML daigrams for Bank ATM system
Ramakant Soni
 
PDF
Collaboration diagram- UML diagram
Ramakant Soni
 
PDF
Activity diagram-UML diagram
Ramakant Soni
 
PDF
Sequence diagram- UML diagram
Ramakant Soni
 
PDF
Class diagram- UML diagram
Ramakant Soni
 
PDF
Use Case diagram-UML diagram-2
Ramakant Soni
 
PDF
Use Case diagram-UML diagram-1
Ramakant Soni
 
PDF
UML Diagrams- Unified Modeling Language Introduction
Ramakant Soni
 
GATE 2021 Exam Information
Ramakant Soni
 
What is Algorithm - An Overview
Ramakant Soni
 
Role of Data Cleaning in Data Warehouse
Ramakant Soni
 
Internet of things
Ramakant Soni
 
Huffman and Arithmetic coding - Performance analysis
Ramakant Soni
 
UML daigrams for Bank ATM system
Ramakant Soni
 
Collaboration diagram- UML diagram
Ramakant Soni
 
Activity diagram-UML diagram
Ramakant Soni
 
Sequence diagram- UML diagram
Ramakant Soni
 
Class diagram- UML diagram
Ramakant Soni
 
Use Case diagram-UML diagram-2
Ramakant Soni
 
Use Case diagram-UML diagram-1
Ramakant Soni
 
UML Diagrams- Unified Modeling Language Introduction
Ramakant Soni
 
Ad

Recently uploaded (20)

PDF
NTPC PATRATU Summer internship report.pdf
hemant03701
 
PPTX
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
PPTX
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
PPTX
How Industrial Project Management Differs From Construction.pptx
jamespit799
 
PPT
Testing and final inspection of a solar PV system
MuhammadSanni2
 
PPTX
fatigue in aircraft structures-221113192308-0ad6dc8c.pptx
aviatecofficial
 
PPTX
darshai cross section and river section analysis
muk7971
 
PDF
Digital water marking system project report
Kamal Acharya
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PPTX
仿制LethbridgeOffer加拿大莱斯桥大学毕业证范本,Lethbridge成绩单
Taqyea
 
PPTX
澳洲电子毕业证澳大利亚圣母大学水印成绩单UNDA学生证网上可查学历
Taqyea
 
PDF
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
PDF
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
PPT
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
PDF
AN EMPIRICAL STUDY ON THE USAGE OF SOCIAL MEDIA IN GERMAN B2C-ONLINE STORES
ijait
 
PDF
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
PPTX
原版一样(EC Lille毕业证书)法国里尔中央理工学院毕业证补办
Taqyea
 
PDF
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
PPTX
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
PDF
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
NTPC PATRATU Summer internship report.pdf
hemant03701
 
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
How Industrial Project Management Differs From Construction.pptx
jamespit799
 
Testing and final inspection of a solar PV system
MuhammadSanni2
 
fatigue in aircraft structures-221113192308-0ad6dc8c.pptx
aviatecofficial
 
darshai cross section and river section analysis
muk7971
 
Digital water marking system project report
Kamal Acharya
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
仿制LethbridgeOffer加拿大莱斯桥大学毕业证范本,Lethbridge成绩单
Taqyea
 
澳洲电子毕业证澳大利亚圣母大学水印成绩单UNDA学生证网上可查学历
Taqyea
 
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
AN EMPIRICAL STUDY ON THE USAGE OF SOCIAL MEDIA IN GERMAN B2C-ONLINE STORES
ijait
 
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
原版一样(EC Lille毕业证书)法国里尔中央理工学院毕业证补办
Taqyea
 
Halide Perovskites’ Multifunctional Properties: Coordination Engineering, Coo...
TaameBerhe2
 
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 

NOSQL- Presentation on NoSQL

  • 1. Presentation on NoSQL “Towards the end of RDBMS ?” By: Ramakant Soni Asst. Professor, Dept. of Computer Science, BKBIET, Pilani
  • 2. What is RDBMS  RDBMS: the relational database management system.  Relation: a relation is a 2D table which has the following features:  Name  Attributes  Tuples Name 2
  • 3. Issues with RDBMS- Scalability  Issues with scaling up when the dataset is just too big e.g. Big Data.  Not designed to be distributed.  Looking at multi-node database solutions. Known as ‘horizontal scaling’.  Different approaches include:  Master-slave  Sharding 3
  • 4. Scaling RDBMS Master-Slave  All writes are written to the master. All reads are performed against the replicated slave databases.  Critical reads may be incorrect as writes may not have been propagated down.  Large data sets can pose problems as master needs to duplicate data to slaves. Sharding  Scales well for both reads and writes.  Not transparent, application needs to be partition-aware.  Can no longer have relationships or joins across partitions.  Loss of referential integrity across shards. 4
  • 5. What is NoSQL  Stands for Not Only SQL. Term was redefined by Eric Evans after Carlo Strozzi.  Class of non-relational data storage systems.  Do not require a fixed table schema nor do they use the concept of joins.  Relaxation for one or more of the ACID properties (Atomicity, Consistency, Isolation, Durability) using CAP theorem. 5
  • 6. Need of NoSQL  Explosion of social media sites (Facebook, Twitter, Google etc.) with large data needs. (Sharding is a problem)  Rise of cloud-based solutions such as Amazon S3 (simple storage solution).  Just as moving to dynamically-typed languages (Ruby/Groovy), a shift to dynamically-typed data with frequent schema changes.  Expansion of Open-source community.  NoSQL solution is more acceptable to a client now than a year ago. 6
  • 7. NoSQL Types NoSQL database are classified into four types: • Key Value pair based • Column based • Document based • Graph based 7
  • 8. Key Value Pair Based • Designed for processing dictionary. Dictionaries contain a collection of records having fields containing data. • Records are stored and retrieved using a key that uniquely identifies the record, and is used to quickly find the data within the database. Example: CouchDB, Oracle NoSQL Database, Riak etc. We use it for storing session information, user profiles, preferences, shopping cart data. We would avoid it when we need to query data having relationships between entities. 8
  • 9. Column based It store data as Column families containing rows that have many columns associated with a row key. Each row can have different columns. Column families are groups of related data that is accessed together. Example: Cassandra, HBase, Hypertable, and Amazon DynamoDB. We use it for content management systems, blogging platforms, log aggregation. We would avoid it for systems that are in early development, changing query patterns. 9
  • 10. Document Based The database stores and retrieves documents. It stores documents in the value part of the key-value store. Self- describing, hierarchical tree data structures consisting of maps, collections, and scalar values. Example: Lotus Notes, MongoDB, Couch DB, Orient DB, Raven DB. We use it for content management systems, blogging platforms, web analytics, real-time analytics, e- commerce applications. We would avoid it for systems that need complex transactions spanning multiple operations or queries against varying aggregate structures. 10
  • 11. Graph Based Store entities and relationships between these entities as nodes and edges of a graph respectively. Entities have properties. Traversing the relationships is very fast as relationship between nodes is not calculated at query time but is actually persisted as a relationship. Example: Neo4J, Infinite Graph, OrientDB, FlockDB. It is well suited for connected data, such as social networks, spatial data, routing information for goods and supply. 11
  • 12. CAP Theorem  According to Eric Brewer a distributed system has 3 properties :  Consistency  Availability  Partitions  We can have at most two of these three properties for any shared-data system  To scale out, we have to partition. It leaves a choice between consistency and availability. ( In almost all cases, we would choose availability over consistency)  Everyone who builds big applications builds them on CAP : Google, Yahoo, Facebook, Amazon, eBay, etc. 12
  • 13. Advantages of NoSQL  Cheap and easy to implement (open source)  Data are replicated to multiple nodes (therefore identical and fault- tolerant) and can be partitioned  When data is written, the latest version is on at least one node and then replicated to other nodes  No single point of failure  Easy to distribute  Don't require a schema 13
  • 14. What is not provided by NoSQL  Joins  Group by  ACID transactions  SQL  Integration with applications that are based on SQL 14
  • 15. Where to use NoSQL  NoSQL Data storage systems makes sense for applications that process very large semi-structured data –like Log Analysis, Social Networking Feeds, Time-based data.  To improve programmer productivity by using a database that better matches an application's needs.  To improve data access performance via some combination of handling larger data volumes, reducing latency, and improving throughput. 15
  • 16. Conclusion  All the choices provided by the rise of NoSQL databases does not mean the demise of RDBMS databases as Relational databases are a powerful tool.  We are entering an era of Polyglot persistence, a technique that uses different data storage technologies to handle varying data storage needs. It can apply across an enterprise or within an individual application. 16
  • 17. References 1. “NoSQL Databases: An Overview”. Pramod Sadalage, thoughtworks.com(2014) 2. “Data management in cloud environments: NoSQL and NewSQL data stores”. Grolinger, K.; Higashino, W. A.; Tiwari, A.; Capretz, M. A. M. (2013). JoCCASA, Springer. 3. “Making the Shift from Relational to NoSQL”. Couchbase.com(2014). 4. “NoSQL - Death to Relational Databases”. Scofield, Ben (2010). 17