SlideShare a Scribd company logo
TThhee NNooSSQQLL PPrriinncciipplleess 
&& 
BBaassiicc AApppplliiccaattiioonn ooff 
CCaassssaannddrraa MMooddeell 
RReesshhmmii RRaaddhhaakkrriisshhnnaann 
SS77 CCSS BB 
RRoollll NNOO::7711 
GGuuiiddeedd bbyy,, 
DDrr..SSuuddhheeeepp EEllaayyiiddoomm 
SSOOEE CCUUSSAATT
AAGGEENNDDAA 
• INTRODUCTION 
• RDBMS 
• CHALLENGE 
• NoSQL 
• COMMON CONCEPTS 
• CLASSIFICATION 
• CASSANDRA 
• CASSANDRA CHARACTERISTICS 
• CONCLUSION 
• REFERENCES
INTRODUCTION 
* RDBMS is the predominant technology for storing structured 
data in web and business applications. 
* The relational database systems have little capability to 
horizontally scale. 
* NoSQL approach includes simplicity of design, horizontal 
scaling and finer control over availability 
* Cassandra is the right NoSQL database when you need 
scalability and high availability without compromising 
performance
RDBMS 
MERITS: 
• Rich language 
• Easy to use and integrate 
• Rich toolset 
• Vertical scaling 
• The promise: ACID 
o Atomicity 
o Consistency 
o Isolation 
o Durability
RDBMS 
DEMERTS: 
• Vertical scaling is highly expensive 
• Fails to handle large amount of data 
• It is not much efficient in cloud concept 
The read-write rates of data in RDBMS are very poor 
So it is hard to face the challenges from the modern web 
applications using RDBMS
The Challenge: Modern web apps 
• Internet-scale data size 
• High read-write rates 
• Frequent schema changes 
• "social" apps - not banks 
o They don't need the same 
level of ACID 
SCALING
NoSQL 
• Uses horizontal scaling 
• Distribute data over many servers 
• It give up ACID property 
• Based on CAP -theorem
Brewer's CAP Theorem: 
You can only choose two
CAP 
Consistency: 
A distributed system is considered to be consistent if after an update 
operation of some writer, all readers see his updates in some 
shared data sources 
Availability: 
System is designed in a way that continue operation even if nodes in 
a cluster crash 
Partition Tolerance: 
Ability of a system to continue operation in the presence of network 
partition
COMMON CONCEPTS 
•Sharding 
•Consistent hashing 
•Map reduce
SHARDING 
• it's a partitioning mechanism 
• records are stored in different servers according to some 
key 
• records that are accesses/updated together reside on same 
node 
• load is almost evenly distributed among servers 
• vertical partitioning: parts of single records are stored on 
different servers
Constant Hashing 
• A,B,C:- NODES 
• 1,2,3,4:- OBJECTS 
-> both are placed in ring 
->movement is clockwise 
->nodes can leave the system 
->nodes can enter into the system
MAP REDUCE 
•Used in 
distributed 
computig 
• map function 
• reduce function 
• process on 
key/value
Existing NOSQL Solutions
Classification of NOSQL data stores 
• Document Oriented 
o CouchDB, MongoDB, Lotus Notes, SimpleDB 
• Key-Value oriented 
o Voldemort, Dynamo, Riak (sort of), Redis, Tokyo 
• Column oriented 
o Cassandra, HBase, BigTable 
• Graph Databases oriented 
o Neo4J, FlockDB, DEX, AlegroGraph
• Developed at facebook 
• Follows the BigTable Data Model - column 
oriented 
• Follows the Dynamo Eventual Consistency 
model 
• Opensourced at Apache 
• Implemented in Java
• Distributed Storage System 
• Manages structured data and scale to large 
size
Data Model 
• Instance of Cassandra – Consists of one table 
represent multidimensional map indexed by a key 
• Query for information: 
get(keyspace, column family, row key) 
• Keyspace:- top level unit of information, Column 
families are subordinate of one key space 
• Column :- atomic unit of information, expressed 
as: name:value 
• Super Column :- groups together like columns 
with a common name, 
Used for modeling complex data types (address)
Data Model 
ROW FAMILY 
• uniquely identifiable data 
• groups column and super column 
• Every row are identified by row key 
COLUMN FAMILY 
• It have to be define in advance before a cluster of servers in 
Cassandra instance is launched 
• It consists of keyed rows which groups columns and super 
columns 
• Column and super column added dynamically to column 
families and they are not restricted in numbers
Write Path
MEMTABLES 
• In-memory representation of recently written data 
• When the table is full, it's sorted and then flushed to disk -> sstable 
SS TABLES 
Sorted Strings Tables 
• Unchangeable 
• On-disk 
• Sorted by a string key 
• In-memory index of elements 
• Binary search (in memory) to find element location 
• Bloom filter to reduce number of unneeded binary searches. 
WRITE PROPERTIES 
• No Locks in the critical path 
• Always available to writes, even if there are failures. 
No seeks 
• Fast 
• Atomic within a Row
Read Path
Read Properteis 
• Read multiple SSTables 
• Slower than writes (but still fast) 
• Seeks can be mitigated with more RAM 
• Uses probabilistic bloom filters to reduce lookups. 
• Extensive optional caching 
o Key Cache 
o Row Cache
Bloom Filters 
• Space efficient probabilistic data structure 
• Test whether an element is a member of a set 
• Union and intersection are implemented as bitwise OR, AND
QUERIES EXAMPLES 
CREATE INDEX CREATE CUSTOM INDEX IF NOT 
EXISTS index_name 
ON keyspace_name.table_name ( KEYS (column_name) ) 
( USING class_name ) ( WITH OPTIONS = map ) 
Restrictions: 
USING class_name is allowed only if CUSTOM is used 
and 
class_name is a string literal containing a java class name. 
index_name is an identifier, enclosed or not enclosed in 
double 
quotation marks, excluding reserved words. 
map is described in ALTER KEYSPACE. 
CREATE KEYSPACE CREATE ( KEYSPACE | 
SCHEMA ) IF NOT EXISTS keyspace_name 
WITH REPLICATION = map 
AND DURABLE_WRITES = ( true | false )
MySQL Comparison 
• MySQL : for 50 GB Data 
Writes Average : ~300 ms 
Reads Average : ~350 ms 
• Cassandra: for 50 GB Data 
Writes Average : 0.12 ms 
Reads Average : 15 ms
CONCLUSION 
•NoSQL is highly efficient concept for dealing 
large amount of data. 
•It can be used to solve big data problem. 
•Cassandra model can provide fast reading and 
writing operations 
•So this database model is used by all the latest 
social networking medias
REFERENCES 
**The NoSQL Principles and Basic Application of Cassandra Model 
Guoxi Wang ; Jianfeng Tang 
Computer Science & Service System (CSSS), 2012 International Conference on 
Digital Object Identifier: 10.1109/CSSS.2012.336 
Publication Year: 2012 , Page(s): 1332 - 1335 
IEEE CONFERENCE PUBLICATIONS 
**Survey on NoSQL database 
Jing Han ; Haihong, E. ; Guan Le ; Jian Du 
Pervasive Computing and Applications (ICPCA), 2011 6th InternationalConferenceon 
Digital Object Identifier: 10.1109/ICPCA.2011.6106531 
Publication Year: 2011 , Page(s): 363 - 366 
Cited by: Papers (2) 
IEEE CONFERENCE PUBLICATIONS
QQuueessttiioonnss??
The No SQL Principles and Basic Application Of Casandra Model

More Related Content

What's hot (20)

PDF
Apache Cassandra overview
ElifTech
 
PDF
Cassandra Database
YounesCharfaoui
 
PPTX
An Overview of Apache Cassandra
DataStax
 
PDF
Cassandra - A Decentralized Structured Storage System
Varad Meru
 
PDF
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
DataStax Academy
 
PPTX
Cassandra training
András Fehér
 
PDF
Cassandra Explained
Eric Evans
 
PPTX
Apache Cassandra at the Geek2Geek Berlin
Christian Johannsen
 
PPTX
Cassandra - Research Paper Overview
sameiralk
 
PDF
Cassandra overview
Sean Murphy
 
PDF
Cassandra for Sysadmins
Nathan Milford
 
PPT
Introduction to cassandra
Nguyen Quang
 
PDF
Cassandra 101
Nader Ganayem
 
PPTX
Managing Objects and Data in Apache Cassandra
DataStax
 
PPT
Cassandra architecture
T Jake Luciani
 
PDF
What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...
Edureka!
 
PDF
On Rails with Apache Cassandra
Stu Hood
 
PPTX
Cassandra ppt 2
Skillwise Group
 
PDF
Intro to Cassandra
DataStax Academy
 
PPTX
Apache Spark II (SparkSQL)
Datio Big Data
 
Apache Cassandra overview
ElifTech
 
Cassandra Database
YounesCharfaoui
 
An Overview of Apache Cassandra
DataStax
 
Cassandra - A Decentralized Structured Storage System
Varad Meru
 
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
DataStax Academy
 
Cassandra training
András Fehér
 
Cassandra Explained
Eric Evans
 
Apache Cassandra at the Geek2Geek Berlin
Christian Johannsen
 
Cassandra - Research Paper Overview
sameiralk
 
Cassandra overview
Sean Murphy
 
Cassandra for Sysadmins
Nathan Milford
 
Introduction to cassandra
Nguyen Quang
 
Cassandra 101
Nader Ganayem
 
Managing Objects and Data in Apache Cassandra
DataStax
 
Cassandra architecture
T Jake Luciani
 
What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...
Edureka!
 
On Rails with Apache Cassandra
Stu Hood
 
Cassandra ppt 2
Skillwise Group
 
Intro to Cassandra
DataStax Academy
 
Apache Spark II (SparkSQL)
Datio Big Data
 

Viewers also liked (15)

PDF
Common Service Definition
David Messineo
 
DOCX
Characteristics of a service marketing
iipmff2
 
PDF
The nature & classification of services
Anupam Kumar
 
PPSX
The importance of customer service
Rasila Abdulla
 
PPTX
Characteristics of services
Dr. Chandra Shekhar Singh
 
PPT
the nature of services
school
 
PPTX
Nature, characteristics of services
Pranav Kumar Ojha
 
PPT
Service Characteristics of Hospitality and Tourism Marketing
ConflagratioNal Jahid
 
PPTX
Classification of services
Dr. Chandra Shekhar Singh
 
PPSX
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
Apache Apex
 
PPTX
Customer Services Ppresentation
Mirza Muhammad Yousuf
 
PDF
75 Customer Service Facts, Quotes & Statistics
Help Scout
 
PPTX
Classification and Characteristics of a service
Sanchit
 
PPT
Services Marketing
Ch Usman Waheed
 
Common Service Definition
David Messineo
 
Characteristics of a service marketing
iipmff2
 
The nature & classification of services
Anupam Kumar
 
The importance of customer service
Rasila Abdulla
 
Characteristics of services
Dr. Chandra Shekhar Singh
 
the nature of services
school
 
Nature, characteristics of services
Pranav Kumar Ojha
 
Service Characteristics of Hospitality and Tourism Marketing
ConflagratioNal Jahid
 
Classification of services
Dr. Chandra Shekhar Singh
 
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
Apache Apex
 
Customer Services Ppresentation
Mirza Muhammad Yousuf
 
75 Customer Service Facts, Quotes & Statistics
Help Scout
 
Classification and Characteristics of a service
Sanchit
 
Services Marketing
Ch Usman Waheed
 
Ad

Similar to The No SQL Principles and Basic Application Of Casandra Model (20)

PPTX
Cassandra an overview
PritamKathar
 
PPTX
Appache Cassandra
nehabsairam
 
PDF
04-Introduction-to-CassandraDB-.pdf
hothyfa
 
PPTX
Cassandra tutorial
Ramakrishna kapa
 
PPTX
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
PPTX
BigData, NoSQL & ElasticSearch
Sanura Hettiarachchi
 
PDF
NOsql Presentation.pdf
AkshayDwivedi31
 
PPTX
No SQL
The lazy hoplite
 
PPTX
Cassandra Learning
Ehsan Javanmard
 
PPTX
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
PPTX
cours database pour etudiant NoSQL (1).pptx
ssuser1fde9c
 
PPTX
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
PPTX
NoSQL.pptx
RithikRaj25
 
PPTX
No sql databases
swathika rajan
 
PPTX
Big data stores
Kumaran Ramanujam
 
PPTX
NoSql
AnitaSenthilkumar
 
PPTX
Introduction to nosql | NoSQL databases
ShilpaKrishna6
 
PPTX
CASSANDRA apache cassandra apacheee.pptx
jerrygame3121
 
PPTX
cassandra_presentation_final
SergioBruno21
 
PPTX
No SQL DATABASE Description about 4 no sql database.pptx
KavithaMs10
 
Cassandra an overview
PritamKathar
 
Appache Cassandra
nehabsairam
 
04-Introduction-to-CassandraDB-.pdf
hothyfa
 
Cassandra tutorial
Ramakrishna kapa
 
Introduction to Data Science NoSQL.pptx
tarakesh7199
 
BigData, NoSQL & ElasticSearch
Sanura Hettiarachchi
 
NOsql Presentation.pdf
AkshayDwivedi31
 
Cassandra Learning
Ehsan Javanmard
 
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
cours database pour etudiant NoSQL (1).pptx
ssuser1fde9c
 
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
NoSQL.pptx
RithikRaj25
 
No sql databases
swathika rajan
 
Big data stores
Kumaran Ramanujam
 
Introduction to nosql | NoSQL databases
ShilpaKrishna6
 
CASSANDRA apache cassandra apacheee.pptx
jerrygame3121
 
cassandra_presentation_final
SergioBruno21
 
No SQL DATABASE Description about 4 no sql database.pptx
KavithaMs10
 
Ad

More from Rishikese MR (19)

PPTX
1 2 3 4 5 g
Rishikese MR
 
PPTX
Natural Language Processing
Rishikese MR
 
PPTX
Fuzzy Logic
Rishikese MR
 
PPTX
Crowd Sourcing With Smart Phone
Rishikese MR
 
PPT
BLUE BRAIN
Rishikese MR
 
PPTX
CYBORG
Rishikese MR
 
PPTX
DATA WAREHOUSING
Rishikese MR
 
PPTX
Automatic 2D to 3D Video Conversion For 3DTV's
Rishikese MR
 
PDF
Middleware and Middleware in distributed application
Rishikese MR
 
PPTX
TOR NETWORK
Rishikese MR
 
PPTX
EMOTION BASED COMPUTING
Rishikese MR
 
PPTX
BITCOIN TECHNOLOGY AND ITS USES
Rishikese MR
 
PPTX
3D OPTICAL DATA STORAGE
Rishikese MR
 
PPTX
OUTERNET
Rishikese MR
 
PPTX
OVERVIEW OF FACEBOOK SCALABLE ARCHITECTURE.
Rishikese MR
 
PDF
Google Glass and its Features
Rishikese MR
 
PDF
Virtualization and cloud Computing
Rishikese MR
 
PDF
Artificial intelligence in gaming.
Rishikese MR
 
PDF
A seminar on neo4 j
Rishikese MR
 
1 2 3 4 5 g
Rishikese MR
 
Natural Language Processing
Rishikese MR
 
Fuzzy Logic
Rishikese MR
 
Crowd Sourcing With Smart Phone
Rishikese MR
 
BLUE BRAIN
Rishikese MR
 
CYBORG
Rishikese MR
 
DATA WAREHOUSING
Rishikese MR
 
Automatic 2D to 3D Video Conversion For 3DTV's
Rishikese MR
 
Middleware and Middleware in distributed application
Rishikese MR
 
TOR NETWORK
Rishikese MR
 
EMOTION BASED COMPUTING
Rishikese MR
 
BITCOIN TECHNOLOGY AND ITS USES
Rishikese MR
 
3D OPTICAL DATA STORAGE
Rishikese MR
 
OUTERNET
Rishikese MR
 
OVERVIEW OF FACEBOOK SCALABLE ARCHITECTURE.
Rishikese MR
 
Google Glass and its Features
Rishikese MR
 
Virtualization and cloud Computing
Rishikese MR
 
Artificial intelligence in gaming.
Rishikese MR
 
A seminar on neo4 j
Rishikese MR
 

Recently uploaded (20)

PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PPTX
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 

The No SQL Principles and Basic Application Of Casandra Model

  • 1. TThhee NNooSSQQLL PPrriinncciipplleess && BBaassiicc AApppplliiccaattiioonn ooff CCaassssaannddrraa MMooddeell RReesshhmmii RRaaddhhaakkrriisshhnnaann SS77 CCSS BB RRoollll NNOO::7711 GGuuiiddeedd bbyy,, DDrr..SSuuddhheeeepp EEllaayyiiddoomm SSOOEE CCUUSSAATT
  • 2. AAGGEENNDDAA • INTRODUCTION • RDBMS • CHALLENGE • NoSQL • COMMON CONCEPTS • CLASSIFICATION • CASSANDRA • CASSANDRA CHARACTERISTICS • CONCLUSION • REFERENCES
  • 3. INTRODUCTION * RDBMS is the predominant technology for storing structured data in web and business applications. * The relational database systems have little capability to horizontally scale. * NoSQL approach includes simplicity of design, horizontal scaling and finer control over availability * Cassandra is the right NoSQL database when you need scalability and high availability without compromising performance
  • 4. RDBMS MERITS: • Rich language • Easy to use and integrate • Rich toolset • Vertical scaling • The promise: ACID o Atomicity o Consistency o Isolation o Durability
  • 5. RDBMS DEMERTS: • Vertical scaling is highly expensive • Fails to handle large amount of data • It is not much efficient in cloud concept The read-write rates of data in RDBMS are very poor So it is hard to face the challenges from the modern web applications using RDBMS
  • 6. The Challenge: Modern web apps • Internet-scale data size • High read-write rates • Frequent schema changes • "social" apps - not banks o They don't need the same level of ACID SCALING
  • 7. NoSQL • Uses horizontal scaling • Distribute data over many servers • It give up ACID property • Based on CAP -theorem
  • 8. Brewer's CAP Theorem: You can only choose two
  • 9. CAP Consistency: A distributed system is considered to be consistent if after an update operation of some writer, all readers see his updates in some shared data sources Availability: System is designed in a way that continue operation even if nodes in a cluster crash Partition Tolerance: Ability of a system to continue operation in the presence of network partition
  • 10. COMMON CONCEPTS •Sharding •Consistent hashing •Map reduce
  • 11. SHARDING • it's a partitioning mechanism • records are stored in different servers according to some key • records that are accesses/updated together reside on same node • load is almost evenly distributed among servers • vertical partitioning: parts of single records are stored on different servers
  • 12. Constant Hashing • A,B,C:- NODES • 1,2,3,4:- OBJECTS -> both are placed in ring ->movement is clockwise ->nodes can leave the system ->nodes can enter into the system
  • 13. MAP REDUCE •Used in distributed computig • map function • reduce function • process on key/value
  • 15. Classification of NOSQL data stores • Document Oriented o CouchDB, MongoDB, Lotus Notes, SimpleDB • Key-Value oriented o Voldemort, Dynamo, Riak (sort of), Redis, Tokyo • Column oriented o Cassandra, HBase, BigTable • Graph Databases oriented o Neo4J, FlockDB, DEX, AlegroGraph
  • 16. • Developed at facebook • Follows the BigTable Data Model - column oriented • Follows the Dynamo Eventual Consistency model • Opensourced at Apache • Implemented in Java
  • 17. • Distributed Storage System • Manages structured data and scale to large size
  • 18. Data Model • Instance of Cassandra – Consists of one table represent multidimensional map indexed by a key • Query for information: get(keyspace, column family, row key) • Keyspace:- top level unit of information, Column families are subordinate of one key space • Column :- atomic unit of information, expressed as: name:value • Super Column :- groups together like columns with a common name, Used for modeling complex data types (address)
  • 19. Data Model ROW FAMILY • uniquely identifiable data • groups column and super column • Every row are identified by row key COLUMN FAMILY • It have to be define in advance before a cluster of servers in Cassandra instance is launched • It consists of keyed rows which groups columns and super columns • Column and super column added dynamically to column families and they are not restricted in numbers
  • 21. MEMTABLES • In-memory representation of recently written data • When the table is full, it's sorted and then flushed to disk -> sstable SS TABLES Sorted Strings Tables • Unchangeable • On-disk • Sorted by a string key • In-memory index of elements • Binary search (in memory) to find element location • Bloom filter to reduce number of unneeded binary searches. WRITE PROPERTIES • No Locks in the critical path • Always available to writes, even if there are failures. No seeks • Fast • Atomic within a Row
  • 23. Read Properteis • Read multiple SSTables • Slower than writes (but still fast) • Seeks can be mitigated with more RAM • Uses probabilistic bloom filters to reduce lookups. • Extensive optional caching o Key Cache o Row Cache
  • 24. Bloom Filters • Space efficient probabilistic data structure • Test whether an element is a member of a set • Union and intersection are implemented as bitwise OR, AND
  • 25. QUERIES EXAMPLES CREATE INDEX CREATE CUSTOM INDEX IF NOT EXISTS index_name ON keyspace_name.table_name ( KEYS (column_name) ) ( USING class_name ) ( WITH OPTIONS = map ) Restrictions: USING class_name is allowed only if CUSTOM is used and class_name is a string literal containing a java class name. index_name is an identifier, enclosed or not enclosed in double quotation marks, excluding reserved words. map is described in ALTER KEYSPACE. CREATE KEYSPACE CREATE ( KEYSPACE | SCHEMA ) IF NOT EXISTS keyspace_name WITH REPLICATION = map AND DURABLE_WRITES = ( true | false )
  • 26. MySQL Comparison • MySQL : for 50 GB Data Writes Average : ~300 ms Reads Average : ~350 ms • Cassandra: for 50 GB Data Writes Average : 0.12 ms Reads Average : 15 ms
  • 27. CONCLUSION •NoSQL is highly efficient concept for dealing large amount of data. •It can be used to solve big data problem. •Cassandra model can provide fast reading and writing operations •So this database model is used by all the latest social networking medias
  • 28. REFERENCES **The NoSQL Principles and Basic Application of Cassandra Model Guoxi Wang ; Jianfeng Tang Computer Science & Service System (CSSS), 2012 International Conference on Digital Object Identifier: 10.1109/CSSS.2012.336 Publication Year: 2012 , Page(s): 1332 - 1335 IEEE CONFERENCE PUBLICATIONS **Survey on NoSQL database Jing Han ; Haihong, E. ; Guan Le ; Jian Du Pervasive Computing and Applications (ICPCA), 2011 6th InternationalConferenceon Digital Object Identifier: 10.1109/ICPCA.2011.6106531 Publication Year: 2011 , Page(s): 363 - 366 Cited by: Papers (2) IEEE CONFERENCE PUBLICATIONS