SlideShare a Scribd company logo
IJSRD - International Journal for Scientific Research & Development| Vol. 3, Issue 10, 2015 | ISSN (online): 2321-0613
All rights reserved by www.ijsrd.com 832
A Study on Mongodb Database
Kavya. S
M Tech. Student
Department of Computer Science
Mount Zion College of Engineering, A. P. J. Abdul Kalam Technological University, Kadammanitta
P.O, Pathanamthitta, Kerala, India
Abstract— This paper trying to focus on main features,
advantages and applications of non-relational database
namely Mongo DB and thus justifying why MongoDB is
more suitable than relational databases in big data
applications. The database used here for comparison with
MongoDB is MySQL. The main features of MongoDB are
flexibility, scalability, auto sharding and replication.
MongoDB is used in big data and real time web applications
since it is a leading database technology.
Key words: NoSQL, MongoDB, auto sharding, aggregation
I. INTRODUCTION
Relational database management systems came into
existence since 1980’s.They are a common choice of storage
of information in new databases used for financial records,
manufacturing and logistical information personnel data and
other applications. They work efficiently when they handle a
limited amount of data. Due to the emergence of
applications that support millions of users simultaneously an
appropriate database is required. To handle huge volume of
data traditional relational database is inefficient. To
overcome the difficulty in handling huge volume of data ,the
term NoSQL was introduced by Crlo Strozzi in 1998.It
refers to non-relational databases. More recently,the term
has received another meaning namely Not Only SQL.The
main advantage of NoSQL database is that it can handle
both unstructured(e-mail,multimedia,social media) and semi
structured data very efficiently. Mainly there are four
categories of databases namely Key –value store, document
store, column oriented and graph database. MongoDB is a
cross platform document oriented database, first developed
by the software company MongoDB Inc., in October 2007
as a component of planned platform as a service product.
The company shifted to an open source development model
in 2009.Since, then MongoDB has been adopted as a
backend software by a number of major websites and
services. These include Craigslist, eBay, Foursquare and
Newyork Times. MongoDB is written in c++ and provides
high availability, easy scalability and better performance.
MongoDB works on the concept of collection and
document. A database is a physical container for collections.
Collection is a group of MongoDB documents. It is
equivalent to RDBMS table. MongoDB database contain
multiple collections. A document is a set of key-value pairs.
Documents have dynamic schema. That means, documents
in the same collection do not need to have the same structure
and common fields. MongoDB supports dynamic queries on
documents using a document based query language that is
nearly as powerful as SQL.It stores the data in the form of
JSON documents. Auto sharding, replication and high
availability are the main features of MongoDB.It is
commonly used in big data, content management and
delivery, mobile and social infrastructure, user data
management and data hub. MongoDB supports different
data types such as String, Integer, Boolean,
Double,Min/Max keys, Arrays, Timestamp, Oject, Null,
Symbol, date, Object ID, Binary data, code and regular
expressions.
II. MONGODB DATA MODEL
MongoDB stores data as documents which are in the BSON
format. BSON is binary representation of JSON document.
Documents having similar structure are organized as
collections. Collection is analogous to a table in relational
database. Documents and fields in MongoDB are
represented using the terms row and columns respectively in
MySQL. The difference between the relational database,
MySQL and non-relational database ,MongoDB is that in
relational database information for a given record is usually
spread across many tables, whereas in MongoDB the
documents tend to have all data for a given record in a
single document.
MySQL MongoDB
Table Collection
Row BSON document
Column BSON field
JOIN Embedded documents and Linking
GROUP BY Aggregation
Primary key Primary key
Table 1:
III. FEATURES OF MONGODB
MongoDB has a flexible data model. That means data can
be stored in any structure. This feature also allows
modification of data in an easy way. Another main feature is
elastic scalability. All NoSQL databases contain some form
of sharding or partitioning.This allows the database to scale
out on hardware. Thereby allowing almost unlimited
growth. MongoDB provides high performance than
traditional relational databases. The performance of
MongoDB is measured in terms of both throughput and
latency at any scale. MongoDB does not use join operation,
instead they use embedding of documents and linking.
Because the data in MongoDB is more localized.This
localization dramatically reduces the need to join separate
tables. Each document structure in MongoDB database can
vary from one another. If there is a need to create a new
field in any one of the document, then the field can be
created without affecting a central system catalog and
without taking the system offline. In MongoDB, field
updates can be done easily. It provides rich data model. Data
locality and dynamic schema are other main features of
MongoDB. The main feature of MongoDB includes
querying, aggregation, indexing and auto sharding.
Indexes play a major role in providing efficient
access to data,for both read and write operations,which are
A Study on Mongodb Database
(IJSRD/Vol. 3/Issue 10/2015/182)
All rights reserved by www.ijsrd.com 833
supported natively by the database rather than maintained in
application code. MongoDB supports many queries, mainly
for highly scalable operational applications. The result of
query execution can be a document or subset of specific
fields within the document.
Different types of query provided by MongoDB
include key value queries, range queries, geo spatial queries,
search queries, text search queries, aggregation framework
queries and map reduce queries. Replica sets are another
feature of MongoDB which is a fail over mechanism. Only
the primary database allows write operation. Multiple
secondary servers are used for read operation. For a replica
set, minimum three servers is required. Of the three servers,
one is primary server, other is secondary server and the
remaining one is arbiter server. Arbiter server is not used for
storing data. They are used only during failover time to
determine which server will be the next primary server.
Another feature is auto sharding. This feature is used to
overcome the hardware limitations. Hardware limitations
means bottleneck in RAM/disk I/O. This feature of
MongoDB helps to distribute data across physical partitions.
These physical partitions are called as shards. Thus data is
automatically balanced in the clusters as the data grows. In
relational database sharding is not built into the database.
An aggregate is a group of related entities and value object.
Maximum document size in MongoDB is 16 MB and large
documents are handled with Grid FS.MongoDB runs on OSs
such as Windows, Linux, Mac and Solaris.
IV. COMPARISON OF MONGODB VS MYSQL
MongoDB Commands
SELECT * FROM table db.collection.find()
SELECT * FROM table
WHERE user=’Akshay’
db.collection.find({user=”Akshay”})
SELECT * FROM table
ORDER BY Age
Db.collection.find.
DISTINCT .distinct()
GROUP .group()
Table 2: Fig (a) Retrieval of data in MySQL and MongoDB
Modeling of data in MongoDB database differs from
relational database. Different modeling styles can be applied
depending on the requirement of the application. Most
common modeling styles are embedding of documents and
normalization on collections. The embedding feature has a
disadvantage. That is, it may cause the situation that
documents grow in size after creation which may degrade
the performance of database.
Col 1
Col 2
Col 3
Table 3: Fig(b)Data modeling by embedding of documents
Example of Embedded documents having one to
one relationship is shown below.
{_id:1,Name:”Akshay Anand”,Address :{
City”:”Kochi”,Country:”India”}}
Example of Embedded documents having one to many
relationship is shown below
{_id:1,Name: “Akshay Anand”,Children
:[{Name:”Aravind”,Age:2},{Name:”Anupama”,Age:4}]
This shows that array of values can be stored easily in
MongoDB.
MongoDB supports denormalization.It is a process
of reducing number of physical tables which are accessed
more frequently to reduce the query processing time.This
process reduces number of joins required to design the query
to get desired output.
Col-1 Col-2
Table 4: Norm.
Col 1 Col 3
Table 5: Fig(c) Normalization.
Col 1 Col 2 Col 3
Table 6: Fig (d)D normalization.
In MySQL,the concept of normalization is used.
This concept was first introduced by E.F.Codd. The
objectives of normalization process include well
organization of data, minimizing update anomalies and
maximizing data accessibility. In ©,a common key is used
to refer the tables Table1_Norm and Table2_Norm.In the
next figure, the tables are merged together. Embedding is
similar to denormalizationbut still little variation is there.
Embedding of documents give better performance than
normalization on collections.
V. ADVANTAGES OF MONGODB
It is schema less. MongoDB database belongs to document
store category in which one collection holds different
different documents. Number of fields, content and size of
the document can be different in each document. The main
advantage of MongoDB database is that structure of a single
object is clear. It does not contain complex joins.It has deep
query ability. Easy of scale out is another major advantage.
In this type of database, conversion/mapping of application
objects to database objects not needed. MongoDB uses the
internal memory for storing the work set there by enabling
faster access of data. It provides index on any attribute. The
secondary indexes supported by the MongoDB database
make them transparent to developers.
VI. APPLICATIONS OF MONGODB
They are widely used in big data and real time web
applications such as Facebook, Yahoo, Google and Amazon.
It is also used in content management and delivery. It can be
used in mobile and social infrastructure. For user data
A Study on Mongodb Database
(IJSRD/Vol. 3/Issue 10/2015/182)
All rights reserved by www.ijsrd.com 834
management the best choice among NoSQL database is
MongoDB. It finds application in data hub also.It is the best
choice for a small or medium sized non –critical sensor
applications, especially when write performance is
important.
VII. CONCLUSION
As NoSQL trend is relatively new, many researchers are
attracted to this category of databases. NoSQL databases
such as MongoDB and its key-value stores provide an
efficient framework to aggregate large volumes of data.
MongoDB can store complex data like array,object or
reference into one field. Mapping of objects is very easy in
this type of database. The features of MongoDB like auto
sharding and replication of data make the development
faster than MySQL. MongoDB provides flexibility,
horizontal scalability, auto sharding and replication.
MongoDB is a better choice for big data applications than
MySQL database. It gives better performance than relational
database. Depending on the requirements of application, we
can choose the suitable NoSQL database.
REFERENCES
[1] Mrs.Anuradha Kanade, Dr.Arpita Gopal,Mr.Shanthanu
Kanade“A Study of Normalization and Embedding in
MongoDB”Advance Computing Conference (IACC),
2014 IEEE International
[2] Cornelia GYORODI,Robert GYORODI,George
PECHERLE,Andrada OLAH “A comparative
study:MongoDB vs MySQL”Engineering of Modern
Electric Systems (EMES), 2015 13th International
Conference on11-12 June 2015.
[3] K.Sanobar,M.Vanita,”SQL Support over MongoDB
using Metadata”

More Related Content

PDF
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
Deborah McGuinness
 
PPTX
Soutenance Business Plan
Clément Lay
 
PDF
Application Prospect of XRT Intelligent X-ray Transmission Ore Sorter for Dry...
Serena Fu
 
PPTX
Silvana vega
30402626
 
PDF
Campus Symposium 2010
Campus Symposium GmbH
 
DOCX
Diseño de carta de correspondencia PsicoMax
cristian567
 
PPTX
Caso de Amanda Todd
Diego Leucel Navarro Iglesias
 
PPTX
Medicina roma grecia
doris_31_paola
 
ISWC2023-McGuinnessTWC16x9FinalShort.pdf
Deborah McGuinness
 
Soutenance Business Plan
Clément Lay
 
Application Prospect of XRT Intelligent X-ray Transmission Ore Sorter for Dry...
Serena Fu
 
Silvana vega
30402626
 
Campus Symposium 2010
Campus Symposium GmbH
 
Diseño de carta de correspondencia PsicoMax
cristian567
 
Caso de Amanda Todd
Diego Leucel Navarro Iglesias
 
Medicina roma grecia
doris_31_paola
 

Viewers also liked (18)

PPTX
Tecnologia educativa taller_2
SAUL ROMERO
 
PPTX
Mitos tdah
alma1111
 
PPTX
Investigacion por mayra zurita
mayraely
 
PPTX
Estabilidad laboral en personas discapacitadas (1)
estalaboral
 
PPT
Fashion Europe Net German
Fashion europe.Net unabh. Partnerin
 
PDF
Indien ist anders Infografik
Florian Blümm
 
DOCX
Trabajo de psicofisiologia
kmbgg
 
PDF
Ernährungsziel sport
Myfoodconcept
 
PDF
A Doce Conquista do Amendoim
Agricultura Sao Paulo
 
PPTX
Redes inalámbricas
Micaela Ñacato
 
PDF
Eeg umlage-hintergrundpapier-hjfell
metropolsolar
 
PPTX
La revolución industrial
Team Moonlight®
 
PPTX
Examen de computación1
Richard Silva
 
PDF
Resdes sociales
3125360641
 
PDF
16.02.17 Андрій Бобровицький - Енергоефективне житло своїми руками - 5 кейсів...
Зелена Школа
 
PPTX
19.02.17 Олена Березовська "Органічна Україна"
Зелена Школа
 
PPTX
Evaluación al maestro!
cristian567
 
Tecnologia educativa taller_2
SAUL ROMERO
 
Mitos tdah
alma1111
 
Investigacion por mayra zurita
mayraely
 
Estabilidad laboral en personas discapacitadas (1)
estalaboral
 
Fashion Europe Net German
Fashion europe.Net unabh. Partnerin
 
Indien ist anders Infografik
Florian Blümm
 
Trabajo de psicofisiologia
kmbgg
 
Ernährungsziel sport
Myfoodconcept
 
A Doce Conquista do Amendoim
Agricultura Sao Paulo
 
Redes inalámbricas
Micaela Ñacato
 
Eeg umlage-hintergrundpapier-hjfell
metropolsolar
 
La revolución industrial
Team Moonlight®
 
Examen de computación1
Richard Silva
 
Resdes sociales
3125360641
 
16.02.17 Андрій Бобровицький - Енергоефективне житло своїми руками - 5 кейсів...
Зелена Школа
 
19.02.17 Олена Березовська "Органічна Україна"
Зелена Школа
 
Evaluación al maestro!
cristian567
 
Ad

Similar to A Study on Mongodb Database (20)

PPTX
Mongo db
Gyanendra Yadav
 
PPTX
Mongodb
ASEEMSRIVASTAVA22
 
PPTX
05201349_Unit_7_FSWD_ advanced learning.pptx
ozakamal8
 
PPTX
05201349_Unit_7_FSWD_II(1) with advance.pptx
ozakamal8
 
PPTX
Nosql
ROXTAD71
 
PPTX
Nosql
Roxana Tadayon
 
PPTX
Mongo db
AbhiKhurana8
 
PDF
Mongo db dhruba
Dhrubaji Mandal ♛
 
PDF
Streaming Analytics Unit 5 notes for engineers
ManjuAppukuttan2
 
PPTX
UNIT-1 MongoDB.pptx
DharaDarji5
 
PDF
Analysis on NoSQL: MongoDB Tool
ijtsrd
 
PPT
Mongo Bb - NoSQL tutorial
Mohan Rathour
 
PPTX
Introduction to NoSQL and MongoDB
Ahmed Farag
 
DOCX
MongoDB DOC v1.5
Tharun Srinivasa
 
PPTX
nosql [Autosaved].pptx
Indrani Sen
 
PDF
MongoDB Lab Manual (1).pdf used in data science
bitragowthamkumar1
 
PPTX
Mongodb - NoSql Database
Prashant Gupta
 
PDF
Mongodb
Apurva Vyas
 
PPTX
MongoDB
Tharun Srinivasa
 
Mongo db
Gyanendra Yadav
 
05201349_Unit_7_FSWD_ advanced learning.pptx
ozakamal8
 
05201349_Unit_7_FSWD_II(1) with advance.pptx
ozakamal8
 
Nosql
ROXTAD71
 
Mongo db
AbhiKhurana8
 
Mongo db dhruba
Dhrubaji Mandal ♛
 
Streaming Analytics Unit 5 notes for engineers
ManjuAppukuttan2
 
UNIT-1 MongoDB.pptx
DharaDarji5
 
Analysis on NoSQL: MongoDB Tool
ijtsrd
 
Mongo Bb - NoSQL tutorial
Mohan Rathour
 
Introduction to NoSQL and MongoDB
Ahmed Farag
 
MongoDB DOC v1.5
Tharun Srinivasa
 
nosql [Autosaved].pptx
Indrani Sen
 
MongoDB Lab Manual (1).pdf used in data science
bitragowthamkumar1
 
Mongodb - NoSql Database
Prashant Gupta
 
Mongodb
Apurva Vyas
 
Ad

More from IJSRD (20)

PPTX
#IJSRD #Research Paper Publication
IJSRD
 
PDF
Maintaining Data Confidentiality in Association Rule Mining in Distributed En...
IJSRD
 
PDF
Performance and Emission characteristics of a Single Cylinder Four Stroke Die...
IJSRD
 
PDF
Preclusion of High and Low Pressure In Boiler by Using LABVIEW
IJSRD
 
PDF
Prevention and Detection of Man in the Middle Attack on AODV Protocol
IJSRD
 
PDF
Comparative Analysis of PAPR Reduction Techniques in OFDM Using Precoding Tec...
IJSRD
 
PDF
Evaluation the Effect of Machining Parameters on MRR of Mild Steel
IJSRD
 
PDF
Filter unwanted messages from walls and blocking nonlegitimate user in osn
IJSRD
 
PDF
Keystroke Dynamics Authentication with Project Management System
IJSRD
 
PDF
Diagnosing lungs cancer Using Neural Networks
IJSRD
 
PDF
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
PDF
A Defect Prediction Model for Software Product based on ANFIS
IJSRD
 
PDF
Experimental Investigation of Granulated Blast Furnace Slag ond Quarry Dust a...
IJSRD
 
PDF
Product Quality Analysis based on online Reviews
IJSRD
 
PDF
Solving Fuzzy Matrix Games Defuzzificated by Trapezoidal Parabolic Fuzzy Numbers
IJSRD
 
PDF
Study of Clustering of Data Base in Education Sector Using Data Mining
IJSRD
 
PDF
Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...
IJSRD
 
PDF
Investigation of Effect of Process Parameters on Maximum Temperature during F...
IJSRD
 
PDF
Review Paper on Computer Aided Design & Analysis of Rotor Shaft of a Rotavator
IJSRD
 
PDF
A Survey on Data Mining Techniques for Crime Hotspots Prediction
IJSRD
 
#IJSRD #Research Paper Publication
IJSRD
 
Maintaining Data Confidentiality in Association Rule Mining in Distributed En...
IJSRD
 
Performance and Emission characteristics of a Single Cylinder Four Stroke Die...
IJSRD
 
Preclusion of High and Low Pressure In Boiler by Using LABVIEW
IJSRD
 
Prevention and Detection of Man in the Middle Attack on AODV Protocol
IJSRD
 
Comparative Analysis of PAPR Reduction Techniques in OFDM Using Precoding Tec...
IJSRD
 
Evaluation the Effect of Machining Parameters on MRR of Mild Steel
IJSRD
 
Filter unwanted messages from walls and blocking nonlegitimate user in osn
IJSRD
 
Keystroke Dynamics Authentication with Project Management System
IJSRD
 
Diagnosing lungs cancer Using Neural Networks
IJSRD
 
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
A Defect Prediction Model for Software Product based on ANFIS
IJSRD
 
Experimental Investigation of Granulated Blast Furnace Slag ond Quarry Dust a...
IJSRD
 
Product Quality Analysis based on online Reviews
IJSRD
 
Solving Fuzzy Matrix Games Defuzzificated by Trapezoidal Parabolic Fuzzy Numbers
IJSRD
 
Study of Clustering of Data Base in Education Sector Using Data Mining
IJSRD
 
Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...
IJSRD
 
Investigation of Effect of Process Parameters on Maximum Temperature during F...
IJSRD
 
Review Paper on Computer Aided Design & Analysis of Rotor Shaft of a Rotavator
IJSRD
 
A Survey on Data Mining Techniques for Crime Hotspots Prediction
IJSRD
 

Recently uploaded (20)

PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
DOCX
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
PPTX
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
PDF
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
PPTX
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Sonnet 130_ My Mistress’ Eyes Are Nothing Like the Sun By William Shakespear...
DhatriParmar
 
PDF
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PPTX
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
PDF
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
PPTX
How to Apply for a Job From Odoo 18 Website
Celine George
 
PPTX
CDH. pptx
AneetaSharma15
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
PPTX
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
PPTX
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
PPTX
Virus sequence retrieval from NCBI database
yamunaK13
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
Modul Ajar Deep Learning Bahasa Inggris Kelas 11 Terbaru 2025
wahyurestu63
 
INTESTINALPARASITES OR WORM INFESTATIONS.pptx
PRADEEP ABOTHU
 
Biological Classification Class 11th NCERT CBSE NEET.pdf
NehaRohtagi1
 
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
Sonnet 130_ My Mistress’ Eyes Are Nothing Like the Sun By William Shakespear...
DhatriParmar
 
The Minister of Tourism, Culture and Creative Arts, Abla Dzifa Gomashie has e...
nservice241
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
How to Apply for a Job From Odoo 18 Website
Celine George
 
CDH. pptx
AneetaSharma15
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
Virus sequence retrieval from NCBI database
yamunaK13
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 

A Study on Mongodb Database

  • 1. IJSRD - International Journal for Scientific Research & Development| Vol. 3, Issue 10, 2015 | ISSN (online): 2321-0613 All rights reserved by www.ijsrd.com 832 A Study on Mongodb Database Kavya. S M Tech. Student Department of Computer Science Mount Zion College of Engineering, A. P. J. Abdul Kalam Technological University, Kadammanitta P.O, Pathanamthitta, Kerala, India Abstract— This paper trying to focus on main features, advantages and applications of non-relational database namely Mongo DB and thus justifying why MongoDB is more suitable than relational databases in big data applications. The database used here for comparison with MongoDB is MySQL. The main features of MongoDB are flexibility, scalability, auto sharding and replication. MongoDB is used in big data and real time web applications since it is a leading database technology. Key words: NoSQL, MongoDB, auto sharding, aggregation I. INTRODUCTION Relational database management systems came into existence since 1980’s.They are a common choice of storage of information in new databases used for financial records, manufacturing and logistical information personnel data and other applications. They work efficiently when they handle a limited amount of data. Due to the emergence of applications that support millions of users simultaneously an appropriate database is required. To handle huge volume of data traditional relational database is inefficient. To overcome the difficulty in handling huge volume of data ,the term NoSQL was introduced by Crlo Strozzi in 1998.It refers to non-relational databases. More recently,the term has received another meaning namely Not Only SQL.The main advantage of NoSQL database is that it can handle both unstructured(e-mail,multimedia,social media) and semi structured data very efficiently. Mainly there are four categories of databases namely Key –value store, document store, column oriented and graph database. MongoDB is a cross platform document oriented database, first developed by the software company MongoDB Inc., in October 2007 as a component of planned platform as a service product. The company shifted to an open source development model in 2009.Since, then MongoDB has been adopted as a backend software by a number of major websites and services. These include Craigslist, eBay, Foursquare and Newyork Times. MongoDB is written in c++ and provides high availability, easy scalability and better performance. MongoDB works on the concept of collection and document. A database is a physical container for collections. Collection is a group of MongoDB documents. It is equivalent to RDBMS table. MongoDB database contain multiple collections. A document is a set of key-value pairs. Documents have dynamic schema. That means, documents in the same collection do not need to have the same structure and common fields. MongoDB supports dynamic queries on documents using a document based query language that is nearly as powerful as SQL.It stores the data in the form of JSON documents. Auto sharding, replication and high availability are the main features of MongoDB.It is commonly used in big data, content management and delivery, mobile and social infrastructure, user data management and data hub. MongoDB supports different data types such as String, Integer, Boolean, Double,Min/Max keys, Arrays, Timestamp, Oject, Null, Symbol, date, Object ID, Binary data, code and regular expressions. II. MONGODB DATA MODEL MongoDB stores data as documents which are in the BSON format. BSON is binary representation of JSON document. Documents having similar structure are organized as collections. Collection is analogous to a table in relational database. Documents and fields in MongoDB are represented using the terms row and columns respectively in MySQL. The difference between the relational database, MySQL and non-relational database ,MongoDB is that in relational database information for a given record is usually spread across many tables, whereas in MongoDB the documents tend to have all data for a given record in a single document. MySQL MongoDB Table Collection Row BSON document Column BSON field JOIN Embedded documents and Linking GROUP BY Aggregation Primary key Primary key Table 1: III. FEATURES OF MONGODB MongoDB has a flexible data model. That means data can be stored in any structure. This feature also allows modification of data in an easy way. Another main feature is elastic scalability. All NoSQL databases contain some form of sharding or partitioning.This allows the database to scale out on hardware. Thereby allowing almost unlimited growth. MongoDB provides high performance than traditional relational databases. The performance of MongoDB is measured in terms of both throughput and latency at any scale. MongoDB does not use join operation, instead they use embedding of documents and linking. Because the data in MongoDB is more localized.This localization dramatically reduces the need to join separate tables. Each document structure in MongoDB database can vary from one another. If there is a need to create a new field in any one of the document, then the field can be created without affecting a central system catalog and without taking the system offline. In MongoDB, field updates can be done easily. It provides rich data model. Data locality and dynamic schema are other main features of MongoDB. The main feature of MongoDB includes querying, aggregation, indexing and auto sharding. Indexes play a major role in providing efficient access to data,for both read and write operations,which are
  • 2. A Study on Mongodb Database (IJSRD/Vol. 3/Issue 10/2015/182) All rights reserved by www.ijsrd.com 833 supported natively by the database rather than maintained in application code. MongoDB supports many queries, mainly for highly scalable operational applications. The result of query execution can be a document or subset of specific fields within the document. Different types of query provided by MongoDB include key value queries, range queries, geo spatial queries, search queries, text search queries, aggregation framework queries and map reduce queries. Replica sets are another feature of MongoDB which is a fail over mechanism. Only the primary database allows write operation. Multiple secondary servers are used for read operation. For a replica set, minimum three servers is required. Of the three servers, one is primary server, other is secondary server and the remaining one is arbiter server. Arbiter server is not used for storing data. They are used only during failover time to determine which server will be the next primary server. Another feature is auto sharding. This feature is used to overcome the hardware limitations. Hardware limitations means bottleneck in RAM/disk I/O. This feature of MongoDB helps to distribute data across physical partitions. These physical partitions are called as shards. Thus data is automatically balanced in the clusters as the data grows. In relational database sharding is not built into the database. An aggregate is a group of related entities and value object. Maximum document size in MongoDB is 16 MB and large documents are handled with Grid FS.MongoDB runs on OSs such as Windows, Linux, Mac and Solaris. IV. COMPARISON OF MONGODB VS MYSQL MongoDB Commands SELECT * FROM table db.collection.find() SELECT * FROM table WHERE user=’Akshay’ db.collection.find({user=”Akshay”}) SELECT * FROM table ORDER BY Age Db.collection.find. DISTINCT .distinct() GROUP .group() Table 2: Fig (a) Retrieval of data in MySQL and MongoDB Modeling of data in MongoDB database differs from relational database. Different modeling styles can be applied depending on the requirement of the application. Most common modeling styles are embedding of documents and normalization on collections. The embedding feature has a disadvantage. That is, it may cause the situation that documents grow in size after creation which may degrade the performance of database. Col 1 Col 2 Col 3 Table 3: Fig(b)Data modeling by embedding of documents Example of Embedded documents having one to one relationship is shown below. {_id:1,Name:”Akshay Anand”,Address :{ City”:”Kochi”,Country:”India”}} Example of Embedded documents having one to many relationship is shown below {_id:1,Name: “Akshay Anand”,Children :[{Name:”Aravind”,Age:2},{Name:”Anupama”,Age:4}] This shows that array of values can be stored easily in MongoDB. MongoDB supports denormalization.It is a process of reducing number of physical tables which are accessed more frequently to reduce the query processing time.This process reduces number of joins required to design the query to get desired output. Col-1 Col-2 Table 4: Norm. Col 1 Col 3 Table 5: Fig(c) Normalization. Col 1 Col 2 Col 3 Table 6: Fig (d)D normalization. In MySQL,the concept of normalization is used. This concept was first introduced by E.F.Codd. The objectives of normalization process include well organization of data, minimizing update anomalies and maximizing data accessibility. In ©,a common key is used to refer the tables Table1_Norm and Table2_Norm.In the next figure, the tables are merged together. Embedding is similar to denormalizationbut still little variation is there. Embedding of documents give better performance than normalization on collections. V. ADVANTAGES OF MONGODB It is schema less. MongoDB database belongs to document store category in which one collection holds different different documents. Number of fields, content and size of the document can be different in each document. The main advantage of MongoDB database is that structure of a single object is clear. It does not contain complex joins.It has deep query ability. Easy of scale out is another major advantage. In this type of database, conversion/mapping of application objects to database objects not needed. MongoDB uses the internal memory for storing the work set there by enabling faster access of data. It provides index on any attribute. The secondary indexes supported by the MongoDB database make them transparent to developers. VI. APPLICATIONS OF MONGODB They are widely used in big data and real time web applications such as Facebook, Yahoo, Google and Amazon. It is also used in content management and delivery. It can be used in mobile and social infrastructure. For user data
  • 3. A Study on Mongodb Database (IJSRD/Vol. 3/Issue 10/2015/182) All rights reserved by www.ijsrd.com 834 management the best choice among NoSQL database is MongoDB. It finds application in data hub also.It is the best choice for a small or medium sized non –critical sensor applications, especially when write performance is important. VII. CONCLUSION As NoSQL trend is relatively new, many researchers are attracted to this category of databases. NoSQL databases such as MongoDB and its key-value stores provide an efficient framework to aggregate large volumes of data. MongoDB can store complex data like array,object or reference into one field. Mapping of objects is very easy in this type of database. The features of MongoDB like auto sharding and replication of data make the development faster than MySQL. MongoDB provides flexibility, horizontal scalability, auto sharding and replication. MongoDB is a better choice for big data applications than MySQL database. It gives better performance than relational database. Depending on the requirements of application, we can choose the suitable NoSQL database. REFERENCES [1] Mrs.Anuradha Kanade, Dr.Arpita Gopal,Mr.Shanthanu Kanade“A Study of Normalization and Embedding in MongoDB”Advance Computing Conference (IACC), 2014 IEEE International [2] Cornelia GYORODI,Robert GYORODI,George PECHERLE,Andrada OLAH “A comparative study:MongoDB vs MySQL”Engineering of Modern Electric Systems (EMES), 2015 13th International Conference on11-12 June 2015. [3] K.Sanobar,M.Vanita,”SQL Support over MongoDB using Metadata”