SlideShare a Scribd company logo
IJSRD - International Journal for Scientific Research & Development| Vol. 3, Issue 10, 2015 | ISSN (online): 2321-0613
All rights reserved by www.ijsrd.com 832
A Study on Mongodb Database
Kavya. S
M Tech. Student
Department of Computer Science
Mount Zion College of Engineering, A. P. J. Abdul Kalam Technological University, Kadammanitta
P.O, Pathanamthitta, Kerala, India
Abstract— This paper trying to focus on main features,
advantages and applications of non-relational database
namely Mongo DB and thus justifying why MongoDB is
more suitable than relational databases in big data
applications. The database used here for comparison with
MongoDB is MySQL. The main features of MongoDB are
flexibility, scalability, auto sharding and replication.
MongoDB is used in big data and real time web applications
since it is a leading database technology.
Key words: NoSQL, MongoDB, auto sharding, aggregation
I. INTRODUCTION
Relational database management systems came into
existence since 1980’s.They are a common choice of storage
of information in new databases used for financial records,
manufacturing and logistical information personnel data and
other applications. They work efficiently when they handle a
limited amount of data. Due to the emergence of
applications that support millions of users simultaneously an
appropriate database is required. To handle huge volume of
data traditional relational database is inefficient. To
overcome the difficulty in handling huge volume of data ,the
term NoSQL was introduced by Crlo Strozzi in 1998.It
refers to non-relational databases. More recently,the term
has received another meaning namely Not Only SQL.The
main advantage of NoSQL database is that it can handle
both unstructured(e-mail,multimedia,social media) and semi
structured data very efficiently. Mainly there are four
categories of databases namely Key –value store, document
store, column oriented and graph database. MongoDB is a
cross platform document oriented database, first developed
by the software company MongoDB Inc., in October 2007
as a component of planned platform as a service product.
The company shifted to an open source development model
in 2009.Since, then MongoDB has been adopted as a
backend software by a number of major websites and
services. These include Craigslist, eBay, Foursquare and
Newyork Times. MongoDB is written in c++ and provides
high availability, easy scalability and better performance.
MongoDB works on the concept of collection and
document. A database is a physical container for collections.
Collection is a group of MongoDB documents. It is
equivalent to RDBMS table. MongoDB database contain
multiple collections. A document is a set of key-value pairs.
Documents have dynamic schema. That means, documents
in the same collection do not need to have the same structure
and common fields. MongoDB supports dynamic queries on
documents using a document based query language that is
nearly as powerful as SQL.It stores the data in the form of
JSON documents. Auto sharding, replication and high
availability are the main features of MongoDB.It is
commonly used in big data, content management and
delivery, mobile and social infrastructure, user data
management and data hub. MongoDB supports different
data types such as String, Integer, Boolean,
Double,Min/Max keys, Arrays, Timestamp, Oject, Null,
Symbol, date, Object ID, Binary data, code and regular
expressions.
II. MONGODB DATA MODEL
MongoDB stores data as documents which are in the BSON
format. BSON is binary representation of JSON document.
Documents having similar structure are organized as
collections. Collection is analogous to a table in relational
database. Documents and fields in MongoDB are
represented using the terms row and columns respectively in
MySQL. The difference between the relational database,
MySQL and non-relational database ,MongoDB is that in
relational database information for a given record is usually
spread across many tables, whereas in MongoDB the
documents tend to have all data for a given record in a
single document.
MySQL MongoDB
Table Collection
Row BSON document
Column BSON field
JOIN Embedded documents and Linking
GROUP BY Aggregation
Primary key Primary key
Table 1:
III. FEATURES OF MONGODB
MongoDB has a flexible data model. That means data can
be stored in any structure. This feature also allows
modification of data in an easy way. Another main feature is
elastic scalability. All NoSQL databases contain some form
of sharding or partitioning.This allows the database to scale
out on hardware. Thereby allowing almost unlimited
growth. MongoDB provides high performance than
traditional relational databases. The performance of
MongoDB is measured in terms of both throughput and
latency at any scale. MongoDB does not use join operation,
instead they use embedding of documents and linking.
Because the data in MongoDB is more localized.This
localization dramatically reduces the need to join separate
tables. Each document structure in MongoDB database can
vary from one another. If there is a need to create a new
field in any one of the document, then the field can be
created without affecting a central system catalog and
without taking the system offline. In MongoDB, field
updates can be done easily. It provides rich data model. Data
locality and dynamic schema are other main features of
MongoDB. The main feature of MongoDB includes
querying, aggregation, indexing and auto sharding.
Indexes play a major role in providing efficient
access to data,for both read and write operations,which are
A Study on Mongodb Database
(IJSRD/Vol. 3/Issue 10/2015/182)
All rights reserved by www.ijsrd.com 833
supported natively by the database rather than maintained in
application code. MongoDB supports many queries, mainly
for highly scalable operational applications. The result of
query execution can be a document or subset of specific
fields within the document.
Different types of query provided by MongoDB
include key value queries, range queries, geo spatial queries,
search queries, text search queries, aggregation framework
queries and map reduce queries. Replica sets are another
feature of MongoDB which is a fail over mechanism. Only
the primary database allows write operation. Multiple
secondary servers are used for read operation. For a replica
set, minimum three servers is required. Of the three servers,
one is primary server, other is secondary server and the
remaining one is arbiter server. Arbiter server is not used for
storing data. They are used only during failover time to
determine which server will be the next primary server.
Another feature is auto sharding. This feature is used to
overcome the hardware limitations. Hardware limitations
means bottleneck in RAM/disk I/O. This feature of
MongoDB helps to distribute data across physical partitions.
These physical partitions are called as shards. Thus data is
automatically balanced in the clusters as the data grows. In
relational database sharding is not built into the database.
An aggregate is a group of related entities and value object.
Maximum document size in MongoDB is 16 MB and large
documents are handled with Grid FS.MongoDB runs on OSs
such as Windows, Linux, Mac and Solaris.
IV. COMPARISON OF MONGODB VS MYSQL
MongoDB Commands
SELECT * FROM table db.collection.find()
SELECT * FROM table
WHERE user=’Akshay’
db.collection.find({user=”Akshay”})
SELECT * FROM table
ORDER BY Age
Db.collection.find.
DISTINCT .distinct()
GROUP .group()
Table 2: Fig (a) Retrieval of data in MySQL and MongoDB
Modeling of data in MongoDB database differs from
relational database. Different modeling styles can be applied
depending on the requirement of the application. Most
common modeling styles are embedding of documents and
normalization on collections. The embedding feature has a
disadvantage. That is, it may cause the situation that
documents grow in size after creation which may degrade
the performance of database.
Col 1
Col 2
Col 3
Table 3: Fig(b)Data modeling by embedding of documents
Example of Embedded documents having one to
one relationship is shown below.
{_id:1,Name:”Akshay Anand”,Address :{
City”:”Kochi”,Country:”India”}}
Example of Embedded documents having one to many
relationship is shown below
{_id:1,Name: “Akshay Anand”,Children
:[{Name:”Aravind”,Age:2},{Name:”Anupama”,Age:4}]
This shows that array of values can be stored easily in
MongoDB.
MongoDB supports denormalization.It is a process
of reducing number of physical tables which are accessed
more frequently to reduce the query processing time.This
process reduces number of joins required to design the query
to get desired output.
Col-1 Col-2
Table 4: Norm.
Col 1 Col 3
Table 5: Fig(c) Normalization.
Col 1 Col 2 Col 3
Table 6: Fig (d)D normalization.
In MySQL,the concept of normalization is used.
This concept was first introduced by E.F.Codd. The
objectives of normalization process include well
organization of data, minimizing update anomalies and
maximizing data accessibility. In ©,a common key is used
to refer the tables Table1_Norm and Table2_Norm.In the
next figure, the tables are merged together. Embedding is
similar to denormalizationbut still little variation is there.
Embedding of documents give better performance than
normalization on collections.
V. ADVANTAGES OF MONGODB
It is schema less. MongoDB database belongs to document
store category in which one collection holds different
different documents. Number of fields, content and size of
the document can be different in each document. The main
advantage of MongoDB database is that structure of a single
object is clear. It does not contain complex joins.It has deep
query ability. Easy of scale out is another major advantage.
In this type of database, conversion/mapping of application
objects to database objects not needed. MongoDB uses the
internal memory for storing the work set there by enabling
faster access of data. It provides index on any attribute. The
secondary indexes supported by the MongoDB database
make them transparent to developers.
VI. APPLICATIONS OF MONGODB
They are widely used in big data and real time web
applications such as Facebook, Yahoo, Google and Amazon.
It is also used in content management and delivery. It can be
used in mobile and social infrastructure. For user data
A Study on Mongodb Database
(IJSRD/Vol. 3/Issue 10/2015/182)
All rights reserved by www.ijsrd.com 834
management the best choice among NoSQL database is
MongoDB. It finds application in data hub also.It is the best
choice for a small or medium sized non –critical sensor
applications, especially when write performance is
important.
VII. CONCLUSION
As NoSQL trend is relatively new, many researchers are
attracted to this category of databases. NoSQL databases
such as MongoDB and its key-value stores provide an
efficient framework to aggregate large volumes of data.
MongoDB can store complex data like array,object or
reference into one field. Mapping of objects is very easy in
this type of database. The features of MongoDB like auto
sharding and replication of data make the development
faster than MySQL. MongoDB provides flexibility,
horizontal scalability, auto sharding and replication.
MongoDB is a better choice for big data applications than
MySQL database. It gives better performance than relational
database. Depending on the requirements of application, we
can choose the suitable NoSQL database.
REFERENCES
[1] Mrs.Anuradha Kanade, Dr.Arpita Gopal,Mr.Shanthanu
Kanade“A Study of Normalization and Embedding in
MongoDB”Advance Computing Conference (IACC),
2014 IEEE International
[2] Cornelia GYORODI,Robert GYORODI,George
PECHERLE,Andrada OLAH “A comparative
study:MongoDB vs MySQL”Engineering of Modern
Electric Systems (EMES), 2015 13th International
Conference on11-12 June 2015.
[3] K.Sanobar,M.Vanita,”SQL Support over MongoDB
using Metadata”

More Related Content

PPTX
SQL Server Integration Services
Robert MacLean
 
PPTX
Sequelize
Tarek Raihan
 
PDF
Reporting with Oracle Application Express (APEX)
Dimitri Gielis
 
PPTX
MS SQL SERVER: SSIS and data mining
DataminingTools Inc
 
PPTX
Introduction of ssis
deepakk073
 
PPT
Ssis 2008
maha2886
 
PPTX
Software Development
Juan Victor Minaya León
 
PPTX
Entity relationship diagram - Concept on normalization
Satya Pal
 
SQL Server Integration Services
Robert MacLean
 
Sequelize
Tarek Raihan
 
Reporting with Oracle Application Express (APEX)
Dimitri Gielis
 
MS SQL SERVER: SSIS and data mining
DataminingTools Inc
 
Introduction of ssis
deepakk073
 
Ssis 2008
maha2886
 
Software Development
Juan Victor Minaya León
 
Entity relationship diagram - Concept on normalization
Satya Pal
 

What's hot (20)

PDF
RDF 해설서
Hansung University
 
PPT
JAD - Joint Applications Development
John Crosby
 
PPTX
SSIS control flow
Slava Kokaev
 
PDF
Talend Open Studio Data Integration
Roberto Marchetto
 
PPT
Enhanced E-R diagram
Mayank Jain
 
PPTX
SQL Server vs Postgres
chandra sekhar pathivada , PMP,ITIL,MCTS
 
PPTX
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
Juan Sequeda
 
PPT
Introducción a UML y Diagrama de Casos de Uso
Yaskelly Yedra
 
PPTX
Component object model and
Saransh Garg
 
PDF
IBM File Net P8
Mohammed El Rafie Tarabay
 
PPTX
Chapter 1 introduction to sql server
baabtra.com - No. 1 supplier of quality freshers
 
PPT
Dbms models
devgocool
 
PPTX
Oracle
Guissellacriss
 
PPT
Introduction to 3 tier architecture of DBMS basics and details
ShrutiArora343479
 
PDF
Mapping ER and EER Model
Mary Brinda
 
PDF
Tryb przypuszczający w języku niemieckim (Konjunktiv) - Marcin Perliński
Asgard71
 
PPTX
Xml schema
Akshaya Akshaya
 
PPTX
Oracle apex training
Vasudha India
 
DOCX
Diseño con uml, caso
cams21
 
PPT
Soft Eng - Software Process
Jomel Penalba
 
RDF 해설서
Hansung University
 
JAD - Joint Applications Development
John Crosby
 
SSIS control flow
Slava Kokaev
 
Talend Open Studio Data Integration
Roberto Marchetto
 
Enhanced E-R diagram
Mayank Jain
 
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
Juan Sequeda
 
Introducción a UML y Diagrama de Casos de Uso
Yaskelly Yedra
 
Component object model and
Saransh Garg
 
IBM File Net P8
Mohammed El Rafie Tarabay
 
Chapter 1 introduction to sql server
baabtra.com - No. 1 supplier of quality freshers
 
Dbms models
devgocool
 
Introduction to 3 tier architecture of DBMS basics and details
ShrutiArora343479
 
Mapping ER and EER Model
Mary Brinda
 
Tryb przypuszczający w języku niemieckim (Konjunktiv) - Marcin Perliński
Asgard71
 
Xml schema
Akshaya Akshaya
 
Oracle apex training
Vasudha India
 
Diseño con uml, caso
cams21
 
Soft Eng - Software Process
Jomel Penalba
 
Ad

Viewers also liked (20)

PPTX
Silvana vega
30402626
 
PDF
Campus Symposium 2010
Campus Symposium GmbH
 
DOCX
Diseño de carta de correspondencia PsicoMax
cristian567
 
PPTX
Caso de Amanda Todd
Diego Leucel Navarro Iglesias
 
PPTX
Medicina roma grecia
doris_31_paola
 
PPTX
Tecnologia educativa taller_2
SAUL ROMERO
 
PPTX
Mitos tdah
alma1111
 
PPTX
Investigacion por mayra zurita
mayraely
 
PPTX
Estabilidad laboral en personas discapacitadas (1)
estalaboral
 
PPT
Fashion Europe Net German
Fashion europe.Net unabh. Partnerin
 
PDF
Indien ist anders Infografik
Florian Blümm
 
DOCX
Trabajo de psicofisiologia
kmbgg
 
PDF
Ernährungsziel sport
Myfoodconcept
 
PDF
A Doce Conquista do Amendoim
Agricultura Sao Paulo
 
PPTX
Redes inalámbricas
Micaela Ñacato
 
PDF
Eeg umlage-hintergrundpapier-hjfell
metropolsolar
 
PPTX
La revolución industrial
Team Moonlight®
 
PPTX
Examen de computación1
Richard Silva
 
PDF
Resdes sociales
3125360641
 
Silvana vega
30402626
 
Campus Symposium 2010
Campus Symposium GmbH
 
Diseño de carta de correspondencia PsicoMax
cristian567
 
Caso de Amanda Todd
Diego Leucel Navarro Iglesias
 
Medicina roma grecia
doris_31_paola
 
Tecnologia educativa taller_2
SAUL ROMERO
 
Mitos tdah
alma1111
 
Investigacion por mayra zurita
mayraely
 
Estabilidad laboral en personas discapacitadas (1)
estalaboral
 
Fashion Europe Net German
Fashion europe.Net unabh. Partnerin
 
Indien ist anders Infografik
Florian Blümm
 
Trabajo de psicofisiologia
kmbgg
 
Ernährungsziel sport
Myfoodconcept
 
A Doce Conquista do Amendoim
Agricultura Sao Paulo
 
Redes inalámbricas
Micaela Ñacato
 
Eeg umlage-hintergrundpapier-hjfell
metropolsolar
 
La revolución industrial
Team Moonlight®
 
Examen de computación1
Richard Silva
 
Resdes sociales
3125360641
 
Ad

Similar to A Study on Mongodb Database (20)

PPTX
Mongo db
Gyanendra Yadav
 
PPTX
Mongodb
ASEEMSRIVASTAVA22
 
PPTX
05201349_Unit_7_FSWD_ advanced learning.pptx
ozakamal8
 
PPTX
05201349_Unit_7_FSWD_II(1) with advance.pptx
ozakamal8
 
PPTX
Nosql
ROXTAD71
 
PPTX
Nosql
Roxana Tadayon
 
PPTX
Mongo db
AbhiKhurana8
 
PDF
Mongo db dhruba
Dhrubaji Mandal ♛
 
PDF
Streaming Analytics Unit 5 notes for engineers
ManjuAppukuttan2
 
PPTX
UNIT-1 MongoDB.pptx
DharaDarji5
 
PDF
Analysis on NoSQL: MongoDB Tool
ijtsrd
 
PPT
Mongo Bb - NoSQL tutorial
Mohan Rathour
 
PPTX
Introduction to NoSQL and MongoDB
Ahmed Farag
 
DOCX
MongoDB DOC v1.5
Tharun Srinivasa
 
PPTX
nosql [Autosaved].pptx
Indrani Sen
 
PDF
MongoDB Lab Manual (1).pdf used in data science
bitragowthamkumar1
 
PPTX
Mongodb - NoSql Database
Prashant Gupta
 
PDF
Mongodb
Apurva Vyas
 
PPTX
MongoDB
Tharun Srinivasa
 
Mongo db
Gyanendra Yadav
 
05201349_Unit_7_FSWD_ advanced learning.pptx
ozakamal8
 
05201349_Unit_7_FSWD_II(1) with advance.pptx
ozakamal8
 
Nosql
ROXTAD71
 
Mongo db
AbhiKhurana8
 
Mongo db dhruba
Dhrubaji Mandal ♛
 
Streaming Analytics Unit 5 notes for engineers
ManjuAppukuttan2
 
UNIT-1 MongoDB.pptx
DharaDarji5
 
Analysis on NoSQL: MongoDB Tool
ijtsrd
 
Mongo Bb - NoSQL tutorial
Mohan Rathour
 
Introduction to NoSQL and MongoDB
Ahmed Farag
 
MongoDB DOC v1.5
Tharun Srinivasa
 
nosql [Autosaved].pptx
Indrani Sen
 
MongoDB Lab Manual (1).pdf used in data science
bitragowthamkumar1
 
Mongodb - NoSql Database
Prashant Gupta
 
Mongodb
Apurva Vyas
 

More from IJSRD (20)

PPTX
#IJSRD #Research Paper Publication
IJSRD
 
PDF
Maintaining Data Confidentiality in Association Rule Mining in Distributed En...
IJSRD
 
PDF
Performance and Emission characteristics of a Single Cylinder Four Stroke Die...
IJSRD
 
PDF
Preclusion of High and Low Pressure In Boiler by Using LABVIEW
IJSRD
 
PDF
Prevention and Detection of Man in the Middle Attack on AODV Protocol
IJSRD
 
PDF
Comparative Analysis of PAPR Reduction Techniques in OFDM Using Precoding Tec...
IJSRD
 
PDF
Evaluation the Effect of Machining Parameters on MRR of Mild Steel
IJSRD
 
PDF
Filter unwanted messages from walls and blocking nonlegitimate user in osn
IJSRD
 
PDF
Keystroke Dynamics Authentication with Project Management System
IJSRD
 
PDF
Diagnosing lungs cancer Using Neural Networks
IJSRD
 
PDF
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
PDF
A Defect Prediction Model for Software Product based on ANFIS
IJSRD
 
PDF
Experimental Investigation of Granulated Blast Furnace Slag ond Quarry Dust a...
IJSRD
 
PDF
Product Quality Analysis based on online Reviews
IJSRD
 
PDF
Solving Fuzzy Matrix Games Defuzzificated by Trapezoidal Parabolic Fuzzy Numbers
IJSRD
 
PDF
Study of Clustering of Data Base in Education Sector Using Data Mining
IJSRD
 
PDF
Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...
IJSRD
 
PDF
Investigation of Effect of Process Parameters on Maximum Temperature during F...
IJSRD
 
PDF
Review Paper on Computer Aided Design & Analysis of Rotor Shaft of a Rotavator
IJSRD
 
PDF
A Survey on Data Mining Techniques for Crime Hotspots Prediction
IJSRD
 
#IJSRD #Research Paper Publication
IJSRD
 
Maintaining Data Confidentiality in Association Rule Mining in Distributed En...
IJSRD
 
Performance and Emission characteristics of a Single Cylinder Four Stroke Die...
IJSRD
 
Preclusion of High and Low Pressure In Boiler by Using LABVIEW
IJSRD
 
Prevention and Detection of Man in the Middle Attack on AODV Protocol
IJSRD
 
Comparative Analysis of PAPR Reduction Techniques in OFDM Using Precoding Tec...
IJSRD
 
Evaluation the Effect of Machining Parameters on MRR of Mild Steel
IJSRD
 
Filter unwanted messages from walls and blocking nonlegitimate user in osn
IJSRD
 
Keystroke Dynamics Authentication with Project Management System
IJSRD
 
Diagnosing lungs cancer Using Neural Networks
IJSRD
 
A Survey on Sentiment Analysis and Opinion Mining
IJSRD
 
A Defect Prediction Model for Software Product based on ANFIS
IJSRD
 
Experimental Investigation of Granulated Blast Furnace Slag ond Quarry Dust a...
IJSRD
 
Product Quality Analysis based on online Reviews
IJSRD
 
Solving Fuzzy Matrix Games Defuzzificated by Trapezoidal Parabolic Fuzzy Numbers
IJSRD
 
Study of Clustering of Data Base in Education Sector Using Data Mining
IJSRD
 
Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...
IJSRD
 
Investigation of Effect of Process Parameters on Maximum Temperature during F...
IJSRD
 
Review Paper on Computer Aided Design & Analysis of Rotor Shaft of a Rotavator
IJSRD
 
A Survey on Data Mining Techniques for Crime Hotspots Prediction
IJSRD
 

Recently uploaded (20)

PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
PPTs-The Rise of Empiresghhhhhhhh (1).pptx
academysrusti114
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PDF
1.Natural-Resources-and-Their-Use.ppt pdf /8th class social science Exploring...
Sandeep Swamy
 
PDF
UTS Health Student Promotional Representative_Position Description.pdf
Faculty of Health, University of Technology Sydney
 
PPTX
Odoo 18 Sales_ Managing Quotation Validity
Celine George
 
PPTX
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Software Engineering BSC DS UNIT 1 .pptx
Dr. Pallawi Bulakh
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PDF
Sunset Boulevard Student Revision Booklet
jpinnuck
 
DOCX
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
PPTX
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
PPTX
CDH. pptx
AneetaSharma15
 
PDF
The Picture of Dorian Gray summary and depiction
opaliyahemel
 
PDF
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
PPTX
Autodock-for-Beginners by Rahul D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
PDF
Study Material and notes for Women Empowerment
ComputerScienceSACWC
 
PDF
PG-BPSDMP 2 TAHUN 2025PG-BPSDMP 2 TAHUN 2025.pdf
AshifaRamadhani
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTs-The Rise of Empiresghhhhhhhh (1).pptx
academysrusti114
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
1.Natural-Resources-and-Their-Use.ppt pdf /8th class social science Exploring...
Sandeep Swamy
 
UTS Health Student Promotional Representative_Position Description.pdf
Faculty of Health, University of Technology Sydney
 
Odoo 18 Sales_ Managing Quotation Validity
Celine George
 
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
Software Engineering BSC DS UNIT 1 .pptx
Dr. Pallawi Bulakh
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
Sunset Boulevard Student Revision Booklet
jpinnuck
 
SAROCES Action-Plan FOR ARAL PROGRAM IN DEPED
Levenmartlacuna1
 
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
CDH. pptx
AneetaSharma15
 
The Picture of Dorian Gray summary and depiction
opaliyahemel
 
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
Autodock-for-Beginners by Rahul D Jawarkar.pptx
Rahul Jawarkar
 
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
Study Material and notes for Women Empowerment
ComputerScienceSACWC
 
PG-BPSDMP 2 TAHUN 2025PG-BPSDMP 2 TAHUN 2025.pdf
AshifaRamadhani
 

A Study on Mongodb Database

  • 1. IJSRD - International Journal for Scientific Research & Development| Vol. 3, Issue 10, 2015 | ISSN (online): 2321-0613 All rights reserved by www.ijsrd.com 832 A Study on Mongodb Database Kavya. S M Tech. Student Department of Computer Science Mount Zion College of Engineering, A. P. J. Abdul Kalam Technological University, Kadammanitta P.O, Pathanamthitta, Kerala, India Abstract— This paper trying to focus on main features, advantages and applications of non-relational database namely Mongo DB and thus justifying why MongoDB is more suitable than relational databases in big data applications. The database used here for comparison with MongoDB is MySQL. The main features of MongoDB are flexibility, scalability, auto sharding and replication. MongoDB is used in big data and real time web applications since it is a leading database technology. Key words: NoSQL, MongoDB, auto sharding, aggregation I. INTRODUCTION Relational database management systems came into existence since 1980’s.They are a common choice of storage of information in new databases used for financial records, manufacturing and logistical information personnel data and other applications. They work efficiently when they handle a limited amount of data. Due to the emergence of applications that support millions of users simultaneously an appropriate database is required. To handle huge volume of data traditional relational database is inefficient. To overcome the difficulty in handling huge volume of data ,the term NoSQL was introduced by Crlo Strozzi in 1998.It refers to non-relational databases. More recently,the term has received another meaning namely Not Only SQL.The main advantage of NoSQL database is that it can handle both unstructured(e-mail,multimedia,social media) and semi structured data very efficiently. Mainly there are four categories of databases namely Key –value store, document store, column oriented and graph database. MongoDB is a cross platform document oriented database, first developed by the software company MongoDB Inc., in October 2007 as a component of planned platform as a service product. The company shifted to an open source development model in 2009.Since, then MongoDB has been adopted as a backend software by a number of major websites and services. These include Craigslist, eBay, Foursquare and Newyork Times. MongoDB is written in c++ and provides high availability, easy scalability and better performance. MongoDB works on the concept of collection and document. A database is a physical container for collections. Collection is a group of MongoDB documents. It is equivalent to RDBMS table. MongoDB database contain multiple collections. A document is a set of key-value pairs. Documents have dynamic schema. That means, documents in the same collection do not need to have the same structure and common fields. MongoDB supports dynamic queries on documents using a document based query language that is nearly as powerful as SQL.It stores the data in the form of JSON documents. Auto sharding, replication and high availability are the main features of MongoDB.It is commonly used in big data, content management and delivery, mobile and social infrastructure, user data management and data hub. MongoDB supports different data types such as String, Integer, Boolean, Double,Min/Max keys, Arrays, Timestamp, Oject, Null, Symbol, date, Object ID, Binary data, code and regular expressions. II. MONGODB DATA MODEL MongoDB stores data as documents which are in the BSON format. BSON is binary representation of JSON document. Documents having similar structure are organized as collections. Collection is analogous to a table in relational database. Documents and fields in MongoDB are represented using the terms row and columns respectively in MySQL. The difference between the relational database, MySQL and non-relational database ,MongoDB is that in relational database information for a given record is usually spread across many tables, whereas in MongoDB the documents tend to have all data for a given record in a single document. MySQL MongoDB Table Collection Row BSON document Column BSON field JOIN Embedded documents and Linking GROUP BY Aggregation Primary key Primary key Table 1: III. FEATURES OF MONGODB MongoDB has a flexible data model. That means data can be stored in any structure. This feature also allows modification of data in an easy way. Another main feature is elastic scalability. All NoSQL databases contain some form of sharding or partitioning.This allows the database to scale out on hardware. Thereby allowing almost unlimited growth. MongoDB provides high performance than traditional relational databases. The performance of MongoDB is measured in terms of both throughput and latency at any scale. MongoDB does not use join operation, instead they use embedding of documents and linking. Because the data in MongoDB is more localized.This localization dramatically reduces the need to join separate tables. Each document structure in MongoDB database can vary from one another. If there is a need to create a new field in any one of the document, then the field can be created without affecting a central system catalog and without taking the system offline. In MongoDB, field updates can be done easily. It provides rich data model. Data locality and dynamic schema are other main features of MongoDB. The main feature of MongoDB includes querying, aggregation, indexing and auto sharding. Indexes play a major role in providing efficient access to data,for both read and write operations,which are
  • 2. A Study on Mongodb Database (IJSRD/Vol. 3/Issue 10/2015/182) All rights reserved by www.ijsrd.com 833 supported natively by the database rather than maintained in application code. MongoDB supports many queries, mainly for highly scalable operational applications. The result of query execution can be a document or subset of specific fields within the document. Different types of query provided by MongoDB include key value queries, range queries, geo spatial queries, search queries, text search queries, aggregation framework queries and map reduce queries. Replica sets are another feature of MongoDB which is a fail over mechanism. Only the primary database allows write operation. Multiple secondary servers are used for read operation. For a replica set, minimum three servers is required. Of the three servers, one is primary server, other is secondary server and the remaining one is arbiter server. Arbiter server is not used for storing data. They are used only during failover time to determine which server will be the next primary server. Another feature is auto sharding. This feature is used to overcome the hardware limitations. Hardware limitations means bottleneck in RAM/disk I/O. This feature of MongoDB helps to distribute data across physical partitions. These physical partitions are called as shards. Thus data is automatically balanced in the clusters as the data grows. In relational database sharding is not built into the database. An aggregate is a group of related entities and value object. Maximum document size in MongoDB is 16 MB and large documents are handled with Grid FS.MongoDB runs on OSs such as Windows, Linux, Mac and Solaris. IV. COMPARISON OF MONGODB VS MYSQL MongoDB Commands SELECT * FROM table db.collection.find() SELECT * FROM table WHERE user=’Akshay’ db.collection.find({user=”Akshay”}) SELECT * FROM table ORDER BY Age Db.collection.find. DISTINCT .distinct() GROUP .group() Table 2: Fig (a) Retrieval of data in MySQL and MongoDB Modeling of data in MongoDB database differs from relational database. Different modeling styles can be applied depending on the requirement of the application. Most common modeling styles are embedding of documents and normalization on collections. The embedding feature has a disadvantage. That is, it may cause the situation that documents grow in size after creation which may degrade the performance of database. Col 1 Col 2 Col 3 Table 3: Fig(b)Data modeling by embedding of documents Example of Embedded documents having one to one relationship is shown below. {_id:1,Name:”Akshay Anand”,Address :{ City”:”Kochi”,Country:”India”}} Example of Embedded documents having one to many relationship is shown below {_id:1,Name: “Akshay Anand”,Children :[{Name:”Aravind”,Age:2},{Name:”Anupama”,Age:4}] This shows that array of values can be stored easily in MongoDB. MongoDB supports denormalization.It is a process of reducing number of physical tables which are accessed more frequently to reduce the query processing time.This process reduces number of joins required to design the query to get desired output. Col-1 Col-2 Table 4: Norm. Col 1 Col 3 Table 5: Fig(c) Normalization. Col 1 Col 2 Col 3 Table 6: Fig (d)D normalization. In MySQL,the concept of normalization is used. This concept was first introduced by E.F.Codd. The objectives of normalization process include well organization of data, minimizing update anomalies and maximizing data accessibility. In ©,a common key is used to refer the tables Table1_Norm and Table2_Norm.In the next figure, the tables are merged together. Embedding is similar to denormalizationbut still little variation is there. Embedding of documents give better performance than normalization on collections. V. ADVANTAGES OF MONGODB It is schema less. MongoDB database belongs to document store category in which one collection holds different different documents. Number of fields, content and size of the document can be different in each document. The main advantage of MongoDB database is that structure of a single object is clear. It does not contain complex joins.It has deep query ability. Easy of scale out is another major advantage. In this type of database, conversion/mapping of application objects to database objects not needed. MongoDB uses the internal memory for storing the work set there by enabling faster access of data. It provides index on any attribute. The secondary indexes supported by the MongoDB database make them transparent to developers. VI. APPLICATIONS OF MONGODB They are widely used in big data and real time web applications such as Facebook, Yahoo, Google and Amazon. It is also used in content management and delivery. It can be used in mobile and social infrastructure. For user data
  • 3. A Study on Mongodb Database (IJSRD/Vol. 3/Issue 10/2015/182) All rights reserved by www.ijsrd.com 834 management the best choice among NoSQL database is MongoDB. It finds application in data hub also.It is the best choice for a small or medium sized non –critical sensor applications, especially when write performance is important. VII. CONCLUSION As NoSQL trend is relatively new, many researchers are attracted to this category of databases. NoSQL databases such as MongoDB and its key-value stores provide an efficient framework to aggregate large volumes of data. MongoDB can store complex data like array,object or reference into one field. Mapping of objects is very easy in this type of database. The features of MongoDB like auto sharding and replication of data make the development faster than MySQL. MongoDB provides flexibility, horizontal scalability, auto sharding and replication. MongoDB is a better choice for big data applications than MySQL database. It gives better performance than relational database. Depending on the requirements of application, we can choose the suitable NoSQL database. REFERENCES [1] Mrs.Anuradha Kanade, Dr.Arpita Gopal,Mr.Shanthanu Kanade“A Study of Normalization and Embedding in MongoDB”Advance Computing Conference (IACC), 2014 IEEE International [2] Cornelia GYORODI,Robert GYORODI,George PECHERLE,Andrada OLAH “A comparative study:MongoDB vs MySQL”Engineering of Modern Electric Systems (EMES), 2015 13th International Conference on11-12 June 2015. [3] K.Sanobar,M.Vanita,”SQL Support over MongoDB using Metadata”