0% found this document useful (0 votes)
78 views

Big Data Analysis

https://irjet.net/archives/V4/i8/IRJET-V4I8308.pdf
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
78 views

Big Data Analysis

https://irjet.net/archives/V4/i8/IRJET-V4I8308.pdf
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056

Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072

Big Data Analysis

Narayanan.V1, NikitaSri.G2, Suchitra.B3

1,2 Student,Dept.of Information Technology, Sri Krishna Arts & Science College, Coimbatore, TamilNadu, India
3Assistant Professor, Dept.of Information Technology, Sri Krishna Arts & Science College, Coimbatore,
TamilNadu, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract- As technology grows , the need for data is also is on unstructured data big data size is a constantly moving
necessary . In today era world contains very huge amount target as of terabytes to many petabytes of data. Big Data is
of data that are scattered / distributed everywhere . The a convergence of new hardware and algorithms that allow
need for exact data is also necessary ,some of the data are us to discover new patterns in large data sets patterns
hidden which cannot be used by users . The datas in the web we can apply to making better predictions and, ultimately,
are classified as structured data ( having proper structure), better decisions. Big Data has the potential to improve
unstructured data(having improper structure data of various lives with better services and products.
file format) and semi-structured data (partial organized
data).In order to handle unstructured data and to support 3. 5v characteristics
large volume of data , the big data is useful. Big data supports
all types of data and large volume of data . In this Paper , Big data can be described by the following 5vs
the need for big data and its advantages and how it is useful characteristics
in IT fields are discussed.

1. Introduction

The most recent development in this type of data is in


attitudes and behaviours and this is where Big Data comes
in. while examing everyones activities on the internet (i.e)
their Facebook posts , Google searches, tweets,emails, and
more, we have now more varities of data on every profiles .
This has led to very large databases, which need to be
tracked for some measures . The evolution of data is not
ending anytime soon. analysis. After the birth of big data,
new technologies and processes were developed at warp
speed to help companies to manuplate their data into
profitable way . Big data required advanced processing
frameworks such as Hadoop and new databases such as a. Volume it refers to the vast amount of data generated
NoSQL to store and manipulate it. The basic idea behind the and stored every second . The size of the data
word BIG DATA is that everything we do is increasingly determines the value and potential insight- and whether
leaving a digital trace (data) which we use and examine. it can actually be considered big data or not.big data
tools use distributed systems so that we can store an
2. What actually BIG DATA is analyse data across databases that are dotted around
anywhere in the world
Data has been spread everywhere whether we want it or
not There are some things that are so big that they have b. Variety it refers to the type and nature of the data
implications for everyone, BIG DATA is one of those things .variety of data categories into structured data (
an is completely transforming the way we do business relational database (.i.e.) having proper structure) ,semi-
and is impacting other parts of our lives. BIG DATA refers to structured data ( partial organize data )and
the large collection of data sets that are so larger or unstructured data (text , images , video , voice , etc.)
complex so that traditional data processing application This helps people who analyze it to effectively use the
software is inadequate to deal with them. big data challenges resulting insight.
include search, capturing data, data storage, data analysis,
sharing , transfer, querying , updating visualization and c. Velocity it refers to the speed at which the data is
information privacy. BIG DATA usually includes data with generated and processed to meet the demands and
data philosophy encompasses unstructured, semi- challenges that lie in the path of growth and
structured and unstructured data ,however the main focus development. Just the data goes viral in seconds.

2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1731
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072

5. Problems with big data


d. Variability Inconsistency of the data set can hamper
processes to handle and manage it. Variability is The most challenging task in big data is
different from variety. Its meaning is constantly
changing it can have a huge impact on your data a. Storing data even the data which are smaller in size is
homogenization. difficult to store and retrieve .therefore it is a complex
task to store the data and in analyzing it
e. Veracity The quality and quantity of captured data can
vary greatly, affecting the accurate analysis and that to b. Processing data faster The data store are in diferent
in the inconvenient form .here Veracity refers to make format structured data(relational database (ie) having
sure the data is truthful, which requires processes to proper structure) ,semi-structured data (partial
keep the bad data from heaping in your systems. organize data )and unstructured data(text, images
,video, voice,etc.) so it not easy to process the data & it
4. Big data as big deal takes plenty of time.

FOUR things make big data significant: 6. Tools for big data

a) The data is massive. The data are huge so that It cant Apache Hadoop
fit on a single hard drive. The volume of data far exceeds Lumify
than what the human mind can think .(for example just Apache Storm
think of a Million billion Terabytes, and then multiply HPCC Systems Big Data
that by more millions ). Apache Samoa
b) The data is messy and unstructured. Most important Elasticsearch
work of the big data is cleaning and converting the MongoDB
information so that it would be easy to search an sort . Talend Open Studio for Big Data
Only a few thousand experts on our planet fully know Rapid Miner
how to do this data cleanup. But in 10 years, the work R-ProgrammingThese are some tools for handling big data
for the big data will increases because data generated
will also increase day by day and will become tedious 7. CONCLUSION
one.
c) Data has become a commodity: now data has become Big data deals with knowledge discovery and data can be
a necessary commodity that can be sold and bought. extracted such a way that it is useful by millon of users .
companies and individuals can buy terabytes of social When data are increased, the need for database is also
media and other data on Data marketplaces . data are important . The data about data is also become a important
huge that it wont be fit into any hard disk and Most of criteria . and as years go , the need & use big data is also
the data is cloud-based. data Buying commonly involves necessary. but big data is just the starting stage of these
a subscription fee where you plug into a cloud server problems. As the technology develop there is a huge chance
farm. that the data which has been collected during that period can
d) The possibilities of big data are endless. Data are exceed the amount of data created till humen birth. The big
very useful in our day to day life because Perhap doctors data plays a vital role in todays world. In this paper the
will one day predict cancer, heart attack s ,strokes and advantage ,characteristics and how the database for the big
some more deadly diseases for individuals weeks before data supports are seen.
they happen it wont be helful for us therefore we should
analyse the data . Airplane and automobile crashes 8. References
might be reduced by predictive analyses of their
mechanical data and traffic and weather patterns. Online 1) JEFF desjards ,on The evolution of data
traing might be improved by having big data experts 2) paul gil ,on What Exactly is BIG data
with us .musician can find out the tune and rhythm 3)Wikipedia.org/wiki/big_data
relating to peoples taste and they can be make the tune 4) neelamani samal,nilamashob myshra ,on Big data
of the current trend by analyzing data. likewise not only process:big challenges and opportunity
in this fields big data has its own scope in every field 5) Ashley devan on The 7VS of big data
world wide these are some of examples for our
understanding .In the big data only a piece of cake has
been eaten .there is more and more in it. the discoveries
in the big data are updating day by day

2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1732
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072

BIOGRAPHIES

Narayanan.V
Student
Dept .of Information technology
Sri Krishna College of Arts &
Science
Coimbatore

NikitaSri.G
Student
Dept .of Information technology
Sri Krishna College of Arts &
Science Coimbatore

Suchitra.B
Assistant Professor
Dept .of Information technology
Sri Krishna College of Arts &
Science Coimbatore

2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1733

You might also like