SlideShare a Scribd company logo
Big DATA
By- Yash Bheda (1524008)
Janhavi Jaltare(1524011)
Krisha Udani()
Binal Savla (1524003)
Table of Contents
Topics
History of Big Data
Big Data
Architecture for Network
Network Analysis Algorithm
Big Data network analysing
Network Application
Summary
1.0: History of Big Data
 Big data is a relative term describing when the data in an
organization is to be stored and managed by timely decision
making.
Time Data Generation Processing
Initially Employee
generated data
Single Processor
Modern times User generated
data
Parallel
Processing(Multiple processors
using servers)Recently System generated
data
Contents
 Big data generated by user and system are
mostly unstructured.
Traditional Data Big Data
Documents Photographs
Finances Audio and Videos
Stock Recording 3D Models
Personnel Files Simulation
Location Data
BIG data
 Big Data represents the way this information is
analysed to help open Opportunities.
 A deep need exists for the structure to parse the data to
separate out the unwanted and find the useful threads
to uncover opportunities.
Input information
New processing
techniques
Better results
Management approach
 Traditionally
 Modern
Data input Storing Analysing
Data input Analysing Storing
4 V’s of BIG data
 Volume :vast amounts of data generated every
second.
 Velocity:speed at which new data generated moves
around.
 Variability :messiness or trustworthiness of the data. It
means inconsistent data flow with periodic peaks.
 Variety :different types of data we can now use.
Variety of data
Big Data Classification
Why classify?
 Complex situations
 4 Vs
 Results
From classifying big data to choosing
a big data solution
Defining a logical
architecture
Understanding
atomic patterns for
big data solutions
Understanding
composite patterns
to use for big data
solutions
Choosing a solution pattern
for a big data solution
Determining the viability of a
business problem for a big
data solution
Selecting the right products
to implement a big data
solution
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction
Parallel processing
Mappers and Reducers
 Map-Reduce job =
- Map function (input->key-value pairs)+
-Reduce function(key and list values->output).
 Map() procedure (method) that performs filtering and sorting.
 Reduce() method that performs a summary operation
NATURAL JOIN- MAPPING
 Join of R(A,B) with S(B,C) is the set of tuples (a,b,c).
 Mapper need to send R(a,b) and S (b,c) to the same reducer, so they
can be joined there.
 Mapper output:key=B-value,value=relation and othe component (A
or C).
-Example:R(1,2)-> (2,(R,1))
S(2,3)-> (2.(S,3))
Mapping Tuples
R(1,2) —> —>(2,(R,1))
R(4,2) —> —>(2,(R,4))
S(2,3) —> —>(2,(S,3))
S(5,6) —> —>(5,(S,6))
Mapper
For R(1,2)
Mapper
For
R(4,2)
Mapper
For
S(2,3)
Mapper
For
S(5,6)
Grouping Phase
 There is a reduce for each key.
 Every key-value pair generated by any mapper is sent to the
reducer for its key.
Mapping Tuples
—>(2,(R,1))
(2,(R,1))
(2,(R,4))
—>(2,(R,4)) (2,(S,3))
—>(2,(S,3))
(5,(S,6))
—>(5,(S,6))
Mapper
For R(1,2)
Mapper
For
R(4,2)
Mapper
For
S(2,3)
Mapper
For
S(5,6)
Reducer
For B=2
Reducer for
B=5
Constructing Value-list
 The input to each reducer is organized by the system into a pair:
- The Key.
- The List of values associated with that key.
THE VALUE-LIST FORMAT
(2,[(R,1), (R,4), (S,3)])—>
(5,[(S,6)])—>
Reducer for
B=2
Reducer for
B=5
The reduce Function For Join
Given key b and a list of values that are either
(R, 𝑎𝑖
) or (S, 𝑐𝑗
), output each triple
(𝑎𝑖 ,b,𝑐𝑗 ).
-Thus, the number of outputs made by a
reducer is the product of the number of R’s on
the list and the numbers of S’s on the list.
OUTPUT OF THE
REDUCERS
(2,[(R,1), (R,4), (S,3)])—>
(5,[(S,6)])—>
Reducer for
B=2
Reducer for
B=5
—>(1,2,3), (4,2,3)
Network Resources Related to Big Data
The network's capability to absorb and transfer big
data traffic is made up of six elements:
1. Bandwidth
2. Network delay
3. Security
4. Data delivery accuracy
5. Availability
6. Resiliency
Network Monitoring of Big Data
● Most monitoring systems deal with major changes,
failures, configuration data, and traffic reporting.
● The monitoring function itself is a producer of big
data. Therefore, the network data needs to be analyzed
with big data applications.
● Traffic trends, where applications are located, what
caused the traffic, and what network resources are
available to effectively carry the traffic are all part of
the network big data information.
Network Monitoring Strategies
● Ensure that your monitoring tools collect the network information with
enough granularity to produce detailed statistical representations.
● You will need a dashboard that continuously provides alerts and alarms
when traffic changes occur that are outside acceptable.
● Create short-term reports rapidly so that traffic changes that could impair
the network operation can be discovered as soon as possible.
● If a cloud service is employed, do you have the traffic data from the cloud
delivered in real time so you can make decisions before a problem worsens?
Benefits of Big Data Network Monitoring
1. Load balancing
2. Data Filtering
3. Real-time data analysis
4. Managing Virtual resources
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction
Big Data Impact
Network Applications
 Big data for network design
 Big data for network management
 Big data for network resource optimization
 Big data for network security and privacy
 Big data for network economics and pricing
 Big data for network performance evaluation
 Parallel and distributed algorithms for Big Data
Online services
 Netflix actually does comparison of their show
banners and gives each customer what
appeals to them
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction
Targeted marketing and
advertising
 Using 'tracking cookies' Facebook can collect
information about each website you are
visiting
 It is possible to accurately predict a range of
highly sensitive personal attributes simply by
analysing the ‘Likes’
Network Security & Bigdata
 Software-Defined Networking (SDN)-based
controllers and Big Data analytics within and
about the data network
 Analyzes network security attacks and potential
risks immediately, which prevents security
breaches.
 Eg:Behavior analysis software to prevent the
misuse of crutial data.
Implementation
 Network partitioning is crucial in setting up big data
environments.
 Heavy demands from applications do not impact other
mission-critical workloads
 Prepare now for big data scalability later
 Yahoo is running more than 42,000 nodes in its big
data environment, in 2013 the average number of
nodes in a big data cluster was just over 100
Summary
 Big data helps better analysis and market
prediction.
 Helps develop better logistic and accuracy in
systems and reduces redundancy.
 The characteristic 4 v’s support the
management and utilization of massive data.

More Related Content

What's hot (20)

Thilga
ThilgaThilga
Thilga
THILAKAVATHIRAMRAJ
 
Big Data
Big DataBig Data
Big Data
Neha Mehta
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
Abdelkader OUARED
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
Vipin Batra
 
Big data
Big dataBig data
Big data
kalyani reddy
 
Big Data Projects Research Ideas
Big Data Projects Research IdeasBig Data Projects Research Ideas
Big Data Projects Research Ideas
Matlab Simulation
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
iACT Global
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 System
Petr Novotný
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
AkhmadZakiAlsafi
 
Big Data & Data Science
Big Data & Data ScienceBig Data & Data Science
Big Data & Data Science
BrijeshGoyani
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
AASTHA PANDEY
 
Big Data Hadoop
Big Data HadoopBig Data Hadoop
Big Data Hadoop
Techsparks
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
JULIO GONZALEZ SANZ
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
VIKAS KATARE
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
Sankarapu Anjaneyulu
 
Big data
Big dataBig data
Big data
ArchanaMani2
 
Bigdata
BigdataBigdata
Bigdata
Saravanan Manoharan
 
Great Expectations Presentation
Great Expectations PresentationGreat Expectations Presentation
Great Expectations Presentation
Adam Doyle
 
Big data
Big dataBig data
Big data
Harry Potter
 

Viewers also liked (16)

Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
The Marketing Distillery
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
Innfinision Cloud and BigData Solutions
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
Melinda Thielbar
 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big Data
Lewis Crawford
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Tyrone Systems
 
Basics of big data analytics hadoop
Basics of big data analytics hadoopBasics of big data analytics hadoop
Basics of big data analytics hadoop
Ambuj Kumar
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
Information Security Awareness Group
 
Introduction to Bigdata Analysis
Introduction to Bigdata AnalysisIntroduction to Bigdata Analysis
Introduction to Bigdata Analysis
Sathish Ravichandran
 
What is big data?
What is big data?What is big data?
What is big data?
David Wellman
 
Big data ppt
Big data pptBig data ppt
Big data ppt
Thirunavukkarasu Ps
 
Big Data
Big DataBig Data
Big Data
NGDATA
 
Big data ppt
Big data pptBig data ppt
Big data ppt
IDBI Bank Ltd.
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
Rahul Agarwal
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
Bernard Marr
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
Philippe Julio
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 
Ad

Similar to Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction (20)

Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
IRJET Journal
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptx
Malla Reddy University
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
IRJET Journal
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoop
IRJET Journal
 
Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big Data
Mrinal Kumar
 
Sycamore Quantum Computer 2019 developed.pptx
Sycamore Quantum Computer 2019 developed.pptxSycamore Quantum Computer 2019 developed.pptx
Sycamore Quantum Computer 2019 developed.pptx
shujee381
 
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
2013  International Conference on Knowledge, Innovation and Enterprise Presen...2013  International Conference on Knowledge, Innovation and Enterprise Presen...
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
oj08
 
Hw09 Hadoop Based Data Mining Platform For The Telecom Industry
Hw09   Hadoop Based Data Mining Platform For The Telecom IndustryHw09   Hadoop Based Data Mining Platform For The Telecom Industry
Hw09 Hadoop Based Data Mining Platform For The Telecom Industry
Cloudera, Inc.
 
Final Report_798 Project_Nithin_Sharmila
Final Report_798 Project_Nithin_SharmilaFinal Report_798 Project_Nithin_Sharmila
Final Report_798 Project_Nithin_Sharmila
Nithin Kakkireni
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
ranjit banshpal
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
kalai75
 
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and ChallengesA Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challenges
ijcisjournal
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET Journal
 
BDA [email protected]
BDA Mod1@AzDOCUMENTS.in.pdfBDA Mod1@AzDOCUMENTS.in.pdf
BDA [email protected]
JayanthSram
 
Big Data Hadoop (Overview)
Big Data Hadoop (Overview)Big Data Hadoop (Overview)
Big Data Hadoop (Overview)
Rohit Srivastava
 
Big Data
Big DataBig Data
Big Data
Rohit Srivastava
 
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
Happiest Minds Technologies
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
IRJET Journal
 
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
IJET - International Journal of Engineering and Techniques
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam
 
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
IRJET Journal
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptx
Malla Reddy University
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
IRJET Journal
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoop
IRJET Journal
 
Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big Data
Mrinal Kumar
 
Sycamore Quantum Computer 2019 developed.pptx
Sycamore Quantum Computer 2019 developed.pptxSycamore Quantum Computer 2019 developed.pptx
Sycamore Quantum Computer 2019 developed.pptx
shujee381
 
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
2013  International Conference on Knowledge, Innovation and Enterprise Presen...2013  International Conference on Knowledge, Innovation and Enterprise Presen...
2013 International Conference on Knowledge, Innovation and Enterprise Presen...
oj08
 
Hw09 Hadoop Based Data Mining Platform For The Telecom Industry
Hw09   Hadoop Based Data Mining Platform For The Telecom IndustryHw09   Hadoop Based Data Mining Platform For The Telecom Industry
Hw09 Hadoop Based Data Mining Platform For The Telecom Industry
Cloudera, Inc.
 
Final Report_798 Project_Nithin_Sharmila
Final Report_798 Project_Nithin_SharmilaFinal Report_798 Project_Nithin_Sharmila
Final Report_798 Project_Nithin_Sharmila
Nithin Kakkireni
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
ranjit banshpal
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
kalai75
 
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and ChallengesA Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challenges
ijcisjournal
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET Journal
 
Big Data Hadoop (Overview)
Big Data Hadoop (Overview)Big Data Hadoop (Overview)
Big Data Hadoop (Overview)
Rohit Srivastava
 
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
Happiest Minds Technologies
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
IRJET Journal
 
Ad

Recently uploaded (20)

Math arihant handbook.pdf all formula is here
Math arihant handbook.pdf all formula is hereMath arihant handbook.pdf all formula is here
Math arihant handbook.pdf all formula is here
rdarshankumar84
 
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptxrefractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
KannanDamodaram
 
llm lecture 3 stanford blah blah blah blah
llm lecture 3 stanford blah blah blah blahllm lecture 3 stanford blah blah blah blah
llm lecture 3 stanford blah blah blah blah
saud140081
 
llm lecture 4 stanford blah blah blah blah
llm lecture 4 stanford blah blah blah blahllm lecture 4 stanford blah blah blah blah
llm lecture 4 stanford blah blah blah blah
saud140081
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
Market Share Analysis.pptx nnnnnnnnnnnnnn
Market Share Analysis.pptx nnnnnnnnnnnnnnMarket Share Analysis.pptx nnnnnnnnnnnnnn
Market Share Analysis.pptx nnnnnnnnnnnnnn
rocky
 
Arrays in c programing. practicals and .ppt
Arrays in c programing. practicals and .pptArrays in c programing. practicals and .ppt
Arrays in c programing. practicals and .ppt
Carlos701746
 
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
elinavihriala
 
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptxArtificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
AbhijitPal87
 
LECT CONCURRENCY………………..pdf document or power point
LECT CONCURRENCY………………..pdf document or power pointLECT CONCURRENCY………………..pdf document or power point
LECT CONCURRENCY………………..pdf document or power point
nwanjamakane
 
BADS-MBA-Unit 1 that what data science and Interpretation
BADS-MBA-Unit 1 that what data science and InterpretationBADS-MBA-Unit 1 that what data science and Interpretation
BADS-MBA-Unit 1 that what data science and Interpretation
srishtisingh1813
 
HPC High Performance Course Presentation.pptx
HPC High Performance Course Presentation.pptxHPC High Performance Course Presentation.pptx
HPC High Performance Course Presentation.pptx
naziaahmadnm
 
GDPR Audit - GDPR gap analysis cost Data Protection People.pdf
GDPR Audit - GDPR gap analysis cost  Data Protection People.pdfGDPR Audit - GDPR gap analysis cost  Data Protection People.pdf
GDPR Audit - GDPR gap analysis cost Data Protection People.pdf
Data Protection People
 
How to Choose the Right Online Proofing Software
How to Choose the Right Online Proofing SoftwareHow to Choose the Right Online Proofing Software
How to Choose the Right Online Proofing Software
skalatskayaek
 
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptxArtificial-Intelligence-in-Autonomous-Vehicles (1).pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptx
AbhijitPal87
 
delta airlines new york office (Airwayscityoffice)
delta airlines new york office (Airwayscityoffice)delta airlines new york office (Airwayscityoffice)
delta airlines new york office (Airwayscityoffice)
jamespromind
 
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdfComprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
epsilonice
 
EPC UNIT-V forengineeringstudentsin.pptx
EPC UNIT-V forengineeringstudentsin.pptxEPC UNIT-V forengineeringstudentsin.pptx
EPC UNIT-V forengineeringstudentsin.pptx
ExtremerZ
 
lecture 33333222234555555555555555556.pptx
lecture 33333222234555555555555555556.pptxlecture 33333222234555555555555555556.pptx
lecture 33333222234555555555555555556.pptx
obsinaafilmakuush
 
Data Analytics and visualization-PowerBi
Data Analytics and visualization-PowerBiData Analytics and visualization-PowerBi
Data Analytics and visualization-PowerBi
Krishnapriya975316
 
Math arihant handbook.pdf all formula is here
Math arihant handbook.pdf all formula is hereMath arihant handbook.pdf all formula is here
Math arihant handbook.pdf all formula is here
rdarshankumar84
 
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptxrefractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
refractiveindexexperimentdetailed-250528162156-4516aa1c.pptx
KannanDamodaram
 
llm lecture 3 stanford blah blah blah blah
llm lecture 3 stanford blah blah blah blahllm lecture 3 stanford blah blah blah blah
llm lecture 3 stanford blah blah blah blah
saud140081
 
llm lecture 4 stanford blah blah blah blah
llm lecture 4 stanford blah blah blah blahllm lecture 4 stanford blah blah blah blah
llm lecture 4 stanford blah blah blah blah
saud140081
 
time_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptxtime_series_forecasting_constructor_uni.pptx
time_series_forecasting_constructor_uni.pptx
stefanopinto1113
 
Market Share Analysis.pptx nnnnnnnnnnnnnn
Market Share Analysis.pptx nnnnnnnnnnnnnnMarket Share Analysis.pptx nnnnnnnnnnnnnn
Market Share Analysis.pptx nnnnnnnnnnnnnn
rocky
 
Arrays in c programing. practicals and .ppt
Arrays in c programing. practicals and .pptArrays in c programing. practicals and .ppt
Arrays in c programing. practicals and .ppt
Carlos701746
 
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
1022_ExtendEnrichExcelUsingPythonWithTableau_04_16+04_17 (1).pdf
elinavihriala
 
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptxArtificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1)-1.pptx
AbhijitPal87
 
LECT CONCURRENCY………………..pdf document or power point
LECT CONCURRENCY………………..pdf document or power pointLECT CONCURRENCY………………..pdf document or power point
LECT CONCURRENCY………………..pdf document or power point
nwanjamakane
 
BADS-MBA-Unit 1 that what data science and Interpretation
BADS-MBA-Unit 1 that what data science and InterpretationBADS-MBA-Unit 1 that what data science and Interpretation
BADS-MBA-Unit 1 that what data science and Interpretation
srishtisingh1813
 
HPC High Performance Course Presentation.pptx
HPC High Performance Course Presentation.pptxHPC High Performance Course Presentation.pptx
HPC High Performance Course Presentation.pptx
naziaahmadnm
 
GDPR Audit - GDPR gap analysis cost Data Protection People.pdf
GDPR Audit - GDPR gap analysis cost  Data Protection People.pdfGDPR Audit - GDPR gap analysis cost  Data Protection People.pdf
GDPR Audit - GDPR gap analysis cost Data Protection People.pdf
Data Protection People
 
How to Choose the Right Online Proofing Software
How to Choose the Right Online Proofing SoftwareHow to Choose the Right Online Proofing Software
How to Choose the Right Online Proofing Software
skalatskayaek
 
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptxArtificial-Intelligence-in-Autonomous-Vehicles (1).pptx
Artificial-Intelligence-in-Autonomous-Vehicles (1).pptx
AbhijitPal87
 
delta airlines new york office (Airwayscityoffice)
delta airlines new york office (Airwayscityoffice)delta airlines new york office (Airwayscityoffice)
delta airlines new york office (Airwayscityoffice)
jamespromind
 
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdfComprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
Comprehensive Roadmap of AI, ML, DS, DA & DSA.pdf
epsilonice
 
EPC UNIT-V forengineeringstudentsin.pptx
EPC UNIT-V forengineeringstudentsin.pptxEPC UNIT-V forengineeringstudentsin.pptx
EPC UNIT-V forengineeringstudentsin.pptx
ExtremerZ
 
lecture 33333222234555555555555555556.pptx
lecture 33333222234555555555555555556.pptxlecture 33333222234555555555555555556.pptx
lecture 33333222234555555555555555556.pptx
obsinaafilmakuush
 
Data Analytics and visualization-PowerBi
Data Analytics and visualization-PowerBiData Analytics and visualization-PowerBi
Data Analytics and visualization-PowerBi
Krishnapriya975316
 

Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #analysis #data #dataanalysis #Mapreduction

  • 1. Big DATA By- Yash Bheda (1524008) Janhavi Jaltare(1524011) Krisha Udani() Binal Savla (1524003)
  • 2. Table of Contents Topics History of Big Data Big Data Architecture for Network Network Analysis Algorithm Big Data network analysing Network Application Summary
  • 3. 1.0: History of Big Data  Big data is a relative term describing when the data in an organization is to be stored and managed by timely decision making. Time Data Generation Processing Initially Employee generated data Single Processor Modern times User generated data Parallel Processing(Multiple processors using servers)Recently System generated data
  • 4. Contents  Big data generated by user and system are mostly unstructured. Traditional Data Big Data Documents Photographs Finances Audio and Videos Stock Recording 3D Models Personnel Files Simulation Location Data
  • 5. BIG data  Big Data represents the way this information is analysed to help open Opportunities.  A deep need exists for the structure to parse the data to separate out the unwanted and find the useful threads to uncover opportunities. Input information New processing techniques Better results
  • 6. Management approach  Traditionally  Modern Data input Storing Analysing Data input Analysing Storing
  • 7. 4 V’s of BIG data  Volume :vast amounts of data generated every second.  Velocity:speed at which new data generated moves around.  Variability :messiness or trustworthiness of the data. It means inconsistent data flow with periodic peaks.  Variety :different types of data we can now use.
  • 9. Big Data Classification Why classify?  Complex situations  4 Vs  Results
  • 10. From classifying big data to choosing a big data solution Defining a logical architecture Understanding atomic patterns for big data solutions Understanding composite patterns to use for big data solutions Choosing a solution pattern for a big data solution Determining the viability of a business problem for a big data solution Selecting the right products to implement a big data solution
  • 14. Mappers and Reducers  Map-Reduce job = - Map function (input->key-value pairs)+ -Reduce function(key and list values->output).  Map() procedure (method) that performs filtering and sorting.  Reduce() method that performs a summary operation
  • 15. NATURAL JOIN- MAPPING  Join of R(A,B) with S(B,C) is the set of tuples (a,b,c).  Mapper need to send R(a,b) and S (b,c) to the same reducer, so they can be joined there.  Mapper output:key=B-value,value=relation and othe component (A or C). -Example:R(1,2)-> (2,(R,1)) S(2,3)-> (2.(S,3))
  • 16. Mapping Tuples R(1,2) —> —>(2,(R,1)) R(4,2) —> —>(2,(R,4)) S(2,3) —> —>(2,(S,3)) S(5,6) —> —>(5,(S,6)) Mapper For R(1,2) Mapper For R(4,2) Mapper For S(2,3) Mapper For S(5,6)
  • 17. Grouping Phase  There is a reduce for each key.  Every key-value pair generated by any mapper is sent to the reducer for its key.
  • 18. Mapping Tuples —>(2,(R,1)) (2,(R,1)) (2,(R,4)) —>(2,(R,4)) (2,(S,3)) —>(2,(S,3)) (5,(S,6)) —>(5,(S,6)) Mapper For R(1,2) Mapper For R(4,2) Mapper For S(2,3) Mapper For S(5,6) Reducer For B=2 Reducer for B=5
  • 19. Constructing Value-list  The input to each reducer is organized by the system into a pair: - The Key. - The List of values associated with that key.
  • 20. THE VALUE-LIST FORMAT (2,[(R,1), (R,4), (S,3)])—> (5,[(S,6)])—> Reducer for B=2 Reducer for B=5
  • 21. The reduce Function For Join Given key b and a list of values that are either (R, 𝑎𝑖 ) or (S, 𝑐𝑗 ), output each triple (𝑎𝑖 ,b,𝑐𝑗 ). -Thus, the number of outputs made by a reducer is the product of the number of R’s on the list and the numbers of S’s on the list.
  • 22. OUTPUT OF THE REDUCERS (2,[(R,1), (R,4), (S,3)])—> (5,[(S,6)])—> Reducer for B=2 Reducer for B=5 —>(1,2,3), (4,2,3)
  • 23. Network Resources Related to Big Data The network's capability to absorb and transfer big data traffic is made up of six elements: 1. Bandwidth 2. Network delay 3. Security 4. Data delivery accuracy 5. Availability 6. Resiliency
  • 24. Network Monitoring of Big Data ● Most monitoring systems deal with major changes, failures, configuration data, and traffic reporting. ● The monitoring function itself is a producer of big data. Therefore, the network data needs to be analyzed with big data applications. ● Traffic trends, where applications are located, what caused the traffic, and what network resources are available to effectively carry the traffic are all part of the network big data information.
  • 25. Network Monitoring Strategies ● Ensure that your monitoring tools collect the network information with enough granularity to produce detailed statistical representations. ● You will need a dashboard that continuously provides alerts and alarms when traffic changes occur that are outside acceptable. ● Create short-term reports rapidly so that traffic changes that could impair the network operation can be discovered as soon as possible. ● If a cloud service is employed, do you have the traffic data from the cloud delivered in real time so you can make decisions before a problem worsens?
  • 26. Benefits of Big Data Network Monitoring 1. Load balancing 2. Data Filtering 3. Real-time data analysis 4. Managing Virtual resources
  • 29. Network Applications  Big data for network design  Big data for network management  Big data for network resource optimization  Big data for network security and privacy  Big data for network economics and pricing  Big data for network performance evaluation  Parallel and distributed algorithms for Big Data
  • 30. Online services  Netflix actually does comparison of their show banners and gives each customer what appeals to them
  • 32. Targeted marketing and advertising  Using 'tracking cookies' Facebook can collect information about each website you are visiting  It is possible to accurately predict a range of highly sensitive personal attributes simply by analysing the ‘Likes’
  • 33. Network Security & Bigdata  Software-Defined Networking (SDN)-based controllers and Big Data analytics within and about the data network  Analyzes network security attacks and potential risks immediately, which prevents security breaches.  Eg:Behavior analysis software to prevent the misuse of crutial data.
  • 34. Implementation  Network partitioning is crucial in setting up big data environments.  Heavy demands from applications do not impact other mission-critical workloads  Prepare now for big data scalability later  Yahoo is running more than 42,000 nodes in its big data environment, in 2013 the average number of nodes in a big data cluster was just over 100
  • 35. Summary  Big data helps better analysis and market prediction.  Helps develop better logistic and accuracy in systems and reduces redundancy.  The characteristic 4 v’s support the management and utilization of massive data.

Editor's Notes

  • #6: It's the information owned by a company, obtained and processed through new techniques to produce value in the best way possible.
  • #14: A problem is broken down into parts that can be solved concurrently. Each part is further broken down into instructions. Instructions execute simultaneously over multiple processors.