0% found this document useful (0 votes)

35 views41 pages

Chapter 1

big data and business intelligence chapter 1

Uploaded by

lalisagutama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views41 pages

Chapter 1

big data and business intelligence chapter 1

Uploaded by

lalisagutama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 41

FUNDAMENTALS OF BIG DATA

AND BUSINESS INTELLIGENCE

CHAPTER ONE -INTRODUCTION TO BIG DATA

2 CONTENTS

 What is Big Data?

 Handling and Processing Big Data
 Methodological Challenges and Problems
 Business Intelligence
3 WHAT IS BIG DATA?

 Big Data is a field dedicated to the analysis, processing, and

storage of large collections of data that frequently originate from
disparate sources.
 Big Data solutions and practices are typically required when
traditional data analysis, processing and storage technologies and
techniques are insufficient.
 The management and analysis of large datasets has been a long-
standing problem from labor intensive approaches of early census
efforts to the actuarial science behind the calculations of insurance
premiums.
 Big Data science has evolved from these roots.
4 WHAT IS BIG DATA?

 The analysis of Big Data datasets is an interdisciplinary endeavor

that blends mathematics, statistics, computer science and subject
matter expertise.
 Data within Big Data environments generally accumulates from
being amassed within the enterprise via applications, sensors and
external sources.
 Data processed by a Big Data solution can be used by enterprise
applications directly or can be fed into a data warehouse to enrich
existing data there.
5 WHAT IS BIG DATA?

 The results obtained through the processing of Big Data can lead
to a wide range of insights and benefits, such as:
 operational optimization
 identification of new markets
 accurate predictions
 fault and fraud detection
 more detailed records
 improved decision-making
 scientific discoveries
6 WHAT IS BIG DATA?

 As a starting point, several fundamental concepts and terms need

to be defined and understood.
 Datasets : collections or groups of related data are generally
referred to as datasets.
 Data Analysis : data analysis is the process of examining data to
find facts, relationships, patterns, insights and/or trends.
 The overall goal of data analysis is to support better decision
making.
 A simple data analysis example is the analysis of ice cream sales
data in order to determine how the number of ice cream cones
sold is related to the daily temperature.
7 WHAT IS BIG DATA?

 Data Analytics : Data analytics is a broader term that

encompasses data analysis.
 Data analytics is a discipline that includes the management of the
complete data lifecycle, which encompasses collecting, cleansing,
organizing, storing, analyzing and governing data.
 The term includes the development of analysis methods, scientific
techniques and automated tools.
8 WHAT IS BIG DATA?

 There are four general categories of analytics that are

distinguished by the results they produce:
 descriptive analytics
 diagnostic analytics
 predictive analytics
 prescriptive analytics
 The different analytics types leverage different techniques and
analysis algorithms.
9 WHAT IS BIG DATA?

 This implies that there may be varying data, storage and

processing requirements to facilitate the delivery of multiple types
of analytic results.
 The generation of high value analytic results increases the
complexity and cost of the analytic environment.
10 WHAT IS BIG DATA?

 Descriptive Analytics
 Descriptive analytics are carried out to answer questions about
events that have already occurred.
 This form of analytics contextualizes data to generate information.
 Sample questions can include:
 What was the sales volume over the past 12 months?
 What is the monthly commission earned by each sales agent?
 It is estimated that 80% of generated analytics results are
descriptive in nature.
 The reports are generally static in nature and display historical
data that is presented in the form of data grids or charts.
11 WHAT IS BIG DATA?

 Diagnostic Analytics
 Diagnostic analytics aim to determine the cause of a phenomenon
that occurred in the past using questions that focus on the reason
behind the event.
 The goal of this type of analytics is to determine what information
is related to the phenomenon in order to enable answering
questions that seek to determine why something has occurred.
 Such questions include:
 Why were Q2 sales less than Q1 sales?
 Why was there an increase in patient re-admission rates over the past
three months?
12 WHAT IS BIG DATA?

 Predictive Analytics
 Predictive analytics are carried out in an attempt to determine the
outcome of an event that might occur in the future.
 With predictive analytics, information is enhanced with meaning to
generate knowledge that conveys how that information is related.
 Questions are usually formulated using a what-if rationale, such as
the following:
 What are the chances that a customer will default on a loan if
they have missed a monthly payment?
 What will be the patient survival rate if Drug B is administered
instead of Drug A?
13 WHAT IS BIG DATA?

 Prescriptive Analytics
 Prescriptive analytics build upon the results of predictive analytics
by prescribing actions that should be taken.
 The focus is not only on which prescribed option is best to follow,
but why.
 In other words, prescriptive analytics provide results that can be
reasoned about because they embed elements of situational
understanding.
 Sample questions may include:
 Among three drugs, which one provides the best results?
 When is the best time to trade a particular stock?
14 BUSINESS INTELLIGENCE

 BI enables an organization to gain insight into the performance of

an enterprise by analyzing data generated by its business
processes and information systems.
 The results of the analysis can be used by management to steer
the business in an effort to correct detected issues or otherwise
enhance organizational performance.
 BI applies analytics to large amounts of data across the enterprise,
which has typically been consolidated into an enterprise data
warehouse to run analytical queries.
15 DIFFERENT TYPES OF DATA

 The data processed by Big Data solutions can be human-generated

or machine-generated, although it is ultimately the responsibility
of machines to generate the analytic results.
 Human-generated data is the result of human interaction with
systems, such as online services and digital devices.
 Machine-generated data is generated by software programs and
hardware devices in response to real-world events.
 An example of machine-generated data would be information
conveyed from the numerous sensors in a cellphone that may be
reporting information, including position and cell tower signal
strength.
16 DIFFERENT TYPES OF DATA

 Structured data
 Structured data conforms to a data model or schema and is often
stored in tabular form.
 It is used to capture relationships between different entities and is
therefore most often stored in a relational database.
 Structured data is frequently generated by enterprise applications.
 Due to the abundance of tools and databases that natively support
structured data, it rarely requires special consideration in regards
to processing or storage.
 Examples of this type of data include banking transactions,
invoices, and customer records.
17 DIFFERENT TYPES OF DATA

 Unstructured Data
 Data that does not conform to a data model or data schema is known
as unstructured data.
 It is estimated that unstructured data makes up 80% of the data
within any given enterprise.
 Unstructured data has a faster growth rate than structured data.
 This form of data is either textual or binary and often conveyed via
files that are self-contained and non-relational.
 A text file may contain the contents of various tweets or blog
postings.
 Binary files are often media files that contain image, audio or video
data.
18 DIFFERENT TYPES OF DATA

 Semi-structured
 Semi-structured data has a defined level of structure and
consistency, but is not relational in nature.
 Instead, semi-structured data is hierarchical or graph-based.
 This kind of data is commonly stored in files that contain text.
19 CHARACTERISTICS OF BIG DATA

 For a dataset to be considered Big Data, it must possess one or

more characteristics that require accommodation in the solution
design and architecture of the analytic environment.
20 CHARACTERISTICS OF BIG DATA

 Volume
 The anticipated volume of data that is processed by Big Data
solutions is substantial and ever-growing.
 High data volumes impose distinct data storage and processing
demands, as well as additional data preparation, curation and
management processes.
 Typical data sources that are responsible for generating high data
volumes can include:
 online transactions, such as point-of-sale and banking
 sensors, such as GPS sensors, smart meters and telematics
 social media, such as Facebook and Twitter
21 CHARACTERISTICS OF BIG DATA

 Velocity
 In Big Data environments, data can arrive at fast speeds, and
enormous datasets can accumulate within very short periods of
time.
 From an enterprise’s point of view, the velocity of data translates
into the amount of time it takes for the data to be processed once
it enters the enterprise’s perimeter.
 Coping with the fast inflow of data requires the enterprise to
design highly elastic and available data processing solutions and
corresponding data storage capabilities.
22 CHARACTERISTICS OF BIG DATA

 Variety
 Data variety refers to the multiple formats and types of data that
need to be supported by Big Data solutions.
 Data variety brings challenges for enterprises in terms of data
integration, transformation, processing, and storage.
23 CHARACTERISTICS OF BIG DATA

 Veracity
 Veracity refers to the quality or fidelity of data.
 Data that enters Big Data environments needs to be assessed for
quality, which can lead to data processing activities to resolve
invalid data and remove noise.
 In relation to veracity, data can be part of the signal or noise of a
dataset.
 Data with a high signal-to-noise ratio has more veracity than data
with a lower ratio.
 Data that is acquired in a controlled manner, for example via online
customer registrations, usually contains less noise than data
acquired via uncontrolled sources, such as blog postings.
24 CHARACTERISTICS OF BIG DATA

 Value
 Value is defined as the usefulness of data for an enterprise.
 The value characteristic is intuitively related to the veracity
characteristic in that the higher the data fidelity, the more value it
holds for the business.
 The longer it takes for data to be turned into meaningful
information, the less value it has for a business.
25 HANDLING AND PROCESSING BIG DATA

 Big data management is the systematic organization,

administration as well as governance of massive amount of data.
 The process includes management of both structured and
unstructured data.
 The primary objective is to ensure the data is of high quality and
accessible for business intelligence along with big data analytics
application.
 The data involves several terabytes or even petabytes of data that
has been saved in a broad range of file formats.
 Effective data management enables the organization to find
valuable information with ease irrespective of how large or
unstructured the data is.
26 HANDLING AND PROCESSING BIG DATA

 Here are some ways of effectively handle big data

 Outline your goal
 The first tick on the checklist when it comes to handling Big Data is
knowing what data to gather and the data that need to be
collected.
 To do this one has to determine clearly defined goals.
 Failure to accomplish this will lead one to gather large amount of
data which is not aligned with business’ continuous requirements.
27 HANDLING AND PROCESSING BIG DATA

 Secure the data

 The next step in managing Big data is to ensure the relevant data
collected is secured with a broad range of measures.
 To ensure the data secured is both accessible and secure, it must
be protected by firewall security measures, spam filtering,
malware scanning and elimination, along with most importantly
team permission control.
 It is wise not to take data management in lightly since securing
organizational data is the highest priority in Big Data
management.
28 HANDLING AND PROCESSING BIG DATA

 Keep the data protected

 A database is susceptible to threats from not just human
influences and synthetic anomalies, but also is prone to damage
from the elements of nature such as heat, humidity, and extreme
cold.
 All of which can easily corrupt data.
 Organizations have to safeguard database against adverse
environmental situations which would corrupt data.
 It is essential to create and maintain/update a backup of the
database elsewhere, in addition to implementation of safety
features.
29 HANDLING AND PROCESSING BIG DATA

 Data has to be interlinked

 Since organizational database are bound to be accessed by
number of channels, it is not recommended to use different
software for the required solutions.
 In essence, all organizational data must be able to talk to each
other.
 Cloud storage solution is the best answer for data interlinking
issue.
30 HANDLING AND PROCESSING BIG DATA

 Know the data you need to capture.

 Organizations are required to know which data has to be collected
and when.
 Adapt to new challenges
 One of the most important aspects of Big Data management is
keeping up with the latest trends in the same.
 Being flexible and open to new trends and technologies will go a
long way in giving you an edge over the competition.
31 CHALLENGES OF BIG DATA

 The volume of data is already enormous and increasingly every

day.
 The velocity of its generation and growth is increasing.
 The variety of data being generated is also expanding and
organization’s capability to capture and process this data is
limited.
 Current technology, architecture, management and analysis
approaches are unable to cope with the flood of data, and
organizations will need to change the way they think about, plan,
govern, manage, process and report on data to realize the
potential of big data.
32 CHALLENGES OF BIG DATA

 Data storage and analysis

 The size of data is increasingly rapidly by various means such as
mobile devices, aerial sensory devices technologies, remote
sensing and etc.
 Some useful data may be deleted as there is no free space to store
such huge data.
 Therefore the first challenges of big data analysis is storage
mediums and higher input/output speed.
 In such cases, the data accessibility must be top priority for the
knowledge discovery and representation.
33 CHALLENGES OF BIG DATA

 With the ever growing of datasets, the data mining tasks has
significantly increased.
 This is another big challenge for big data.
 While dealing with large datasets, data reduction, data selection,
feature selection are used.
 Hadoop and MapReduce make it possible to collect large amount
of semi structured and unstructured data in a reasonable amount
of time.
 The key engineering challenge is how to effectively analyze these
data for obtaining better knowledge.
34 CHALLENGES OF BIG DATA

 Computational complexities and knowledge discovery

 Knowledge discovery and representation is a prime issue in big
data.
 There are several tools for knowledge discovery and
representation such as: fuzzy set, rough set, soft set, and etc.
 Since the size of big data keeps increasing exponentially, the
available tools may not be efficient to process these data for
obtaining meaningful information.
35 CHALLENGES OF BIG DATA

 Information security
 In big data analysis massive amount of data are correlated,
analyzed, and mined for meaningful patterns.
 All organizations have different policies to safe guard their
sensitive information.
 Preserving sensitive information is a major issue in big data
analysis.
 There is a huge security risk associated with big data.
36 CHALLENGES OF BIG DATA

• Data Volume: Managing and Storing Massive Amounts of

Data
 Challenge: The most apparent challenge with Big Data is the
sheer volume of data being generated.
 This vast amount of data requires advanced storage infrastructure,
which can be costly and complex to maintain.
 Solution: Adopting scalable cloud storage solutions, such
as Amazon S3, Google Cloud Storage, or Microsoft
Azure, can help manage large volumes of data.
37 CHALLENGES OF BIG DATA

• Data Variety: Handling Diverse Data Types

• Challenge: Big Data encompasses a wide variety of data types,
including structured data (e.g., databases), semi-structured data
(e.g., XML, JSON), and unstructured data (e.g., text, images,
videos).
• The diversity of data types can make it difficult to integrate,
analyze, and extract meaningful insights.
• Solution: To address the challenge of data variety, organizations
can employ data integration platforms and tools like Apache Nifi,
Talend, or Informatica.
38 CHALLENGES OF BIG DATA

• Data Velocity: Processing Data in Real-Time

• Challenge: The speed at which data is generated and needs to be
processed is another significant challenge.
• For instance, IoT devices, social media platforms, and financial
markets produce data streams that require real-time or near-real-
time processing.
• Delays in processing can lead to missed opportunities and
inefficiencies.
• Solution: To handle high-velocity data, organizations can
implement real-time data processing frameworks such as Apache
Kafka, Apache Flink, or Apache Storm.
39 CHALLENGES OF BIG DATA

• Data Veracity: Ensuring Data Quality and Accuracy

• Challenge: With Big Data, ensuring the quality, accuracy, and
reliability of data referred to as data veracity becomes increasingly
difficult. Inaccurate or low-quality data can lead to misleading
insights and poor decision-making.
• Data veracity issues can arise from various sources, including data
entry errors, inconsistencies, and incomplete data.
• Solution: Implementing robust data governance frameworks is
crucial for maintaining data veracity.
• Tools like Trifacta, Talend Data Quality, and Apache Griffin can help
automate and streamline data quality management processes.
40 CHALLENGES OF BIG DATA

• Data Security and Privacy: Protecting Sensitive

Information
• Challenge: As organizations collect and store more data, they
face increasing risks related to data security and privacy.
 Solution: To mitigate security and privacy risks, organizations
must adopt comprehensive data protection strategies.
 This includes implementing encryption, access controls, and
regular security audits
41 CHALLENGES OF BIG DATA

• Data Integration: Combining Data from Multiple Sources

• Challenge: Integrating data from various sources, especially
when dealing with legacy systems, can be a daunting task.
• Data silos, where data is stored in separate systems without easy
access, further complicate the integration process, leading to
inefficiencies and incomplete analysis.
• Solution: Data integration platforms like Apache Camel, MuleSoft,
and IBM DataStage can help streamline the process of integrating
data from multiple sources.

Ccs334 Big Data Analytics
No ratings yet
Ccs334 Big Data Analytics
49 pages
Big Data Analytics - Unit 1
No ratings yet
Big Data Analytics - Unit 1
43 pages
01 Konsep Big Data
No ratings yet
01 Konsep Big Data
60 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
62 pages
Bda Unit 1
No ratings yet
Bda Unit 1
74 pages
BDA Class1
No ratings yet
BDA Class1
26 pages
Chapter - 01 - Introduction To Big Data
No ratings yet
Chapter - 01 - Introduction To Big Data
23 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
Unit - I - Types of Digital Data
No ratings yet
Unit - I - Types of Digital Data
45 pages
Unit 1 - ETI (BDA)
No ratings yet
Unit 1 - ETI (BDA)
20 pages
Dataanalyticsunit 1
No ratings yet
Dataanalyticsunit 1
26 pages
1 - Konsep Big Data
No ratings yet
1 - Konsep Big Data
35 pages
117769
No ratings yet
117769
20 pages
Unit - 2 Fundamentals of Big Data Analytics
No ratings yet
Unit - 2 Fundamentals of Big Data Analytics
39 pages
Big Data
No ratings yet
Big Data
22 pages
Ch3 - Introduction To Big Data Analytics
No ratings yet
Ch3 - Introduction To Big Data Analytics
37 pages
Insights Into Big Data: An Industrial Perspective
No ratings yet
Insights Into Big Data: An Industrial Perspective
52 pages
Unit 1 Introduction To Data Science
No ratings yet
Unit 1 Introduction To Data Science
63 pages
BDT 1
No ratings yet
BDT 1
49 pages
Bda Unit 1
No ratings yet
Bda Unit 1
27 pages
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
No ratings yet
Big Data and Data Analysis: Offurum Paschal I Kunoch Education and Training College, Owerri
35 pages
Big Data Analytics
No ratings yet
Big Data Analytics
14 pages
What Is Need of Big Data in Enterprises and How It Is Different From Business Intelligence
No ratings yet
What Is Need of Big Data in Enterprises and How It Is Different From Business Intelligence
56 pages
Introduction To Big Data
No ratings yet
Introduction To Big Data
4 pages
Introduction To Data
No ratings yet
Introduction To Data
34 pages
L01-Fundamentals of Big Data and Data Analytics
No ratings yet
L01-Fundamentals of Big Data and Data Analytics
58 pages
Unit 1
No ratings yet
Unit 1
74 pages
Introduction To Business Analytics
No ratings yet
Introduction To Business Analytics
63 pages
BDA 02 - Fundamentals
No ratings yet
BDA 02 - Fundamentals
64 pages
Bda Unit-1
No ratings yet
Bda Unit-1
43 pages
Big Data Analtics (Unit 1)
No ratings yet
Big Data Analtics (Unit 1)
31 pages
Unit 1
No ratings yet
Unit 1
76 pages
Business Analytics
No ratings yet
Business Analytics
34 pages
Hamid Seminar
No ratings yet
Hamid Seminar
57 pages
Basic Business Analytics Using Excel, Chapter 01
No ratings yet
Basic Business Analytics Using Excel, Chapter 01
21 pages
Unit1 Introduction To Data Analytics and Data Analytics Lifecycle Notes
No ratings yet
Unit1 Introduction To Data Analytics and Data Analytics Lifecycle Notes
13 pages
Reviewerku
No ratings yet
Reviewerku
6 pages
Da 1
No ratings yet
Da 1
20 pages
BDS Session 3
No ratings yet
BDS Session 3
56 pages
Chapter - 01 - Introduction To Big Data
No ratings yet
Chapter - 01 - Introduction To Big Data
22 pages
OC - Module 1 - Intro To BDA 021312
No ratings yet
OC - Module 1 - Intro To BDA 021312
37 pages
Unit 1
No ratings yet
Unit 1
20 pages
Chap 1
No ratings yet
Chap 1
41 pages
Week 1
No ratings yet
Week 1
50 pages
Module 4 DSBD
No ratings yet
Module 4 DSBD
89 pages
CS 329 Lecture One 2025
No ratings yet
CS 329 Lecture One 2025
28 pages
Ccs334 Big Data Analytics
No ratings yet
Ccs334 Big Data Analytics
69 pages
Big Data Analytics
No ratings yet
Big Data Analytics
25 pages
BMIS510-Chapter1 - An Overview of Business Intelligence - Analytics - and Data Science
No ratings yet
BMIS510-Chapter1 - An Overview of Business Intelligence - Analytics - and Data Science
16 pages
"Ÿ""Isliln: Formation
No ratings yet
"Ÿ""Isliln: Formation
34 pages
Kwasu-Csc204 Big Data Computing and Security-1
No ratings yet
Kwasu-Csc204 Big Data Computing and Security-1
57 pages
Chapter 1 - Intro To Business Analytics
No ratings yet
Chapter 1 - Intro To Business Analytics
52 pages
Big Data Analytics. Notes
No ratings yet
Big Data Analytics. Notes
32 pages
BIG DATA
No ratings yet
BIG DATA
54 pages
XRC Manual
No ratings yet
XRC Manual
342 pages
Fundamentals Big DAta Read
100% (1)
Fundamentals Big DAta Read
61 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
31 pages
Enabling and Sub Enabling Outcomes
No ratings yet
Enabling and Sub Enabling Outcomes
86 pages
384736bf-fcc4-4a3c-820e-e1c5ba93916d-1.2-big-data
No ratings yet
384736bf-fcc4-4a3c-820e-e1c5ba93916d-1.2-big-data
23 pages
ICT Skill Framework 2020
No ratings yet
ICT Skill Framework 2020
462 pages
CSH 11
No ratings yet
CSH 11
16 pages
Unit 5
No ratings yet
Unit 5
67 pages
CHAPTER 02: Big Data Analytics
No ratings yet
CHAPTER 02: Big Data Analytics
73 pages
FusionServer Tools V2R2 InfoCollect User Guide 22
No ratings yet
FusionServer Tools V2R2 InfoCollect User Guide 22
75 pages
Incidente
No ratings yet
Incidente
25 pages
1.1 About The Project: Computer Science & Engineering Dept Smart Bin
No ratings yet
1.1 About The Project: Computer Science & Engineering Dept Smart Bin
35 pages
CISA Lecture Domain 2
50% (4)
CISA Lecture Domain 2
116 pages
Unit 3
No ratings yet
Unit 3
42 pages
MAD MCQ QB Cwipedia
No ratings yet
MAD MCQ QB Cwipedia
10 pages
FED 314 Past Questions (Proteges)
No ratings yet
FED 314 Past Questions (Proteges)
6 pages
Requisition
No ratings yet
Requisition
8 pages
PAN Card Service Process Document
No ratings yet
PAN Card Service Process Document
2 pages
Compiler Lecture 3 4 5
No ratings yet
Compiler Lecture 3 4 5
14 pages
DGS&D For Laptop
No ratings yet
DGS&D For Laptop
24 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
Blockchain Proposal
No ratings yet
Blockchain Proposal
23 pages
Informatica Partitions
No ratings yet
Informatica Partitions
11 pages
DON DIACAP Handbook V1 0 Final - 15 July08 V
No ratings yet
DON DIACAP Handbook V1 0 Final - 15 July08 V
169 pages
Pharos Atonis
No ratings yet
Pharos Atonis
2 pages
8242 Dect Handset Maintenance Manual: 8Al90310Usaaed02 July 2017
No ratings yet
8242 Dect Handset Maintenance Manual: 8Al90310Usaaed02 July 2017
19 pages
Prabharani Public School: Half Yearly Exam 2016 Time: 3 H Class - XII, Sub: Comp - Sci. F.M.: 70
No ratings yet
Prabharani Public School: Half Yearly Exam 2016 Time: 3 H Class - XII, Sub: Comp - Sci. F.M.: 70
3 pages
Gateway I IFC User Manual
No ratings yet
Gateway I IFC User Manual
30 pages
Abstract Hadoop
No ratings yet
Abstract Hadoop
1 page
Jurnal 2211600123 Steven Adriandi Vodegel
No ratings yet
Jurnal 2211600123 Steven Adriandi Vodegel
5 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
17 pages
Programming Control View
No ratings yet
Programming Control View
1 page
Micom C264P: Bay Computer With Feeder Protections
No ratings yet
Micom C264P: Bay Computer With Feeder Protections
536 pages
R12 Config Files
No ratings yet
R12 Config Files
5 pages

Chapter 1

Uploaded by

Chapter 1

Uploaded by

FUNDAMENTALS OF BIG DATA

AND BUSINESS INTELLIGENCE

CHAPTER ONE -INTRODUCTION TO BIG DATA

 What is Big Data?

 Big Data is a field dedicated to the analysis, processing, and

 The analysis of Big Data datasets is an interdisciplinary endeavor

 As a starting point, several fundamental concepts and terms need

 Data Analytics : Data analytics is a broader term that

 There are four general categories of analytics that are

 This implies that there may be varying data, storage and

 BI enables an organization to gain insight into the performance of

 The data processed by Big Data solutions can be human-generated

 For a dataset to be considered Big Data, it must possess one or

 Big data management is the systematic organization,

 Here are some ways of effectively handle big data

 Secure the data

 Keep the data protected

 Data has to be interlinked

 Know the data you need to capture.

 The volume of data is already enormous and increasingly every

 Data storage and analysis

 Computational complexities and knowledge discovery

• Data Volume: Managing and Storing Massive Amounts of

• Data Variety: Handling Diverse Data Types

• Data Velocity: Processing Data in Real-Time

• Data Veracity: Ensuring Data Quality and Accuracy

• Data Security and Privacy: Protecting Sensitive

• Data Integration: Combining Data from Multiple Sources

You might also like