0% found this document useful (0 votes)
54 views30 pages

P3 - Data, Preprocessing, Informasi, & Analisis

1. The document discusses data, preprocessing, information, and analysis. It defines key concepts like data, information, the different types and sources of data. 2. It explains how data is raw material that needs to be processed and transformed into information to provide value. The amount of data is increasing rapidly creating both opportunities and challenges for organizations. 3. For organizations to gain insights from big data, they need to implement business intelligence solutions that can integrate, clean, and analyze large volumes of data from various sources to provide actionable information.

Uploaded by

ulfatmi hanifa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views30 pages

P3 - Data, Preprocessing, Informasi, & Analisis

1. The document discusses data, preprocessing, information, and analysis. It defines key concepts like data, information, the different types and sources of data. 2. It explains how data is raw material that needs to be processed and transformed into information to provide value. The amount of data is increasing rapidly creating both opportunities and challenges for organizations. 3. For organizations to gain insights from big data, they need to implement business intelligence solutions that can integrate, clean, and analyze large volumes of data from various sources to provide actionable information.

Uploaded by

ulfatmi hanifa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

Data, Preprocessing,

Informasi & Analisis

Pertemuan 3
Just One Word: Data and Big Data
“I just want to say one word to you. Just one word… Are you listening? … Plastics.
There’s a great future in plastics.”
Mr. McGuire in the 1967 movie The Graduate

• No longer plastic
• But, data
• Data is the key, the ticket, and the Holy Grail all rolled into one

Adi Arga Arifnur, M.Kom


Data and Information

• You want to eat a cup of a flour?


• You have to bake it into a cake with butter, eggs, and sugar
• What is this?
• Ingredients?
• Dough?
• Process into dough, then bake it with the right amount of time
and temperature
• Then transform the dough into something delicious

Adi Arga Arifnur, M.Kom


Data and Information

• Ingredients = Raw Material = Data


• Dough = Processes Ingredients = Step = Process
• Cake = Result = Have the taste = Information

• You see the different?


• Data is Raw Materials
• Information is the result of raw material that have been proceed

Adi Arga Arifnur, M.Kom


Data adalah…

• Sekumpulan fakta
• Angka
• Teks
• Gambar
• Suara

Adi Arga Arifnur, M.Kom


Jenis Data

• Berdasarkan Sifat:
• Kualitatif
• Kuantitatif
• Skala Pengukuran
• Nominal
• Ordinal
• Rasio
• Interval
• Sumber
• Primer
• Sekunder

Adi Arga Arifnur, M.Kom


Welcome to The Data Deluge

• In the business world, knowledge is not just power


• Knowledge is a lifeblood (sumber kehidupan) of a thriving
enterprise
• Knowledge helps business people to make informed decisions
• Knowledge comes from information
• Information comes from the data

Data Information Knoweldge

Adi Arga Arifnur, M.Kom


Welcome to The Data Deluge

• Enterprise need information,


• To understand their operations
• To understand their customers
• To understand their competitors
• To understand their suppliers
• To understand their partners
• To understand their employees
• To understand their stockholders
• Enterprise need to learn,
• What is happening in the business
• Analyze their operations
• React to internal and external pressures
• And make decisions that will help them manage costs, grow revenues, and
increase sales and profits

Adi Arga Arifnur, M.Kom


Too Much Data, Too Little Information

• As the sailor in The Rime of the Ancient


Mariner said, “Water, water, everywhere,
nor any drop to drink.”
• How about “Data, data everywhere, but
not any of the right information I need to
do my job?”
• “What’s the use of Big Data if it’s just a lot
of data? We need information we can
analyze, not just a big pile of data.”

Adi Arga Arifnur, M.Kom


Welcome to The Data Deluge

• It can be a problem when there is more data


than an enterprise can handle (overload)
• They collect massive amounts of data every
day internally and externally as they interact
with customers, partners, and suppliers.
• They research and track information on their
competitors and the marketplace.
• They put tracking codes on their websites so
they can learn exactly how many visitors they
get and where they came from.
• They store and track information required by
government regulations and industry
initiatives.
• It is a deluge of data

Adi Arga Arifnur, M.Kom


Data Volume, Variety, and Velocity

• Not only volume is increasing, but also variety and the velocity
of data are increasing
• Volume
• Velocity
• Variety
• Structured
• Semi-structured
• Unstructured

Adi Arga Arifnur, M.Kom


Data Deluge and Analytics

• With this flood of data comes a flood of analytics.


• Decision making still using intuition
• But data needed to validate it
• Analytics use to discover the pattern
• Analytics is not just about numbers; it is about brainpower

Adi Arga Arifnur, M.Kom


The Role Of BI

• BI turns data into “actionable” information


• BI different with operational systems
• Operational Systems
• used to capture the data
• Perform day by day and limited reporting capabilities
• BI
• Used for reporting, querying, and analytics
• Using DW

Adi Arga Arifnur, M.Kom


Where Data Warehousing Fits In
Operational Data Data Warehousing
Operational data is structured for efficiently Data in data warehouses is structured for business
processing and managing business transactions and people to understand and analyze
interactions
Operational systems live in the here and now Data warehousing must support the past,
present, and future
Operational systems record the business event as is Data warehousing tracks changes in dimensions—
products, customers, businesses, geopolitical, account
structures, and organizational hierarchies.

Operational data typically contains a relatively short Analytical data is historical. A business needs to
time span perform period-over-period analysis or examine
trending using
historical data.

Adi Arga Arifnur, M.Kom


The Five Cs Of Data

• Before a BI/DW program can deliver actionable information to business people, it


must whip the enterprise’s data into shape.
• Data that has been whipped into shape will be clean, consistent, conformed,
current, and comprehensive—the five Cs of data.
• Clean—Dirty data has missing items, invalid entries, and other problems that wreak
havoc (malapetaka)
with automated data integration and data analysis.
• Consistent—there should be no arguments about whose version of the data is the
correct one. Example: perhitungan yang berbeda
• Conformed(Selaras)—the business needs to analyze the data across common,
shareable dimensions if business people across the enterprise are to use the same
information for their decision-making.
• Current—the business needs to base decisions on whatever currency is necessary for
that type of
decision. In some cases, such as detecting credit card fraud, the data needs to be up to
the minute (up to date).
• Comprehensive—business people should have all the data they need to do their jobs—
regardless of where the data came from and its level of granularity (lengkap).

Adi Arga Arifnur, M.Kom


Preprocessing

• Data Cleaning
• Missing values can do this: Ignore the tuple, fill the
missing value manually, use global constant to fill,
using mean to fill, use mean for category, use most
probable value (regression, Bayesian, decision tree)
• Noisy Data, using smoothing techniques such as
binning (mean, median, boundaries), regression,
clustering.
• Data Integration
• Data Transformation
• Smoothing, Aggregation, Generalization,
Normalization, Attribute Construction
• Data Reduction
• Data Cube Aggregation, Attribute subset selection,
Dimensionality Reduction, Numerosity reduction,
discretization and hierarchy generation

Adi Arga Arifnur, M.Kom


Adi Arga Arifnur, M.Kom
Big Picture of BI
• The three core building blocks of a DW/BI program
• Data integration—combining data from different sources and bringing it together to ultimately
provide a unifed view.
• If an enterprise has inconsistent data, it is highly likely that it has a data integration problem.
• The components of data integration include the data sources; the processes to gather, consolidate,
transform, cleanse, and aggregate data and metadata; standards; tools; and resources and skills.
• Data warehousing—the process of storing and staging information, separate from an
enterprise’s day-to-day transaction processing operations, and optimizing it for access and
analysis in an enterprise.
• In this process, data flows from data producers to the data warehouse, where it is transformed into
information for business consumers.
• It encompasses all the data transformations, cleansing, flterring, and aggregations
• A centralized database.
• In the classic defnition from Bill Inmon’s book Building the Data Warehouse it is:
• Integrated—data gathered and made consistent from one or more source systems
• Subject oriented—organized by data subject rather than by application
• Time variant—historical data is stored (Note: in the beginning, enterprise applications often only stored a limited amount of
current and historical data online.)
• Nonvolatile—data did not get modifed in the DW, it was read-only.

Adi Arga Arifnur, M.Kom


Big Picture of BI

• BI—to present data to business people so they can use it to gain


knowledge.
• BI enables access and delivery of information to business users.
• BI is what business people see via tools and dashboards.
• The data comes from relational data sources or enterprise applications such as
enterprise resource planning (ERP), customer resource management (CRM), and
SCM.

Adi Arga Arifnur, M.Kom


Adi Arga Arifnur, M.Kom
BI JUSTIFICATIONS

Adi Arga Arifnur, M.Kom


WHY JUSTIFICATION IS NEEDED

• Needs to make both the business and technical case to :


• determine the need, identify the benefits, and, most importantly, set
expectations.
• BI team needs to estimate scope, costs, schedule, and a return
on investment (ROI), identifying risks and an organization’s
readiness.
• Justification can help prevent your project from being labeled a
failure.
• Justification type:
• Business Case
• Technical Case

Adi Arga Arifnur, M.Kom


BUILDING THE BUSINESS CASE

• The business case needs to answer the following questions:


• What is the business process
• What business problems or opportunities are being addressed?
• Who will use it?
• What are the anticipated business benefits?

Adi Arga Arifnur, M.Kom


BUILDING THE TECHNICAL CASE

• Technology And Product Short Lists

Adi Arga Arifnur, M.Kom


BUILDING THE TECHNICAL CASE

• Convincing Business People


• Convincing Technologies
• Determine the technologies required
• Create a short list of product candidates
• Assessing Readiness
• Data
• Expertise and experience
• Analytical commitment
• Organizational and cultural change
• Financial commitment
• Creating A BI Road Map
• Developing Scope, Preliminary Plan, And Budget

Adi Arga Arifnur, M.Kom


DEFINING REQUIREMENTS -
BUSINESS, DATA & QUALITY

Adi Arga Arifnur, M.Kom


THE PURPOSE OF DEFINING
REQUIREMENTS
• Can’t take shortcuts and skip the justification process
• Defining requirements creates the foundation of a successful
business intelligence (BI) solution by documenting what you are
planning to build.
• The development team then uses these requirements to design,
develop, and deploy BI systems.
• Include:
• Goals
• Deliverable: The primary deliverable is a set of requirements that business
sponsors, stakeholders, and IT management have agreed upon.
• Roles

Adi Arga Arifnur, M.Kom


Deliverable

Adi Arga Arifnur, M.Kom


Tugas Individu

1. Data dan Informasi


a. Data dan Jenisnya
b. Perbedaanya dengan informasi
c. Informasi Berkualitas dan Perlunya Informasi
2. Big Data dan Karakteristiknya. Bagaimana hubungannya Business Intelligence?
3. Apa itu Data Warehouse dan apa perbedaannya dengan Data Operasional?
4. Jelaskan 5 karakteristik data berikut:
a. Clean
b. Consistent
c. Conformed
d. Current
e. Comprehensive

Adi Arga Arifnur, M.Kom


TERIMA KASIH

You might also like