Khan E Noor University
Vice Chancellor office
Computer science faculty
IS Department
Introduction to Data WH and Data Mining (Introduction)
Week #2nd
Semester: 7th DB
Lecture: 2nd
Objective(s)
The main objective of this chapter is to introduce the
The basic concept data warehouse/ ing and data mining.
Data warehouse applications
Data warehouse applications
• As discussed before, a data warehouse helps business executives
to organize, analyze, and use their data for decision making.
• A data warehouse serves as a sole part of a plan-execute-assess
"closed-loop" feedback system for the enterprise management.
Data warehouses are widely used in the following fields:
− Financial services
− Banking services
− Consumer goods
− Retail sectors
− Controlled manufacturing
Types of data warehouse’s
applications
• Information Processing – A data warehouse allows to process the
data stored in it. The data can be processed by means of
querying, basic statistical analysis, reporting using crosstabs,
tables, charts, or graphs.
• Analytical Processing – A data warehouse supports
analytical processing of the information stored in it. The data can
be analyzed by means of basic OLAP operations, including slice-
and-dice, drill down, drill up, and pivoting.
Types of data warehouse’s
applications
• Data Mining - Data mining supports knowledge discovery by
finding hidden patterns and associations, constructing analytical
models, performing classification and prediction. These mining
results can be presented using visualization tools.
OLAL(Online analytical processing) OLTP (Online transactions
processing)
OLAL(Online analytical processing) OLTP (Online transactions
processing)
Data mining
• Data mining functions such as association, clustering, classification,
prediction can be integrated with OLAP operations to enhance the
interactive mining of knowledge at multiple level of abstraction. That's why
data warehouse has now become an important platform for data analysis
and online analytical processing.
Understanding a data warehouse
• A data warehouse is a database, which is kept separate from the
organization's operational database.
• There is no frequent مکررupdating done in a data warehouse.
• It possesses بدست آوردنconsolidated historical data, which helps the
organization to analyze its business.
Understanding a data warehouse
• A data warehouse helps executives to organize, understand, and use their
data to take strategic decisions.
• Data warehouse systems help in the integration of diversity of application
systems.
• A data warehouse system helps in consolidated historical data analysis.
What a data warehouse is separated from operational
database
A data warehouses is kept separate from operational databases due to the
following reasons:
• An operational database is constructed for well-known tasks and
workloads such as searching particular records, indexing, etc. In contrast,
data warehouse queries are often complex and they present a general form
of data.
• Operational databases support concurrent processing of multiple
transactions. Concurrency control and recovery mechanisms are required for
operational databases to ensure robustness and consistency of the database.
What a data warehouse is separated from operational
database
• An operational database query allows to read and modify operations, while
an OLAP query needs only read only access of stored data.
• An operational database maintains current data. On the other hand, a data
warehouse maintains historical data.
Features of data warehouse
The key features of a data warehouse are discussed below:
• Subject Oriented - A data warehouse is subject oriented because it provides
information around a subject rather than the organization's ongoing
operations. These subjects can be product, customers, suppliers, sales,
revenue, etc. A data warehouse does not focus on the ongoing operations,
rather it focuses on modelling and analysis of data for decision making.
• Integrated – A data warehouse is constructed by integrating data from
heterogeneous sources such as relational databases, flat files, etc. This
integration enhances the effective analysis of data.
Features of data warehouse
Time Variant - The data collected in a data warehouse is identified with a
particular time period. The data in a data warehouse provides information from
the historical point of view.
Non-volatile - Non-volatile means the previous data is not erased when new
data is added to it. A data warehouse is kept separate from the operational
database and therefore frequent changes in operational database is not
reflected in the data warehouse.
Note: A data warehouse does not require transaction processing, recovery, and
concurrency controls, because it is physically stored and separate from the
operational database.
Features of data warehouse
Time Variant - The data collected in a data warehouse is identified with a
particular time period. The data in a data warehouse provides information from
the historical point of view.
Non-volatile - Non-volatile means the previous data is not erased when new
data is added to it. A data warehouse is kept separate from the operational
database and therefore frequent changes in operational database is not
reflected in the data warehouse.
Note: A data warehouse does not require transaction processing, recovery, and
concurrency controls, because it is physically stored and separate from the
operational database.