0% found this document useful (0 votes)
7 views20 pages

FD Unit 2

Uploaded by

kishankumarms51
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views20 pages

FD Unit 2

Uploaded by

kishankumarms51
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

Unit-II

DATA WAREHOUSE
Introduction to Data Warehouse
Data Warehouse is a relational database management system (RDBMS) construct to meet the requirements of
transaction processing systems.

It can be described as any centralized data repository.

It is a database that stores information oriented to satisfy decision-making requests.

It is a group of decision-support technologies, to make superior and higher decisions.

The Data Warehouse environment contains an extraction, transportation, and loading (ETL) solution, an
online analytical processing (OLAP) engine, customer analysis tools, and other applications that handle the
process of gathering information and delivering it to business users.
Data Warehouse

A data warehouse is a subject-oriented, integrated,

time-variant and non-volatile collection of data in

support of management's decision making process.


Definition
A Data Warehouse (DW) is a relational database that is designed for query and analysis rather than
transaction processing. It includes historical data derived from transaction data from single and
multiple sources.
A Data Warehouse can be viewed as a data system with the following attributes:

o It is a database designed for investigative tasks, using data from various applications.
o It supports a relatively small number of clients with relatively long interactions.
o It includes current and historical data to provide a historical perspective of information.
o Its usage is read-intensive.
o It contains a few large tables.
Characteristics of Data Warehouse
Subject-Oriented
Subject-Oriented: A data warehouse can be used to analyze a particular subject area. For example,
"sales" can be a particular subject.
Integrated
A data warehouse integrates various heterogeneous data sources like RDBMS, flat files, and online
transaction records. It requires performing data cleaning and integration during data warehousing to
ensure consistency in naming conventions, attributes types, etc., among different data sources.
Time-Variant

Historical information is kept in a data warehouse. For example, one can retrieve files from 3
months, 6 months, 12 months, or even previous data from a data warehouse. These variations
with a transactions system, where often only the most current file is kept.
Non-Volatile
Non-volatile: Once data is in the data warehouse, it will not change. So, historical data in a data
warehouse should never be altered.
Data Warehousing Objectives

• To support both analysis and reporting


• Preserve the historical data of the organisation.
• serve as the basis for choices.
Data Warehouse Requirement

Data Warehouse is needed for the following reasons:

1) Business User: To view historical data that has been summarised, business users need access to a data
warehouse. Given that these individuals lack technological expertise, the information could be conveyed to
them in a simple format.

2) Archive historical data: Historical time-variable data must be stored in a data warehouse. There are
multiple uses intended for this input.

3) Make strategic choices: Depending on the information in the data warehouse, some tactics may be
implemented. Thus, data warehouses aid in the process of making strategic choices.

4) For data quality and consistency: By combining data from several sources into one location, the user
can efficiently work to improve the uniformity and consistency of data.

5) Quick response time: The data warehouse must be prepared for occasionally unforeseen loads.
Data Warehouse Requirement
Advantages of Data Warehouse

• Gain insight into company trends to improve your forecasts.


• Massive volumes of data are handled efficiently by data warehouses by design.
• End users can more easily traverse, comprehend, and query data warehouse structures.
• In data warehouses, it may be simpler to create and maintain queries that would be difficult in many
normalised databases.
• One effective way to handle the need for large amounts of information from numerous consumers is
through data warehousing.
• An enormous amount of historical data may be analysed thanks to data warehousing.
Foundational Pieces of a Data Warehouse or Component of
Data Warehouse
Data Warehouse Models:
There are three data warehouse models.

1. Enterprise warehouse
2.Data mart
3.Virtual warehouse
Enterprise warehouse:

• An enterprise warehouse collects all of the information about subjects spanning the entire
organization.
• It provides corporate-wide data integration, usually from one or more operational systems or
external information providers, and is cross-functional in scope.
• It typically contains detailed data as well as summarized data, and can range in size from a few
gigabytes to hundreds of gigabytes, terabytes, or beyond.
• An enterprise data warehouse may be implemented on traditional mainframes, computer super
servers, or parallel architecture platforms. It requires extensive business modelling and may
take years to design and build.
Data mart:

• A d ata mar t contains a subs e t of corpora te - wide da ta tha t is of va lue to a sp e cific


group of user s. the scop e is conf ine d to spe c ific se le cted subje cts . for exa mp le , a
ma rk eting data mart may conf ine its subje cts to cu sto me r, ite m, and sa les . the da ta
c o n ta ined i n d ata ma r ts te n d to b e s u mma r iz e d .
• D a ta mar ts ar e u s u a lly imp le me n te d o n lo w - c ost d e p a rtme ntal s e r v ers th a t a r e
• Un ix/L inux - or windo ws -b as ed . th e imp le me nta tion c ycle of a d ata mar t is more
lik ely to be measured in w eek s ra th er than mon ths or ye ars. how ev er, it ma y
invo lv e comp lex in tegra tion in the long run if its des ign and p lanning wer e not
en ter prise -wide.
• D ep ending on the sourc e of da ta , d ata mar ts can be c ate gorized as indep end ent
mo re d epend ent . indepe nden t da ta ma r ts ar e sourc ed fro m da ta c apture d fro m on e
or more op era tion al s ys te ms or externa l infor ma tion provid ers , or fro m d ata
g ener ated lo cally w ithin a p articular d ep artme n t or geograph ic are a . d epend ent
d ata mar ts ar e s o u rce d ir ectly f r om en terprise d ata w ar ehous es .
Virtual warehouse:
A virtual warehouse is a set of views over operational databases. For efficient query
processing, only some of the possible summary views may be materialized.
A virtual warehouse is easy to build but requires excess capacity on operational database
servers.

You might also like