0% found this document useful (0 votes)
87 views17 pages

01 DataStage Overview

This document provides an overview of DataStage, describing it as an ETL tool used to extract, transform, integrate and load data. It defines key DataStage concepts like data warehouses, data marts, projects and jobs. It describes the server components of DataStage including the Administrator, Director, Designer and Manager and explains how they are used to develop ETL processes in DataStage.

Uploaded by

Bhaskar Reddy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
87 views17 pages

01 DataStage Overview

This document provides an overview of DataStage, describing it as an ETL tool used to extract, transform, integrate and load data. It defines key DataStage concepts like data warehouses, data marts, projects and jobs. It describes the server components of DataStage including the Administrator, Director, Designer and Manager and explains how they are used to develop ETL processes in DataStage.

Uploaded by

Bhaskar Reddy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 17

Data Stage Overview

Objectives

 Describe a data warehouse or data mart


 Enterprise Data Integration
 Describe DataStage
 History of Datastage
 Identify the server and client components of
DataStage
 Describe DataStage projects
 Describe DataStage jobs
 Identify the steps for designing a DataStage job
What is a Data Warehouse?

 Repository of data
 Optimized for report generation
 Supports business analysis
 Projections
 Comparisons
 Assessments
 Extracted from operational sources
Integrated Summarized Filtered
Cleansed De-normalized Historical
Data Marts

 Like data warehouses but smaller in scope


 Organize data from a single subject area or
department
 Solve a small set of business requirements
 Cheaper and faster to build
Enterprise Data-Integration
DataStage

 With DataStage you can:


 Design jobs that extract, integrate, aggregate, transform
data
 Create, manage, and reuse metadata
 Run, monitor, and schedule jobs
 Manage your development environment
History Of Datastage

 Datastage was started in 1997 by company called


V-Mark.
 Later was taken over by Ardent , which in turn was
taken over by Informix.
 Current release is Datastage 6 (Viper) from
Ascential Software.
DataStage Application Components
DataStage Administrator

User
privileges

Connection
License timeout
info
DataStage Administrator

Permissions

Job
scheduling
User
privileges
DataStage Director

 Validate jobs
 Run jobs
 Monitor jobs
 Schedule jobs
 Gather statistics
DataStage Designer

 Specify extraction, transformation


 Denormalize (decode) data
 Aggregate data
 Split data
DataStage Manager

 Store metadata
 Reuse metadata
 Define routines
Development in DataStage

 Define project properties: Administrator


 Open project
 Design jobs: Designer
 Import metadata: Manager
 Define extractions, data flows, integrations
 Define transformations, constraints, aggregations
 Define loads
 Compile and debug jobs: Designer
 Run and monitor jobs: Director
DataStage Projects

 Created during installation


 Associated with a directory
 Must attach to
 Self-contained
 Multiple users can be working at the same time
DataStage Jobs
Compile
Debug
Passive
stage Active
stage

Lookup

Link

You might also like