0% found this document useful (0 votes)
97 views13 pages

B101 Overview

B101 Overview

Uploaded by

ranusofi
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
97 views13 pages

B101 Overview

B101 Overview

Uploaded by

ranusofi
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 13

Module 1: Teradata Product Overview

After completing this module, you will be able to: Describe the purpose of the Teradata product Give a brief history of the product List major architectural features of the product

What is Teradata?
Teradata is a Relational Database Management System (RDBMS). Designed to run the worlds largest commercial databases.

Preferred solution for enterprise data warehousing Executes on UNIX MP-RAS and Windows 2000 operating systems Compliant with ANSI industry standards Runs on a single or multiple nodes Acts as a database server to client applications throughout the enterprise Uses parallelism to manage terabytes of data Capable of supporting many concurrent users from various client platforms (over a TCP/IP or IBM channel connection).
Win XP

Win 2000

Teradata DATABASE

UNIX Client

Mainframe Client

Teradata A Brief History


1979 Teradata Corp founded in Los Angeles, California Development begins on a massively parallel computer 1982 YNET technology is patented 1984 Teradata markets the first database computer DBC/1012 First system purchased by Wells Fargo Bank of Cal. Total revenue for year - $3 million 1987 First public offering of stock 1989 Teradata and NCR partner on next generation of DBC

1991 NCR Corporation is acquired by AT&T Teradata revenues at $280 million


1992 Teradata is merged into NCR 1996 AT&T spins off NCR Corp. with Teradata product

1997 Teradata database becomes industry leader in data warehousing


2000 100+ Terabyte system in production 2002 Teradata V2R5 released 12/2002; major release including features such as PPI, roles and profiles, multi-value compression, and more. 2003 Teradata V2R5.1 released 12/2003; includes UDFs, BLOBs, CLOBs, and more.

How Large is a Trillion?


1 Kilobyte 1 Megabyte 1 Gigabyte 1 Terabyte 1 Petabyte = 103 = 106 = 109 = 1012 = 1015 = 1000 bytes = 1,000,000 bytes = 1,000,000,000 bytes = 1,000,000,000,000 bytes = 1,000,000,000,000,000 bytes = 11.57 days = 31.6 years = 31,688 years = 15.7 miles = 15,700,000 miles

1 million seconds 1 billion seconds 1 trillion seconds 1 million inches 1 trillion inches

(30 roundtrips to the moon)

1 million square inches = .16 acres = .0002 square miles 1 trillion square inches = 249 square miles (larger than Singapore)
$1 million $1 billion $1 trillion = < $ .01 for every person in U.S. = $ 3.64 for every person is U.S. = $ 3,636 for every person in U.S.

Designed for Todays Business


Teradatas Charter meets the business needs of today and tomorrow with:

Relational database standard for database design

Enormous capacity billions of rows, terabytes of


data

High performance parallel processing Single database server for multiple clients Single
Version of the Truth

Network and mainframe connectivity Industry standard access language Structured


Query Language (SQL)

Manageable growth via modularity Fault tolerance at all levels of hardware and
software

Data integrity and reliability

Evolution of Data Processing


Type
T R A D I T I O N A L

Example

Number of Rows Accessed

Response Time

OLTP

Update a checking account to reflect a deposit


How many child size blue jeans were sold across all of the our Eastern stores in the month of March?

Small

Seconds

DSS

Large

Seconds or minutes

OLCP
T o d a y

Instant credit How much credit can be extended to this person? Show the top ten selling items across all stores for 2003.

Small to moderate; possibly across multiple databases Large number of detail rows or moderate number of summary rows

Minutes

OLAP

Seconds or minutes

The need to process DSS, OLCP, and OLAP type requests across an enterprise and its data leads to the concept of a Data Warehouse.

What is a Data Warehouse?


A Data Warehouse is a central, enterprise-wide database that contains information extracted from Operational Data Stores (ODS).

Based on enterprise-wide model Can begin small but may grow large rapidly Populated by extraction/loading data from operational systems Responds to end-user what if queries Can store detailed as well as summary data

ATM

PeopleSoft

Point of Service (POS)

Operational Data

Data Warehouse

Teradata Database
Teradata Warehouse Miner

Cognos

MicroStrategies

Examples of Access Tools End Users

Data Warehouse Usage Evolution


STAGE 1
REPORTING WHAT happened?

STAGE 2
ANALYZING WHY did it happen?

STAGE 3
PREDICTING WHY will it happen?

STAGE 4
OPERATIONALIZING WHAT IS Happening?

STAGE 5
ACTIVE WAREHOUSING MAKING it happen!

Primarily Batch

Increase in Ad Hoc Queries


Ad Hoc

Analytical Modeling Grows


Analytics

Continuous Update & Time Sensitive Queries Become Important


Continuous Update Short Queries

Event Based Triggering Takes Hold


Event-Based Triggering

Batch

What is Active Data Warehousing?


Data Warehousing is the timely, integrated, logically consistent store of detailed data available for analytic business decision making.

Primarily batch feeds and updates Ad hoc queries to support strategic decisions that return in minutes and maybe
hours

Active Data Warehousing is the timely, integrated, logically consistent store of detailed data available for strategic, tactical driven business decisions.

Timely updates close to real time Short, tactical queries that return in seconds Event driven activity plus strategic queries
Business requirements for an ADW (Active Data Warehouse)?

Performance response within seconds Scalability support for large data volumes, mixed workloads, and concurrent
users Availability 7 x 24 x 365 Data Freshness Accurate, up to the minute, data

Teradatas Competitive Advantages


Unlimited, Proven Scalability amount of data and number of users; allows
for an enterprise wide model of the data.

Unlimited Parallelism parallel access, sorts, and aggregations.


Mature Optimizer handles complex queries, up to 64 joins per query, ad-hoc
processing.

Models the Business 3NF, robust view processing, & provides star schema
capabilities.

Provides a single version of the truth. Low TCO (Total Cost of Ownership) ease of setup, maintenance, &
administration; no re-orgs, lowest disk to data ratio, and robust expansion utility (reconfig).

High Availability no single point of failure. Parallel Load and Unload utilities robust, parallel, and scalable load and
unload utilities such as FastLoad, MultiLoad, TPump, and FastExport.

Teradata Manageability
Things a Teradata DBA never has to do!

Reorganize data or index space

Pre-allocate table/index space, format partitions


Pre-prepare data for loading (convert, sort, split, etc.) Ensure that queries run in parallel Unload/reload data spaces due to expansion Design, implement and support partition schemes. Write or run programs to split the input source files into partitions for loading

A DBA knows that if the data doubles, the system can expand easily to accommodate it. The command and workload for creating a table that will have 100,000 rows is the same as creating a table that will have 1,000,000,000 rows!

Review Questions
1. Name the two primary operating systems that the Teradata RDBMS executes on. _______________________ _______________________ 2. Which of the following represents a trillion bytes or a TB of data? ____ a. b. c. d. 106 109 1012 1015

3. Which feature allows Teradata to process enormous volumes of data quickly? ____ a. b. c. d. High availability software and hardware components Parallelism Proven Scalability High performance servers from Intel

4. The Teradata RBDMS is primary a ____ . a. Server b. Client

Module 1: Review Question Answers


1. Name the two primary operating systems that the Teradata RDBMS executes on. UNIX MP-RAS Windows 2000 2. Which of the following represents a trillion bytes or a TB of data? ____ a. b. c. d. 106 109 1012 1015

3. Which feature allows Teradata to process enormous volumes of data quickly? ____ a. b. c. d. High availability software and hardware components Parallelism Proven Scalability High performance servers from Intel

4. The Teradata RBDMS is primary a ____ . a. Server b. Client

You might also like