B101 Overview
B101 Overview
After completing this module, you will be able to: Describe the purpose of the Teradata product Give a brief history of the product List major architectural features of the product
What is Teradata?
Teradata is a Relational Database Management System (RDBMS). Designed to run the worlds largest commercial databases.
Preferred solution for enterprise data warehousing Executes on UNIX MP-RAS and Windows 2000 operating systems Compliant with ANSI industry standards Runs on a single or multiple nodes Acts as a database server to client applications throughout the enterprise Uses parallelism to manage terabytes of data Capable of supporting many concurrent users from various client platforms (over a TCP/IP or IBM channel connection).
Win XP
Win 2000
Teradata DATABASE
UNIX Client
Mainframe Client
1 million seconds 1 billion seconds 1 trillion seconds 1 million inches 1 trillion inches
1 million square inches = .16 acres = .0002 square miles 1 trillion square inches = 249 square miles (larger than Singapore)
$1 million $1 billion $1 trillion = < $ .01 for every person in U.S. = $ 3.64 for every person is U.S. = $ 3,636 for every person in U.S.
High performance parallel processing Single database server for multiple clients Single
Version of the Truth
Manageable growth via modularity Fault tolerance at all levels of hardware and
software
Example
Response Time
OLTP
Small
Seconds
DSS
Large
Seconds or minutes
OLCP
T o d a y
Instant credit How much credit can be extended to this person? Show the top ten selling items across all stores for 2003.
Small to moderate; possibly across multiple databases Large number of detail rows or moderate number of summary rows
Minutes
OLAP
Seconds or minutes
The need to process DSS, OLCP, and OLAP type requests across an enterprise and its data leads to the concept of a Data Warehouse.
Based on enterprise-wide model Can begin small but may grow large rapidly Populated by extraction/loading data from operational systems Responds to end-user what if queries Can store detailed as well as summary data
ATM
PeopleSoft
Operational Data
Data Warehouse
Teradata Database
Teradata Warehouse Miner
Cognos
MicroStrategies
STAGE 2
ANALYZING WHY did it happen?
STAGE 3
PREDICTING WHY will it happen?
STAGE 4
OPERATIONALIZING WHAT IS Happening?
STAGE 5
ACTIVE WAREHOUSING MAKING it happen!
Primarily Batch
Batch
Primarily batch feeds and updates Ad hoc queries to support strategic decisions that return in minutes and maybe
hours
Active Data Warehousing is the timely, integrated, logically consistent store of detailed data available for strategic, tactical driven business decisions.
Timely updates close to real time Short, tactical queries that return in seconds Event driven activity plus strategic queries
Business requirements for an ADW (Active Data Warehouse)?
Performance response within seconds Scalability support for large data volumes, mixed workloads, and concurrent
users Availability 7 x 24 x 365 Data Freshness Accurate, up to the minute, data
Models the Business 3NF, robust view processing, & provides star schema
capabilities.
Provides a single version of the truth. Low TCO (Total Cost of Ownership) ease of setup, maintenance, &
administration; no re-orgs, lowest disk to data ratio, and robust expansion utility (reconfig).
High Availability no single point of failure. Parallel Load and Unload utilities robust, parallel, and scalable load and
unload utilities such as FastLoad, MultiLoad, TPump, and FastExport.
Teradata Manageability
Things a Teradata DBA never has to do!
A DBA knows that if the data doubles, the system can expand easily to accommodate it. The command and workload for creating a table that will have 100,000 rows is the same as creating a table that will have 1,000,000,000 rows!
Review Questions
1. Name the two primary operating systems that the Teradata RDBMS executes on. _______________________ _______________________ 2. Which of the following represents a trillion bytes or a TB of data? ____ a. b. c. d. 106 109 1012 1015
3. Which feature allows Teradata to process enormous volumes of data quickly? ____ a. b. c. d. High availability software and hardware components Parallelism Proven Scalability High performance servers from Intel
3. Which feature allows Teradata to process enormous volumes of data quickly? ____ a. b. c. d. High availability software and hardware components Parallelism Proven Scalability High performance servers from Intel