unit 4 part 4

A distributed database system consists of multiple interrelated databases spread across a network, managed by a distributed database management system. It can be homogeneous, with identical software across sites, or heterogeneous, where different schemas and software are used. Key concepts include local and global transactions, data replication, fragmentation, and transparency, which are essential for efficient data management and user interaction in distributed environments.

Uploaded by

bhavyagu12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views20 pages

unit 4 part 4

Uploaded by

bhavyagu12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

DISTRIBUTED DATABASES

Dr. Avdhesh Gupta

Professor
Department of Information Technology
AKGEC, Ghaziabad
Distributed Database System
• A distributed database is a collection of
multiple, logically inter related databases
distributed over a computer network

• A distributed database management

system is a software system that permits
the management of distributed database
• Data spread over multiple computers (also
referred to as sites or nodes).
• Network interconnects the computers
• Data shared by users on multiple computers
Homogeneous Distributed Databases
• In a homogeneous distributed database
– All sites have identical software
– Are aware of each other and agree to
cooperate in processing user requests.
– Appears to user as a single system
– Goal: provide a view of a single database,
hiding details of distribution
• In a heterogeneous distributed database
– Different sites may use different schemas and
software
• Difference in schema is a major problem for query
processing
• Difference in software is a major problem for
transaction processing
– Sites may not be aware of each other and may
provide only limited facilities for cooperation in
transaction processing
• Goal: integrate existing databases to provide useful functionality
Local and Global Transactions

– A local transaction accesses data in the single site at which the transaction
was initiated.
– A global transaction either accesses data in a site different from the one at
which the transaction was initiated or accesses data in several different sites.
Distributed Data Storage
• Assume relational data model
• Replication
– System maintains several identical copies or
replicas of data, stored in different sites
• Fragmentation
– Relation is partitioned into several fragments
stored in distinct sites
• Replication and fragmentation can be
combined
– Relation is partitioned into several fragments:
system maintains several identical replicas of
each such fragment.
Data Replication
• A relation or fragment of a relation is said
to be replicated if it is stored redundantly
in two or more sites.
• Full replication of a relation is the case
where the relation is stored at all sites.
• Partial Replication is where fragments are
replicated at different sites, but each site
does not contain all the fragments
Data Fragmentation
• Division of relation r into fragments r1,
r2, …, rn which contain sufficient
information to reconstruct relation r.
• Two types
– Horizontal Fragmentation
– Vertical Fragmentation
– Mixed Fragmentation
• Horizontal fragmentation: each tuple of r is
assigned to one or more fragments
– Each tuple of the global relation must be present
in atleast one fragment
– Usually tuples are kept at sites where they may be
used most to minimize data transfer
– The fragments may be
• Disjoint : A tuple appears in only one fragment
• Overlapping : A tuple appears in more than one
fragment
Horizontal Fragmentation of account Relation

branch_name account_number balance

Hillside A-305 500

Hillside A-226 336
Hillside A-155 62

account1 = σbranch_name=“Hillside” (account )

branch_name account_number balance

Valleyview A-177 205

Valleyview A-402 10000
Valleyview A-408 1123
Valleyview A-639 750

account2 = σbranch_name=“Valleyview” (account )

• Vertical fragmentation: the schema for
relation r is split into several smaller schemas
(fragmentation on the basis of attributes)
– Each attribute to be present in atleast one
fragment
– All schemas must contain a common primary key
to ensure lossless join property.
– A special attribute, the tuple-id attribute may be
added to each schema to serve as a candidate key.
Vertical Fragmentation of employee_info Relation
branch_name customer_name tuple_id

Hillside Lowman 1
Hillside Camp 2
Valleyview Camp 3
Valleyview Kahn 4
Hillside Kahn 5
Valleyview Kahn 6
Valleyview Green 7
deposit1 = Πbranch_name, customer_name, tuple_id (employee_info )
account_number balance tuple_id

A-305 500 1
A-226 336 2
A-177 205 3
A-402 10000 4
A-155 62 5
A-408 1123 6
A-639 750 7
deposit2 = Πaccount_number, balance, tuple_id (employee_info )
Mixed Fragmentation
Horizontal fragmentation followed by vertical
fragmentation
Vertical fragmentation followed by horizontal
fragmentation
Data Transparency
• Data transparency: Degree to which
system user may remain unaware of the
details of how and where the data items
are stored in a distributed system
• Consider transparency issues in relation
to:
– Fragmentation transparency
– Replication transparency
– Location transparency
• Fragmentation Transparency
– Users are unaware of how a relation has been
fragmented
• Replication Transparency
– Users are unaware of what data objects have been
replicated and where the replicas have been places
• Location Transparency
– Users are unaware of the physical location of the data
Naming of Data Items - Criteria
1. Every data item must have a system-wide
unique name.
2. It should be possible to find the location of
data items efficiently.
3. It should be possible to change the location
of data items transparently.
4. Each site should be able to create new data
items autonomously.
Centralized Scheme - Name Server
• Structure:
– name server assigns all names
– each site maintains a record of local data items
– sites ask name server to locate non-local data items
• Advantages:
– satisfies naming criteria 1-3
• Disadvantages:
– does not satisfy naming criterion 4
– name server is a potential performance bottleneck
resulting in poor performance
– name server is a single point of failure, if it crashes then
the sites will not run
Use of Aliases
• Alternative to centralized scheme: each site prefixes its
own site identifier to any name that it generates i.e.,
site 17.account.
– Fulfills having a unique identifier, and avoids problems
associated with central control.
– However, fails to achieve network transparency.

• Solution: Create a set of aliases for data items; Store

the mapping of aliases to the real names at each site.
• Users use the alias names
• The user can be unaware of the physical location of a
data item, and is unaffected if the data item is moved
from one site to another.
Distributed Transactions
• Transaction may access data at several sites.
• Each site has a local transaction manager responsible for:
– Maintaining a log for recovery purposes
– Participating in coordinating the concurrent execution of the
transactions executing at that site.
• Each site has a transaction coordinator, which is
responsible for:
– Starting the execution of transactions that originate at the site.
– Distributing subtransactions at appropriate sites for execution.
– Coordinating the termination of each transaction that originates
at the site, which may result in the transaction being committed
at all sites or aborted at all sites.

Rohini College of Engineering & Technology: Cs3492-Database Management Systems
No ratings yet
Rohini College of Engineering & Technology: Cs3492-Database Management Systems
4 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
CH 22
No ratings yet
CH 22
93 pages
5 Chapter Five
No ratings yet
5 Chapter Five
29 pages
50 Excel Interview Questions and Answers
No ratings yet
50 Excel Interview Questions and Answers
4 pages
DDB Slides
No ratings yet
DDB Slides
67 pages
Distributed Database Management Systems: Week-4
No ratings yet
Distributed Database Management Systems: Week-4
24 pages
Distributed Databases
No ratings yet
Distributed Databases
53 pages
DBMS
No ratings yet
DBMS
17 pages
CH 19
No ratings yet
CH 19
27 pages
Manual Controlador Governor Eng
No ratings yet
Manual Controlador Governor Eng
194 pages
Chapter 22: Distributed Databases
No ratings yet
Chapter 22: Distributed Databases
91 pages
Data Communication Basics CH 7
No ratings yet
Data Communication Basics CH 7
27 pages
Week10DatabaseTerminology 38c594f2 f34d 431e 82f5 074ebff1acad 170579
No ratings yet
Week10DatabaseTerminology 38c594f2 f34d 431e 82f5 074ebff1acad 170579
30 pages
Distributed Data Management: Distributed Systems Department of Computer Science UC Irvine
No ratings yet
Distributed Data Management: Distributed Systems Department of Computer Science UC Irvine
67 pages
Distributed Databases , NOSQL Systems and BIGDATA
No ratings yet
Distributed Databases , NOSQL Systems and BIGDATA
62 pages
Unit 5
No ratings yet
Unit 5
17 pages
4.1 Lecture 4 Distributed Databases
No ratings yet
4.1 Lecture 4 Distributed Databases
42 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Unit i Distributed Databases
No ratings yet
Unit i Distributed Databases
15 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
41 pages
Chapter 19: Distributed Databases
No ratings yet
Chapter 19: Distributed Databases
95 pages
Chapter 6
No ratings yet
Chapter 6
45 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
30 pages
Chapter -7 Distributed Database System
No ratings yet
Chapter -7 Distributed Database System
29 pages
Concurrency Control in Distributed Datab
No ratings yet
Concurrency Control in Distributed Datab
5 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Distributed Database Frank Chinembiri and Florence-2
No ratings yet
Distributed Database Frank Chinembiri and Florence-2
42 pages
Distributed Database Transparency Features
No ratings yet
Distributed Database Transparency Features
6 pages
DistributedDatabases 3
No ratings yet
DistributedDatabases 3
14 pages
DBMS Lecture 10
No ratings yet
DBMS Lecture 10
12 pages
DDIS U1-3
No ratings yet
DDIS U1-3
40 pages
Distributed Databases: Centralized Database System Distributed Database System Advantages and Disadvantages of DDBMS
No ratings yet
Distributed Databases: Centralized Database System Distributed Database System Advantages and Disadvantages of DDBMS
26 pages
Distributed Databases: CMP-3440 - Database Systems
No ratings yet
Distributed Databases: CMP-3440 - Database Systems
12 pages
Unit V
No ratings yet
Unit V
22 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
41 pages
Chapter 22: Distributed Databases
No ratings yet
Chapter 22: Distributed Databases
10 pages
Distributed Databases
No ratings yet
Distributed Databases
12 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
ADS Chapter 7 Distributed Database
No ratings yet
ADS Chapter 7 Distributed Database
16 pages
Chapter 5 - Distributed Databases Roobera
No ratings yet
Chapter 5 - Distributed Databases Roobera
58 pages
Distributed Database
No ratings yet
Distributed Database
23 pages
Final
No ratings yet
Final
46 pages
8 Distributed Databases
No ratings yet
8 Distributed Databases
13 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Lecture 2 Distriburted Databases
No ratings yet
Lecture 2 Distriburted Databases
45 pages
DD Design
No ratings yet
DD Design
17 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Adb CH 4
No ratings yet
Adb CH 4
14 pages
Unit-1 Transparency in DDBMS
No ratings yet
Unit-1 Transparency in DDBMS
15 pages
Q # 1: What Are The Components of Distributed Database System? Explain With The Help of A Diagram. Answer
No ratings yet
Q # 1: What Are The Components of Distributed Database System? Explain With The Help of A Diagram. Answer
12 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
No ratings yet
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
24 pages
Distributed DB
No ratings yet
Distributed DB
16 pages
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
No ratings yet
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
9 pages
DISTRIBUTED DATABASES Presentation
No ratings yet
DISTRIBUTED DATABASES Presentation
13 pages
DBMS-Unit 5
No ratings yet
DBMS-Unit 5
27 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
A Distributed Database Management System ('DDBMS') Is A Software System
No ratings yet
A Distributed Database Management System ('DDBMS') Is A Software System
5 pages
ROUTER 7705 - SAR - HM
100% (1)
ROUTER 7705 - SAR - HM
21 pages
M.E-CSE Anna University
No ratings yet
M.E-CSE Anna University
25 pages
Karnataka Listofcolleges
No ratings yet
Karnataka Listofcolleges
60 pages
COD 103 Creating Software Security Requirements
No ratings yet
COD 103 Creating Software Security Requirements
20 pages
Qdoc - Tips - Manual Fallas Terex
No ratings yet
Qdoc - Tips - Manual Fallas Terex
184 pages
Database Management System
No ratings yet
Database Management System
51 pages
Francois Fleuret - C++ Lecture Notes
No ratings yet
Francois Fleuret - C++ Lecture Notes
146 pages
Chapter3 1
No ratings yet
Chapter3 1
13 pages
1 - Theory and Problems of Digital Principles
No ratings yet
1 - Theory and Problems of Digital Principles
13 pages
Library Automation IIMT JAVA BCA
No ratings yet
Library Automation IIMT JAVA BCA
40 pages
serverless-etl-aws-glue
No ratings yet
serverless-etl-aws-glue
17 pages
ALE Quick Reference
No ratings yet
ALE Quick Reference
9 pages
AWS Project Report
No ratings yet
AWS Project Report
6 pages
Network Models
No ratings yet
Network Models
11 pages
Internship Report Ii
No ratings yet
Internship Report Ii
9 pages
Startup Shutdown HPE Simplivity
No ratings yet
Startup Shutdown HPE Simplivity
6 pages
Assembly Language Lecture 5
No ratings yet
Assembly Language Lecture 5
20 pages
Lab No.09 Title: Register File
No ratings yet
Lab No.09 Title: Register File
5 pages
Panini Scanner
No ratings yet
Panini Scanner
2 pages
Ds Complete Data Protection
No ratings yet
Ds Complete Data Protection
4 pages
User Guide - ENG
No ratings yet
User Guide - ENG
2 pages
JD for NW Engineer
No ratings yet
JD for NW Engineer
2 pages
Question Paper Winter 2019
No ratings yet
Question Paper Winter 2019
3 pages
More BYO Modem Setup FTTN
No ratings yet
More BYO Modem Setup FTTN
2 pages
Pyautogui: Keyboard and Mouse Control
No ratings yet
Pyautogui: Keyboard and Mouse Control
1 page
BT0065
No ratings yet
BT0065
1 page
640 802 Exam Topics
No ratings yet
640 802 Exam Topics
3 pages
AP SBTET VI Sem Data Communication & Computer Networks (C09) June 2019 QP
No ratings yet
AP SBTET VI Sem Data Communication & Computer Networks (C09) June 2019 QP
2 pages
Active Directory Disaster Recovery
From Everand
Active Directory Disaster Recovery
Florian Rommel
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)

unit 4 part 4

Uploaded by

unit 4 part 4

Uploaded by

DISTRIBUTED DATABASES

Dr. Avdhesh Gupta

• A distributed database management

branch_name account_number balance

Hillside A-305 500

account1 = σbranch_name=“Hillside” (account )

branch_name account_number balance

Valleyview A-177 205

account2 = σbranch_name=“Valleyview” (account )

• Solution: Create a set of aliases for data items; Store

You might also like