SlideShare a Scribd company logo
13
Most read
15
Most read
17
Most read
QUERY PROCESSING IN
DISTRIBUTED DATABASE SYSTEMS
1
Presented by:
Muskaan
MCA/25020/18
OUTLINE
2
 What is Query ?
 What is Query Processor?
 Main Problems of Query Processing
 Characteristics of Query Processor
 Main layers of Query Processing
statement requesting the retrieval of
 What is Query ?
 A query is a
information.
A database query can be either a select query or an
action query.
 A select query is a data retrieval query, while an
action query asks for additional operations on the data, such
as insertion, updating or deletion.
3
Ad
What is Query Processor?
 The query processor in a DBMS receives as input , parses it,
generates an execution plan, and completes the processing
by executing the plan and returning the results to the
client.
 In relational database, users perform the task of data
processing and data manipulation with the help of high-
level non-procedural language (e.g. SQL).
4
What is Query Processor?
 Main function of a query processor is to transform a high- level-
query (also called calculus query) into an equivalent lower-level
query (also called algebraic query).
 This high-level query hides the low-level details from the user about
the physical organization of the data and presents such an environment
so that the user can handle the tasks of even complex queries in an
easy, concise and simple fashion.
Ad
 Main Problems of Query Processing
 Main problem of query processing is query optimization.
 It is a time consuming task, because many execution
strategies are involved to minimize (optimize) computer
resource consumption.
 Time and space required to process the query is also an
important factor for the performance of the query
processing.
6
 Important Characteristics of Query Processor
 Language
 Types of Optimization
 Optimization Timing
 Statistics
7
Ad
Important Characteristics of Query Processor
Language
 The input language of query processing can be based on
relational calculus or relational algebra.
Types of Optimization:
 Among all possible strategies for executing query, the one in
which less time and space are required is the best solution
for the optimization of query.
9
Ad
Optimization Timing:
 The actual time required to optimize the execution of a query is an
important factor. If less time is required, then it is the best solution for
query processing.
10
Statistics:
 The effectiveness of query optimization relies on statistical
information of the database, i.e. how many fragments
query will be needed, which operation should be done first.
11
Ad
 Main layers of Query Processing
Query processing involves 4 main layers:
• Query Decomposition
• Data Localization
• Global Query Optimization
• Distributed Execution
12
 Main layers of Query Processing
13
Query Decomposition
Calculus Query on Global Relations
Algebraic Query on Global Relations
Data Localization
Algebraic Query on Fragments
Global Optimization
Distributed Query Execution Plan
Distributed Execution
Global
Schema
Fragment
Schema
Allocation
Schema
Control Site
Local Sites
Fig. Generic Layering Scheme for Distributed Query Processing
Ad
 Query Decomposition
 The first layer decomposes the calculus query into an
algebraic query on global relations.
 Query decomposition can be viewed as four successive
steps:
 1) Normalization, 2)Analysis,
3) Elimination of redundancy, and 4) Rewriting.
14
15
 Query Decomposition
• Normalization
 First, the calculus query is rewritten in a normalized form
that is suitable for manipulation.
 Its main objective is to isolate data so that additions,
deletions, and modifications of a field can be made in just
one table
• Analysis
 Second, the normalized query is analysed so that incorrect
queries are detected and rejected as early as possible.
Ad
 Query Decomposition
• Elimination of Redundancy
 Third, the correct query is simplified. One way to simplify a
query is to eliminate redundancy.
• Rewriting
 Fourth, the calculus query is restructured as an algebraic
query. Several algebraic queries can be derived from the
same calculus query, and that some algebraic queries are
“better” than others.
16
 Localization of Distributed Data
 Output of the first layer is an algebraic query on distributed
relations which is input to the second layer.
 The main role of this layer is to localize the query’s data
using data distribution information.
 We know that relations are fragmented and stored in disjoint
subsets, called fragments where each fragment is stored at
different site.
17
Ad
 Global Query Optimization
 The input to the third layer is a fragment algebraic query.
 The goal of this layer is to find an execution strategy for
the algebraic fragment query which is close to optimal.
 The previous layers have already optimized the query, by
eliminating redundancies.
18
 Global Query Optimization
 Query optimization consists of
i)Finding the best ordering of operations in the query,
ii)Finding the communication operations which minimize
a cost function.
19
Ad
 Distributed Execution
 The last layer is performed by all the sites having
fragments involved in the query.
 Each subquery, called a local query, is executing at one
site. It is then optimized using the local schema of the
site.
20
THANK YOU
Ad
Ad

More Related Content

What's hot (20)

PPTX
Distributed DBMS - Unit 5 - Semantic Data Control
PPTX
Distributed database
27 slides68K views
PPTX
Concurrency Control in Distributed Database.
20 slides4.4K views
PPTX
Database , 12 Reliability
61 slides10.4K views
PPTX
Distributed design alternatives
11 slides6.3K views
PPTX
Concurrency Control in Database Management System
21 slides19.7K views
PPTX
Query processing and optimization (updated)
31 slides28.3K views
PDF
Resource management
68 slides7.4K views
PDF
Processes and threads
DOC
Naming in Distributed System
15 slides15.9K views
PDF
DDBMS_ Chap 7 Optimization of Distributed Queries
57 slides2.8K views
PPTX
State space search
10 slides3.6K views
PPTX
Memory Management in OS
10 slides16.5K views
PDF
Ddb 1.6-design issues
13 slides16K views
PPT
20. Parallel Databases in DBMS
42 slides13.2K views
PPSX
Parallel Database
83 slides71K views
PPTX
Distributed database management system
15 slides10.8K views
PPT
Distributed Database Management System
87 slides34.3K views
PPT
Ddbms1
19 slides1.6K views
PPTX
Distributed Database Management System
49 slides4.2K views
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed database
27 slides68K views
Concurrency Control in Distributed Database.
20 slides4.4K views
Database , 12 Reliability
61 slides10.4K views
Distributed design alternatives
11 slides6.3K views
Concurrency Control in Database Management System
21 slides19.7K views
Query processing and optimization (updated)
31 slides28.3K views
Resource management
68 slides7.4K views
Processes and threads
Naming in Distributed System
15 slides15.9K views
DDBMS_ Chap 7 Optimization of Distributed Queries
57 slides2.8K views
State space search
10 slides3.6K views
Memory Management in OS
10 slides16.5K views
Ddb 1.6-design issues
13 slides16K views
20. Parallel Databases in DBMS
42 slides13.2K views
Parallel Database
83 slides71K views
Distributed database management system
15 slides10.8K views
Distributed Database Management System
87 slides34.3K views
Ddbms1
19 slides1.6K views
Distributed Database Management System
49 slides4.2K views

Similar to Query processing in Distributed Database System (20)

PDF
Query processing
48 slides762 views
PPTX
Query processing
16 slides24.8K views
PDF
itm661-lecture0VBBBBBBBBBBBBBBM3-part2-2015.pdf
47 slides20 views
PPT
Query optimization and processing for advanced database systems
127 slides2.4K views
PDF
07.Overview_of_Query_Processing.pdf
35 slides21 views
PPT
ch02-240507064009-ac337bf1 .ppt
127 slides15 views
PPT
QPOfutyfurfugfuyttruft7rfu65rfuyt PPT - Copy.ppt
127 slides15 views
PPTX
Query processing
95 slides8K views
PPTX
DB LECTURE 5 QUERY PROCESSING.pptx
41 slides40 views
PPTX
700442110-advanced database Ch-2-Query-Process.pptx
44 slides36 views
PPTX
Ch-2-Query-Process.pptx advanced database
44 slides500 views
PPTX
Lecture 5.pptx
54 slides27 views
PPTX
PPT-UEU-Database-Objek-Terdistribusi-Pertemuan-8.pptx
15 slides18 views
PPTX
LECTURE_06_DATABASE PROCESSING & OPTIMAZATION.pptx
62 slides71 views
PPTX
Concepts of Query Processing in ADBMS.pptx
11 slides13 views
PPT
Query processing-and-optimization
55 slides9.2K views
PPTX
Query processing and Query Optimization
18 slides1.3K views
PPTX
Query processing and Query Optimization
18 slides3.9K views
PDF
CH5_Query Processing and Optimization.pdf
46 slides19 views
PPTX
Query Processingin database management systems.pptx
10 slides22 views
Query processing
48 slides762 views
Query processing
16 slides24.8K views
itm661-lecture0VBBBBBBBBBBBBBBM3-part2-2015.pdf
47 slides20 views
Query optimization and processing for advanced database systems
127 slides2.4K views
07.Overview_of_Query_Processing.pdf
35 slides21 views
ch02-240507064009-ac337bf1 .ppt
127 slides15 views
QPOfutyfurfugfuyttruft7rfu65rfuyt PPT - Copy.ppt
127 slides15 views
Query processing
95 slides8K views
DB LECTURE 5 QUERY PROCESSING.pptx
41 slides40 views
700442110-advanced database Ch-2-Query-Process.pptx
44 slides36 views
Ch-2-Query-Process.pptx advanced database
44 slides500 views
Lecture 5.pptx
54 slides27 views
PPT-UEU-Database-Objek-Terdistribusi-Pertemuan-8.pptx
15 slides18 views
LECTURE_06_DATABASE PROCESSING & OPTIMAZATION.pptx
62 slides71 views
Concepts of Query Processing in ADBMS.pptx
11 slides13 views
Query processing-and-optimization
55 slides9.2K views
Query processing and Query Optimization
18 slides1.3K views
Query processing and Query Optimization
18 slides3.9K views
CH5_Query Processing and Optimization.pdf
46 slides19 views
Query Processingin database management systems.pptx
10 slides22 views
Ad

More from Meghaj Mallick (20)

PPT
24 partial-orderings
12 slides243 views
PPTX
PORTFOLIO BY USING HTML & CSS
9 slides2.2K views
PPTX
Introduction to Software Testing
9 slides30 views
PPTX
Introduction to System Programming
15 slides584 views
PPTX
MACRO ASSEBLER
11 slides190 views
PPTX
Icons, Image & Multimedia
14 slides376 views
PPTX
Project Tracking & SPC
17 slides254 views
PPTX
Peephole Optimization
17 slides285 views
PPTX
Routing in MANET
21 slides159 views
PPTX
Macro assembler
19 slides1.2K views
PPTX
Architecture and security in Vanet PPT
23 slides1.2K views
PPTX
Design Model & User Interface Design in Software Engineering
51 slides3K views
PPTX
Text Mining of Twitter in Data Mining
28 slides114 views
PPTX
DFS & BFS in Computer Algorithm
22 slides93 views
PPTX
Software Development Method
17 slides38 views
PPTX
Secant method in Numerical & Statistical Method
16 slides82 views
PPTX
Motivation in Organization
18 slides2.2K views
PPTX
Communication Skill
15 slides34 views
PPT
Partial-Orderings in Discrete Mathematics
12 slides1.9K views
PPTX
Hashing In Data Structure
16 slides1.4K views
24 partial-orderings
12 slides243 views
PORTFOLIO BY USING HTML & CSS
9 slides2.2K views
Introduction to Software Testing
9 slides30 views
Introduction to System Programming
15 slides584 views
MACRO ASSEBLER
11 slides190 views
Icons, Image & Multimedia
14 slides376 views
Project Tracking & SPC
17 slides254 views
Peephole Optimization
17 slides285 views
Routing in MANET
21 slides159 views
Macro assembler
19 slides1.2K views
Architecture and security in Vanet PPT
23 slides1.2K views
Design Model & User Interface Design in Software Engineering
51 slides3K views
Text Mining of Twitter in Data Mining
28 slides114 views
DFS & BFS in Computer Algorithm
22 slides93 views
Software Development Method
17 slides38 views
Secant method in Numerical & Statistical Method
16 slides82 views
Motivation in Organization
18 slides2.2K views
Communication Skill
15 slides34 views
Partial-Orderings in Discrete Mathematics
12 slides1.9K views
Hashing In Data Structure
16 slides1.4K views
Ad

Recently uploaded (20)

PDF
COLEAD A2F approach and Theory of Change
8 slides117 views
PDF
public speaking for kids in India - LearnifyU
PPTX
2025-08-17 Joseph 03 (shared slides).pptx
75 slides16 views
PPTX
Sustainable Forest Management ..SFM.pptx
24 slides74 views
PPTX
CAPE CARIBBEAN STUDIES- Integration-1.pptx
18 slides10 views
PDF
6.-propertise of noble gases, uses and isolation in noble gases
14 slides22 views
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
13 slides42 views
PPTX
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
31 slides56 views
PPTX
Bob Difficult Questions 08 17 2025.pptx
26 slides8 views
PDF
natwest.pdf company description and business model
12 slides25 views
PPTX
NORMAN_RESEARCH_PRESENTATION.in education
68 slides100 views
PPTX
Kompem Part Untuk MK Komunikasi Pembangunan 5.pptx
31 slides9 views
PDF
Unnecessary information is required for the
9 slides22 views
PDF
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
36 slides21 views
PPTX
lesson6-211001025531lesson plan ppt.pptx
14 slides29 views
PPTX
power point presentation ofDracena species.pptx
27 slides10 views
PPTX
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
19 slides39 views
PPTX
PurpoaiveCommunication for students 02.pptx
12 slides26 views
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
19 slides36 views
PPTX
ANICK 6 BIRTHDAY....................................................
2 slides15 views
COLEAD A2F approach and Theory of Change
8 slides117 views
public speaking for kids in India - LearnifyU
2025-08-17 Joseph 03 (shared slides).pptx
75 slides16 views
Sustainable Forest Management ..SFM.pptx
24 slides74 views
CAPE CARIBBEAN STUDIES- Integration-1.pptx
18 slides10 views
6.-propertise of noble gases, uses and isolation in noble gases
14 slides22 views
chapter8-180915055454bycuufucdghrwtrt.pptx
13 slides42 views
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
31 slides56 views
Bob Difficult Questions 08 17 2025.pptx
26 slides8 views
natwest.pdf company description and business model
12 slides25 views
NORMAN_RESEARCH_PRESENTATION.in education
68 slides100 views
Kompem Part Untuk MK Komunikasi Pembangunan 5.pptx
31 slides9 views
Unnecessary information is required for the
9 slides22 views
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
36 slides21 views
lesson6-211001025531lesson plan ppt.pptx
14 slides29 views
power point presentation ofDracena species.pptx
27 slides10 views
Phylogeny and disease transmission of Dipteran Fly (ppt).pptx
19 slides39 views
PurpoaiveCommunication for students 02.pptx
12 slides26 views
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
19 slides36 views
ANICK 6 BIRTHDAY....................................................
2 slides15 views

Query processing in Distributed Database System

  • 1. QUERY PROCESSING IN DISTRIBUTED DATABASE SYSTEMS 1 Presented by: Muskaan MCA/25020/18
  • 2. OUTLINE 2  What is Query ?  What is Query Processor?  Main Problems of Query Processing  Characteristics of Query Processor  Main layers of Query Processing
  • 3. statement requesting the retrieval of  What is Query ?  A query is a information. A database query can be either a select query or an action query.  A select query is a data retrieval query, while an action query asks for additional operations on the data, such as insertion, updating or deletion. 3
  • 4. What is Query Processor?  The query processor in a DBMS receives as input , parses it, generates an execution plan, and completes the processing by executing the plan and returning the results to the client.  In relational database, users perform the task of data processing and data manipulation with the help of high- level non-procedural language (e.g. SQL). 4
  • 5. What is Query Processor?  Main function of a query processor is to transform a high- level- query (also called calculus query) into an equivalent lower-level query (also called algebraic query).  This high-level query hides the low-level details from the user about the physical organization of the data and presents such an environment so that the user can handle the tasks of even complex queries in an easy, concise and simple fashion.
  • 6.  Main Problems of Query Processing  Main problem of query processing is query optimization.  It is a time consuming task, because many execution strategies are involved to minimize (optimize) computer resource consumption.  Time and space required to process the query is also an important factor for the performance of the query processing. 6
  • 7.  Important Characteristics of Query Processor  Language  Types of Optimization  Optimization Timing  Statistics 7
  • 8. Important Characteristics of Query Processor Language  The input language of query processing can be based on relational calculus or relational algebra.
  • 9. Types of Optimization:  Among all possible strategies for executing query, the one in which less time and space are required is the best solution for the optimization of query. 9
  • 10. Optimization Timing:  The actual time required to optimize the execution of a query is an important factor. If less time is required, then it is the best solution for query processing. 10
  • 11. Statistics:  The effectiveness of query optimization relies on statistical information of the database, i.e. how many fragments query will be needed, which operation should be done first. 11
  • 12.  Main layers of Query Processing Query processing involves 4 main layers: • Query Decomposition • Data Localization • Global Query Optimization • Distributed Execution 12
  • 13.  Main layers of Query Processing 13 Query Decomposition Calculus Query on Global Relations Algebraic Query on Global Relations Data Localization Algebraic Query on Fragments Global Optimization Distributed Query Execution Plan Distributed Execution Global Schema Fragment Schema Allocation Schema Control Site Local Sites Fig. Generic Layering Scheme for Distributed Query Processing
  • 14.  Query Decomposition  The first layer decomposes the calculus query into an algebraic query on global relations.  Query decomposition can be viewed as four successive steps:  1) Normalization, 2)Analysis, 3) Elimination of redundancy, and 4) Rewriting. 14
  • 15. 15  Query Decomposition • Normalization  First, the calculus query is rewritten in a normalized form that is suitable for manipulation.  Its main objective is to isolate data so that additions, deletions, and modifications of a field can be made in just one table • Analysis  Second, the normalized query is analysed so that incorrect queries are detected and rejected as early as possible.
  • 16.  Query Decomposition • Elimination of Redundancy  Third, the correct query is simplified. One way to simplify a query is to eliminate redundancy. • Rewriting  Fourth, the calculus query is restructured as an algebraic query. Several algebraic queries can be derived from the same calculus query, and that some algebraic queries are “better” than others. 16
  • 17.  Localization of Distributed Data  Output of the first layer is an algebraic query on distributed relations which is input to the second layer.  The main role of this layer is to localize the query’s data using data distribution information.  We know that relations are fragmented and stored in disjoint subsets, called fragments where each fragment is stored at different site. 17
  • 18.  Global Query Optimization  The input to the third layer is a fragment algebraic query.  The goal of this layer is to find an execution strategy for the algebraic fragment query which is close to optimal.  The previous layers have already optimized the query, by eliminating redundancies. 18
  • 19.  Global Query Optimization  Query optimization consists of i)Finding the best ordering of operations in the query, ii)Finding the communication operations which minimize a cost function. 19
  • 20.  Distributed Execution  The last layer is performed by all the sites having fragments involved in the query.  Each subquery, called a local query, is executing at one site. It is then optimized using the local schema of the site. 20
  • 21. THANK YOU