Design issues and challenges

The document discusses the design issues and challenges in distributed systems, categorizing them into system design, algorithm design, and emerging technology issues. Key challenges include communication, synchronization, fault tolerance, security, and scalability, as well as algorithmic challenges like dynamic distributed graph algorithms and debugging. Additionally, it highlights applications of distributed computing, such as mobile systems, sensor networks, and peer-to-peer computing, each with their own unique challenges.

Uploaded by

nivethitha0264

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Design issues and challenges

Uploaded by

nivethitha0264

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

UNIT - I

DESIGN ISSUES AND CHALLENGES IN DISTRIBUTED SYSTEMS

The design of distributed systems has numerous challenges. They can be categorized
into:
 Issues related to system and operating systems design
 Issues related to algorithm design
 Issues arising due to emerging technologies

The above three classes are not mutually

exclusive.

Issues related to system and operating systems design

The following are some of the common challenges to be addressed in designing a
distributed system from system perspective:
 Communication: This task involves designing suitable communication
mechanisms among the various processes in the networks.
Examples: RPC, RMI
 Processes: The main challenges involved are: process and thread management at both
client and server environments, migration of code between systems, design of software and
mobile agents.
 Naming: Devising easy to use and robust schemes for names, identifiers, and addresses
is essential for locating resources and processes in a transparent and scalable manner. The
remote and highly varied geographical locations make this task difficult.
 Synchronization: Mutual exclusion, leader election, deploying physical clocks, global
state recording are some synchronization mechanisms.
 Data storage and access Schemes: Designing file systems for easy and efficient data
storage with implicit accessing mechanism is very much essential for distributed operation
 Consistency and replication: The notion of Distributed systems goes hand in hand with
replication of data, to provide high degree of scalability. The replicas should be handed with
care since data consistency is prime issue.
 Fault tolerance: This requires maintenance of fail proof links, nodes, and processes.
Some of the common fault tolerant techniques are resilience, reliable communication,
distributed commit, check pointing and recovery, agreement and consensus, failure detection,
and self-stabilization.
 Security: Cryptography, secure channels, access control, key management – generation
and distribution, authorization, and secure group management are some of the security measure
that is imposed on distributed systems.

 Applications Programming Interface (API) and transparency: The user friendliness

and ease of use is very important to make the distributed services to be used by wide
community. Transparency, which is hiding inner implementation policy from users, is of the
following types:
 Access transparency: hides differences in data representation
 Location transparency: hides differences in locations y providing uniform access to
data located at remote locations.
 Migration transparency: allows relocating resources without changing names.
 Replication transparency: Makes the user unaware whether he is working on original or
replicated data.
 Concurrency transparency: Masks the concurrent use of shared resources for the user.
 Failure transparency: system being reliable and fault-tolerant.
 Scalability and modularity: The algorithms, data and services must be as distributed as
possible. Various techniques such as replication, aching and cache management, and
asynchronous processing help to achieve scalability.

Algorithmic challenges in distributed computing

 Designing useful execution models and frameworks
The interleaving model, partial order model, input/output automata model and the Temporal
Logic of Actions (TLA) are some examples of models that provide different degrees of
infrastructure.
 Dynamic distributed graph algorithms and distributed routing algorithms
 The distributed system is generally modeled as a distributed graph.
 Hence graph algorithms are the base for large number of higher level
communication, data dissemination, object location, and object search functions.
 These algorithms must have the capacity to deal with highly dynamic graph
characteristics. They are expected to function like routing algorithms.
 The performance of these algorithms has direct impact on user-perceived latency, data
traffic and load in the network.
 Time and global state in a distributed system
 The geographically remote resources demands the synchronization based on logical time.
 Logical time is relative and eliminates the overheads of providing physical time
for applications .Logical time can
(i) capture the logic and inter-process dependencies
(ii) track the relative progress at each process
 Maintaining the global state of the system across space involves the role of time
dimension for consistency. This can be done with extra effort in a coordinated
manner.
 Deriving appropriate measures of concurrency also involves the time dimension, as the
execution and communication speed of threads may vary a lot.
 Synchronization/coordination mechanisms
 Synchronization is essential for the distributed processes to facilitate
concurrent execution without affecting other processes.
 The synchronization mechanisms also involve resource management and
concurrency management mechanisms.
 Some techniques for providing synchronization are:
 Physical clock synchronization: Physical clocks usually diverge in the values due to
hardware limitations. Keeping them synchronized is a fundamental challenge to
common time.
 Leader election: All the processes need to agree on which process will play the role of a
distinguished process or a leader process. A leader is necessary even for many distributed
algorithms because there is often some asymmetry.
 Mutual exclusion: Access to the critical resource(s) has to be coordinated.
 Deadlock detection and resolution: This is done to avoid duplicate work, and deadlock
resolution should be coordinated to avoid unnecessary aborts of processes.
 Termination detection: cooperation among the processes to detect the specific global
state of quiescence.
 Garbage collection: Detecting garbage requires coordination among the processes.
 Group communication, multicast, and ordered message delivery
 A group is a collection of processes that share a common context and collaborate on a
common task within an application domain. Group management protocols are needed for
group communication wherein processes can join and leave groups dynamically, or fail.
 The concurrent execution of remote processes may sometimes violate the semantics and
order of the distributed program. Hence, a formal specification of the semantics of ordered
delivery need to be formulated, and then implemented.
 Monitoring distributed events and predicates
 Predicates defined on program variables that are local to different processes are used for
specifying conditions on the global system state.
 On-line algorithms for monitoring such predicates are hence important.
 An important paradigm for monitoring distributed events is that of event streaming,
wherein streams of relevant events reported from different processes are examined collectively
to detect predicates.
 The specification of such predicates uses physical or logical time relationships.
 Distributed program design and verification tools
Methodically designed and verifiably correct programs can greatly reduce the overhead of
software design, debugging, and engineering. Designing these is a big challenge.
 Debugging distributed programs
Debugging distributed programs is much harder because of the concurrency and replications.
Adequate debugging mechanisms and tools are need of the hour.
 Data replication, consistency models, and caching
 Fast access to data and other resources is important in distributed systems.
 Managing replicas and their updates faces concurrency problems.
 Placement of the replicas in the systems is also a challenge because resources
usually cannot be freely replicated.
 World Wide Web design – caching, searching, scheduling
 WWW is a commonly known distributed system.
 The issues of object replication and caching, pre fetching of objects have to be done
on WWW also.
 Object search and navigation on the web are important functions in the operation of
the web.
 Distributed shared memory abstraction
 A shared memory is easier to implement since it does not involve managing
the communication tasks.
 The communication is done by the middleware by message passing.
 The overhead of shared memory is to be dealt by the middleware technology.
 Some of the methodologies that does the task of communication in shared memory
distributed systems are:
 Wait-free algorithms: The ability of a process to complete its execution irrespective of
the actions of other processes is wait free algorithm. They control the access to shared resources
in the shared memory abstraction. They are expensive.
 Mutual exclusion: Concurrent access of processes to a shared resource or data is
executed in mutually exclusive manner. Only one process is allowed to execute the critical
section at any given time. In a distributed system, shared variables or a local kernel cannot be
used to implement mutual exclusion. Message passing is the sole means for implementing
distributed mutual exclusion.
 Register constructions: Architectures must be designed in such a way that, registers
allows concurrent access without any restrictions on the concurrency permitted.
 Reliable and fault-tolerant distributed systems
The following are some of the fault tolerant strategies:
 Consensus algorithms: Consensus algorithms allow correctly functioning processes to
reach agreement among themselves in spite of the existence of malicious processes. The goal of
the malicious processes is to prevent the correctly functioning processes from reaching
agreement. The malicious processes operate by sending messages with misleading information,
to confuse the correctly functioning processes.
 Replication and replica management: The Triple Modular Redundancy (TMR)
technique is used in software and hardware implementation. TMR is a fault-tolerant form of N-
modular redundancy, in which three systems perform a process and that result is processed by a
majority-voting system to produce a single output.
 Voting and quorum systems: Providing redundancy in the active or passive
components in the system and then performing voting based on some quorum criterion is a
classical way of dealing with fault-tolerance. Designing efficient algorithms for this purpose is
the challenge.
 Distributed databases and distributed commit: The distributed databases should also
follow atomicity, consistency, isolation and durability (ACID) properties.
 Self-stabilizing systems: All system executions have associated good(or legal) states and
bad (or illegal) states; during correct functioning, the system makes transitions among the good
states. A self-stabilizing algorithm guarantee to take the system to a good state even if a bad
state were to arise due to some error. Self-stabilizing algorithms require some in-built
redundancy to track additional variables of the state and do extra work.
 Checkpointing and recovery algorithms: Check pointing is periodically recording the
current state on secondary storage so that, in case of a failure. The entire computation is not lost
but can be recovered from one of the recently taken checkpoints. Check pointing in a distributed
environment is difficult because if the checkpoints at the different processes are not
coordinated, the local checkpoints may become useless because they are inconsistent with the
checkpoints at other processes.
 Failure detectors: The asynchronous distributed do not have a bound on the message
transmission time. This makes the message passing very difficult, since the receiver do not
know the waiting time. Failure detectors probabilistically suspect another process as having
failed and then converge on a determination of the up/down status of the suspected process.
 Load balancing: The objective of load balancing is to gain higher throughput, and
reduce the user perceived latency. Load balancing may be necessary because of a variety off
actors such as high network traffic or high request rate causing the network connection to be a
bottleneck, or high computational load. The following are some forms of load balancing:
 Data migration: The ability to move data around in the system, based on the access
pattern of the users
 Computation migration: The ability to relocate processes in order to perform a
redistribution of the workload.
 Distributed scheduling: This achieves a better turnaround time for the users by using
idle processing power in the system more efficiently.
 Real-time scheduling

Real-time scheduling becomes more challenging when a global view of the system state is
absent with more frequent on-line or dynamic changes. The message propagation delays which
are network-dependent are hard to control or predict. This is an hindrance to meet the QoS
requirements of the network.
 Performance
User perceived latency in distributed systems must be reduced. The common issues
in performance:
 Metrics: Appropriate metrics must be defined for measuring the performance
of theoretical distributed algorithms and its implementation.
 Measurement methods/tools: The distributed system is a complex entity
appropriate methodology and tools must be developed for measuring the performance
metrics.

Applications of distributed computing and newer challenges

The deployment environment of distributed systems ranges from mobile systems to cloud
storage. All the environments have their own challenges:
 Mobile systems
o Mobile systems which use wireless communication in shared broadcast medium
have issues related to physical layer such as transmission range, power, battery
power consumption, interfacing with wired internet, signal processing and
interference.
o The issues pertaining to other higher layers include routing, location management,
channel allocation, localization and position estimation, and mobility
management.
o Apart from the above mentioned common challenges, the architectural differences
of the mobile network demands varied treatment. The two architectures are:
 Base-station approach (cellular approach): The geographical region is divided into
hexagonal physical locations called cells. The powerful base station transmits signals to all
other nodes in its range
 Ad-hoc network approach: This is an infrastructure-less approach which do not have
any base station to transmit signals. Instead all the responsibility is distributed among the
mobile nodes.
 It is evident that both the approaches work in different environment with different
principles of communication. Designing a distributed system to cater the varied need is a great
challenge.
 Sensor networks
o A sensor is a processor with an electro-mechanical interface that is capable of
sensing physical parameters.
o They are low cost equipment with limited computational power and battery life.
They are designed to handle streaming data and route it to external computer
network and processes.
o They are susceptible to faults and have to reconfigure themselves.
o These features introduces a whole new set of challenges, such as position
estimation and time estimation when designing a distributed system .
 Ubiquitous or pervasive computing
o In Ubiquitous systems the processors are embedded in the environment to
perform application functions in the background.
o Examples: Intelligent devices, smart homes etc.
o They are distributed systems with recent advancements operating in wireless
environments through actuator mechanisms.
o They can be self-organizing and network-centric with limited resources.
 Peer-to-peer computing
o Peer-to-peer (P2P) computing is computing over an application layer
network where all interactions among the processors are at a same level.
o This is a form of symmetric computation against the client sever paradigm.
o They are self-organizing with or without regular structure to the network.
o Some of the key challenges include: object storage mechanisms, efficient object
lookup, and retrieval in a scalable manner; dynamic reconfiguration with nodes as
well as objects joining and leaving the network randomly; replication strategies to
expedite object search; tradeoffs between object size latency and table sizes;
anonymity, privacy, and security.
 Publish-subscribe, content distribution, and multimedia
o The users in present day require only the information of interest.
o In a dynamic environment where the information constantly fluctuates there is
great demand for
o Publish: an efficient mechanism for distributing this information
o Subscribe: an efficient mechanism to allow end users to indicate interest
in receiving specific kinds of information
o An efficient mechanism for aggregating large volumes of published
information and filtering it as per the user’s subscription filter.
o Content distribution refers to a mechanism that categorizes the information based
on parameters.
o The publish subscribe and content distribution overlap each other.
o Multimedia data introduces special issue because of its large size.
 Distributed agents
o Agents are software processes or sometimes robots that move around the system
to do specific tasks for which they are programmed.
o Agents collect and process information and can exchange such information with
other agents.
o Challenges in distributed agent systems include coordination mechanisms among
the agents, controlling the mobility of the agents, their software design and
interfaces.
 Distributed data mining
o Data mining algorithms process large amount of data to detect patterns and trends
in the data, to mine or extract useful information.
o The mining can be done by applying database and artificial intelligence
techniques to a data repository.
 Grid computing
 Grid computing is deployed to manage resources. For instance, idle CPU cycles
of machines connected to the network will be available to others.
 The challenges includes: scheduling jobs, framework for implementing quality of
service, real-time guarantees, security.
 Security in distributed systems
The challenges of security in a distributed setting include: confidentiality, authentication
and availability. This can be addressed using efficient and scalable solutions.

Patroni
100% (1)
Patroni
137 pages
SCCM Basic Troubleshooting
100% (1)
SCCM Basic Troubleshooting
11 pages
IT Infrastructure Map
100% (3)
IT Infrastructure Map
16 pages
Message Passing Synchronous & Asynchronous
No ratings yet
Message Passing Synchronous & Asynchronous
11 pages
Design and Issues and DC
No ratings yet
Design and Issues and DC
3 pages
Distributed Systems
No ratings yet
Distributed Systems
4 pages
UNIT I
No ratings yet
UNIT I
17 pages
ds2
No ratings yet
ds2
5 pages
Chapter 1 - Intro
No ratings yet
Chapter 1 - Intro
31 pages
- Unit1 aos
No ratings yet
- Unit1 aos
3 pages
Distributed Systems
No ratings yet
Distributed Systems
35 pages
Distributed Systems Notes
No ratings yet
Distributed Systems Notes
12 pages
Distributed Systems As DS DS
No ratings yet
Distributed Systems As DS DS
7 pages
PDC-2.1_Updated_Design
No ratings yet
PDC-2.1_Updated_Design
121 pages
Assignment on Distribution System
No ratings yet
Assignment on Distribution System
28 pages
Distributed Computing
No ratings yet
Distributed Computing
10 pages
Overview of Distributed Computing
No ratings yet
Overview of Distributed Computing
4 pages
Gate Exam DC
No ratings yet
Gate Exam DC
4 pages
Distributed Systems
No ratings yet
Distributed Systems
13 pages
Characteristics of Distributed System
100% (1)
Characteristics of Distributed System
27 pages
Distributed Systems
100% (1)
Distributed Systems
71 pages
Distributed ProgrammingSolutions
No ratings yet
Distributed ProgrammingSolutions
20 pages
Distributed Computing: Beakal Gizachew Assefa
No ratings yet
Distributed Computing: Beakal Gizachew Assefa
54 pages
ADVANCE OPERATING SYSTEM Short Notes
No ratings yet
ADVANCE OPERATING SYSTEM Short Notes
23 pages
Abstract On Challenges in Distributed Systems
No ratings yet
Abstract On Challenges in Distributed Systems
4 pages
CC ZG526 Course Handout
No ratings yet
CC ZG526 Course Handout
6 pages
Distributed System Assinmnet
No ratings yet
Distributed System Assinmnet
9 pages
Chapter 6 BasicsDS
No ratings yet
Chapter 6 BasicsDS
38 pages
RMCS
No ratings yet
RMCS
127 pages
University of Okara, Okara: Department of Computer Science
No ratings yet
University of Okara, Okara: Department of Computer Science
5 pages
DC UT1 QB Soln
No ratings yet
DC UT1 QB Soln
19 pages
distributed-systems-notes
No ratings yet
distributed-systems-notes
122 pages
Resumen - Cap - 2
No ratings yet
Resumen - Cap - 2
7 pages
Distributive Systems Design
No ratings yet
Distributive Systems Design
30 pages
Cs3351 DC Ques Bank
No ratings yet
Cs3351 DC Ques Bank
10 pages
Distributed Systems
No ratings yet
Distributed Systems
47 pages
DS Ia1
No ratings yet
DS Ia1
34 pages
Chapter 1-Introduction To Distributed Systems
No ratings yet
Chapter 1-Introduction To Distributed Systems
59 pages
CS407 M1 Ktunotes.in
No ratings yet
CS407 M1 Ktunotes.in
9 pages
dc_rev
No ratings yet
dc_rev
11 pages
Adobe Scan 02-Oct-2023
No ratings yet
Adobe Scan 02-Oct-2023
4 pages
Commvault Administration and Best Practices: Definitive Reference for Developers and Engineers
From Everand
Commvault Administration and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed system TYPED NOTES
No ratings yet
Distributed system TYPED NOTES
40 pages
IntroDistribuetComputing
No ratings yet
IntroDistribuetComputing
41 pages
Distributed Systems
No ratings yet
Distributed Systems
121 pages
5ec6f859-83a0-4b48-a986-46fa87aaa36d
No ratings yet
5ec6f859-83a0-4b48-a986-46fa87aaa36d
122 pages
Distributed Systems Overview
No ratings yet
Distributed Systems Overview
8 pages
Building Secure and Reliable Network Applications
No ratings yet
Building Secure and Reliable Network Applications
4 pages
DC exam perp
No ratings yet
DC exam perp
39 pages
ds part b
No ratings yet
ds part b
30 pages
Distributed Computing - Syllabus: Course Code: Cs3551 REGULATION:2021
No ratings yet
Distributed Computing - Syllabus: Course Code: Cs3551 REGULATION:2021
7 pages
Blue-Green Deployment Engineering: Definitive Reference for Developers and Engineers
From Everand
Blue-Green Deployment Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
module_1
No ratings yet
module_1
21 pages
106106168
No ratings yet
106106168
760 pages
CS3551-Distributed computing Notes_removed (1)
No ratings yet
CS3551-Distributed computing Notes_removed (1)
32 pages
DS Syllabus Introduction (Reference)
No ratings yet
DS Syllabus Introduction (Reference)
44 pages
Chapter-1-Introduction
No ratings yet
Chapter-1-Introduction
53 pages
Chapter 1 - Characterization of Distributed Systems
No ratings yet
Chapter 1 - Characterization of Distributed Systems
20 pages
CS439-CC-2-Parallel Distributed Systems
No ratings yet
CS439-CC-2-Parallel Distributed Systems
37 pages
DC 2marks
No ratings yet
DC 2marks
21 pages
Chapter 1-Introduction (2)
No ratings yet
Chapter 1-Introduction (2)
45 pages
DS ModelQP Solution
No ratings yet
DS ModelQP Solution
44 pages
Efficient Deployment Automation with Fabric: Definitive Reference for Developers and Engineers
From Everand
Efficient Deployment Automation with Fabric: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Nidhi Koshti: (Document Title)
No ratings yet
Nidhi Koshti: (Document Title)
2 pages
Oracle Workflow Tutorial
No ratings yet
Oracle Workflow Tutorial
39 pages
Technote #Cm6 Omnitrend: How To Transfer An Omnitrend Database Into Microsoft SQL Format Using Microsoft Access
No ratings yet
Technote #Cm6 Omnitrend: How To Transfer An Omnitrend Database Into Microsoft SQL Format Using Microsoft Access
5 pages
Nas2002 Advanced-Cyber-Security LTP 1.0 6 Nas2002
No ratings yet
Nas2002 Advanced-Cyber-Security LTP 1.0 6 Nas2002
3 pages
Review of Object Orientation
No ratings yet
Review of Object Orientation
50 pages
Unix Shell Scripting Book
100% (1)
Unix Shell Scripting Book
145 pages
UNIT 1: Database Systems
No ratings yet
UNIT 1: Database Systems
25 pages
The SAP BPC Path To S4 HANA Finance What Does It Mean For You Justin McNeely PDF
No ratings yet
The SAP BPC Path To S4 HANA Finance What Does It Mean For You Justin McNeely PDF
11 pages
Mini Project
No ratings yet
Mini Project
2 pages
Utkarsh Gupta Resume
No ratings yet
Utkarsh Gupta Resume
1 page
Pega CSSA Session 02
No ratings yet
Pega CSSA Session 02
14 pages
unit 5 - part 1
No ratings yet
unit 5 - part 1
25 pages
Adding Customer Using Order Import
No ratings yet
Adding Customer Using Order Import
11 pages
Dokumen - Pub - Understanding Etl Data Pipelines For Modern Data Architectures Early Release 9781098159252
No ratings yet
Dokumen - Pub - Understanding Etl Data Pipelines For Modern Data Architectures Early Release 9781098159252
39 pages
Ariba P2X ProcessFlows PaymentRemittance en XX
No ratings yet
Ariba P2X ProcessFlows PaymentRemittance en XX
3 pages
Chapter 4 Questions and Answers 03 Nov
No ratings yet
Chapter 4 Questions and Answers 03 Nov
4 pages
Improvement of Procurement Performance A
No ratings yet
Improvement of Procurement Performance A
23 pages
Rabobank DevOps DBA Oracle
No ratings yet
Rabobank DevOps DBA Oracle
2 pages
Bitlocker Deployment Guide
No ratings yet
Bitlocker Deployment Guide
80 pages
ANSYS Remote Solve Manager Tutorials r170
No ratings yet
ANSYS Remote Solve Manager Tutorials r170
102 pages
DBMS Lab Manual
No ratings yet
DBMS Lab Manual
28 pages
Blockchain 0.1
No ratings yet
Blockchain 0.1
8 pages
QuickLearner ProblemSolvingProgramDesign
No ratings yet
QuickLearner ProblemSolvingProgramDesign
107 pages
Practical Examples On Database Management Systems
No ratings yet
Practical Examples On Database Management Systems
9 pages
Data Science & AI Certification
No ratings yet
Data Science & AI Certification
29 pages
Shubham Kumar
No ratings yet
Shubham Kumar
1 page
Tip DS On Hadoop
100% (1)
Tip DS On Hadoop
14 pages

Design issues and challenges

Uploaded by

Design issues and challenges

Uploaded by

UNIT - I

DESIGN ISSUES AND CHALLENGES IN DISTRIBUTED SYSTEMS

The above three classes are not mutually

Issues related to system and operating systems design

 Applications Programming Interface (API) and transparency: The user friendliness

Algorithmic challenges in distributed computing

Applications of distributed computing and newer challenges

You might also like