Assignment 4 - 044

Uploaded by

Amirthalakshmi Dheivasigamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Assignment 4 - 044

Uploaded by

Amirthalakshmi Dheivasigamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

DISTRIBUTED COMPUTING

CS3551

ASSIGNMENT – 4

NAME: Saraniya P
REG.NO: 310822243044
DEPT: AI&DS
YEAR: III
1. Issues in Failure Recovery:
Failure recovery in distributed systems is complex due to several challenges:
• Unpredictable Failures: Hardware or software failures can occur at any node,
and their impact may cascade across the system.
• Concurrency and Non-Determinism: Processes in distributed systems
operate concurrently and may exhibit non-deterministic behaviour, making
recovery nontrivial.
• Global State Consistency: It is difficult to ensure a consistent global state for
recovery because distributed systems lack a single point of control.
• Partial Failures: Some nodes may fail while others remain operational,
complicating coordination and recovery efforts.
• Communication Failures: Loss or delay of messages between nodes can lead
to inconsistent states and make recovery harder.
• Cost of Recovery: Recovery mechanisms like checkpointing and logging add
computational and storage overhead.

2. Algorithm for Asynchronous Checkpointing and Recovery:

Asynchronous checkpointing allows processes in a distributed system to take
checkpoints independently, avoiding the overhead of coordination.

Steps:
1. Checkpointing:
o Each process periodically saves its local state (checkpoint)
without waiting for other processes.
o Checkpoints include process state and metadata about
dependencies (e.g., sent/received messages).

2. Log Communication:
o Messages exchanged between checkpoints are logged to
ensure they can be replayed during recovery.
o Processes log messages sent and received during execution.
3. Failure Detection:
Upon detecting a failure, the system identifies a set of consistent.
checkpoints for recovery.

4. Recovery:
o Processes roll back to their latest checkpoints.
o Lost messages after the checkpoints are replayed from the logs to
ensure consistency.

Benefits:
• No need for global coordination, reducing latency.
• Suitable for systems with frequent communication or high failure
probabilities.

Drawbacks:
• Risk of cascading rollbacks if dependencies among checkpoints are not
carefully managed.
• Increased storage and communication overhead for logging.

3. Log-Based Rollback Recovery

Log-based rollback recovery is a mechanism to recover a system to a
consistent state by replaying logged events after a failure.

Key Concepts:
• Event Logging:
o Each process logs events such as message sends, receives, and state
changes.
o Logs are stored persistently to survive failures.

• Consistent Recovery Point:

A consistent global state is reconstructed by rolling back processes to
their checkpoints and replaying logs.
Types:
1. Pessimistic Logging:
o Ensures logs are committed to stable storage synchronously before
proceeding, guaranteeing no loss of information.
o Low recovery overhead but high runtime overhead.

2. Optimistic Logging:
o Allows processes to proceed without waiting for logs to be committed,
reducing runtime overhead.
o Recovery may involve complex rollbacks and replays.

3. Causal Logging:
o Ensures logs respect causal dependencies between events.
o Balances runtime performance and recovery complexity.

Steps in Log-Based Rollback Recovery:

1. Detect failure and identify affected processes.
2. Roll back each process to the latest checkpoint.
3. Replay logged events to restore the system to a consistent state.
Advantages:
• Provides precise recovery by replaying only necessary events.
• Can tolerate multiple simultaneous failures if logs are intact.

Challenges:
• Managing and storing logs efficiently in large-scale systems.
• Ensuring that logs capture all necessary events for recovery without
excessive overhead.

Flight Data Recorder (SSFDR)
100% (2)
Flight Data Recorder (SSFDR)
16 pages
1904050001
No ratings yet
1904050001
119 pages
System Recovery
No ratings yet
System Recovery
38 pages
unit 4
No ratings yet
unit 4
94 pages
Unit 4 - DSRM
No ratings yet
Unit 4 - DSRM
5 pages
Unit-3 Part2
No ratings yet
Unit-3 Part2
74 pages
Dc-3551 Unit IV Notes
No ratings yet
Dc-3551 Unit IV Notes
32 pages
4th Unit Topics Recovery
No ratings yet
4th Unit Topics Recovery
73 pages
CS8603 U.iv
No ratings yet
CS8603 U.iv
33 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
21 pages
DC Unit 4 Important
No ratings yet
DC Unit 4 Important
6 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
21 pages
Module 4 - Distributed Shared Memory and Failure Recovery - Sreerag Sanilkumar
No ratings yet
Module 4 - Distributed Shared Memory and Failure Recovery - Sreerag Sanilkumar
14 pages
u4p6
No ratings yet
u4p6
10 pages
Lm2-Rollback & Recovery
No ratings yet
Lm2-Rollback & Recovery
34 pages
Failure Recovery in Distributed Systems
No ratings yet
Failure Recovery in Distributed Systems
24 pages
CheckpointingRecovery ds14
No ratings yet
CheckpointingRecovery ds14
35 pages
CS8603 U.iv
No ratings yet
CS8603 U.iv
33 pages
DC Unit4
No ratings yet
DC Unit4
32 pages
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
No ratings yet
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
52 pages
Session 33
No ratings yet
Session 33
4 pages
Checkpointing and Rollback Recovery For Distributed Systems 5cvcuy5txm
No ratings yet
Checkpointing and Rollback Recovery For Distributed Systems 5cvcuy5txm
23 pages
DC UNIT4
No ratings yet
DC UNIT4
33 pages
Unit Iv Recovery
No ratings yet
Unit Iv Recovery
27 pages
DS NOTES Unit 4 PDF
No ratings yet
DS NOTES Unit 4 PDF
36 pages
Distributed-Computing-Module-4-Important-Topics-PYQs
No ratings yet
Distributed-Computing-Module-4-Important-Topics-PYQs
23 pages
Module 4
No ratings yet
Module 4
59 pages
module4_distributed
No ratings yet
module4_distributed
6 pages
Distributed Failure Recovery
No ratings yet
Distributed Failure Recovery
30 pages
CS 194: Distributed Systems
No ratings yet
CS 194: Distributed Systems
15 pages
Unit IV 2 Marks With Answer
No ratings yet
Unit IV 2 Marks With Answer
2 pages
Unit 4_Deadlock Handling & Recovery Techniques & Failuere Classification
No ratings yet
Unit 4_Deadlock Handling & Recovery Techniques & Failuere Classification
55 pages
DU3 1
No ratings yet
DU3 1
54 pages
DS unit_4
No ratings yet
DS unit_4
20 pages
DistributedComputing(University) PartA
No ratings yet
DistributedComputing(University) PartA
19 pages
Concurrent Checkpointing and Recovery in Distributed Systems
No ratings yet
Concurrent Checkpointing and Recovery in Distributed Systems
61 pages
c1cc1cde-bdda-41e7-92a0-5453e98d0676
No ratings yet
c1cc1cde-bdda-41e7-92a0-5453e98d0676
5 pages
Consensus
No ratings yet
Consensus
77 pages
Unit 4 Answer Key
No ratings yet
Unit 4 Answer Key
24 pages
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
No ratings yet
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
30 pages
DC - Unit IV
No ratings yet
DC - Unit IV
36 pages
Presentation On Consistent Checkpoints & Recovery in Distributed System
100% (1)
Presentation On Consistent Checkpoints & Recovery in Distributed System
26 pages
Lm3 Checkpointing Algorithm
No ratings yet
Lm3 Checkpointing Algorithm
40 pages
Session 32
No ratings yet
Session 32
3 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
33 pages
Chapter 8-Fault Tolerance
No ratings yet
Chapter 8-Fault Tolerance
30 pages
16_issues in Failure Recovery
No ratings yet
16_issues in Failure Recovery
5 pages
Ds chapter 7 (2)
No ratings yet
Ds chapter 7 (2)
21 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
4.1.5. Log based roll back Recovery-1
No ratings yet
4.1.5. Log based roll back Recovery-1
12 pages
Coordinated Checkpoint Versus Message Log For Fault Tolerant MPI
No ratings yet
Coordinated Checkpoint Versus Message Log For Fault Tolerant MPI
27 pages
Distributed Systems As DS DS
No ratings yet
Distributed Systems As DS DS
7 pages
Message Passing Synchronous & Asynchronous
No ratings yet
Message Passing Synchronous & Asynchronous
11 pages
Recovery DC
No ratings yet
Recovery DC
6 pages
Chapter 8-Fault Tolerance
100% (1)
Chapter 8-Fault Tolerance
71 pages
Possible Types of Failure
No ratings yet
Possible Types of Failure
16 pages
Chapter 8 Fault Tolerance
No ratings yet
Chapter 8 Fault Tolerance
20 pages
DS UNIT-3 NOTES
No ratings yet
DS UNIT-3 NOTES
35 pages
Rohini 836843492
No ratings yet
Rohini 836843492
3 pages
Distributed Sys Lab Manual
No ratings yet
Distributed Sys Lab Manual
25 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
24 Input Devices of A Computer
100% (1)
24 Input Devices of A Computer
51 pages
CFCBinsDownloader Log
No ratings yet
CFCBinsDownloader Log
3 pages
Bold Italic Underline in .NET RichTextBox (Part 1 Visual Basic)
0% (1)
Bold Italic Underline in .NET RichTextBox (Part 1 Visual Basic)
7 pages
WebSphere® Development Studio ILE RPG Reference Summary
No ratings yet
WebSphere® Development Studio ILE RPG Reference Summary
78 pages
Linux Installation On Virtual Machine (Includes Screenshots)
No ratings yet
Linux Installation On Virtual Machine (Includes Screenshots)
18 pages
D1-Practice-Exercise-12 Binary Search Tree
No ratings yet
D1-Practice-Exercise-12 Binary Search Tree
14 pages
The Good Parts of AWS
No ratings yet
The Good Parts of AWS
176 pages
SCT_UNIT-5
No ratings yet
SCT_UNIT-5
24 pages
Punycode Es6
No ratings yet
Punycode Es6
8 pages
Two Level Predictor
No ratings yet
Two Level Predictor
11 pages
Amazon EC2 Autoscaling
No ratings yet
Amazon EC2 Autoscaling
212 pages
25L3205
No ratings yet
25L3205
46 pages
Comp1L Lec1c
No ratings yet
Comp1L Lec1c
81 pages
Class XLL NIOS Data Entry L1
No ratings yet
Class XLL NIOS Data Entry L1
4 pages
Cheat Sheet
100% (3)
Cheat Sheet
3 pages
Week008-Microprocessor Systems Assessement 2
100% (1)
Week008-Microprocessor Systems Assessement 2
15 pages
SINAMICS Startdrive V16 Supported Drives and Functions en
No ratings yet
SINAMICS Startdrive V16 Supported Drives and Functions en
9 pages
Strategic Analysis of Apple Inc.: Brian Masi
No ratings yet
Strategic Analysis of Apple Inc.: Brian Masi
35 pages
اتاق چت چت ایرانی خانه آنلاین شما چت روم فارسی Iranchati.20140522.214832
No ratings yet
اتاق چت چت ایرانی خانه آنلاین شما چت روم فارسی Iranchati.20140522.214832
1 page
Pic18f45k50 PDF
No ratings yet
Pic18f45k50 PDF
508 pages
Devops Interview Questions With Answers 1
No ratings yet
Devops Interview Questions With Answers 1
12 pages
Dual-Booting Windows Vista and GNU/Linux On The Dell Latitude E4300
No ratings yet
Dual-Booting Windows Vista and GNU/Linux On The Dell Latitude E4300
13 pages
NAT Theory
No ratings yet
NAT Theory
2 pages
File Handling
No ratings yet
File Handling
3 pages
Industrial Training Edited
No ratings yet
Industrial Training Edited
30 pages
Actix One V211 To V30 Upgrade Manual USAEnglish Edn 4
No ratings yet
Actix One V211 To V30 Upgrade Manual USAEnglish Edn 4
24 pages
NCOM31203 - Introduction To Computer Programming - Mid Semester Examinations
No ratings yet
NCOM31203 - Introduction To Computer Programming - Mid Semester Examinations
7 pages
University Online Course Registration System
100% (3)
University Online Course Registration System
25 pages
Hack Terms & Definitions
No ratings yet
Hack Terms & Definitions
6 pages

Assignment 4 - 044

Uploaded by

Assignment 4 - 044

Uploaded by

DISTRIBUTED COMPUTING

2. Algorithm for Asynchronous Checkpointing and Recovery:

3. Log-Based Rollback Recovery

• Consistent Recovery Point:

Steps in Log-Based Rollback Recovery:

You might also like