3 Software Fault Tolerance

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/14

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

15 Terms

1
New cards

replication checks

compare outputs of matching modules

2
New cards

Timing checks

use of timers to check timing constraints

3
New cards

Reversal checks

reverse output and check against inputs

4
New cards

Coding checks

parity, Hamming codes, etc

5
New cards

Reasonable checks

use semantic properties of data

6
New cards

Structural checks

redundancy in data structures

7
New cards

Validity checks

divide by 0, check array bounds, overflow

8
New cards

Interface exception

Invalid service request detected by interface detectors and corrected by service requestor

9
New cards

Local / internal exception

Problems with own internal operations detected by local detectors and corrected by local correctors

10
New cards

Failure exception

When internal errors propagate to interface, and detected by interface detectors. Global correction may be needed, e.g., look for another server

11
New cards

Forward error recovery

Upon detection of error, program attempts to get into a state which is no longer erroneous (we will see how recovery blocks enable this)

12
New cards

Backward error recovery

Upon detection of error, program rolls back to a previously “recorded” good point (we will see how checkpoints enable this) from which it can restart executing

13
New cards

Recovery Line

A recovery line is a set of checkpoints across all processes to which the programs can be rolled back, in the event of failure, to ensure consistent error-free state of the system

OK to checkpoint after a message send
Not ok to checkpoint prior to a message receive (need to process pending messages first)

<p><span>A recovery line is a set of checkpoints across all processes to which the programs can be rolled back, in the event of failure, to ensure consistent error-free state of the system</span></p><p><span>OK to checkpoint after a message send</span><br><span>Not ok to checkpoint prior to a message receive (need to process pending messages first)</span></p>
14
New cards

A recovery line is said to be consistent if

there are no messages that originate (temporally speaking) after the line and terminate before it

In other words, along the recovery line, there can not be a receive without a corresponding send the reverse is allowed, because temporally speaking, receive will occur after send

15
New cards