Module 9 - Business Continuity

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/37

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

38 Terms

New cards

the process of preparing for and recovering from system outages to keep operations running smoothly

Ensure continuous access to data and services
Use proactive/reactive measures
Automation to reduce downtime and manual effort
Main goal is to maximize availability of apps and info
Ex: if a natural disaster caused a power outage, a good BC plan ensures power kicks in or shifts operations to another data center to keep services running

New cards

Information Availability

the ability of an IT infrastructure to function according to business requirements and customer expectations, during its specified time of operation.

New cards

IA can be defined in terms of:

accessibility, reliability, and timelines

New cards

accessibility

information should be accessible to the right user when required

New cards

reliability

information should be reliable and correct in all aspects. I tis the same as what was stored and there is no alteration or corruption to the information

New cards

Timelines

defines the time window during which information must be accessible

New cards

Causes of information Unavailability

Application failure
Data center outage
Refreshing IT infrastructure
Infrastructure Component Failure
Data Loss

New cards

planned outages

such as installation and upgrades, facility constructions

New cards

unplanned outages

such as human error or natural disaster

New cards

impact of information unavailability

lost productivity, lost revenue, financial performance, other expenses, and damaged reputation

New cards

MTBF

mean time between failure

average time for a system to perform its normal operations until another failure

total uptime/number of failures

New cards

MTTR

average time is takes to repair a failed component

total downtime/number of failures

New cards

IA formula

IA = MTBF / (MTBF + MTTR) or IA = Uptime / (Uptime + Downtime)

New cards

RPO

Recovery Point Objective

point in time to which data must be recovered

New cards

RTO

Recovery Time Objective

Time within which systems and applications must be recovered

New cards

units of RPO and RTO

both are counted in units of time

New cards

usually the lower the RTO and RPO

the higher is the cost of a BC solution

New cards

Disaster Recovery

involved a set of policies for restoring IT infrastructure

New cards

the fundamental principle of DR

maintain a secondary data center or site called a DR site which should be located in a different geographical region

New cards

BC Technology Solutions

implementing FT mechanisms

deploying data protection solutions

automatic failover mechanisms

architecting resilient modern applications

New cards

Fault Tolerance

the ability of an IT system to continue working in the event of a failure

New cards

Key Requirements for FT

fault isolation and eliminating SPOF

New cards

Fault Isolation

contains the scope of a fault so that the other areas of a system are not impacted by the fault.

New cards

Single Point of Failure

refers to any individual or aspect of an infrastructure whose failure can make the entire system or service unavailable

New cards

How to eliminate SPOF

implement redundancy at component level: compute network storage

implement multiple availability zones

New cards

Network FT Mechanisms

link aggregation

NIC teaming

Multipathing

Elastic Load Balancing

New cards

link aggregation

combines link between 2 switches/nodes to enable network traffic failover

New cards

NIC teaming:

groups NIC so that they appear as a single logical NIC to the operating system or hypervisor

New cards

Multipathing

enabling a compute system to use multiple paths for transferring data to a LUN

New cards

Elastic Load Balancing

detecting the unhealthy VM instances and automatically redirects the I/Os to other healthy VM instances.

New cards

Storage Fault Tolerance Mechanisms

RAID

Erasure coding

Dynamic Disk Sparing

Cache protection

New cards

Erasure coding

provide space optimal data redundancy to prevent data loss against multiple disk drive failures
- Set of n disk divided into m disk to hold data and k disks to hold coding infos

New cards

Dynamic Disk Sparing:

hot spare

New cards

Cache protection

- mirroring

New cards

Erasure Coding

Provides space-optimal data redundancy to protect data loss against multiple drive failure

New cards

Cache Protection - Mirroring

Each write to cache is held in two different memory locations on two independent memory cards. • If a cache failure occurs, the write data will still be safe in the mirrored location and can be committed to the storage drive

New cards

FT at Site-Level - Availability Zones

Availability zone is a location with its own set of resources and isolated from other zones

New cards

Dell EMC PowerPath

a family of software products that ensures consistent application availability and performance across I/O paths on physical and virtual platforms. • PowerPath provides automated path management and tools that enable you to satisfy aggressive service-level agreements without investing in additional infrastructure