Definition: The structured process of managing the transition of large software systems from development to implementation, including tracking changes during development and operations.
Key Aspects:
Ensures controlled and documented changes.
Minimizes disruptions during transitions.
Definition: The application of processes to manage changes to both software and hardware configurations.
Key Aspects:
Tracks system configurations.
Ensures consistency across environments.
Definition: Keeping a business operational during and after disruptive events through advanced planning.
Key Components:
Disaster Recovery Plan (DRP): Focuses on recovery after a disaster, prioritizing human safety and critical systems.
Business Impact Analysis (BIA):
Documents the impact of disruptions on operations.
Identifies critical systems and prioritizes backups/restoration.
Single Points of Failure: Systems relying on one component (e.g., routers, power supplies) must be mitigated.
Key Considerations:
Frequency: How often backups occur.
Extent: What data/applications are backed up.
Storage: Where backups are kept (geographic diversity recommended).
Retention: How long backups are stored (follow the "Rule of Three"—rotate backups).
Responsibility: Who ensures backups are created/maintained.
Full Backup: Copies all files/software.
Differential Backup: Copies files changed since the last full backup.
Incremental (Delta) Backup: Backs up only new/changed data since the last backup.
Snapshots: Point-in-time copies of virtual machines (VMs).
Long-Term Storage: Magnetic media degrades; rotate/update backups.
Encryption: Ensure multiple personnel know decryption keys.
Legal/Compliance: Data sovereignty laws may require backups in specific countries.
Site Type | Cost | Recovery Speed | Complexity |
---|---|---|---|
Hot Site | High | Immediate/Few Hours | Low |
Warm Site | Moderate | Days | Moderate |
Cold Site | Low | Weeks | High |
Mutual Aid Agreement: Organizations agree to support each other (risky if both are hit).
Dependencies: Systems required for others to function.
Critical Infrastructure: Core business systems.
Short Outages: Use UPS (Uninterruptible Power Supply).
Extended Outages: Deploy backup generators.
Ensures access to critical data/files remotely during disruptions.
Defines which operations must continue during disruptions.
Key Elements:
Recovery Time Objective (RTO): Target time to resume operations (shorter = higher cost).
Recovery Point Objective (RPO): Maximum acceptable data loss (dictates backup frequency).
Testing: Regularly rehearse the plan.
Configuration Identification: Tagging assets (e.g., hardware/software) as Configuration Items (CIs).
Baseline: A stable reference point for comparison.
Configuration Status Accounting: Tracking CI changes.
Configuration Auditing: Verifying compliance with policies.
Change Control Board (CCB): Approves/reviews changes.
System Problem Report (SPR): Tracks change requests.
Backout Plan: Reverts changes if issues arise.
Levels:
Initial: Chaotic processes.
Managed: Planned processes.
Defined: Standardized processes.
Quantitatively Managed: Metrics-driven.
Optimizing: Continuous improvement.
Development: Where code is written.
Test: Mirrors production for validation.
Staging: Optional; tests partial deployments.
Production: Live systems with real data.
Secure Baseline: Systems with patches/settings applied.
Sandboxing: Isolates systems to prevent issues from spreading.
Integrity Measurement: Detects unauthorized changes.
Plan Ahead: BIA and DRP are critical.
Test Backups/DRP: Ensure they work when needed.
Avoid Single Points of Failure: Redundancy is key.
Document Everything: Changes, configurations, and recovery steps.