By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
BCP (Business Continuity Planning) and DRP (Disaster Recovery Planning) are your insurance policies for when things go wrong—whether it’s a ransomware attack, a data center flood, or a misconfigured rm -rf / command. In CompTIA Security+, this is Domain 2.5 ("Explain the importance of policies, procedures, and controls").
rm -rf /
Why it matters in production: - If you ignore BCP/DRP, a single disaster can wipe out your company (e.g., Code Spaces was permanently destroyed by a DDoS attack in 2014 because they had no off-site backups). - If you do it right, you can recover from a catastrophic failure in minutes (e.g., Netflix’s "Chaos Monkey" intentionally breaks things to test DRP).
Real-world scenario: You’re a cloud engineer at a fintech startup. Your CEO just asked: "If AWS us-east-1 goes down, how long until our app is back online?" If you can’t answer in minutes, you’re failing at BCP/DRP.
RTO (Recovery Time Objective) How long until the system is back online? Production insight: If your RTO is 4 hours but your database takes 6 hours to restore, you’re screwed.
RPO (Recovery Point Objective) How much data can you afford to lose? Production insight: If your RPO is 15 minutes but your backups run hourly, you’re violating your SLA.
MTD (Maximum Tolerable Downtime) The absolute longest your business can survive without the system. Production insight: If your MTD is 24 hours but your RTO is 36 hours, you need a better plan.
Failover vs. Failback Failover = Switch to a backup system. Failback = Switch back to the primary system. Production insight: If you don’t test failback, you’ll be stuck on the backup system forever.
Hot Site vs. Cold Site vs. Warm Site
Cold Site: Just a room with power (e.g., a rented data center with no equipment). Production insight: Hot sites cost $$$, but cold sites take days to recover.
Backup Types
Differential Backup: Only changes since the last full backup. Production insight: Incremental backups are fast but slow to restore (you need the full backup + every incremental since then).
Redundancy (N+1, 2N, 2N+1)
2N+1: Double + one extra (e.g., 3 servers for 1 workload). Production insight: 2N is expensive but ensures no single point of failure.
Tabletop Exercise A simulated disaster to test your BCP/DRP. Production insight: If you don’t run these, your plan will fail when it matters.
Prerequisites: - AWS account with admin IAM permissions. - A running EC2 instance (or RDS database) to back up.
aws backup create-backup-vault --backup-vault-name "DRP-Vault" --region us-east-1
Verify:
aws backup list-backup-vaults --region us-east-1
Create a file backup-plan.json:
backup-plan.json
{ "BackupPlanName": "DRP-Plan", "BackupPlanRule": [ { "RuleName": "Daily-Backup", "TargetBackupVault": "DRP-Vault", "ScheduleExpression": "cron(0 5 * *-*)", # Daily at 5 AM UTC "StartWindowMinutes": 60, "CompletionWindowMinutes": 180, "Lifecycle": { "DeleteAfterDays": 30 # Keep backups for 30 days } } ] }
Apply the plan:
aws backup create-backup-plan --backup-plan file://backup-plan.json --region us-east-1
Get the backup plan ARN:
aws backup list-backup-plans --region us-east-1
Assign an EC2 instance (replace instance-id and backup-plan-arn):
instance-id
backup-plan-arn
aws backup create-backup-selection \ --backup-selection file://selection.json \ --region us-east-1
Where selection.json is:
selection.json
{ "BackupSelection": { "SelectionName": "EC2-Backup", "IamRoleArn": "arn:aws:iam::123456789012:role/service-role/AWSBackupDefaultServiceRole", "Resources": ["arn:aws:ec2:us-east-1:123456789012:instance/i-1234567890abcdef0"], "BackupPlanId": "backup-plan-arn" } }
aws backup put-backup-vault-access-policy \ --backup-vault-name "DRP-Vault" \ --policy file://replication-policy.json \ --region us-east-1
Where replication-policy.json is:
replication-policy.json
{ "Version": "2012-10-17", "Statement": [ { "Effect": "Allow", "Principal": { "Service": "backup.amazonaws.com" }, "Action": "backup:CopyIntoBackupVault", "Resource": "*", "Condition": { "StringEquals": { "aws:RequestedRegion": "us-west-2" } } } ] }
us-east-1
us-west-2
bash aws backup start-restore-job \ --recovery-point-arn "arn:aws:backup:us-west-2:123456789012:recovery-point:rp-1234567890abcdef0" \ --metadata '{"InstanceType":"t3.micro"}' \ --iam-role-arn "arn:aws:iam::123456789012:role/service-role/AWSBackupDefaultServiceRole" \ --region us-west-2
bash aws ec2 describe-instances --filters "Name=tag:aws:backup:restore-point,Values=rp-1234567890abcdef0" --region us-west-2
Expected output: A new EC2 instance in us-west-2 with the same data as the original.
AdministratorAccess
2N
N+1
Environment=Production
BackupPlan=DRP
aws ec2 start-instances --instance-ids i-12345678
bash aws cloudwatch put-metric-alarm \ --alarm-name "Backup-Failed" \ --metric-name "NumberOfBackupJobsFailed" \ --namespace "AWS/Backup" \ --statistic "Sum" \ --period 300 \ --threshold 1 \ --comparison-operator "GreaterThanOrEqualToThreshold" \ --evaluation-periods 1 \ --alarm-actions "arn:aws:sns:us-east-1:123456789012:Backup-Alerts"
Answer: Incremental backup (you need the full backup + every incremental since then).
"You need a cost-effective, highly available storage for infrequently accessed backups. Which S3 class?"
Answer: S3 Glacier Deep Archive (cheapest, but retrieval takes 12+ hours).
"What’s the difference between a hot site and a cold site?"
"Your company’s database is corrupted. You have a full backup from 24 hours ago and incremental backups every 6 hours. What’s the fastest way to restore?" - Answer: Restore the full backup + the latest incremental backup (not all incrementals).
You’re a cloud engineer at a startup. Your CEO just asked: "If our primary AWS region (us-east-1) goes down, how long until our app is back online?" Your current setup: - A single EC2 instance in us-east-1. - No backups or multi-region setup.
Task: Design a cost-effective BCP/DRP plan with: - RTO < 30 minutes. - RPO < 1 hour.
Solution:1. Enable AWS Backup for the EC2 instance (daily full backups + hourly incrementals).2. Replicate backups to us-west-2 (cross-region).3. Create an AMI of the EC2 instance and copy it to us-west-2.4. Set up a CloudFormation template to launch the AMI in us-west-2 if us-east-1 fails.5. Test failover by terminating the us-east-1 instance and launching the AMI in us-west-2.
Why it works: - RTO < 30 minutes: AMI launch + data restore from backup. - RPO < 1 hour: Hourly incremental backups.
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.