AWS Module 7 Notes
Module 7: Storage
Section 1: Core AWS Storage Services Overview
AWS Service | Storage Type | Use Case |
|---|---|---|
EBS | Block | OS volumes, databases on EC2 |
EFS | File | Shared access across EC2s |
S3 | Object | Static files, backups, big data |
Glacier | Object (cold) | Archival, compliance |
Section 2: Instance Store (Ephemeral Storage)
Temporary storage physically attached to EC2 instances.
Data is lost when the instance stops or is terminated; not suitable for persistent storage.
Section 3: Amazon EBS (Elastic Block Store)
Provides persistent block storage volumes for EC2 instances.
Auto-replicated within the same Availability Zone (AZ).
Offers low-latency, high-performance storage.
Supports snapshots for backups (incremental after baseline).
Types of EBS Volumes
General Purpose SSD (gp3/gp2): Default choice for most workloads.
Provisioned IOPS SSD (io2/io1): High-performance for mission-critical apps.
Throughput-Optimized HDD (st1): Ideal for large, sequential workloads.
Cold HDD (sc1): Lowest cost, for infrequent access workloads.
EBS Pricing Considerations
Volumes: Charged by provisioned GB/month.
IOPS: Charged by provisioned IOPS (if using io1/io2).
Snapshots: Charged by GB-month.
Data Transfer: Charged for inter-region transfers.
Section 4: Amazon S3 (Simple Storage Service)
Scalable object storage with global access via URL.
Designed for 99.999999999% durability and 99.99% availability.
Stores objects in buckets, each up to 5 TB.
Offers multiple storage classes:
S3 Standard: For frequently accessed data.
Intelligent-Tiering: Automatically shifts objects between access tiers.
Standard-IA: Infrequently accessed data.
One Zone-IA: Same as IA but in one AZ (cheaper, lower durability).
Glacier / Glacier Deep Archive: Long-term archiving with retrieval delays.
Lifecycle Management
Lifecycle policies can automatically move objects between storage classes (e.g., Standard → IA → Glacier).
S3 Pricing
Charged by:
Storage (GB per month)
Number of requests (e.g., GET, PUT, COPY per 1,000 requests)
Data transferred out of AWS
Section 5: Amazon EFS (Elastic File System)
Scalable, elastic file storage for Linux EC2 instances.
Shared access across multiple instances (NFS protocol).
Automatically grows/shrinks based on usage.
Stores data across multiple AZs by default.
Best for shared web content, big data workloads, or containers.
Section 6: Amazon S3 Glacier & Glacier Deep Archive
Long-term, low-cost archival storage.
Glacier: Retrieval in minutes to hours.
Glacier Deep Archive: Lowest cost, retrieval can take 12+ hours.
Key Concepts
Archive: Individual data object.
Vault: Container for multiple archives.
Retrieval Options
Expedited (1–5 mins): Highest cost
Standard (3–5 hrs): Default
Bulk (5–12 hrs): Lowest cost
Use Cases
Archiving media, compliance records, health data backups
Note: Glacier & Glacier Deep Archive are S3 storage classes, not separate services.
Section 7: S3 vs. Glacier Comparison
Feature | S3 Standard | S3 Glacier |
|---|---|---|
Access Speed | Milliseconds | Minutes to hours |
Cost per GB | Higher | Much lower |
Use Case | Frequent access | Infrequent, archival |
Max Object Size | 5 TB | 40 TB |
Module Summary
This module explained the different AWS storage services:
EBS: Block storage for EC2
S3: Scalable object storage for general-purpose use
EFS: Shared file storage for Linux EC2 instances
S3 Glacier/Deep Archive: Archival solutions for long-term, infrequent access
You now understand their primary functions, storage types, use cases, performance characteristics, and pricing models. This will help you choose the right solution depending on your app’s needs.