1/44
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
ECS
Amazon ECS is a fully managed container orchestration service for deploying, managing, and scaling containerized applications.
SNS
Amazon SNS is a fully managed messaging service for decoupling microservices, distributed systems, and serverless applications by sending messages to multiple subscribers.
EMR
Amazon EMR is a cloud big data platform for processing large amounts of data quickly and cost-effectively using tools like Apache Spark, Hadoop, and Presto. Elastic MapReduce
SQS
Amazon SQS is a fully managed message queuing service for decoupling and scaling microservices, distributed systems, and serverless applications by sending, storing, and receiving messages between software components.
Cross-Region Replication (CRR)
CRR is an Amazon S3 feature that automatically replicates objects from a bucket in one AWS region to a bucket in another region, ensuring data availability in case of regional failures.
Versioning
Versioning in Amazon S3 allows keeping multiple versions of an object in the same bucket, ensuring all versions are replicated to the destination bucket in Cross-Region Replication.
Weighted Routing Policy
A routing policy that directs traffic based on different weights assigned to each resource, allowing load distribution based on the assigned weights.
Latency Routing Policy
A routing policy that directs users to the region with the lowest latency, ensuring they are sent to the fastest responding region.
Failover Routing Policy
A routing policy that directs traffic to a backup resource when the primary resource is unavailable, ensuring high availability and reliability.
Geolocation Routing Policy
A routing policy that directs users to different resources based on their geographic location, ensuring users are directed to the closest or most appropriate region.
Versioning and Cross-Region Replication
Enabling versioning for the source bucket and configuring Cross-Region Replication ensures all data versions are replicated to another region, meeting disaster recovery requirements.
Latency
The time for data to travel between a user and a server, crucial for user experience as lower latency means faster response times.
Routing Policies
In AWS Route 53, policies that determine how DNS queries are resolved to guide users to the appropriate resource.
Weighted Routing
Routing traffic to multiple resources based on specified weights, allowing distribution but not automatically choosing the region with the lowest latency.
Failover Routing
Routing traffic to a primary resource unless it becomes unavailable, then routing to a secondary resource, ensuring high availability but not optimizing for fastest response times.
Geolocation Routing
Routing traffic based on the geographic location of the user, directing users from specific countries or regions to particular resources.
Amazon SQS Messages
A metric to track the number of messages in the SQS queue, directly indicating how many messages are waiting to be processed by the ECS cluster.
Amazon ECS Containers
Tracking the number of running ECS containers, related to processing capacity but not directly helping in minimizing the SQS queue size.
Auto Scaling Strategy
Dynamically adjusting the number of ECS tasks based on demand to minimize the SQS queue size, scaling up when the queue grows and down when it shrinks.
Elastic Load Balancer (ELB)
Balancing incoming application traffic across multiple targets, ensuring only healthy instances receive traffic.
Cross-Zone Load Balancing
Configuring the ELB to distribute incoming application traffic across multiple Availability Zones to ensure the website is highly available.
High Availability
The ability of a system to remain accessible even in the event of failures or unavailability of components.
Auto Scaling
A feature that automatically adjusts the number of EC2 instances based on demand, ensuring the system's capacity and availability.
Cross-zone load balancing
A method that allows an ELB to evenly distribute incoming traffic across all instances in all Availability Zones.
Configure CloudWatch to monitor the EC2 instances using VPC flow logs
Amazon CloudWatch collects data and insights from AWS resources. VPC flow logs capture IP traffic information but do not directly contribute to high availability.
Configure the EC2 instance sizes to the maximum available size
Scaling up EC2 instances to the largest available size, which may improve performance but does not ensure high availability.
Configure the ELB to implement Auto Scaling based on health checks
Auto Scaling adjusts the number of EC2 instances based on demand. Health checks trigger the launch of new instances if failures occur, ensuring high availability.
Configure the ELB for cross-zone load balancing
Enables an ELB to evenly distribute traffic across all instances in all Availability Zones, improving load distribution but not directly addressing instance failures.
Scheduled scaling
An option that automatically adjusts the number of EC2 instances based on predefined schedules, suitable for applications with regular seasonal demand changes.
Regular Seasonal Increases in Demand
Demand patterns that are predictable and occur at specific times, such as certain months of the year or specific days of the week.
No Modifications to the Application
Requiring a solution that works with the existing application without any changes to the application code or architecture.
Automatic Adaptation
The scaling strategy automatically adjusts resources to handle increases in demand.
Predictive Scaling
Uses machine learning to forecast demand and adjust the number of EC2 instances in anticipation of traffic changes.
Dynamic Scaling
Adjusts the number of EC2 instances in real-time based on current demand metrics like CPU utilization, memory usage, or request rates.
Scheduled Scaling
Allows increasing or decreasing the number of EC2 instances at specific times based on a predefined schedule, suitable for predictable workloads.
Target Scaling
Maintains a desired target metric, such as CPU utilization, by automatically scaling resources up or down.
Constantly Changing Demands with No Patterns
Resource needs fluctuate unpredictably without predictable times of demand increases or decreases.
Key Concepts: Automatic Adaptation
The scaling strategy must automatically respond to unpredictable changes in demand without manual intervention.
Machine Learning Algorithms
Utilized to predict and adapt to changes in demand, suggesting that demand patterns may be analyzed for improved resource allocation.
Scaling the database layer through the addition of read replicas
Improves database performance by adding read replicas to offload read traffic from the primary database server.
Scaling the web server layer through the addition of web servers
Involves adding more web servers behind a load balancer to distribute traffic, reducing the load on each server and improving response times.
Amazon CloudFront caching to serve web assets to users
A content delivery network that caches static web assets at edge locations to reduce latency and improve delivery speeds for users.
Amazon ElastiCache to cache database results
Provides in-memory caching to improve database performance by caching frequently accessed data.
Amazon Simple Queue Service (Amazon SQS)
A fully managed message queuing service that enables decoupling and scaling of microservices, distributed systems, and serverless applications.
Scheduled
Refers to a mechanism where actions or messages are triggered at specified times.