Design High-Performing Architectures

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/80

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

81 Terms

1
New cards
What is Amazon API Gateway?
A service that manages APIs, letting you send requests to backends like Lambda or other AWS services.
2
New cards
What happens when API Gateway requests exceed the configured throttling limit?
Requests over the limit get an HTTP 429 “Too Many Requests” response.
3
New cards
What are the two throttling settings in API Gateway?
Standard rate (requests per second) and burst (short spikes of extra requests).
4
New cards
How do API Gateway SDK clients handle HTTP 429 responses?
They automatically try the request again.
5
New cards
How do you enable caching in API Gateway?
By creating a cache for a specific API stage and setting its size.
6
New cards
What are two benefits of enabling caching in API Gateway?
Makes APIs faster and reduces traffic to your backend.
7
New cards
What cache settings can you configure in API Gateway?
How the cache key is built and how long data stays in the cache (TTL).
8
New cards
How can you invalidate cached data in API Gateway?
Use API Gateway management APIs to clear the cache for a stage.
9
New cards
What is a VPC endpoint?
A way to privately connect your VPC to supported AWS services without using the internet, NAT, VPN, or Direct Connect.
10
New cards
Why don’t instances need public IPs when using a VPC endpoint?
Because traffic to the service stays within the Amazon network and doesn’t go over the public internet.
11
New cards
What is the main benefit of using a VPC endpoint with S3 and DynamoDB?
You can access them privately using private IP addresses, with no exposure to the public internet.
12
New cards
When would you use a VPC endpoint?
When you want to connect to AWS services privately from your VPC without using the internet.
13
New cards
What is throttling in API Gateway?
A way to limit the number of API requests per second to prevent overload.
14
New cards
What are the two types of throttling limits in API Gateway?
Standard rate (requests per second) and burst (short spikes of extra requests).
15
New cards
What happens when requests exceed the throttling limit in API Gateway?
They get an HTTP 429 “Too Many Requests” response.
16
New cards
How do API Gateway SDK clients handle HTTP 429 responses?
They automatically retry the request.
17
New cards
Why use throttling limits in API Gateway?
To control traffic, protect backend resources, and avoid overload.
18
New cards
How do you provision a cache in API Gateway?
By creating a cache for a specific API stage and choosing its size in gigabytes.
19
New cards
What does enabling caching in API Gateway do?
It improves performance and reduces traffic to your backend.
20
New cards
What cache settings can you control in API Gateway?
How the cache key is built and the time-to-live (TTL) for each method.
21
New cards
How can you clear cached data in API Gateway?
Use the API Gateway management APIs to invalidate the cache for a stage.
22
New cards
What is hot, warm, and cold storage?
Hot = frequently accessed data, Warm = less frequently accessed, Cold = rarely accessed; colder storage is cheaper to store but more expensive to access.
23
New cards
What is Amazon FSx for Lustre used for?
A high-performance, parallel file system for fast processing of hot data workloads.
24
New cards
What is Amazon FSx for Windows File Server?
A fully managed Windows file system supporting SMB, NTFS, and Active Directory integration.
25
New cards
What is Amazon Elastic File System (EFS)?
A fully managed file storage service for easy setup and scalable file storage in AWS.
26
New cards
What is Amazon S3 used for in terms of data access?
Object storage for frequently, infrequently, or rarely accessed data, with multiple storage tiers including Glacier for cold data.
27
New cards
Which AWS service combination can be used for hot and cold storage?
Amazon FSx for Lustre for hot, high-performance data and Amazon S3 (Glacier) for cold, infrequently accessed data.
28
New cards
What is Lambda@Edge?
A service that lets you run Lambda functions to customize CloudFront content at locations closer to viewers without managing servers.
29
New cards
At which points can Lambda@Edge functions run in CloudFront?
Viewer request, Origin request, Origin response, Viewer response.
30
New cards
Why use Lambda@Edge for authentication?
It allows authentication to happen closer to users, reducing latency.
31
New cards
What is an origin failover in CloudFront?
A setup where an origin group has a primary and secondary origin, and CloudFront switches to the secondary if the primary fails.
32
New cards
Which HTTP errors can CloudFront origin failover help prevent?
Errors like HTTP 504 when the primary origin fails.
33
New cards
What is a DynamoDB stream?
An ordered flow of information about changes to items in a DynamoDB table.
34
New cards
What information does a DynamoDB stream record contain?
The primary key of the modified item and optionally "before" and "after" images.
35
New cards
How can DynamoDB Streams trigger actions automatically?
By integrating with AWS Lambda, which can execute code in response to stream events.
36
New cards
How does Lambda process DynamoDB stream records?
Lambda polls the stream and invokes the function synchronously when new records appear.
37
New cards
What role is needed for Lambda to process DynamoDB stream data?
An IAM role with permissions the Lambda function needs at runtime.
38
New cards
How can processed stream data notify users?
The Lambda function can publish messages to an SNS topic, which notifies subscribers via email.
39
New cards
What is Amazon FSx for Windows File Server?
A fully managed Windows file server with native Windows file system, accessible from Windows, Linux, and macOS.
40
New cards
How many compute instances and devices can access an FSx file system concurrently?
Thousands of compute instances and devices.
41
New cards
What is a file share in Amazon FSx?
A specific folder (including subfolders) made accessible to compute instances via the SMB protocol.
42
New cards
What is the first step to migrate an existing file share to FSx?
Migrate the files to Amazon FSx before migrating the file share configuration.
43
New cards
Why use Amazon FSx for migrating enterprise file shares?
It supports lift-and-shift of existing Windows applications with compatibility, performance, and enterprise features.
44
New cards
How can you invoke a Lambda function from an Aurora MySQL DB cluster?
By using a native function or a stored procedure.
45
New cards
Why would you invoke Lambda from Aurora MySQL?
To integrate the database with other AWS services or react to data changes automatically.
46
New cards
What can a Lambda function do after being triggered by Aurora MySQL?
It can send data or event notifications to services like an SQS queue for further processing.
47
New cards
What does the partition key in a DynamoDB table determine?
The logical partitions where the table's data is stored, which affects physical partitions.
48
New cards
What happens if I/O requests are unevenly distributed across partition keys?
It can create "hot" partitions, causing throttling and inefficient use of provisioned throughput.
49
New cards
How does partition-key design affect provisioned throughput usage?
More distinct partition key values spread requests evenly, improving throughput efficiency.
50
New cards
What type of partition keys should you use for optimal performance?
High-cardinality attributes with many distinct values for each item.
51
New cards
What is scheduled scaling in Auto Scaling?
Scaling your application in response to predictable load changes at specified times.
52
New cards
How do you configure scheduled scaling in an Auto Scaling group?
By creating a scheduled action with a start time and specifying new minimum, maximum, and desired group sizes.
53
New cards
Can scheduled scaling actions be recurring or one-time?
Both; you can create one-time or recurring scheduled actions.
54
New cards
Why use scheduled scaling for predictable traffic patterns?
It ensures instances are ready before peak usage, improving performance and availability.
55
New cards
What is AWS Resource Access Manager (RAM)?
A service that lets you securely share AWS resources with other AWS accounts or within your AWS Organization.
56
New cards
Which AWS resources can be shared using RAM?
Transit Gateways, Subnets, License Manager configurations, and Route 53 Resolver rules.
57
New cards
Why use AWS RAM in a multi-account environment?
It eliminates the need to duplicate resources in every account, reducing operational overhead.
58
New cards
How do you share resources using AWS RAM?
Create a Resource Share, specify the resources, and specify the accounts to share with.
59
New cards
What is AWS Organizations?
A service to consolidate and centrally manage multiple AWS accounts, organize them into groups, and apply policy-based controls.
60
New cards
Why combine AWS Organizations with RAM?
To centrally manage accounts while securely sharing resources across them without duplication.
61
New cards
What is AWS Glue?
A fully managed ETL service that moves and transforms data between different data stores.
62
New cards
What is a key advantage of AWS Glue?
Automatic schema discovery and mapping across different data sources.
63
New cards
Why use Apache Parquet for big data processing?
It’s a columnar format with high compression and faster query times, improving efficiency and performance.
64
New cards
Which data processing frameworks are compatible with Parquet?
Spark, Hive, and Hadoop.
65
New cards
How can AWS Glue process .csv files into Parquet and store them in S3?
Use a Glue ETL job to convert the files and store the output in an S3 bucket.
66
New cards
How can S3 events trigger a Glue ETL job?
Set up an S3 Event Notification for PUT events and invoke the Glue job through Amazon SQS.
67
New cards
Which EC2 metrics are readily available in CloudWatch?
CPU utilization, Network utilization, and Disk read.
68
New cards
What metrics are not available by default in CloudWatch?
Memory utilization, disk swap utilization, disk space utilization, page file utilization, and custom log collection.
69
New cards
How can you collect custom metrics for EC2 instances?
By using CloudWatch Monitoring Scripts or installing the CloudWatch Agent.
70
New cards
What is an example of a custom metric for EC2?
Memory utilization.
71
New cards
Why use custom metrics in CloudWatch?
To monitor system-level metrics that are not available by default, like memory or disk usage.
72
New cards
What is Amazon Kinesis Data Streams (KDS)?
A real-time data streaming service that can capture gigabytes of data per second from many sources.
73
New cards
How can AWS Lambda integrate with Kinesis Data Streams?
It can process, transform, or anonymize streaming data in real time before storage.
74
New cards
Why anonymize data using Lambda with Kinesis?
To protect PII and reduce the risk of data breaches or privacy violations.
75
New cards
Where can anonymized streaming data be stored?
In Amazon DynamoDB, a NoSQL database suitable for processed streaming data.
76
New cards
What is AWS Database Migration Service (DMS)?
A service that migrates relational databases, data warehouses, NoSQL databases, and other data stores to AWS or between on-premises and cloud.
77
New cards
What types of migrations can AWS DMS perform?
One-time migrations and ongoing replication to keep sources and targets in sync.
78
New cards
What tool can help migrate database schemas to a new engine?
AWS Schema Conversion Tool (AWS SCT).
79
New cards
What is AWS Elastic Beanstalk?
A service that automatically handles capacity, load balancing, scaling, and application health for deployed applications.
80
New cards
Which programming languages does Elastic Beanstalk support?
Go, Java, .NET, Node.js, PHP, Python, and Ruby.
81
New cards
What is the benefit of Elastic Beanstalk for .NET applications?
It simplifies deploying, managing, and scaling ASP.NET web applications running on IIS.