SAA-C03 / Chapter 9 Data Ingestion, Transformation, and Analytics:

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/19

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

20 Terms

1
New cards

1. What are valid use cases for transforming data when importing it into a data lake?

(Select three.)

A. Imposing consistent timestamps

B. Removing corrupted data

C. Creating a schema

D. Removing duplicate data

E. Visualizing data

2
New cards

2. What AWS Data Lake transform detects duplicate data?

A. MatchFinder

B. FindMatches ML

C. Elastic MapReduce

D. Spark

3
New cards

3. What's the most efficient way to import data from an on-premises

SQL database into an AWS Data Lake?

A. Dump the database into an S3 bucket and then import the data into the data lake.

B. Import the data into RDS and then into the data lake.

C. Use the Glue Connector.

D. Use the JDBC connector.

4
New cards

4. What protocols does AWS Transfer Family support? (Choose two.)

A. SFTP

B. SMB

C. FTP

D. CIFS

E. HTTPS

5
New cards

5. AWS Transfer Family can be used to transfer files to or from which of the following?

(Choose two.)

A. EBS

B. EFS

C. RDP

D. S3

E. DynamoDB

6
New cards

6. What technology does AWS Glue use to search large data sets and perform data transformation?

A. Amazon Athena

B. Apache Spark

C. Apache Elephant Stack

D. AWS Data Lake

7
New cards

7. What is the difference between a data lake and a data warehouse?

(Choose two.)

A. A data warehouse can store unstructured data.

B. A data warehouse is a relational database.

C. A data lake requires structured data.

D. A data lake can store unstructured, schema-less data.

8
New cards

8. Which of the following can AWS Data Lake import from? (Choose two.)

A. EBS

B. ELB

C. CloudFront

D. IAM

E. CloudWatch

9
New cards

9. Which of the following can analyze data in an AWS Data Lake?

(Choose two.)

A. Amazon EMS

B. Athena

C. RedShift Spectrum

D. RedShift

E. S3

10
New cards

10. Which of the following is not an appropriate use of AWS Glue?

A. Searching data

B. Ingesting real-time streaming data

C. Preparing data for analysis

D. Transforming data

11
New cards

11. You're developing an application to predict future weather patterns based on RADAR images. Which of the following Kinesis services is the best choice to support this application?

A. Kinesis Data Streams

B. Kinesis Video Streams

C. Kinesis Data Firehose

D. Kinesis ML

12
New cards

12. You're streaming image data to Kinesis Data Streams and need to retain the data for 30 days. How can you do this? (Choose two.)

A. Create a Kinesis Data Firehose delivery stream.

B. Increase the stream retention period to 14 days.

C. Specify an S3 bucket as the destination.

D. Specify CloudWatch Logs as the destination.

13
New cards

13. Which of the following Kinesis services requires you to specify a destination for the stream?

A. Kinesis Video Streams

B. Kinesis Data Streams

C. Kinesis Data Firehose

D. Kinesis Data Warehouse

14
New cards

14. You're running an on-premises application that frequently writes to a log file. You want to stream this log file to a Kinesis Data Stream. How can you accomplish this with the least effort?

A. Use the CloudWatch Logs Agent.

B. Use the Amazon Kinesis Agent.

C. Write a script that uses the Kinesis Producer Library.

D. Move the application to an EC2 instance.

15
New cards

15. When deciding whether to use SQS or Kinesis Data Streams to ingest data, which of the following should you take into account?

A. The frequency of data

B. The total amount of data

C. The number of consumers that need to receive the data

D. The order of data

16
New cards

16. You want to send streaming log data into Amazon Redshift. Which of the following services should you use? (Choose two.)

A. SQS with a standard queue

B. Kinesis Data Streams

C. Kinesis Data Firehose

D. SQS with a FIFO queue

17
New cards

17. Which of the following is not an appropriate use case for Kinesis?

A. Stock feeds

B. Facial recognition

C. Static website hosting

D. Videoconferencing

18
New cards

18. You need to push 2 MB per second through a Kinesis Data Stream. How many shards do you need to configure?

A. 1

B. 2

C. 4

D. 8

19
New cards

19. Multiple consumers are receiving a Kinesis Data Stream at a total rate of 3 MB per second. You plan to add more consumers and need the stream to support reads of at least 5 MB per second. How many shards do you need to add?

A. 1

B. 2

C. 3

D. 4

20
New cards

20. Which of the following does Kinesis Data Firehose not support?

A. Videoconferencing

B. Transforming video metadata

C. Converting CSV to JSON

D. Redshift