AWS Machine Learning - Specialty Official Practice Question Set (MLS-C01) v2

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/50

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

51 Terms

1
New cards

Pre-splitting Your Data

Used for when you need explicit control over the data in your training and evaluation datasources.

2
New cards

Sequentially Splitting Your Data

This approach is useful if you want to evaluate your ML models on data for a certain date or within a certain time range

3
New cards

Randomly Splitting Your Data

This approach is useful to ensure that the distribution of the data is similar in the training and evaluation datasources.

4
New cards

True

Is important to use the same seed string value for both datasources and the complement flag for one datasource

5
New cards

True

A common pitfall in developing a high-quality ML model is evaluating the ML model on data that is not similar to the data used for training.

6
New cards

True

The model and evaluation are too dissimilar (have extremely different descriptive statistics) to be useful.

7
New cards

This can happen when input data is sorted by one of the columns in the dataset

and then split sequentially.

8
New cards

False

You need to use random splitting in Amazon ML if you have already randomized your input data

9
New cards

groupFiles

Set _ to inPartition to enable the grouping of files within an Amazon S3 data partition.

10
New cards

groupSize

Set _ to the target size of groups in bytes.

11
New cards

False

The groupSize property is required

12
New cards

recurse

Set _ to True to recursively read files in all subdirectories when specifying paths as an array of paths.

13
New cards

False

You need to set recurse if paths is an array of object keys in Amazon S3

14
New cards

Sequence-to-Sequence Algorithm

supervised learning algorithm where the input is a sequence of tokens (for example

15
New cards

Sequence-to-Sequence Algorithm

Algorithm to use for a:

16
New cards

machine translation

17
New cards

Sequence-to-Sequence Algorithm

Algorithm to use for a:

18
New cards

text summarization

19
New cards

True

Amazon SageMaker AI seq2seq uses Recurrent Neural Networks (RNNs) models

20
New cards

False

Amazon SageMaker AI seq2seq does not use Convolutional Neural Network (CNN) models

21
New cards

Data Flow

Create a _ to define a series of ML data prep steps.

22
New cards

Used to combine datasets from different data sources

identify the number and types of transformations you want to apply to datasets

23
New cards

Transform

Clean and your dataset using standard s like string

24
New cards

Examples in usage:

  • text and date/time embedding
  • categorical encoding.
25
New cards

Generate Data Insights

Automatically verify data quality and detect abnormalities in your data with Data Wrangler Data Quality and Insights Report.

26
New cards

True

Amazon SageMaker Canvas supports training a range of model types

27
New cards

Amazon SageMaker Canvas

Canvas custom model on the following types of datasets:

28
New cards
  • Tabular (including numeric

categorical

29
New cards

Numeric prediction

Predicting house prices based on features like square footage

30
New cards

Numeric

31
New cards

Local upload

Amazon S3

32
New cards

2 category prediction

Predicting whether or not a customer is likely to churn

33
New cards

Binary or categorical

34
New cards

Local upload

Amazon S3

35
New cards

3+ category prediction

Predicting patient outcomes after being discharged from the hospital

36
New cards

Categorical

37
New cards

Local upload

Amazon S3

38
New cards

Time series forecasting

Predicting your inventory for the next quarter

39
New cards

Timeseries

40
New cards

Local upload

Amazon S3

41
New cards

Single-label image prediction

Predicting types of manufacturing defects in images

42
New cards

Image (JPG

PNG)

43
New cards

Local upload

Amazon S3

44
New cards

Multi-category text prediction

Predicting categories of products

45
New cards

Target column: binary or categorical

46
New cards

Local upload

Amazon S3

47
New cards

True

In Amazon ML

48
New cards

Area Under the (Receiver Operating Characteristic) Curve (AUC)

Amazon ML provides an industry-standard accuracy metric for binary classification models called _

49
New cards

True

AUC values near 1 indicate an ML model that is highly accurate.

50
New cards

True

Values near 0.5 indicate an ML model that is no better than guessing at random.

51
New cards

True

Values near 0 are unusual to see