WGU D491 Exam Prep: Introduction to Analytics Practice

0.0(0)
studied byStudied by 5 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/110

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

111 Terms

1
New cards

data analytics

The process of analyzing data to extract insights

2
New cards

descriptive analytics

A type of data analytics that summarizes past data

3
New cards

predictive analytics

The process of predicting future outcomes

4
New cards

prescriptive analytics

The process of recommending actions based on data

5
New cards

ETL

Extract, Transform, Load

6
New cards

big data

Data that is too large and complex to be processed by traditional data-processing software

7
New cards

data mining

To extract useful information from large datasets

8
New cards

unstructured data

A collection of social media posts

9
New cards

data warehouse

A system used for reporting and data analysis

10
New cards

regression analysis

A common technique used in predictive analytics

11
New cards

structured data

It is stored in a fixed format

12
New cards

data mart

To provide a subset of data for a specific business function

13
New cards

data visualization tool

An example is Tableau

14
New cards

data analytics process

The first step is data collection

15
New cards

data lake

A storage repository that holds a vast amount of raw data in its native format

16
New cards

descriptive analytics goal

To summarize past data

17
New cards

data cleaning technique

Data transformation

18
New cards

data analyst role

To analyze data and extract insights

19
New cards

business intelligence tool

An example is Power BI

20
New cards

data governance goal

To ensure data quality and compliance

21
New cards

cloud-based analytics benefit

Reduced data storage costs

22
New cards

data dashboard purpose

To provide a visual representation of key performance indicators

23
New cards

machine learning algorithm

An example is Linear regression

24
New cards

real-time analytics advantage

Faster decision-making

25
New cards

missing data handling method

Replacing missing values with the mean or median

26
New cards

hypothesis test purpose

To make inferences about a population based on sample data

27
New cards

categorical variable example

Gender

28
New cards

exploratory data analysis goal

To summarize the main characteristics of the data

29
New cards

statistical test for means

T-test

30
New cards

p-value indication

The probability of observing the data given that the null hypothesis is true

31
New cards

measure of central tendency

Mean

32
New cards

correlation coefficient purpose

To measure the strength and direction of the relationship between two variables

33
New cards

continuous variable example

Temperature

34
New cards

regression analysis purpose

To predict the value of a dependent variable based on one or more independent variables

35
New cards

Summarizing past data

To summarize past data

36
New cards

Predicting dependent variable

To predict the value of a dependent variable based on one or more independent variables

37
New cards

Cleaning and organizing data

To clean and organize data

38
New cards

Storing data securely

To store data securely

39
New cards

Healthcare application of data science

Predicting patient outcomes

40
New cards

Recommendation system in e-commerce

To predict customer preferences and suggest products

41
New cards

Finance application of data science

Predicting stock prices

42
New cards

Sentiment analysis in social media

To analyze and interpret the emotions expressed in text

43
New cards

Marketing application of data science

Predicting customer churn

44
New cards

Natural language processing (NLP)

To analyze and interpret human language

45
New cards

Transportation application of data science

Predicting traffic patterns

46
New cards

Anomaly detection in cybersecurity

To identify unusual patterns that may indicate a security breach

47
New cards

Retail application of data science

Predicting inventory needs

48
New cards

Machine learning in data science

To develop algorithms that can learn from and make predictions on data

49
New cards

Key principle of effective data visualization

Keeping the visualization simple and clear

50
New cards

Purpose of a scatter plot

To compare the relationship between two variables

51
New cards

Categorical data visualization

Bar chart

52
New cards

Advantage of using a dashboard

It allows for interactive exploration of data

53
New cards

Data storytelling

The practice of using data visualizations to convey a narrative

54
New cards

Purpose of a heat map

To display data density or intensity

55
New cards

Common tool for creating data visualizations

Tableau

56
New cards

Goal of data communication

To effectively convey insights and findings from data analysis

57
New cards

Example of a time series visualization

Line chart

58
New cards

Purpose of a pie chart

To display the proportion of categories within a whole

59
New cards

Type of supervised learning

Regression

60
New cards

Goal of unsupervised learning

To find hidden patterns or intrinsic structures in data

61
New cards

Example of a classification algorithm

Decision tree

62
New cards

Purpose of a confusion matrix

To evaluate the performance of a classification model

63
New cards

Common technique in dimensionality reduction

Principal component analysis (PCA)

64
New cards

Advantage of ensemble methods

They improve the accuracy and robustness of predictions

65
New cards

Example of a clustering algorithm

K-means

66
New cards

Purpose of cross-validation

To evaluate the performance of a model

67
New cards

Characteristic of a neural network

It is based on the structure and function of the human brain

68
New cards

Goal of reinforcement learning

To learn optimal actions through trial and error

69
New cards

Common evaluation metric for regression models

Mean Squared Error (MSE)

70
New cards

Purpose of feature scaling

To normalize the range of independent variables

71
New cards

Example of a supervised learning algorithm

Support vector machine (SVM)

72
New cards

Predict continuous values

To predict continuous values

73
New cards

Classify data

To classify data into distinct categories

74
New cards

Reduce dimensionality

To reduce the dimensionality of data

75
New cards

Visualize data

To visualize data

76
New cards

Prevent overfitting

Common method for preventing overfitting in machine learning models

77
New cards

Learning rate

To control how much the model's weights are adjusted with respect to the loss gradient

78
New cards

Unsupervised learning algorithm

Example of an unsupervised learning algorithm is K-means clustering

79
New cards

Random forest advantage

It reduces the risk of overfitting by averaging multiple decision trees

80
New cards

Activation function

Common activation function used in neural networks is Sigmoid

81
New cards

Validation set purpose

To tune the model's hyperparameters

82
New cards

Feature selection technique

Common technique for feature selection is Principal component analysis (PCA)

83
New cards

Convolutional neural network goal

To analyze and interpret images

84
New cards

Reinforcement learning algorithm

Example of a reinforcement learning algorithm is Q-learning

85
New cards

Dropout purpose

To prevent overfitting by randomly dropping units during training

86
New cards

Evaluation metric for classification

Common evaluation metric for classification models is Accuracy

87
New cards

Support vector machine advantage

It works well with high-dimensional data

88
New cards

Handling imbalanced datasets

Common technique for handling imbalanced datasets is Oversampling the minority class

89
New cards

Confusion matrix purpose

To evaluate the performance of a classification model

90
New cards

Deep learning framework

Example of a deep learning framework is TensorFlow

91
New cards

Generative adversarial network goal

To generate new data samples that are similar to the training data

92
New cards

Model selection technique

Common technique for model selection in machine learning is Cross-validation

93
New cards

Learning curve purpose

To evaluate the performance of a model over time

94
New cards

Boosting algorithm example

Example of a boosting algorithm is AdaBoost

95
New cards

Hyperparameter tuning goal

To optimize the performance of a model

96
New cards

Reducing overfitting technique

Common technique for reducing overfitting in neural networks is Using dropout

97
New cards

ROC curve purpose

To evaluate the performance of a classification model

98
New cards

Regularization technique example

Example of a regularization technique is Lasso regression

99
New cards

Convolutional neural network advantage

It is specifically designed to process grid-like data such as images

100
New cards

Evaluation metric for clustering

Common evaluation metric for clustering algorithms is Silhouette score