WGU D492 CompTIA Data+ Latest updated version with accurate solutions 2026

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/153

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:28 AM on 6/11/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

154 Terms

1
New cards

Data Lake

a centralized repository designed to store, process, and secure large amounts of structured, semistructured, and unstructured data.

2
New cards

Data Lakehouse

Data lakehouses are a combination of a data warehouse and data lakes. They provide a cost-effective and flexible solution for data storage needs, but are not focused on transactional data.

3
New cards

OLTP (online transaction processing)

a technology used for real-time data queries and record creation

4
New cards

OLAP (online analytical processing)

a class of software that allows complex analysis to be conducted on large databases without affecting transactional systems

5
New cards

Data Mart

A data mart is a data storage technology used for specific departments or needs within a company - A subject-oriented relational database that stores transactional data in rows and columns, which makes it easy to access, organize, and understand. As it contains historical data, this structure makes it easier for an analyst to determine data trends

6
New cards

quantitative (discrete)

countable and with a limited number of values

7
New cards

quantitative (continuous)

measurable and can use any value

8
New cards

qualitative (nominal)

has no natural order ex. pink, green, brown

9
New cards

qualitative (ordinal)

follows a natural order ex. bad, good, great

10
New cards

SQL

direct interaction with the database

11
New cards

HTML

web pages

12
New cards

XML (Extensible Markup Language)

sending of data to and from another system

13
New cards

JSON (JavaScript Object Notation)

key value pairing of data between two web applications

14
New cards

delimited file

a delimited flat file contains one or more records set off from each other by a specified delimiter, or separator

15
New cards

synchronous

must wait for the response for requests from the web service

16
New cards

Asynchronous

allows you to do other tasks while waiting for the response

17
New cards

web scraping

the act of extracting data from a website (last resort, get permission)

18
New cards

machine data

data generated by the web servers - can be used for predictive maintenance

19
New cards

sampling

creating a smaller data set from a larger data set

20
New cards

data profiling

the process of checking information that is present in the data

1. identify and document the source of data

2. identify the field names and data types

3. determine the fields to be identified for reporting

4. check for the primary, natural, or foreign keys

5. recognize all the data in the data set

21
New cards

parametric

Data with an underlying normal distribution

22
New cards

nonparametric

Data for which the probability distribution is unknown or known not to be normal

23
New cards

noise

Unnecessary data fields that have no value to the analysis

24
New cards

data manipulation

the process of recoding data so that it can be more useful during our processing, correlation, analysis, and reporting

25
New cards

derived variable

a new variable or data point derived or created from existing data

26
New cards

recoded data

transforming data

27
New cards

data imputation

substitutes missing data with estimated values

28
New cards

index field

a unique, non-personally identifiable number that can be used a unique identifier

29
New cards

transposing data

swapping columns for rows, and rows for columns

30
New cards

appending data

combines data from one data set to another data set

31
New cards

inline append

combines data sets together (discards original)

32
New cards

intermediate append

retains individual data sets, but also creates a new data set with the combined data (keeps original)

33
New cards

data blending

takes data and uses different text-based functions to determine how it will be displayed or stored inside a data environment

34
New cards

conditional logic

any kind of function that checks if there is a logical condition that's being met (if, isnull, and, or)

35
New cards

IF

is a logical function that uses a logical test to validate whether a condition is true or false

36
New cards

ISNULL

returns a specified value if the expression is null

37
New cards

AND

a logical join function that tests two conditions

38
New cards

OR

tests if either one of two conditions is true

39
New cards

system functions

any functions that are packaged with your reporting tool or analysis tool to perform certain functions inside of that software

40
New cards

aggregate functions

written for a group of records, not just for a single record, and work with a column of data

41
New cards

data functions

derive attributes from date fields, like determining the day of the week, month, or year from a single date

42
New cards

indexing

a field property setting that improves query speed and performance for fields that are commonly queried, sorted, or filtered

43
New cards

parsing

breaks and extracts data out of a field for use

44
New cards

inner join

selects records that have matching values in both tables

45
New cards

left outter join

the matching data in the right, but all the data on the left whether it matches or not

46
New cards

right outter join

the matching data in the left, but all the data on the right whether it matches or not

47
New cards

full outter join

everything that intersects, as well as the rest of the data from both sides

48
New cards

cross outter join

It joins every row of the first table with every row of the second table, resulting in a potentially very large result set

49
New cards

parameterization

the concept of replacing values within the query with parameters

50
New cards

temporary table

a table that just resides in memory on the database

51
New cards

subquery

a query nested inside another query statement

52
New cards

actual execution plan

confirms the requirements used for a query

53
New cards

estimated execution plan

is a list of possible requirements for executing a query

54
New cards

exploratory analysis

the goal is to figure out what type of cleaning, profiling and transformation the data needs - it's all about the initial look at a given data set

55
New cards

performance analysis

type of analysis that measures the performance of a particular product, outcome, or scenario against the defined objective

56
New cards

KPIs (Key Performance Indicators)

measurements and goals that help identify whether a business is achieving its objectives (qualitative or quantitative)

57
New cards

gap analysis

analyzes the difference between the present state and a desired or future state (mostly quantitative measures)

58
New cards

delta

the change between where you are and where you want to be

59
New cards

trend analysis

measures the trend on historical data to predict a future outcome

60
New cards

link analysis

determines how a single data point links to other data points

61
New cards

finding standing deviation for SAMPLE

find the mean, subtract the mean to get the differences, square the differences, get the "mean" of those differences (this is the variance). The standard deviation is the squared root of the variance. with SAMPLE, its the number of samples -1

62
New cards

z-score

63
New cards

frequency

number of times that the given data value appears in the dataset

64
New cards

percentage difference

overall difference relative to the mean of two data points. b6-b5/average(b5+b6/2) *100

65
New cards

percentage change

(b6-b5)/b5*100

66
New cards

T-test

compares two groups to determine if there's a significant difference between their means

67
New cards

P-value

shows the probability that an observed difference occurred by chance (you want lower than 5%)

68
New cards

null hypothesis

assumes that there is NO relationship between the two variables being tested

69
New cards

alternative hypothesis

assumes that there is a relationship between the two variables being tested

70
New cards

type I error

false positive

71
New cards

type II error

false negative

72
New cards

chi-square statistic

compares the size of the difference between the expected result and the actual result

73
New cards

regression analysis

Statistical method used to estimate relationships between a dependent variable and one or more independent variables

74
New cards

pie chart

to show percentages

75
New cards

tree map

made for representing hierarchical data

76
New cards

bar chart

values on the x axis

77
New cards

column chart

values on the Y axis

78
New cards

line chart

time based data

79
New cards

scatter plot

to see if your data fits a trend

80
New cards

bubble chart

A type of scatter plot with circular symbols used to compare three variables; the area of the circle indicates the value of a third variable

81
New cards

Histogram

A graph of vertical bars representing the frequency distribution of a set of data. (no spaces)

82
New cards

waterfall chart

we're looking at discrete events over time

and seeing how each event

plays off the previous event before it

and adds or subtracts from where it left you

83
New cards

stacked chart

The stacked column/bar chart breaks a bar or column into separate portions to represent each data point

84
New cards

static report

report that is not automatically updated

85
New cards

real-time reporting

occurs when receiving up-to-date data

86
New cards

ad hoc report

generated in response to a one-time request

87
New cards

paginated report

a multi-page report that is not suitable for display on a dashboard

88
New cards

wireframe

a series of multiple mockups for multiple screens that are likely connected on a dashboard

89
New cards

narrative

a summary of the report contents and key findings

90
New cards

data creation

when data is acquired, entered, or captured in the system

91
New cards

data acquisition

occurs when existing data is produced outside and imported automatically to the system

92
New cards

data entry

occurs when information is manually typed into the system

93
New cards

data capture

occurs when data is generated by a device into the organization

94
New cards

data storage

occurs when data is not being actively used

95
New cards

data use

viewing, processing, modifying, manipulating or saving the data

96
New cards

data archival

copying and storing of data that can be used when needed

97
New cards

data destruction

when the data is no longer valuable or has reached its useful life and needs to be destroyed

98
New cards

data steward

the person responsible for ensuring data is properly labeled, identified, collected, and stored

99
New cards

data custodian

a role that's responsible for handling the management of the system on which the data assets are going to be stored

100
New cards

data sovereignty

A term that refers to the legal implications of data stored in different countries or states