CompTIA Data+ Exam DA0-001: Data Analytics Terminology

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/187

flashcard set

Earn XP

Description and Tags

A complete vocabulary set based on the CompTIA Data+ Study Guide Glossary, covering database management, statistical analysis, and data governance.

Last updated 3:31 AM on 6/18/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

188 Terms

1
New cards

acceptable use agreement

an agreement that describes not only how data can be used, but also for what purpose

2
New cards

accountability

when data governance plans are being followed and there are accountability measures in place

3
New cards

ad hoc report

a report that is generated in response to a one-time request

4
New cards

Advanced Encryption Standard (AES)

a Federal Information Standards (FIPS)-approved cryptographic algorithm that can be used to protect electronic data

5
New cards

aggregate functions

functions that are written for all or a group of records, as opposed to a single record

6
New cards

aggregated data

data that has already been compiled and summarized for the purposes of analysis and reporting

7
New cards

alternative hypothesis

the assumption that a relationship exists between two variables

8
New cards

append

to combine data from one data set with another data set

9
New cards

appendix

apart of the narrative that provides additional details related to the report or process that is not essential to the main content

10
New cards

application programming interface (API)

a library of programming utilities used, for example, to enable software developers to access functions of the TCP/IP network stack under a particular operating system

11
New cards

application programming interface (API)

a library of programming utilities used, for
example, to enable software developers to access functions of the TCP/IP network stack
under a particular operating system

12
New cards

ascending and descending order

a method of sorting in which fields are sorted with the minimum on top (for ascending) or maximum on top (for descending)

13
New cards

using the power of software to ensure data achieves a validated result

automated validation

14
New cards

a chart that displays information, listing the categories on the y-axis and the discrete values on the x-axis

bar chart

15
New cards

bins / binning

defined intervals or "buckets" used to group continuous numerical data into discrete categories.

16
New cards

Binning (or bucketing/discretization)

is the data preprocessing technique of assigning individual data points into these intervals

17
New cards

which type of graph is binning associated with ?/

histograms

18
New cards

bin size calculation

To determine a good bin size, take the range of the data (largest value minus the smallest value) and divide it by the number of bins, typically between 55 and 1515: largest valuesmallest valuenumber of bins\frac{\text{largest value} - \text{smallest value}}{\text{number of bins}}.

19
New cards

bubble chart

a visual that plots points on an x-axis and y-axis similar to a scatter plot, but with the addition of the size of the dot representing a third variable

20
New cards


captioning

designating more meaningful names for fields in a report or dashboard

21
New cards

cardinality

how many possible occurrences of one entity can be associated with the number of occurrences in another

22
New cards

cascade delete

referential integrity setting that deletes all related records when the primary key is deleted

23
New cards

referential integrity setting that updated all related records when the primary key is changed

cascade update

24
New cards

causal relationship

a relationship in which one variable is proven to have an effect on another

25
New cards

chi-square statistic

a value that compares the size of the difference between the expected result and the actual result

26
New cards

chi-square test

a test used to determine if a difference exists between groups; produces the chi-square statistic

27
New cards

a chart that displays information, listing the categories on the x-axis and the discrete values on the y-axis

column chart

28
New cards

conceptual data model

the conceptual view of what should exist in a data system and how it could be related

29
New cards

confidence interval

a calculation of values that describes the certainty or uncertainty of an estimate made on the analysis

30
New cards

continuous data

a characteristic of quantitative data that identifies data that can be measured and can use any value

31
New cards

cross validation

determining whether data collected across different methods is consistent and accurate

32
New cards

custom sorts

sorting when you create the data set to include the value and the sort order you need for your visualization

33
New cards
34
New cards

correlation

the statistical association between two (or more) equal variables that tells us if one variable changes, the other(s) will too

35
New cards

data at rest

data that is being stored

36
New cards

data custodian

the person who manages the system on which the data assets are stored

37
New cards

data dictionary

a document that serves as the authority on all definitions that have been agreed upon for the organization, as well as key metrics

38
New cards

data governance

a large umbrella term for a framework used to govern data in an organization

39
New cards

data in transit

data that is actively being transferred

40
New cards

data in use

data that has been transmitted and is now present in memory or being queried

41
New cards

data lake

a technology for storing large amounts of structured and unstructured types of information in their original format

42
New cards

data lakehouse

a data management system that combines the best of both data warehousing and data lakes

43
New cards

data mart

a subset of the data warehouse that is dedicated to a specific department or group

44
New cards

data owner

the person who holds the ultimate responsibility for maintaining the confidentiality, integrity, and availability of the information asset

45
New cards

data steward

the person who is primarily responsible for data quality

46
New cards

data validation

the process of confirming the type, structure, and accurate representation of the data

47
New cards

data verification

the process of confirming that the data is accurate or true

48
New cards

data warehouse

a technology that is dedicated to the store of company data from a wide range of sources for reporting and decision making purposes

49
New cards

discrete data

a characteristic of quantitative value that identifies data that can be counted and can only take on a certain number of values

50
New cards

delimited files

files in which some form of character separates each field of data from the other data fields

51
New cards

delta load

the method of loading new data into a data system and updating any existing data
that has changed since the last load

52
New cards

denormalized data

data that has not gone through a normalization process and contains repetitive data

53
New cards

dependent variable

the variable we are measuring when comparing two groups

54
New cards

dimension table

a table that holds attributes or the categorial information that supports the fact tables

55
New cards

discrete data

a characteristic of quantitative value that identifies data that can be counted and can only take on a certain number of values

56
New cards

domain integrity

the acceptable values for a field

57
New cards

duplicated data

data that is repeated within the same data set

58
New cards

dynamic report

a report that is connected to the data and can be refreshed on demand or regularly updated automatically; also known as real-time report

59
New cards

ELT (Extract, Load, Transform)

the process that occurs when moving data from source systems to data lakes, which holds data in preparation for transformation

60
New cards

empirical rule

the tendency of most data points in normal distribution to fall within three points of the mean either on the positive or negative side of the curve

61
New cards

entity integrity

the unique identifier of a record as defined using a primary key field

62
New cards

ETL (Extract, Transform, Load)

the process that occurs when moving data from source systems to data warehouses by extracting data from the source, transforming the data, and then loading it to the warehouse

63
New cards

exploratory analysis

analysis that determines the main characteristics of a data set

64
New cards

Extensible Markup Language (XML)

a system for structuring documents so that they are human- and machine-readable; information within the document is placed within tags, which describe how information within the document is structured

65
New cards

fact table

a table that holds the "facts" about a particular business process or event and contains keys to relate to the other tables

66
New cards

field definitions

descriptive information about what each field contains, intended to clarify field names that may be ambiguous

67
New cards

flat files

delimited files that are exported out of a system

68
New cards

full load

the method of loading all data into a data system for the very first time

69
New cards

gap analysis

the study of a present state, desired state, and the gaps that exist between the two

70
New cards

goodness of fit

a chi-square test that tests against a single variable to analyze the relationship between variables

71
New cards

hard-coded filters

filters that are coded into the view or the visual

72
New cards

Hyper Text Markup Language (HTML)

a system of coded tags that identify the structure of
the document files used for web pages

73
New cards

imputing

replacing data with an estimated value

74
New cards

independent variable

the variable that is different between two groups that we are
comparing

75
New cards

index field

a field that applies a unique number to a record

76
New cards

indexing

a field property setting that tells the database that a field needs to be indexed

77
New cards

infographic

any combination of visuals, artwork, photos, and language that tells the story of
your data in a compelling and graphically appealing way

78
New cards

inline append

an append query that combines data sets until all are combined

79
New cards

intellectual property (IP)

intangible products of human thought and ingenuity

80
New cards

interactive filters

filters that allow the consumer to adjust a slicer or filter option on a dashboard to narrow down the data they want to see

81
New cards

intermediate append

an append query that creates a combined data set but also retains the
separate data sets

82
New cards

JSON

an object-oriented, event-driven programming

language that allows us to interact with websites

83
New cards

key performance indicators (KPls)

measurements/goals that are established to help identify
whether a business is achieving its objectives

84
New cards

key value pair

a type of non-relational structure that establishes a unique identifier or key field and maps it to a value

85
New cards

legend

a labeling element that lets you know which color represents which value in a visual

86
New cards

lifecycle of data

the five stages of the life of data: create, store, use, archive, and destroy

87
New cards

line graph

a graph that consists of either a single horizontal line or a group of multiple lines that represent different data points at different times; also known as run chart

88
New cards

link analysis

analysis that helps us determine how a single data point links to other data
points

89
New cards

logical data model

a more detailed view of the conceptual model that includes data fields
and the relationships between them

90
New cards

logical functions

functions that check if a condition is met and return a result based on
whether or not the condition is met

91
New cards

masking

the act of hiding the original value of data by showing something else in its place;
also known as anonymization

92
New cards

master data management

tools and processes that are used to create the single source of truth or the "golden record" for the data that is considered critical at the organization

93
New cards

mean

the average of a set of numbers, calculated by adding all the values and then dividing that sum by the total number of values: Sum of ValuesTotal Number of Values\frac{\text{Sum of Values}}{\text{Total Number of Values}}.

94
New cards

measures of central tendency

mathematical functions used to find the center of a data set, including the mean, median, and mode

95
New cards

measures of dispersion

mathematical functions used to determine the distribution of a data set, also known as measures of variability (e.g., standard deviation, variance, range)

96
New cards

median

the middle number within a group of sorted numbers

97
New cards

MOU

an acceptable use agreement that establishes the
rules of engagement between two parties and defines roles and expectations

98
New cards

merge fields function

a function used to combine different fields to create and display a single consolidated field; also known as CONCATENATE function

99
New cards

mockup

to draw out a potential layout

100
New cards

mode

the number that shows up most often in a data set