Chapter 8: Data Management

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/178

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 8:47 PM on 1/14/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

179 Terms

1
New cards

analytics process model

provides a statistical analysis using a set of processes to solve system problems and find a new market opportunity

2
New cards

archival backup

storing the data in different servers/sites

3
New cards

asynchronous call

client sends a request without waiting for the response

4
New cards

atomicity, consistency, isolation, durability (ACID)

properties that impose a number of constraints to ensure that stored data are reliable and accurate

5
New cards

attribute

column header

6
New cards

blockchain DBMS

database that stores data as a block data structure and each block is connected to other blocks by providing cryptographic security and immutability

7
New cards

business intelligence (BI)

set of activities, techniques, and tools aimed at understanding patterns in past data to predict the future

8
New cards

CAP theorem

states that a distributed computer system cannot guarantee consistency, availability, and partition tolerance at the same time

9
New cards

centralized DBMS architecture

data are maintained on a centralized server at a single location

10
New cards

change data capture (CDC)

technology that detects any data update event and keeps track of versions

11
New cards

cloud DBMS architecture

DBMS and database are hosted by a third-party cloud provider

12
New cards

cluster

single computer that can dramatically improve the performance, availability, and scalability over a single, more powerful machine and at a lower cost by using commodity hardware

13
New cards

cognitive computing

technology tries to simulate human’s way in solving problems

14
New cards

column-oriented database

database that stores data in column families or tables and is built to manage petabytes of data across a massive, distributed system

15
New cards

computer scientist

person who has theoretical and practical knowledge of computer science.

16
New cards

concurrency control

coordination of transactions that execute simultaneously on the same data so that they do not cause inconsistencies because of mutual interference

17
New cards

connection manager

manages reports, books, objects, and batches

18
New cards

data

information and facts that are stored digitally by a computer

19
New cards

data accuracy

whether the data values stored for an object are the correct values and are often correlated with other DQ dimensions

20
New cards

Data as a Service (DaaS)

data management technique that uses the cloud to store, process, and manage data

21
New cards

data completeness

degree to which all data in a specific dataset are available with a minimum percentage of missing data

22
New cards

data compliance

process that ensures that data practices align with external legal requirements and industry standards

23
New cards

data consistency

keeping data consistent as it moves between various parts of the system

24
New cards

data consolidation

use of ETL to capture data from multiple sources and integrate it into a single store such as a data warehouse

25
New cards

data control language (DCL)

language used to control access to data stored in a database

26
New cards

data description language (DDL)

language used to create and modify the object structure in a database

27
New cards

data description language (DDL) compiler

translates statements in a high-level language into low-level instructions that the query evaluation engine understands

28
New cards

data dictionary

set of information describing the contents, format, and structure of a database

29
New cards

data federation

use of enterprise information integration (EII) to provide a unified view over data sources

30
New cards

data governance

set of clear roles, policies, and responsibilities that enables the enterprise to manage and safeguard data quality

31
New cards

data integration

providing a consistent view of all organization data

32
New cards

data lake

large data repository that stores raw data and can be set up without having to first define the data structure and schema

33
New cards

data management

study of managing data effectively

34
New cards

data manipulation language (DML)

language used to manipulate and edit data in a database

35
New cards

data mart

scaled-down version of a data warehouse aimed at meeting the information needs of a homogeneous small group of end users

36
New cards

data model

abstract model that contains a set of concepts to describe the structure of a database, the operations for manipulating these structures, and certain constraints that the database should obey

37
New cards

data owner

person with the authority to ultimately decide on the access to, and usage of, the data

38
New cards

data propagation

use of enterprise application integration (EAI) corresponding to the synchronous or asynchronous propagation of updates in a source system to a target system

39
New cards

data quality (DQ)

measure of how well the data represents its purpose or fitness for use

40
New cards

data quality dimension

includes accuracy, completeness, consistency, and accessibility

41
New cards

data query language (DQL)

language used to make various queries in a database

42
New cards

data redundancy

happens when the same piece of data is held in two separate places in the database

43
New cards

data replication

storing data in more than one site to improve the data availability and retrieval performance

44
New cards

data scientist

person who has theoretical and practical knowledge of managing data

45
New cards

data security

pertains to guaranteeing data integrity, guaranteeing data availability, authentication, access control, guaranteeing confidentiality, auditing, mitigating, and vulnerabilities

46
New cards

data steward

person who ensures that the enterprise's actual business data and the metadata are accurate, accessible, secure, and safe

47
New cards

data swamp

data stored without organization to make retrieval easy

48
New cards

data virtualization

technique that hides the physical location of data and uses data integration patterns to produce a unified data view

49
New cards

data warehouse

centralizes an enterprise’s data from its databases; it supports the flow of data from operational systems to analytics/decision systems by creating a single repository of data from various sources both internal and external

50
New cards

database administrator (DBA)

person responsible for the implementation and monitoring of a database and ensuring databases run efficiently

51
New cards

database application

program or piece of software designed to collect, store, access, retrieve, and manage information efficiently and securely

52
New cards

database architecture

representation of the design that helps design, develop, implement, and maintain the DBMS

53
New cards

database designer

person responsible for creating, implementing, and maintaining the database management system

54
New cards

database language

used to write instructions to access and update data in the database

55
New cards

database management system (DBMS)

approach where metadata are stored in a catalog

56
New cards

database normalization

process of structuring a relational database to reduce data redundancy and improve data integrity

57
New cards

database recovery

activity of setting the database in a consistent state without any data loss in the event of a failure or when any problem occurs

58
New cards

database security

using a set of controls to secure data, guaranteeing a high level of confidentiality

59
New cards

database transaction

sequence of read/write operations considered to be an atomic unit

60
New cards

database user

person with the privileges to access, analyze, update, and maintain the data

61
New cards

DBMS interface

main line of communication between the database and the user

62
New cards

DBMS utility

utility for managing and controlling database activities such as loading utility, reorganization utility, performance-monitoring utilities, user management utilities, backup and recovery utility

63
New cards

deep learning network

machine learning method based on artificial neural networks

64
New cards

denormalizing

process of merging several normalized data tables into an aggregated, denormalized data table

65
New cards

descriptive analytics

patterns of customer behavior

66
New cards

disk storage

memory device that stores the data such as hard disks, flash memory, magnetic disks, optical disks, and tapes

67
New cards

distributed transaction

set of operations that are performed across multiple database systems

68
New cards

domain constraint

defines the domain of values for an attribute

69
New cards

enterprise search

process of making content stemming from databases by offering tools that can be used within the enterprise

70
New cards

entity integrity constraint

specifies that no primary key contains a null

71
New cards

equi-join

join that combines tables based on matching values in specified columns

72
New cards

exploratory analysis

process of summarizing and visualizing data for initial insight

73
New cards

extraction, transformation, and loading (ETL)

data integration that combines data from multiple sources, fixes the data format, and loads the data into a data warehouse

74
New cards

fact constellation

more than one fact table connected to other smaller dimension tables

75
New cards

fat client variant

where presentation logic and application logic are handled by the client; common in cases where it makes sense to couple an application’s workflow

76
New cards

federated DBMS

provides a uniform interface to multiple underlying data sources

77
New cards

flat file database

database that uses a simple structure to store data in a text file; each line in the file holds one record

78
New cards

full-text search

selection of individual text documents from a collection of documents according to the presence of a single or a combination of search terms in the document

79
New cards

functional dependency (FD)

constraint that specifies the relationship between two sets of attributes and provides a formal tool for the analysis of relational schemas

80
New cards

garbage in, garbage out (GIGO)

quality of output is determined by the quality of the input

81
New cards

graph-based database

database that represents data as a network of related nodes or objects to facilitate data visualizations and graph analytics

82
New cards

Hadoop

distributed data infrastructures that leverage clusters to store and process massive amounts of data

83
New cards

heuristics optimization

mathematical technique for processing a query quickly

84
New cards

hierarchical DBMS

data model in which the data are organized into a treelike model, DML is procedural and record-oriented, the query processor is logical, and internal data models are intertwined

85
New cards

hierarchical model

model in which data are stored in the form of records and organized into a tree structure

86
New cards

horizontal fragmentation (sharding)

rows that satisfy a query predicate, global view with UNION query, and common in NoSQL databases

87
New cards

immediate backup

storing the copies in disks

88
New cards

in-memory DBMS

stores all data in internal memory instead of slower external storage

89
New cards

indexed organization

uses a key, similar to relative organization, but the key is unique and fixed

90
New cards

informatics

study, design, and development of information technology for the good of people, organizations, and society

91
New cards

information architect

(also, data architect or information analyst) a person responsible for designing the conceptual data model (blueprints) to bridge the gap between the business processes and the IT environment

92
New cards

information retrieval

searching for information in documents using retrieval models that specify matching functions and query representation

93
New cards

inner join

represents the intersection of two tables

94
New cards

key constraint

specifies that all the values of the primary key must be unique

95
New cards

key-value store

simple database that uses an associative array such as Redis, DynamoDB, and Cosmos DB

96
New cards

logical data independence

separates any changes in the data from the data format

97
New cards

logical design

designing a database based on a specific data model but independent of physical details

98
New cards

macro life cycle

includes feasibility analysis, requirements collection and analysis, design, implementation, and validation and acceptance testing

99
New cards

MapReduce

open-source software framework used to apply complex queries

100
New cards

master data management (MDM)

series of processes, policies, standards, and tools to help organizations define and provide a single point of reference for all data that are mastered