1/178
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
analytics process model
provides a statistical analysis using a set of processes to solve system problems and find a new market opportunity
archival backup
storing the data in different servers/sites
asynchronous call
client sends a request without waiting for the response
atomicity, consistency, isolation, durability (ACID)
properties that impose a number of constraints to ensure that stored data are reliable and accurate
attribute
column header
blockchain DBMS
database that stores data as a block data structure and each block is connected to other blocks by providing cryptographic security and immutability
business intelligence (BI)
set of activities, techniques, and tools aimed at understanding patterns in past data to predict the future
CAP theorem
states that a distributed computer system cannot guarantee consistency, availability, and partition tolerance at the same time
centralized DBMS architecture
data are maintained on a centralized server at a single location
change data capture (CDC)
technology that detects any data update event and keeps track of versions
cloud DBMS architecture
DBMS and database are hosted by a third-party cloud provider
cluster
single computer that can dramatically improve the performance, availability, and scalability over a single, more powerful machine and at a lower cost by using commodity hardware
cognitive computing
technology tries to simulate human’s way in solving problems
column-oriented database
database that stores data in column families or tables and is built to manage petabytes of data across a massive, distributed system
computer scientist
person who has theoretical and practical knowledge of computer science.
concurrency control
coordination of transactions that execute simultaneously on the same data so that they do not cause inconsistencies because of mutual interference
connection manager
manages reports, books, objects, and batches
data
information and facts that are stored digitally by a computer
data accuracy
whether the data values stored for an object are the correct values and are often correlated with other DQ dimensions
Data as a Service (DaaS)
data management technique that uses the cloud to store, process, and manage data
data completeness
degree to which all data in a specific dataset are available with a minimum percentage of missing data
data compliance
process that ensures that data practices align with external legal requirements and industry standards
data consistency
keeping data consistent as it moves between various parts of the system
data consolidation
use of ETL to capture data from multiple sources and integrate it into a single store such as a data warehouse
data control language (DCL)
language used to control access to data stored in a database
data description language (DDL)
language used to create and modify the object structure in a database
data description language (DDL) compiler
translates statements in a high-level language into low-level instructions that the query evaluation engine understands
data dictionary
set of information describing the contents, format, and structure of a database
data federation
use of enterprise information integration (EII) to provide a unified view over data sources
data governance
set of clear roles, policies, and responsibilities that enables the enterprise to manage and safeguard data quality
data integration
providing a consistent view of all organization data
data lake
large data repository that stores raw data and can be set up without having to first define the data structure and schema
data management
study of managing data effectively
data manipulation language (DML)
language used to manipulate and edit data in a database
data mart
scaled-down version of a data warehouse aimed at meeting the information needs of a homogeneous small group of end users
data model
abstract model that contains a set of concepts to describe the structure of a database, the operations for manipulating these structures, and certain constraints that the database should obey
data owner
person with the authority to ultimately decide on the access to, and usage of, the data
data propagation
use of enterprise application integration (EAI) corresponding to the synchronous or asynchronous propagation of updates in a source system to a target system
data quality (DQ)
measure of how well the data represents its purpose or fitness for use
data quality dimension
includes accuracy, completeness, consistency, and accessibility
data query language (DQL)
language used to make various queries in a database
data redundancy
happens when the same piece of data is held in two separate places in the database
data replication
storing data in more than one site to improve the data availability and retrieval performance
data scientist
person who has theoretical and practical knowledge of managing data
data security
pertains to guaranteeing data integrity, guaranteeing data availability, authentication, access control, guaranteeing confidentiality, auditing, mitigating, and vulnerabilities
data steward
person who ensures that the enterprise's actual business data and the metadata are accurate, accessible, secure, and safe
data swamp
data stored without organization to make retrieval easy
data virtualization
technique that hides the physical location of data and uses data integration patterns to produce a unified data view
data warehouse
centralizes an enterprise’s data from its databases; it supports the flow of data from operational systems to analytics/decision systems by creating a single repository of data from various sources both internal and external
database administrator (DBA)
person responsible for the implementation and monitoring of a database and ensuring databases run efficiently
database application
program or piece of software designed to collect, store, access, retrieve, and manage information efficiently and securely
database architecture
representation of the design that helps design, develop, implement, and maintain the DBMS
database designer
person responsible for creating, implementing, and maintaining the database management system
database language
used to write instructions to access and update data in the database
database management system (DBMS)
approach where metadata are stored in a catalog
database normalization
process of structuring a relational database to reduce data redundancy and improve data integrity
database recovery
activity of setting the database in a consistent state without any data loss in the event of a failure or when any problem occurs
database security
using a set of controls to secure data, guaranteeing a high level of confidentiality
database transaction
sequence of read/write operations considered to be an atomic unit
database user
person with the privileges to access, analyze, update, and maintain the data
DBMS interface
main line of communication between the database and the user
DBMS utility
utility for managing and controlling database activities such as loading utility, reorganization utility, performance-monitoring utilities, user management utilities, backup and recovery utility
deep learning network
machine learning method based on artificial neural networks
denormalizing
process of merging several normalized data tables into an aggregated, denormalized data table
descriptive analytics
patterns of customer behavior
disk storage
memory device that stores the data such as hard disks, flash memory, magnetic disks, optical disks, and tapes
distributed transaction
set of operations that are performed across multiple database systems
domain constraint
defines the domain of values for an attribute
enterprise search
process of making content stemming from databases by offering tools that can be used within the enterprise
entity integrity constraint
specifies that no primary key contains a null
equi-join
join that combines tables based on matching values in specified columns
exploratory analysis
process of summarizing and visualizing data for initial insight
extraction, transformation, and loading (ETL)
data integration that combines data from multiple sources, fixes the data format, and loads the data into a data warehouse
fact constellation
more than one fact table connected to other smaller dimension tables
fat client variant
where presentation logic and application logic are handled by the client; common in cases where it makes sense to couple an application’s workflow
federated DBMS
provides a uniform interface to multiple underlying data sources
flat file database
database that uses a simple structure to store data in a text file; each line in the file holds one record
full-text search
selection of individual text documents from a collection of documents according to the presence of a single or a combination of search terms in the document
functional dependency (FD)
constraint that specifies the relationship between two sets of attributes and provides a formal tool for the analysis of relational schemas
garbage in, garbage out (GIGO)
quality of output is determined by the quality of the input
graph-based database
database that represents data as a network of related nodes or objects to facilitate data visualizations and graph analytics
Hadoop
distributed data infrastructures that leverage clusters to store and process massive amounts of data
heuristics optimization
mathematical technique for processing a query quickly
hierarchical DBMS
data model in which the data are organized into a treelike model, DML is procedural and record-oriented, the query processor is logical, and internal data models are intertwined
hierarchical model
model in which data are stored in the form of records and organized into a tree structure
horizontal fragmentation (sharding)
rows that satisfy a query predicate, global view with UNION query, and common in NoSQL databases
immediate backup
storing the copies in disks
in-memory DBMS
stores all data in internal memory instead of slower external storage
indexed organization
uses a key, similar to relative organization, but the key is unique and fixed
informatics
study, design, and development of information technology for the good of people, organizations, and society
information architect
(also, data architect or information analyst) a person responsible for designing the conceptual data model (blueprints) to bridge the gap between the business processes and the IT environment
information retrieval
searching for information in documents using retrieval models that specify matching functions and query representation
inner join
represents the intersection of two tables
key constraint
specifies that all the values of the primary key must be unique
key-value store
simple database that uses an associative array such as Redis, DynamoDB, and Cosmos DB
logical data independence
separates any changes in the data from the data format
logical design
designing a database based on a specific data model but independent of physical details
macro life cycle
includes feasibility analysis, requirements collection and analysis, design, implementation, and validation and acceptance testing
MapReduce
open-source software framework used to apply complex queries
master data management (MDM)
series of processes, policies, standards, and tools to help organizations define and provide a single point of reference for all data that are mastered