Introduction to Data Management Tools

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/27

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering data management tools, big data characteristics, database structures, and analytical software mentioned in the lecture notes.

Last updated 8:18 PM on 6/7/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

28 Terms

1
New cards

Data

Raw, unorganized facts that lack meaning until they are processed, organized, and structured with context.

2
New cards

Information

Data that has been given context and organized so that it has meaning, such as the average training score of an entire department.

3
New cards

Data Management (DM)

The practices, architectural techniques, and tools used to achieve consistent access to and delivery of data across all subject areas and structure types in an enterprise.

4
New cards

Data Governance

A concern with data policies, procedures, access control, backup and recovery, and classification standards to keep business-critical data safe and confidential.

5
New cards

Data Discovery (Data Mining)

The examination of huge sets of data to find patterns, connections, outliers, and hidden relationships to assist in making informed business decisions.

6
New cards

Business Intelligence (BI)

A broad range of tools and practices used to extract, analyze, and report information to enable better strategic decision-making and predictions.

7
New cards

Data Lakes

Storage systems that hold large amounts of unstructured data in their raw form to allow for flexible analysis.

8
New cards

Machine Learning Algorithms

Tools that identify patterns and anomalies in data sets to make predictions based on that data.

9
New cards

Database Management Systems (DBMS)

Software systems used to create and manage databases where data are stored in computer files called tables.

10
New cards

Tables

The most important part of a database, consisting of records (rows) and fields (columns) used to store and query data.

11
New cards

Primary Key

A field in a database table that uniquely identifies a specific record, such as a unique student ID number.

12
New cards

Foreign Key

A field in a database table that provides a link between two different tables in a relational database.

13
New cards

Schema

The organization, layout, reference, or blueprint of a database that defines tables, fields, constraints, keys, and integrity.

14
New cards

Big Data

Large and expansive collected data sets that are often disparate and not in neat, tidy database tables.

15
New cards

Volume

One of the 4 Vs; the characteristic of big data referring to the enormous amount of digital data that grows exponentially.

16
New cards

Variety

One of the 4 Vs; indicates that data comes from both structured areas and fragmented, unstructured sources.

17
New cards

Veracity

One of the 4 Vs; refers to the quality and trustworthiness of data, requiring scrubbing to remove discrepancies.

18
New cards

Velocity

One of the 4 Vs; the accelerating speed at which data is produced over a given time period.

19
New cards

Structured Data

Data that resides in fixed formats, is typically well-labeled with traditional fields and records, and is easily searchable.

20
New cards

Unstructured Data

Unorganized data that cannot be easily read by a computer because it is not in rows and columns; it accounts for 80%80\% of all data.

21
New cards

Semi-structured Data

Data that contains both consistent structured elements and unstructured components, such as email and HTML files.

22
New cards

Data Warehouse

A digital storehouse used to consolidate disparate data in a central location, potentially holding yottabytes of data.

23
New cards

Yottabyte

A unit of data storage equal to one trillion (101210^{12}) terabytes.

24
New cards

Data Mart

A smaller data set designed to support the specific needs of a single department, such as sales or human resources.

25
New cards

ETL (Extract, Transform, and Load)

A set of tools used to standardize data across systems by pulling it from sources, cleaning it, and moving it into a warehouse.

26
New cards

Hadoop

An infrastructure for storing and processing large, unstructured, or semi-structured data sets across a distributed file system of multiple servers.

27
New cards

Structured Query Language (SQL)

The most widely used standard computer language for relational databases, allowing programmers to manipulate and query data.

28
New cards

Tableau

Software that produces interactive data visualizations and dashboards to simplify raw data into business insights.