Introduction to Data Management Tools

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/27

flashcard set

Earn XP

Description and Tags

A comprehensive set of vocabulary flashcards covering the core concepts of data management, big data, storage structures, and analytical tools as presented in the lecture transcript.

Last updated 8:13 PM on 6/7/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

28 Terms

1
New cards

Data Management (Gartner Group)

The practices, architectural techniques, and tools for achieving consistent access to and delivery of data across the spectrum of data subject areas and data structure types in the enterprise.

2
New cards

Data

Raw, unorganized facts that might not mean much until they are processed, organized, and structured into a context.

3
New cards

Information

Data that has context and meaning after being processed and organized, such as the average training score of an entire department.

4
New cards

Database Management Systems (DBMS)

Software systems used to create and manage databases where data are stored in computer files called tables.

5
New cards

Tables

The most important part of a database where data is held; they consist of records (rows) and fields (columns).

6
New cards

Records

The rows in a database table.

7
New cards

Fields

The columns in a database table.

8
New cards

Primary Key

A field in a database table that uniquely identifies a record in that table, such as a unique student ID number.

9
New cards

Foreign Key

A field in a database table that provides a link between two tables in a relational database.

10
New cards

Schema

The organization or layout of a database that defines tables, fields, constraints, keys, and integrity; it serves as the database blueprint.

11
New cards

Big Data

Large and expansive collected data sets from disparate sources including smartphone metadata, internet usage records, and social media activity.

12
New cards

Volume

One of the 4 Vs of big data referring to the sheer amount of data that requires significant resources to hold and manage.

13
New cards

Variety

One of the 4 Vs of big data referring to data coming from both structured and unstructured areas and in various forms.

14
New cards

Veracity

One of the 4 Vs of big data referring to the quality and trustworthiness of the data, and whether it represents what is believed.

15
New cards

Velocity

One of the 4 Vs of big data referring to the accelerating speed of data production over a given time period.

16
New cards

Data Mining

The examination of huge sets of data to find patterns, connections, outliers, and hidden relationships to make informed business decisions; also called data discovery.

17
New cards

Structured Data

Data residing in fixed formats that are typically well-labeled with traditional fields and records, making them easily searchable and queryable.

18
New cards

Unstructured Data

Unorganized data that cannot be easily read by a computer because it is not in rows and columns, such as video, audio, and social media posts; it accounts for 80%80\% of all data.

19
New cards

Semi-structured Data

Data that falls between structured and unstructured, containing both consistent structured elements (like tags or headers) and unstructured content, such as email and HTML.

20
New cards

Data Warehouse

A digital location used to consolidate disparate data in a central location for enterprise-wide needs, often holding yottabytesyottabytes of data.

21
New cards

Data Mart

A smaller data storage system designed to support the specific needs of a single department, such as sales or human resources.

22
New cards

Yottabytes

A unit of data storage equivalent to 1trillion1\,trillion terabytes; each terabyte is 1,000gigabytes1,000\,gigabytes.

23
New cards

ETL

An acronym for extract, transform, and load; it describes tools used to standardize data across systems and prepare them for querying in a warehouse or mart.

24
New cards

Hadoop

An infrastructure for storing and processing large sets of data across multiple servers using a distributed file system, designed for unstructured and semi-structured data.

25
New cards

Structured Query Language (SQL)

The most widely used standard computer language for relational databases, allowing programmers to manipulate and query data.

26
New cards

Tableau

A business analytics platform that produces interactive data visualizations, such as graphs and charts, to simplify raw data into information.

27
New cards

Business Intelligence (BI)

A broad range of tools and practices used to extract, analyze, and report information to assist in making critical strategic business decisions and predictions.

28
New cards

Data Governance

The management of data policies, procedures, access control, backup and recovery, and classification standards to ensure data remains confidential and safe.