Introduction to Data Management Tools

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/32

flashcard set

Earn XP

Description and Tags

Flashcards covering the fundamentals of data management, database structures, big data attributes, and various analytical tools like SQL, ETL, and Hadoop.

Last updated 8:59 PM on 6/7/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

33 Terms

1
New cards

Data Management (DM)

According to Gartner Group, it consists of the practices, architectural techniques, and tools for achieving consistent access to and delivery of data across the spectrum of data subject areas and data structure types in the enterprise.

2
New cards

Data

Raw, unorganized facts that need context and processing to become useful.

3
New cards

Information

Data that has been processed, organized, and structured into a context that gives it meaning.

4
New cards

Database Management Systems (DBMS)

Software systems used to create and manage databases where data are stored in computer files called tables.

5
New cards

Tables

Storehouses of data within a database consisting of records (rows) and fields (columns).

6
New cards

Record

A row within a database table.

7
New cards

Field

A column within a database table.

8
New cards

Primary key

A field in a database table that uniquely identifies a record in the table, such as a unique student ID number.

9
New cards

Foreign key

A field in a database table that provides a link between two tables in a relational database.

10
New cards

Schema

The organization or layout of a database that defines the tables, fields, constraints, and keys; it serves as the blueprint of the database.

11
New cards

Big Data

Large and expansive collected data sets from various sources like smartphone metadata, social media, and internet usage records that are sifted for patterns and trends.

12
New cards

Volume

One of the 4 Vs of big data referring to the main characteristic of the sheer mass of digital data that requires significant resources to manage.

13
New cards

Variety

One of the 4 Vs of big data referring to the fact that data comes in both structured and unstructured forms from various fragmented sources.

14
New cards

Veracity

One of the 4 Vs of big data referring to the quality and trustworthiness of the data, and whether it represents what it is believed to show.

15
New cards

Velocity

One of the 4 Vs of big data referring to the accelerating speed at which data is produced over a given time period.

16
New cards

Data mining (Data discovery)

The examination of huge sets of data to find patterns, connections, outliers, and hidden relationships to help businesses make informed decisions.

17
New cards

Structured data

Data that resides in fixed formats, is typically well-labeled with traditional fields and records, and fits easily into relational databases.

18
New cards

Unstructured data

Unorganized data that cannot be easily read by a computer because it is not stored in traditional rows and columns; it accounts for 80%80\% of all data.

19
New cards

Semi-structured data

Data that contains both structured elements and unstructured components, with common examples being email and HTML files.

20
New cards

Business Intelligence (BI)

A broad range of tools and practices that extract, analyze, and report information to assist in critical business strategic decision-making and pattern prediction.

21
New cards

Data lakes

Storage systems that hold large amounts of unstructured data in their raw form to allow for flexible analysis.

22
New cards

Structured Query Language (SQL)

The most widely used standard computer language for relational databases, used by programmers to manipulate and query data tables.

23
New cards

Data Warehouse

A digital location used to consolidate disparate data from across an entire enterprise; can hold up to yottabytes of data.

24
New cards

Data Mart

A smaller data set or storage system designed to support the specific needs of a single department, such as sales or human resources.

25
New cards

Terabyte

A unit of data storage equal to 1,000gigabytes1,000\,gigabytes.

26
New cards

Yottabyte

A unit of data storage equal to one trillion terabytes (1×1012terabytes1 \times 10^{12}\,terabytes).

27
New cards

ETL

An acronym for extract, transform, and load; describing tools used to standardize data across systems and move it into a central location.

28
New cards

Extract (ETL)

The first step in ETL where data is taken from its original source, such as CRM or ERP systems.

29
New cards

Transform (ETL)

The second step in ETL where extracted data is reformatted or cleaned (e.g., removing decimals) to fit into a structured database table.

30
New cards

Load (ETL)

The final step in ETL where transformed data is transferred into the data warehouse or data mart.

31
New cards

Hadoop

An infrastructure for storing and processing large, distributed sets of unstructured and semi-structured data across multiple servers.

32
New cards

Tableau

A business intelligence tool that produces interactive data visualizations, such as graphs and charts, to simplify raw data into information.

33
New cards

Dashboard

A tool used for the presentation of complex data in an easier-to-understand visual form, often providing an overall picture of business efficiency.