Data, Data Lakes, & Databases – IDS 200 CH1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards summarizing key terms and definitions from the lecture on data, data lakes, and databases.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

Data

Raw facts generated by business events such as purchases, shipments, or employee actions.

2
New cards

Record

A single representation of an event or thing, composed of a set of fields.

3
New cards

Field

An individual attribute or piece of information stored within a record.

4
New cards

Information

Data that has been processed into a usable form, such as a single record, subset, or aggregate.

5
New cards

Information System

A system designed to store specific kinds of data and provide standard methods for access and use.

6
New cards

Operational Data

Data created and used to perform day-to-day business operations, typically stored for quick lookups.

7
New cards

Analytical Data

Regrouped operational data used to answer performance questions rather than run daily operations.

8
New cards

Structured Data

Data in which every record has the same predefined set of fields, e.g., customer accounts.

9
New cards

Unstructured Data

Data whose records do not share identical fields, e.g., webpages or email documents.

10
New cards

Semi-Structured Data

Data with consistent groups of fields but varying counts or formats within those groups (e.g., orders with variable line items).

11
New cards

Scale (Volume & Velocity)

The growth in amount and speed of data that requires automated processing, often reaching gigabytes, terabytes, or petabytes.

12
New cards

Aggregate

A combined measure—such as a sum, average, minimum, or maximum—derived from multiple records.

13
New cards

Data Lake

A repository holding all operational data in varied formats—structured and unstructured—for later processing.

14
New cards

ETL (Extract, Transform, Load)

The process of pulling data from a lake, converting it as needed, and loading it into end systems or databases.

15
New cards

Database

An application that lets users record, modify, delete, and query data, often fed by the data lake.

16
New cards

Table

A set of records of the same type stored together within a database.

17
New cards

Primary Key

A unique identifier assigned to each record in a table to distinguish it from others.

18
New cards

Relational Database

A database model that stores data in related tables allowing multi-table queries; best suited for structured data.

19
New cards

Relationship (Database)

A linkage between tables based on common fields (e.g., Customer.zip = ZipCode.zip) enabling combined queries.

20
New cards

Non-Relational Database (NoSQL)

A database model focused on single-table lookups, sacrificing some query flexibility for speed or storage efficiency.

21
New cards

Eventual Consistency

A design principle in many NoSQL systems where temporary data conflicts are tolerated to gain performance or capacity.

22
New cards

Database Design

The planning process that determines what data to collect, how to store and group it, and how users will access it.