Data, Databases & Big Data EZ

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/17

flashcard set

Earn XP

Description and Tags

A set of flashcards covering lecture material on data management, DBMS, Big Data characteristics, Data Warehousing, and Knowledge Management Systems.

Last updated 5:04 PM on 5/16/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

18 Terms

1
New cards

Why is managing data considered difficult for modern businesses?

Data management is difficult because data grows very fast, comes from many sources (internal, customers, internet), includes new types like blogs and sensors, can degrade over time, and must comply with government rules like GDPR.

2
New cards

What is clickstream data?

Clickstream data refers to the record of what users do on websites.

3
New cards

What is Data Governance?

Data Governance is the management of data across an entire company, including rules, policies, security, storage, and usage to ensure data is handled properly.

4
New cards

What is the goal of Master Data?

The goal of Master Data is to create one correct version of core data, such as customers, products, and employees, providing a "one version of truth."

5
New cards

According to the lecture, what is the hierarchy of data from smallest to largest units?

BitByteFieldRecordFileDatabase\text{Bit} \rightarrow \text{Byte} \rightarrow \text{Field} \rightarrow \text{Record} \rightarrow \text{File} \rightarrow \text{Database}

6
New cards

What is a Database Management System (DBMS)?

A DBMS is software that manages databases by solving problems like data redundancy, data isolation, and data inconsistency while improving security, integrity, and independence.

7
New cards

In the relational database model, what are rows and columns called?

Rows are called records and columns are called attributes.

8
New cards

What is the difference between a Primary Key and a Foreign Key?

A Primary Key is a unique ID for each row in a table, whereas a Foreign Key is a field that links two tables together.

9
New cards

What are the characteristics of Big Data known as the "3 V’s"?

Volume (huge size)\text{Volume (huge size)}, Velocity (arrives very fast)\text{Velocity (arrives very fast)}, and Variety (different types of data)\text{Variety (different types of data)}.

10
New cards

What tools are used to manage and process Big Data?

Tools include NoSQL databases (for flexible data), Hadoop (a processing system), and MapReduce (for processing large data sets).

11
New cards

What is the difference between a Data Warehouse and a Data Mart?

A Data Warehouse is a large storage system for historical company data used for strategic decision-making, while a Data Mart is a smaller, cheaper version designed for a specific department (e.g., Marketing).

12
New cards

What are the four key characteristics of a Data Warehouse?

  1. Integrated (combines many systems); 2. Time Variant (stores historical data); 3. Non-Volatile (stable/not changed); 4. Multidimensional (analysis from different perspectives).
13
New cards

What does the acronym ETL represent in the data integration process?

Extract (take data from systems)\text{Extract (take data from systems)}, Transform (clean and organise data)\text{Transform (clean and organise data)}, and Load (move data into the warehouse)\text{Load (move data into the warehouse)}.

14
New cards

What is OLAP and how is it used?

OLAP stands for Online Analytical Processing; it allows businesses to analyse trends, compare information, and create reports quickly using a multidimensional data cube.

15
New cards

What is the difference between Explicit Knowledge and Tacit Knowledge?

Explicit Knowledge is easy to record and share (e.g., manuals, reports), while Tacit Knowledge is personal experience and skills that are harder to record (e.g., leadership, problem-solving).

16
New cards

What are the six steps in the Knowledge Management System (KMS) cycle?

1. Create\text{1. Create}, 2. Capture\text{2. Capture}, 3. Refine\text{3. Refine}, 4. Store\text{4. Store}, 5. Manage\text{5. Manage}, and 6. Disseminate\text{6. Disseminate}.

17
New cards

What is SQL and what is its purpose?

SQL (Structured Query Language) is used to search, update, and manage data within a database.

18
New cards

In an Entity Relationship Diagram (ERD), what are the three types of relationships based on cardinality?

One-to-One (1:1)\text{One-to-One (1:1)}, One-to-Many (1:M)\text{One-to-Many (1:M)}, and Many-to-Many (M:M)\text{Many-to-Many (M:M)}.