Data Warehousing and Dimensional Modeling

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/46

flashcard set

Earn XP

Description and Tags

These flashcards cover essential vocabulary relevant to dimensional modeling and data mining within data warehousing.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

47 Terms

1
New cards

Data Warehouse

A centralized repository for storing large amounts of data, optimized for query and analysis.

2
New cards

Dimensional Modeling

A data modeling technique used to design data warehouses with optimized data retrieval.

3
New cards

Fact Table

Central table in a dimensional model that stores measurable business data.

4
New cards

Dimension Table

Table that contains attributes related to the facts, used to categorize facts.

5
New cards

Star Schema

A type of database schema that consists of a central fact table surrounded by dimension tables.

6
New cards

Snowflake Schema

A more complex schema compared to star schema which normalizes dimension tables into multiple related tables.

7
New cards

Galaxy Schema

Also known as fact constellation schema, contains multiple fact tables sharing common dimension tables.

8
New cards

Fact Constellation Schema

A schema that includes multiple fact tables and associated dimension tables.

9
New cards

Conformed Dimension

Dimension that has the same meaning across different fact tables.

10
New cards

Outrigger Dimension

A dimension that connects different dimension tables to provide additional attributes.

11
New cards

Shrunken Dimension

A reduced granularity of a dimension to optimize performance.

12
New cards

Role-Playing Dimension

A dimension that has multiple valid relationships with other tables.

13
New cards

Degenerate Dimension

Dimension that is derived from fact table attributes and does not have its own dimension table.

14
New cards

Junk Dimension

A dimension that combines multiple low cardinality attributes to reduce complexity.

15
New cards

Swappable Dimension

Dimension with multiple versions that can be swapped at query time.

16
New cards

Step Dimension

Dimension that identifies the step in a process, with each step assigned a number.

17
New cards

Attributes

Characteristics of a dimension that describe the data and can be used for filtering.

18
New cards

Data Mining

The process of analyzing large datasets to identify patterns and insights for decision-making.

19
New cards

Classification

Data mining technique used to categorize data based on predefined labels.

20
New cards

Clustering

Grouping similar data points in data mining without prior labels.

21
New cards

Association Rule Mining

Technique to find relationships between variables in large datasets.

22
New cards

Anomaly Detection

Identifying data points that deviate from expected patterns in data mining.

23
New cards

Regression Analysis

Predicting numeric values based on historical data.

24
New cards

Text Mining

Analyzing unstructured text data to extract meaningful information.

25
New cards

Neural Networks

Complex models that learn patterns from large datasets in data mining.

26
New cards

SQL

Structured Query Language used for managing and querying relational databases.

27
New cards

Data Segmentation

Dividing a dataset into distinct subgroups for analysis.

28
New cards

Fraud Detection

Analyzing data to uncover and prevent fraudulent activities.

29
New cards

Risk Assessment

Evaluating potential risks based on data analysis.

30
New cards

Compliance Analysis

Ensuring adherence to regulations through data examination.

31
New cards

Data Integration

Combining data from different sources into a coherent data set.

32
New cards

Performance Analysis

Evaluating performance metrics to inform business decisions.

33
New cards

Customer Segmentation

Categorizing customers into groups for targeted analysis.

34
New cards

OLAP Systems

Online Analytical Processing systems designed for complex queries and data analysis.

35
New cards

Fact Metrics

Quantifiable measurements that represent business performance.

36
New cards

Dimension Attributes

Qualitative fields that provide context to the facts in a data model.

37
New cards

Database Normalization

Organizational process of structuring a relational database to reduce redundancy.

38
New cards

Data Retrieval Optimization

Techniques to improve the speed and efficiency of data access.

39
New cards

Data Analysis Tools

Software applications used for performing extensive data analysis.

40
New cards

Historical Data

Past data utilized for forecasting and predictive analysis.

41
New cards

Financial Risk Assessment

Analyzing data to identify potential financial threats to an organization.

42
New cards

Data Visualization

The graphical representation of information and data to communicate insights.

43
New cards

Business Intelligence

Technologies and strategies for analyzing business data to enhance decision-making.

44
New cards

Granularity

The level of detail represented in data.

45
New cards

Query Complexity

An assessment of how intricate a database query is to execute.

46
New cards

Data Governance

Managing the availability, usability, integrity, and security of data used in an organization.

47
New cards

Operational Data Store (ODS)

Database designed for day-to-day operations and immediate data access.