Modeling a Data Warehouse

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/31

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

32 Terms

1
New cards

The 7 W’s of Business Questions

  • WHO is involved?

  • WHAT did they do? To WHAT is it done?

  • WHEN did it happen?

  • WHERE did it take place?

  • HoW many or much was recorded – HoW can it be measured?

  • WHY did it happen?

  • HoW did it happen – in what manner?

2
New cards

What is a warehouse?

Efficiently store things so you can easily find them later when they are needed

3
New cards

Business Intelligence

  • Infrastructure for collecting, storing, analyzing data produced by business

  • Databases, data warehouses, data marts

4
New cards

Data Warehouse

  • Tools and techniques for analyzing data

  • OLAP, statistics, models, data mining

Purpose: Consolidate and integrate data from multiple sources (including various databases) into a central repository designed for analysis.

Structure: Often uses dimensional modeling (fact and dimension tables) for efficient querying.

Benefit: Provides a historical and holistic view of the organization’s data over time, enabling deeper insights.

5
New cards

Database Design Approaches

  • Database Design Approaches

  • DW/BI – Data Warehouse / Business Intelligence

6
New cards

OLTP – On-Line Transaction Processing

  • Operational systems to support the execution of business processes

  • Execute individual business processes

  • Processes are stable, predictable, pre-defined, real-time

  • Optimized for transaction processing: (Select, Insert, Update, Delete)

7
New cards

DW/BI – Data Warehouse / Business Intelligence

  • Support the evaluation of business processes

  • Evaluate effects of multiple business processes

  • Optimized for query processing: (Select)

  • Ad-hoc, querying very large amounts of data

8
New cards

Normalization (Normal Forms) Refresher

  • First Normal Form (1NF)

  • Second Normal Form (2NF)

  • Third Normal Form (3NF)

9
New cards

First Normal Form

  • All attributes have an ‘atomic’ value

  • Remove redundant columns and fields, and add a primary key

10
New cards

Second Normal Form (2NF)

  • Lessens redundancy of 1NF, by eliminating partial dependencies

11
New cards

Third Normal Form (3NF)

  • Further reduces data duplication, by removing transitive dependencies

    • (Transitive = Indirect dependency)

12
New cards

OLTP Advantages

  • Very efficient for data entry

  • Normalization minimizes duplication (3NF)

13
New cards

OLTP Disadvantages

  • ERD Diagrams can be very complex!

  • Many joins and combinations

14
New cards

data cube

Grouping of data in a multidimensional matrix.

  • Arrange relevant information together

  • Expedite query response times

  • Analyze large datasets quickly

15
New cards

Dimensional Modeling

Define business processes and individual events in terms of:

  • Facts

  • Dimensions

16
New cards

fact

Measurements. A quantitative piece of information.

17
New cards

Dimensions

Descriptions of the objects. One of the 7 W’s.

18
New cards

star

Single fact table referenced by several dimension tables.

19
New cards

Design Considerations

  • Data Driven Analysis

  • Report Driven Analysis

  • Reactive BI Design

  • Proactive BI Design

20
New cards

Data Driven Analysis

Analyze operational data sources

Starts with the raw data and explores it to uncover patterns, trends, or insights without a predefined set of questions. The analysis evolves based on what the data reveals.

21
New cards

Report Driven Analysis

Analyze business intelligence needs

Begins with a specific set of reporting requirements or business questions. The data warehouse and BI tools are designed to generate predefined reports that answer these questions.

22
New cards

Reactive BI Design

Wait for OLTP design and data

an established company might have an existing OLTP system and opt for a reactive approach

23
New cards

Proactive BI Design

Design DW/BI in parallel with OLTP

For example, a startup might favor a proactive design to quickly gain business insights

24
New cards

So how do you start the design?

‘Chicken or Egg’ conumdrum

25
New cards

agile modeling

Early and frequent delivery of working models, focusing on stakeholder input and value

  • collaborative

  • Incremental

  • Iterative

Early and Frequent Delivery: Delivering working models incrementally rather than waiting for a complete design.

Collaborative Approach: Involving stakeholders directly in the modeling process to ensure the design meets their needs.

Incremental and Iterative: Gradually refining the model based on ongoing feedback and new requirements.

26
New cards

collaborative

Model directly with stakeholders

27
New cards

Incremental

Provide more requirements when they are better understood by stakeholders

28
New cards

Iterative

Understanding existing requirements better and improve existing schemas refactoring:

  • Correcting mistakes

  • Adding missing attributes

29
New cards

Describe the 7 W’s of Business Analytics

  • Who

  • What

  • When

  • Why

  • How Many

  • How did it work

30
New cards

Descibe the difference between OLTP and DW/BI

OLTP = Execution

DW/BI = Evaluation

31
New cards

Understand fact and dimension tables (Star Schema)

  • Fact = Measurments

  • Dimensions = Descriptions

32
New cards

Designing and Modeling BI Database

  • agile with stakeholders: Collaborative, Incremental, Iterative