Data Warehousing Vocabulary Practice

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/31

Earn XP

Description and Tags

Comprehensive vocabulary flashcards covering Data Warehousing definitions, features, processing types, architectural approaches (Inmon vs. Kimball), and schema designs (Star vs. Snowflake).

Last updated 3:53 PM on 5/14/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

32 Terms

New cards

Data Warehousing

A collection of data that helps an analyst to make informed decisions in an organisation.

New cards

OLAP

OnLine Analytical Processing tools that allow us to analyse data in a multi-dimensional space, resulting in data generalisation and data mining.

New cards

Subject-oriented

A feature of a data warehouse focusing on modelling and analysis of data around a specific subject rather than ongoing operations.

New cards

Integrated

A feature of data warehouses constructed by combining data from many different sources such as relational databases or flat files.

New cards

Time-variant

A feature where the data collected in a warehouse is identified with a particular time period.

New cards

Non-volatile

A feature where previous data is not erased when new data is added to the data warehouse.

New cards

Information Processing

A type of warehouse processing that deals with querying, basic statistical analysis, and reporting using crosstabs, tables, charts, or graphs.

New cards

Analytical Processing

Processing of information using OLAP operations such as slice and dice, drill down, drill up, and pivoting.

New cards

Data Mining

Knowledge discovery by finding hidden patterns and associations, constructing analytical models, and performing classification and predictions.

New cards

Enterprise Data Warehouse (EDW)

A data warehouse environment that services an entire enterprise. May have a operational data store and physical and virtual data marts

New cards

Data Marts

Subsets of data used by individual departments or groups, often building on a dimensional data model.

New cards

Operational data store

A hybrid form of data warehouse containing integrated information from many different databases.

New cards

Update Driven Approach

An alternative approach where information from multiple heterogeneous sources is integrated in advance and available for direct query. Have high performance and data is copied, processed, integrated, annotated, summarised and restructured in semantic data stores in advance

New cards

Data extraction

The warehouse tool function of gathering data from many different sources.

New cards

Data cleaning

The warehouse tool function of finding and correcting data errors.

New cards

Data transformation

The warehouse tool function of converting source data into the specific data warehouse format.

New cards

Metadata

Known as 'data about data', it defines warehouse objects and acts as a directory to help locate the contents of a data warehouse.

New cards

Normalisation

A process for converting complex data structures into simple and stable data structures with minimal redundancy, often categorized as 3NF or 4NF.

New cards

Inmon Approach

A Top-Down approach to data warehouse design that utilizes entity relationship modelling and normalisation. Time, cost and maintenance are high

New cards

Kimball Approach

A Bottom-Up approach to data warehouse design that focuses on dimensional modelling to allow for different individual models of interest. Time, cost and maintenance are low

New cards

Denormalisation

A data structure using fewer tables to group data; it offers better performance when reading data for analytical purposes by reducing query complexity.

New cards

Fact Tables

Tables that contain the measurements of a business, such as sales, purchase orders, or shipment information.

New cards

Dimension Tables

Tables that store descriptions of the dimensions of the business, such as products, customers, vendors, or stores.

New cards

Transaction Fact Tables

A type of fact table used to record information related to specific events, such as individual product sales.

New cards

Snapshot Fact Tables

A type of fact table that records information applying to specific moments in time, such as year-end accounts.

New cards

Surrogate primary keys

A single column integer related to a natural primary key used to map fact tables to specific rows in dimension tables.

New cards

Star Schema

A schema where the fact table connects all information sources by pulling data from dimension tables and duplicating it to simplify queries.

New cards

Snowflake Schema

A multi-dimensional structure that normalises data within a star schema, splitting dimension tables into a series of normalised tables. Has more complex queries as they need to dig deeper

<p>A multi-dimensional structure that normalises data within a star schema, splitting dimension tables into a series of normalised tables. Has more complex queries as they need to dig deeper</p>

New cards

Process flow

Extract and load data
Clean and transform data
Backup and archive data
Manage queries and direct them to appropriate data source

New cards

Benefits of star schema

Is denormalised, thus queries are simpler as data connects though fact tables. Removes the bottleneck of normalised schema, used OLAP cubes.

New cards

Challenges of star schema

Decreased data integrity due to denormalisation, can’t handle complex queries and no many to many relationships as schema is too simple

New cards

Accumulating snapshot tables

Records information running tally of data - year to year sales figure