IB Comp Sci Option A Databases HL

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/33

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

34 Terms

1
New cards

Relational

Stores data in tables with relationships between them.

2
New cards

Object-oriented

Stores data as objects, similar to object-oriented programming languages.

3
New cards

Network

Based on a traditional hierarchical database, Data is represented using a graph, with records and links. In Networsk Child Records can have multiple parent records

4
New cards

Spatial

A Type of Databse Optimized for storing and querying data representing objects defined in a geometric space.

5
New cards

Multi-dimensional

Data is organized into dimensions, often used in data warehousing and OLAP (Online Analytical Processing)

6
New cards

Data Definition

The process of specifying the structure of a database — including the tables, fields (columns), data types, and relationships.

7
New cards

Data Manipulation

The process of adding, changing, retrieving, or removing data in a database.

8
New cards

Data Integrity

The accuracy, consistency, and reliability of data in the database.

9
New cards

Data Warehouse

A subject-oriented, integrated, time-variant, and non-volatile collection of data is used in decision-making.


A large Centralized repository for storing integrated data from multiple sources.

10
New cards

Subject-oriented

Organized around major subjects like sales, product.

11
New cards

Integrated

Data is collected from various sources and merged into a coherent whole.

12
New cards

Time-variant

Data is kept for historical analysis and has a time dimension.

13
New cards

Non-volatile

Once entered into the warehouse, data is not updated.

14
New cards

Strategic Planning

Using data to make long-term decisions.

15
New cards

Business Modelling

Creating data models to simulate different business scenarios.

16
New cards

Time Dependency

Data in a warehouse is valid for a specific range of time.

17
New cards

Real-time Updates

Data is constantly refreshed from operational systems.

18
New cards

Advantages of Data Warhouse

Centralizes data management

Supports decision-making with complex queries.

Enables historical analysis and trend identification over time

Allows for faster query performance due to optimized structure (e.g., de-normalized schema)

Doesnt Effect Daily Operations

19
New cards

Extract

Pulling data from various sources.

Such as:

Operational Databases

CRM (Customer Relationship Management) System

ERP (Enterprise Resource Planning) system

Spreadsheets & CSV files

20
New cards

Transform

Extracted data is cleaned, restructured, and converted to match the format and rules of the target data warehouse.

Deduplicate

Clean (Remove Invalid or Missing Data)

Standardize (Make sure all data follows same rules)

21
New cards

Loading

Inserting data into the final target database or data warehouse.

22
New cards

Data Cleaning

Enhancing data quality by rectifying inconsistent, incomplete, or inaccurate data.

23
New cards

Pattern Discovery

Identifying patterns or correlations in large datasets using methods like cluster analysis, associations, classifications, sequential patterns, forecasting.

24
New cards

Benefits of Pattern Analysis

Fraud detection in banking

optimizing retail marketing strategies.

Disease Detection in Healthcare

25
New cards

Predictive Modelling

Using statistical techniques for prediction like decision tree induction and neural networks.

26
New cards

Database Segmentation

Dividing a database into distinct segments based on certain criteria.

27
New cards

Link Analysis

Analyzing connections between nodes in a network to identify relationships and patterns.

28
New cards

Deviation Detection

Identifying unexpected or rare items, events, or observations that raise suspicions by differing significantly.

29
New cards

Decision tree induction

A predictive modelling technique where a system learns from historical data by building a tree-like structure of decisions and outcomes.

30
New cards

Cluster Analysis

A method of grouping similar data points together based on shared characteristics, without pre-labeled categories.

  • Uses Unsupervised learning

  • Example: Segmenting customers into groups based on shopping habit

31
New cards

Associations

Rule Learning, an (If, Then)

Identifies relationships between variables by finding items that frequently occur together in a dataset.

  • Often used in market basket analysis

  • Example: “If a customer buys bread and butter, they are likely to buy jam”

32
New cards

Classifications

Assigns data into predefined categories or classes based on learned patterns from labeled training data.

  • Uses Supervised learning

  • Example: Predicting if an email is spam or not based on its contents

33
New cards

Sequential patterns

Identifies frequently occurring ordered sequences of events or items in a dataset.

  • Focuses on time-based patterns

  • Example: A user who buys a phone is likely to buy a case a week later, then a charger

34
New cards

forecasting

Predicts future values based on patterns found in historical data, using techniques like regression or time series analysis.

  • Example: Stock Market Prices