***All data apprenticeship

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/14

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 11:38 AM on 6/18/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

15 Terms

1
New cards

What are the 7 data principles

  • Data minimization

  • Storage limitation

  • Accuracy

  • Accountability

  • Purpose limitation

  • Integrity and Confidentiality

  • Lawfulness, fairness, and transparency

2
New cards

What is a data warehouse?

A data warehouse is a centralized repository that stores large volumes of structured data from various sources, optimized for analytics and reporting.

3
New cards

What is a data lake?

A data lake is a storage repository that holds vast amounts of raw, unstructured, or semi-structured data in its native format until it is needed for analysis.

4
New cards

What is a database?

A database is an organised, digital collection of information designed for easy access, management, and updating.

5
New cards

What is an ERD?

An ERD (Entity-Relationship Diagram) is a visual representation of the relationships between entities in a database. It illustrates how data entities are related to one another within a system, typically used in database design.

6
New cards

What is statistics?

Statistics is the branch of mathematics that deals with collecting, analyzing, interpreting, presenting, and organizing data.

7
New cards

What is inferential statistics?

Inferential statistics uses a sample to make conclusions about a population.

8
New cards

What is descriptive analysis?

Using descriptive statistics, plus tables and charts, to summarise and present a dataset so it's understandable — answering "what happened" rather than "why" or "what's next."

9
New cards

What is descriptive statistics?

They are the numbers that summarise a dataset. They tell you what the data looks like, not why it's like that or what happens next.

10
New cards

What is Pearson's correlation test?

Pearson's correlation test measures the strength and direction of the linear relationship between two variables.

11
New cards

What is predictive analytics?

Predictive analytics refers to a set of techniques that analyze historical data and identify patterns to predict future outcomes.

12
New cards

What is prescriptive analytics?

Prescriptive analytics provides recommendations for actions based on predictive analytics and data analysis, helping organizations make informed decisions.

13
New cards

What is RMSE?

Root Mean Square Error (RMSE) is a commonly used metric to measure the differences between predicted values by a model and the actual values. It represents the square root of the average of the squared differences.

14
New cards

What is overfitting in statistical models?

Overfitting occurs when a statistical model learns not only the underlying patterns in the training data but also the noise, leading to poor generalization to new, unseen data. This situation often results in high accuracy on the training set but low accuracy on the validation or test set.

15
New cards

What is statistics?

Statistics is the science of collecting, analyzing, interpreting, presenting, and organizing data. It provides methodologies for understanding and making decisions based on numerical information.