Theme 1 ~ Foundations of Data Fluency

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/42

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 12:43 AM on 3/25/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

43 Terms

1
New cards

What is Data

Raw form of what informs our information, once its organized and process it turns into valuable insights

2
New cards

What three things influence data reliability

Reviews, Best seller badges, Photos

3
New cards

Human Basis

Search words, past uses & preferences, filling in what we don’t know with assumptions

4
New cards

Platform Basis

Sponsorships, location and delivery access, past purchases, popularity and assumptions about you

5
New cards

Visible data

Information that can be easily observed, collected, or quantified.

6
New cards

Invisible data

Information that is hidden, less accessible, or not immediately measurable.

7
New cards

Steps of the Decision making process

  1. Define Goal (feature matters most) 

  2. Collected data 

  3. Compared Options 

  4. Watch for Bias 

  5. Apply Decision Rule

8
New cards

Quantitative

Measurable, countable data

9
New cards

Qualitive

Non numerical, descriptive data that provides meaning

10
New cards

Structured Data Sets

Excel sheet vibes, organised columns and rows, consistent format with predefined categories

11
New cards

Pros of Structured Data Sets

Easier to analyse, consistent, more accurate (rules), Easy to store / link with others

12
New cards

Cons of Structured Data Sets

Real world data is typically more complex, you need to know upfront what your getting

13
New cards

Unstructured Data Set

No fixed format (emails, social media content etc)

14
New cards

Pros of Unstructured

Lots of info, better insights into human behaviour

15
New cards

Cons of Unstructured

Getting it to store and link is very hard

16
New cards

Semi Structured Data Set

Put anchors to grab what is streamlined while allowing the  unstructured bits

17
New cards

Pros of Semi

Flexible, adaptable, easy to store some

18
New cards

Cons of Semi

Inconsistent and can be hard to operate on

19
New cards

Format

Structure and encoding of data set (JPEG, link etc)

20
New cards

Modality

Fundamental type of information or the method by which it was collected (text, images etc)

21
New cards

Granularity

Level of detail given within the data set

22
New cards

Individual Granularity

Each record is a single entity, allows you to do individual information (variability increase)

23
New cards

Aggregate Level Granularity

Data is summarised or grouped (Averages of several entities),  trends, averages

24
New cards

Sensitivity

Level of risk or harm if that data is lost / leaked

25
New cards

Static Temporal

Cross-sectional, one point in time, can't use to infer things, Snapshot of time

26
New cards

Dynamic Temporal

Collected over time, longitudinal, time series

27
New cards

Spatial

Anything related to location or physical space

28
New cards

5 V’s of Big Data

  • Volume – The sheer amount of data generated. 

  • Velocity – The speed at which data is created and processed. 

  • Variety – The diversity of data types and sources. 

  • Veracity – The quality, accuracy, and trustworthiness of data. 

  • Value – The usefulness and actionable insights derived from data.

29
New cards

Small Data

Focused, curated dataset with higher control, and interpretability

30
New cards

Open Data

freely available for anyone to use, often provided by the governments or organizations thou portals

31
New cards

Closed Data

restricted, requiring payment or permissions to access

32
New cards

Absolute Change

Simply subtracting one from another

33
New cards

Relative Change

Percent change, typically over time

34
New cards

Probability/Risk

Calculated chance that something occurs, or doesn't 

35
New cards

Uncertainty/Margin of Error

A number estimate with a range (typically analysed good or bad)

36
New cards

Sampling

What is/is not included in the data results (think online revs, health norms)

37
New cards

Correlation

Two variables are related and tend to change together, not ALWAYS directly related. Try avoid saying words like x causes y, x leads to y, etc

38
New cards

Causation

One variable directly causes a change in another, there is a guaranteed cause and affect

39
New cards

Spurious Correlations

Happen when two variables follow the same trend BUT have absolutely no meaningful causal connection

40
New cards

Probability

How likely something is to happen [0,1]

41
New cards

Margin of Error

The natural gap between a prediction/estimate says will happen vs what actually happens

42
New cards

Confidence Interval

A range of values that gives a span for the true answer, showing how much uncertainty the estimate has

43
New cards

Risk

Risk is the chance that something negative or unwanted might happen and the consequences if it happens

Explore top notes

Explore top flashcards

flashcards
BH E3- Review
136
Updated 327d ago
0.0(0)
flashcards
AP Lang terms: "R-A"
65
Updated 1211d ago
0.0(0)
flashcards
Unit 1 Words (Classics 30)
44
Updated 796d ago
0.0(0)
flashcards
Anatomy Chapter 14 Quiz #3
26
Updated 23d ago
0.0(0)
flashcards
Frans juillet-decembre
112
Updated 54d ago
0.0(0)
flashcards
Verbos Útiles
100
Updated 756d ago
0.0(0)
flashcards
BH E3- Review
136
Updated 327d ago
0.0(0)
flashcards
AP Lang terms: "R-A"
65
Updated 1211d ago
0.0(0)
flashcards
Unit 1 Words (Classics 30)
44
Updated 796d ago
0.0(0)
flashcards
Anatomy Chapter 14 Quiz #3
26
Updated 23d ago
0.0(0)
flashcards
Frans juillet-decembre
112
Updated 54d ago
0.0(0)
flashcards
Verbos Útiles
100
Updated 756d ago
0.0(0)