[q] Topic 5 - Linear Model

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/29

flashcard set

Earn XP

Description and Tags

Compendium of vocabulary terms and definitions regarding data fundamentals, linear regression, probability, and hypothesis testing as presented in the DATA1001/1901 lecture notes.

Last updated 9:08 AM on 6/3/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

30 Terms

1
New cards

What is Tidy Data?

Data where each column is a variable and each row is an observation.

2
New cards

What does Tidy ≠ clean mean?

Tidy data structure does not imply absence of errors.

3
New cards

What are Qualitative Variables?

Categorical variables with no numerical value.

4
New cards

What are Quantitative Variables?

Numerical variables with meaningful values.

5
New cards

What is Sampling Bias?

Sample that doesn't represent the target population.

6
New cards

What is Response Bias?

Error from poorly worded survey questions.

7
New cards

What is Non-response Bias?

Bias from certain demographics not responding to surveys.

8
New cards

What is Data Linkage?

Combining datasets about the same individuals.

9
New cards

What is Measurement Error Formula?

Measurement = exact value + chance error + bias.

10
New cards

What is Chance Error?

Random measurement fluctuations.

11
New cards

What is Standard Deviation (SD)?

A distance measure that is always non-negative.

12
New cards

What is Linear Transformation (Mean)?

New Mean = a + b × (old mean).

13
New cards

What is Linear Transformation (SD)?

New SD = |b| × (old SD).

14
New cards

What is Correlation Coefficient (r)?

Unitless number (-1 to +1) for linear association strength.

15
New cards

What is Regression Line?

Line minimizing squared residuals.

16
New cards

What is Residual?

Actual - Predicted value.

17
New cards

What is Homoscedasticity?

Constant spread of residuals.

18
New cards

What is R²?

Variation percentage in y explained by x.

19
New cards

What is RMS Error?

SD of residuals in regression.

20
New cards

What is 68-95-99.7 Rule?

Data distribution in Normal Distribution.

21
New cards

What are Binomial Distribution Requirements?

Fixed trials with consistent probability of success.

22
New cards

What is Central Limit Theorem (CLT)?

Sample means approach normal distribution.

23
New cards

What is Prosecutor's Fallacy?

Confusing evidence probability with innocence probability.

24
New cards

What is P-value?

Probability of extreme results assuming null hypothesis.

25
New cards

What is Null Hypothesis (H₀)?

Claim of no difference or association.

26
New cards

What is Alternative Hypothesis (H₁)?

Claim of difference or association.

27
New cards

What is Chi-Squared Test of Independence?

Test for association between two categorical variables.

28
New cards

What is Confidence Interval (CI)?

Range to contain true parameter a specified percentage of times.

29
New cards

What is Extrapolation?

Predicting outside the data range.

30
New cards

What is Causation vs Association?

x predicting y does not mean x causes y.