Statistics Review Notes

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/16

flashcard set

Earn XP

Description and Tags

These flashcards cover essential concepts related to statistics, including measures of central tendency, variability, correlation, and regression analysis.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

17 Terms

1
New cards

What is a contingency table used for?

To display the frequency distribution of two categorical variables.

2
New cards

What does the five-number summary include?

Minimum, Q1 (lower quartile), median, Q3 (upper quartile), and maximum.

3
New cards

What is the formula for calculating the z-score?

z = (X - mean) / standard deviation.

4
New cards

What does a boxplot visually summarize?

The five-number summary and outliers of a data set.

5
New cards

What is the significance of the IQR?

The IQR (interquartile range) is a measure of statistical dispersion and is resistant to outliers.

6
New cards

Define skewness in distributions.

Skewness indicates the direction and degree of asymmetry of a distribution.

7
New cards

What are outliers?

Extreme values that differ significantly from other observations in a dataset.

8
New cards

How do you find the range of a dataset?

Range is calculated as max - min.

9
New cards

What does the coefficient of determination (R²) indicate?

The proportion of the variance in the dependent variable that is predictable from the independent variable(s).

10
New cards

What are z-scores used for in statistics?

To measure the standard deviations away from the mean.

11
New cards

What information does the correlation coefficient (r) provide?

The strength and direction of a linear relationship between two variables.

12
New cards

What does a negative correlation indicate?

As one variable increases, the other variable tends to decrease.

13
New cards

What is the 68-95-99.7 rule?

In a normal distribution, approximately 68% of values fall within one standard deviation, 95% fall within two, and 99.7% fall within three standard deviations from the mean.

14
New cards

How can you determine whether to use mean or median?

Use mean for symmetric distributions and median for skewed distributions.

15
New cards

What is linear regression used for?

To predict the value of one variable based on the value of another variable.

16
New cards

What does 'sensitive to outliers' mean for standard deviation and range?

Values can be significantly affected by extreme data points.

17
New cards

What is a lurking variable?

A variable that is not included in the analysis but affects both the independent and dependent variables.