Chapter 1-3

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/24

flashcard set

Earn XP

Description and Tags

Flashcards covering key vocabulary from introductory statistics.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

Categorical Variable

Places an individual into a group or category (e.g., eye color, genre of music).

2
New cards

Discrete Quantitative Variable

Takes specific numerical values with gaps in-between (e.g., number of students in a class, shoe size).

3
New cards

Continuous Quantitative Variable

Takes any decimal value over a certain range of values (e.g., height, weight).

4
New cards

Mean

The sum of the data divided by the number of observations; a measure of center.

5
New cards

Median

The value with ≤ 50% of the other data below it and ≤ 50% above it; a measure of center.

6
New cards

Range

Maximum value minus minimum value; a measure of variability/spread.

7
New cards

Interquartile Range (IQR)

Q3 minus Q1; a measure of variability/spread.

8
New cards

Outlier

Any observation greater than Q3 + (1.5 * IQR) or less than Q1 – (1.5 * IQR).

9
New cards

Standard Deviation

Approximately how much, on average/typically, the values vary from the mean; a measure of spread/variability.

10
New cards

Variance

The square of standard deviation; a measure of spread/variability.

11
New cards

Percentile

The value in a distribution with that percent of the observations less than it.

12
New cards

Z-Score

Measures how many standard deviations above or below the mean a specific observation is.

13
New cards

Z-Distribution

The normal distribution with a mean of 0 and a standard deviation of 1.

14
New cards

Explanatory Variable

May help explain or predict changes in a response variable; the x-variable or “input”.

15
New cards

Response Variable

Measures the outcome of a study; the y-variable or “output.”

16
New cards

Scatterplot

Shows the relationship between two quantitative variables measured on the same individuals.

17
New cards

Correlation Coefficient (r)

Measures the strength and direction of the linear relationship between two quantitative variables; a number between -1 and 1.

18
New cards

Regression Line

A line that describes how a response variable y changes as an explanatory variable x changes; used to predict y from x in a scatterplot.

19
New cards

Residual

The difference between an observed y value from the actual data and a predicted y value from the regression line; prediction errors of the regression line.

20
New cards

Least-Squares Regression Line

The line that minimizes the sum of the squared residuals.

21
New cards

Residual Plot

Scatterplot with the residuals on the y-axis instead of the response variable; used to check the appropriateness of a linear model.

22
New cards

Standard Deviation of Residuals (s)

The approximate average size of the residuals or prediction errors; indicates how well the regression line fits the data.

23
New cards

Coefficient of Determination (r^2)

The proportion of the variation in the values of y that is explained by the least-squares regression line of y vs. x.

24
New cards

Influential Point

Any point which significantly affects slope, y-intercept, correlation coefficient r, standard deviation of residuals s, or coefficient of determination r^2 of the linear regression.

25
New cards

High-Leverage Point

Any point with an x value that is significantly above or below the rest of the x values in the data set.