Unit 2 Exploring Bivariate Data Vocabulary

0.0(0)

Studied by 9 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/34

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

35 Terms

New cards

Response Variable

Measures an outcome of a study

New cards

Explanatory variable

May help predict or explain changes in a response variable

New cards

Two-way table

A table of counts or relative frequencies that summarizes data on the relationship between two categorical variables for some group of individuals

New cards

Marginal relative frequency

Gives the percentage or proportion of individuals in a two-way table that have a specific value for one categorical variable. A [blank] relative frequency is calculated by dividing a row or column total by the total for the entire two-way table

New cards

Joint relative frequency

Gives the percentage or proportion of individuals in a two-way table that have a specific value for one categorical variable and a specific value for another categorical variable. A [blank] relative frequency is calculated by dividing the value in one cell by the total for the entire two-way table

New cards

Conditional relative frequency

Gives the percentage or proportion of individuals that have a specific value for one categorical variable among a group of individuals that share the same value of another categorical variable (the condition). A [blank] relative frequency is calculated by dividing the value in one cell of a two-way table by the total for the appropriate row or column

New cards

Segmented bar graph

Displays the distribution of a categorical variable as segments of a bar, with the area of each segment proportional to the number of individuals in the corresponding category

New cards

Mosaic plot

A modified segmented bar graph in which the width of each bar is proportional to the number of individuals in the corresponding category

New cards

Association

A [blank] between two variables if knowing the value of one variable helps us predict the value of the other. If knowing the value of one variable does not help us predict the value of the other, there is no [blank] between the variables

New cards

Scatterplot

Shows the relationship between two quantitative variables measured for the same individuals. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each individual in the data set appears as a point in the graph

New cards

Positive association

Two variables have a [blank] association when the values of one variable tend to increase as the values of the other variable increase

New cards

Negative association

Two variables have a [blank] association when the values of one variable tend to decrease as the values of the other variable increase

New cards

No association

There is [blank] association between two variables if knowing the value of one variable does not help us predict the value of the other variable

New cards

Correlation

Gives the direction and measures the strength of the linear association between two quantitative variables

New cards

Correlation symbol

New cards

Correlation (formula)

New cards

Regression line

A line that models how a response variable y changes as an explanatory variable x changes. [blank] lines are expressed in the form ŷ = a + bx, where ŷ (pronounced “y-hat”) is the predicted value of y for a given value of x

New cards

y-hat or regression line

New cards

Extrapolation

The use of a regression line for a prediction outside the interval of x-values to obtain the line. The further we [blank], the less reliable the predictions become

New cards

Residual

The difference between the actual value of y and the value of y predicted by the regression line. That is, [blank] = actual y – predicted y (or ŷ)

New cards

y-intercept

Predicted value of y when x = 0

New cards

Slope

The among by which the predicted value of y changes when x

New cards

The slope

New cards

The y-intercept

New cards

Least-squares regression line

The line that makes the sum of the squared residuals as small as possible

New cards

Slope (formula)

New cards

y-intercept (formula)

New cards

Residual plot

A scatterplot that displays the residuals on the vertical axis and the values of the explanatory variable (or the predicted values) on the horizontal axis

New cards

Coefficient of determination

Measures the percent reduction in the sum of squared residuals when using the least-squares regression line to make predictions, rather than the mean value of y. In other words, [blank] measures the proportion (or percentage) of the variation in the response variable that is accounted for (or explained) by the explanatory variable in the linear model

New cards

r²

Coefficient of determination

New cards

Standard deviation of the residuals

Measures the size of a typical residual. That is, [blank] measures the typical distance between the actual y-values and the predicted y-values

New cards

Standard deviation of the residuals

New cards

High leverage

Points with [blank] in regression have much larger or much smaller x-values than the other points in the data set

New cards

Outlier

A point in a regression that does not follow the pattern of the data and the has a large residual

New cards

Influential point

Any point in a regression that, if removed, substantially changes the slope, y intercept, correlation, coefficient of determination, or standard deviation of the residuals