Stats Vocab Ch 1

studied byStudied by 125 people
0.0(0)
get a hint
hint

Categorical Variable

1 / 53

Tags and Description

54 Terms

1

Categorical Variable

Variable that represents categories that place data into groups

New cards
2

Quantitative Variable

Variable for which the numbers act as numerical values with known units

New cards
3

Distribution

The possible values of a variable and the frequency that each value occurs

New cards
4

Frequency Table

lists the categories for a categorical variable and displays the counts for each category

New cards
5

Relative Frequency Table

lists the categories for a categorical variable and displays the proportion/percentage for each category

New cards
6

Bar Chart

Display where bars represent the count of each category for a categorical variable

New cards
7

Relative Frequency Bar Chart

Display where bars represent the proportion/percentage of each category for a categorical variable

New cards
8

Segmented Bar Chart (Stacked Bar Chart)

Display where one bar represents a "whole" that is proportionally divided by each category for a categorical variable

New cards
9

Pie Chart

Display where one circle represents a "whole" that is proportionally divided by each category for a categorical variable

New cards
10

Comparative Display

Display (of any type) that is used to directly compare two or more distributions at once

New cards
11

Stem & Leaf Plot

Display that shows both the distribution and the individual data values for a quantitative variable as shared with "stems" with individual "leaves"

New cards
12

Dot Plot

Display where a dot is graphed on a single axis for each data value, stacking repeated values, this showing the distribution of a quantitative variable

New cards
13

Histogram

Display where bars represent the count of values falling into intervals ("bins") for a quantitative variable, showing its distribution

New cards
14

Relative Frequency Histogram

Display where vars represent the proportion/percentage of values falling into intervals ("bins") for a quantitative variable, showing its distribution

New cards
15

Cumulative Relative Frequency Plot

Display where a line shows the percentage of observations that are less than or equal to particular values for a quantitative variable

New cards
16

Percentile

The nth percentile is the value that falls above n% of the data (for example, the 90th percentile is above 90% of the data, demarking the top 10% of the data)

New cards
17

Context

Identifies what is being described/compared/analyzed

New cards
18

Shape

Describes the "look" of the distribution

New cards
19

Mode(s)

The most commonly occurring value(s) in a distribution, seen as hump(s) in displays. Can be unimodal (one mode), bimodal (two modes), or multimodal (three or more)

New cards
20

Uniform

A distribution that is roughly flat in shape, meaning there is no consequential mode

New cards
21

Symmetric

A distribution whose left & right halves form the center are approximately the same

New cards
22

Skewed

A Distribution that is not symmetric and has a longer tail on one side. Skewness is where the tail is (tail on left = skewed left, tail on right = skewed right)

New cards
23

Center

A value that attempts to summarize the entire distribution with s single number

New cards
24

Mean

The arithmetic average of a distribution. Sum of all the data values divided by the number of data values.

New cards
25

Median

The middle value of a distribution, where half the data is above and half of the data is below this value (50th percentile)

New cards
26

Spread

Describes how tightly the data is clustered around the center

New cards
27

Standard Deviation

The average distance a data value is from the mean

New cards
28

Quartile

One of three values (Q1, median, Q3) that divide a data set into four equal parts

New cards
29

1st Quartile (Lower)

The median of the lower half of the distribution (25th percentile), known as Q1

New cards
30

3rd Quartile (Upper)

The median of the upper half of the distribution (75th percentile), known as Q3

New cards
31

Interquartile Range (IQR)

The difference between the first and third quartiles, which is the middle 50% of the data

New cards
32

Range

The difference between the maximum and minimum values in a data set

New cards
33

Outlier

A data value that falls outside of the overall pattern of the rest of the data, specifically beyond 1.5IQR from either Q1 or Q3 (these form your fences)

New cards
34

Resistant

A calculated summary statistic is resistant if outliers have little to no effect on it...for example, medians/IQR's are resistant while means/standard deviations/ranges are not

New cards
35

5-Number Summary

Reports the minimum, Q1, median, Q3, and maximum of a distribution

New cards
36

Boxplot

Display that shows the 5-Number Summary as a central box, whiskers, and outliers, effectively dividing the data into quartiles

New cards
37

Shifting

Adding or subtracting a constant to every data value, which adds or subtracts that same constant to all measures of position and leaves measures of spread unchanged

New cards
38

Rescaling

Multiplying or dividing every data value by a constant, which multiplies or divides all measures of position and spread by that same constant

New cards
39

Standardized Values

Values for which the units have been systematically eliminated, allowing for comparison, even if the original variables had different scales and/or units

New cards
40

z-Score

Standardized value that identifies how many standard deviations a value is from the mean; z-scores don't change a distribution's shape, but force the mean to 0 and standard deviation to 1

New cards
41

Scatterplot

Display that shows the relationship between two quantitative variables measured for the same subjects on an x-y coordinate plane

New cards
42

Association

Relationship between two quantitative variables, described by SDFOC

New cards
43

Strength

Describes how well/closely the data follows the identified pattern of a scatterplot

New cards
44

Direction

A positive direction means that one variable increases as the other increases...a negative direction means that one variable decreases as the other increases

New cards
45

Form

Describes the overall shape of a scatterplot; we focus on linear vs. non-linear

New cards
46

Outliers

Points that fall outside of the overall pattern of a scatterplot

New cards
47

Context

Identifies the two variables for which an association is being described

New cards
48

Explanatory Variable

The variable that is thought to explain or predict the response variable (x-axis)

New cards
49

Response Variable

The variable that is thought to be explained/predicted by the explanatory variable (y-axis)

New cards
50

Correlation Coefficient (r)

The number that describes both the direction and strength of the linear association between two quantitative variables, from -1 to 1, where -1 is perfectly negative linear and 1 perfectly positive linear

New cards
51

Linear Model via Least Squares Regression Line of Best Fit

A linear equation that is used to simplify and represent an association, found via the line that minimizes the sum of the squared residuals

New cards
52

Predicted Value (ŷ)

The predicted y-value found for each x-value by substituting that x into the linear model producing the points (x, ŷ)...a "hat" in Statistics means that a value is predicted

New cards
53

Residual

The difference between an observed data value and the predicted value from the model...Residual = observed - predicted = y - ŷ

New cards
54

Coefficient of Determination (R^2)

The square of the correlation coefficient, which gives the percentage of the variability of y that is accounted for by the least squares regression on x, from 0% to 100%. Provides an overall measure of how strong the regression is in linearly relating y to x.

New cards

Explore top notes

note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 8 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 14 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 26493 people
Updated ... ago
4.8 Stars(224)

Explore top flashcards

flashcards Flashcard74 terms
studied byStudied by 20 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard24 terms
studied byStudied by 27 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard36 terms
studied byStudied by 17 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard25 terms
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard74 terms
studied byStudied by 24 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard38 terms
studied byStudied by 23 people
Updated ... ago
4.3 Stars(3)
flashcards Flashcard84 terms
studied byStudied by 35 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard68 terms
studied byStudied by 89 people
Updated ... ago
5.0 Stars(3)