Stats Vocab

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Knowt Play

New

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/91

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

92 Terms

New cards

Individual

Objects described by a set of data (people, animals, things, etc.)

New cards

Variable

Any characteristic of an individual; can take different values.

New cards

Categorical variable

Places an individual into a group or category.

New cards

Quantitative variable

Takes numerical values for which arithmetic operations make sense.

New cards

Distribution

Tells what values a variable takes and how often.

New cards

Frequency table

Displays counts for each category.

New cards

Relative frequency table

Displays proportions or percentages for each category.

New cards

Two-way table

Describes two categorical variables.

New cards

Marginal distribution

Distribution of one variable among all individuals.

New cards

Conditional distribution

Distribution of one variable given a specific value of another variable.

New cards

Association

When knowing one variable helps predict another.

New cards

Dotplot

Graph showing each data value as a dot above a number line.

New cards

Stemplot

Displays data to show shape and distribution while retaining actual values.

New cards

Histogram

Displays distribution of a quantitative variable using bars.

New cards

Shape

Describes symmetry, skewness, peaks, and gaps.

New cards

Center

Describes typical value (mean, median).

New cards

Spread

Describes variability (range, IQR, standard deviation).

New cards

Outlier

Value that falls outside the overall pattern.

New cards

Resistant measure

Statistic not strongly affected by extreme values (median, IQR).

New cards

Density curve

Curve above the horizontal axis with area 1; shows distribution.

New cards

Median of a density curve

Divides area into two equal halves.

New cards

Mean of a density curve

Balance point of the curve.

New cards

Normal distribution

Symmetric, bell-shaped curve defined by mean (μ) and SD (σ).

New cards

68–95–99.7 Rule

Describes data within 1, 2, and 3 SDs of the mean.

New cards

Standard normal distribution

Normal distribution with mean 0 and SD 1.

New cards

z-score

Standardized value showing distance from mean in SDs: z = (x−μ)/σ.

New cards

Normal probability plot

Graph to assess Normality of data.

New cards

Scatterplot

Graph showing relationship between two quantitative variables.

New cards

Explanatory variable

Helps explain or predict changes in the response variable.

New cards

Response variable

Measures the outcome of a study.

New cards

Form

Overall pattern (linear, curved).

New cards

Direction

Indicates positive or negative association.

New cards

Strength

Describes how closely points follow a pattern.

New cards

Correlation (r)

Measures direction and strength of linear relationship.

New cards

Least-squares regression line (LSRL)

Line minimizing squared residuals: ŷ = a + bx.

New cards

Slope (b)

Change in predicted y for each 1-unit increase in x.

New cards

y-intercept (a)

Predicted value when x = 0.

New cards

Residual

Observed − predicted value (y − ŷ).

New cards

Coefficient of determination (r²)

Proportion of variation in y explained by x.

New cards

Residual plot

Graph of residuals versus x; checks fit of regression.

New cards

Influential point

Point that greatly changes correlation or slope if removed.

New cards

Population

Entire group we want to study or describe.

New cards

Sample

Subset of individuals from the population.

New cards

Census

Collects data from every individual in the population.

New cards

Sample survey

Collects data from a sample to generalize to the population.

New cards

Bias

Systematic error producing unrepresentative samples.

New cards

Voluntary response sample

People choose to participate; often biased.

New cards

Convenience sample

Chooses individuals easiest to reach; biased

New cards

Simple random sample (SRS)

Every group of n individuals has equal chance of selection.

New cards

Stratified random sample

Divides population into strata; SRS taken from each.

New cards

Cluster sample

Divides population into clusters; randomly selects clusters.

New cards

Undercoverage

Some groups left out of the sampling frame.

New cards

Nonresponse

Selected individuals can’t be contacted or refuse participation.

New cards

Response bias

Pattern of inaccurate answers due to wording or interviewer.

New cards

Observational study

Observes individuals without imposing treatment.

New cards

Experiment

Deliberately imposes treatment to measure response.

New cards

Explanatory variable (factor)

Variable manipulated in an experiment

New cards

Treatment

Specific condition applied to subjects.

New cards

Experimental units (subjects)

Individuals on which experiment is done.

New cards

Control group

Used for comparison; may receive placebo.

New cards

Random assignment

Uses chance to assign treatments; balances variables.

New cards

Replication

Using enough subjects to reduce chance variation.

New cards

Double-blind experiment

Neither subjects nor those interacting know treatments.

New cards

Statistically significant

Effect too large to be due to chance

New cards

Block design

Subjects grouped by similarity; treatments assigned within blocks

New cards

Matched pairs design

Compares two treatments using similar or same subjects.

New cards

Standard Deviation

The context typically varies by SD from the mean of mean.

New cards

Percentile:

percentile % of context are less than or equal to value.

New cards

z-score:

Specific value with context is z-score standard deviations above/below the mean.

New cards

Describe a distribution:

Be sure to address shape, center, variability, and outliers (in context).

New cards

Correlation (r):

The linear association between x-context and y-context is weak/moderate/strong

(strength) and positive/negative (direction).

New cards

Residual:

The actual y-context was residual above/below the predicted value when x-context = #.

New cards

y-intercept:

The predicted y-context when x = 0 context is y-intercept.

New cards

Slope:

The predicted y-context increases/decreases by slope for each additional x-context.

New cards

Standard Deviation of Residuals (s):

The actual y-context is typically about s away from the value

predicted by the LSRL.

New cards

Coefficient of Determination (r2):

About r2% of the variation in y-context can be explained by the

linear relationship with x-context.

New cards

Describe the relationship:

Be sure to address strength, direction, form and unusual features (in context).

New cards

Probability P(A):

After many many context, the proportion of times that context A will occur is about P(A).

New cards

Conditional Probability P(A|B):

Given context B, there is a P(A|B) probability of context A.

New cards

Expected Value (Mean, μ):

If the random process of context is repeated for a very large number of, the average number of x-context we can expect is expected value. (decimals OK).

New cards

Binomial Mean (μX):

After many, many trials the average # of success context out of n is μ#.

New cards

Binomial Standard Deviation (σX):

The number of success context out of n typically varies by σ#

from the mean of μ#.

New cards

Standard Deviation of Sample Proportions (σp%):

The sample proportion of success context typically varies by σ&' from the true proportion of p.

New cards

The sample proportion of success context typically varies by σ&' from the true proportion of p.

The sample mean amount of x-context typically varies

by σ*̅from the true mean of μ#.

New cards

Confidence Interval (A, B):

We are % confident that the interval from A to B captures the true

parameter context.

New cards

Confidence Level:

If we take many, many samples of the same size and calculate a confidence interval for each, about confidence level % of them will capture the true parameter in context

New cards

p-value:

Assuming H0 in context (H0), there is a p-value probability of getting the observed result

or less/greater/more extreme, purely by chance.

New cards

Conclusion for a Significance Test:

Because p-value p-value < / > α we reject / fail to reject H0. We

do / do not have convincing evidence for Ha in context.

New cards

Type 1 Error:

The H0 context is true, but we find convincing evidence for Ha context.

New cards

Type II Error:

The Ha context is true, but we don’t find convincing evidence for Ha context.

New cards

Power:

If Ha context is true at a specific value there is a power probability the significance test will

correctly reject H-.

New cards

Standard Error of the Slope (SEb):

The slope of the sample LSRL for x-context and y-context

typically varies from the slope of the population LSRL by about SE2.