AP Statistics Vocabulary Flashcards

0.0(0)
studied byStudied by 4 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/106

flashcard set

Earn XP

Description and Tags

Flashcards for reviewing AP Statistics vocabulary terms from Woody Nivens' notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

107 Terms

1
New cards

Alternative Hypothesis

States that a treatment has had an effect or caused a change in the population.

2
New cards

Bias

Describes a study which systematically favors certain outcomes.

3
New cards

Binomial Distribution

The distribution of the probabilities of X successes out of n trials, calculated using p as the probability of any single success – B(n, p)

4
New cards

Blind

Describes an experiment in which the subjects do not know which treatment they are getting.

5
New cards

Blocking

A statistical design which creates groups that are similar in some way, and then randomizes the treatments within each block.

6
New cards

Central Limit Theorem

States that when an SRS is drawn from a population with mean µ and standard deviation σ, the sampling distribution for the sample mean will be approximately normally distributed, and have a mean µ and a standard deviation σ/√n

7
New cards

Chi-Square Distributions

A family of skewed-right distributions which take on only positive values and are defined by their degrees of freedom.

8
New cards

Chi-Square Goodness-of-Fit Test

Used to determine if a population has a certain hypothesized distribution.

9
New cards

Chi-Square Test for Homogeneity

Used to determine if every category in the population has the same population.

10
New cards

Chi-Square Test for Independence

Used to determine if there is a relationship between two categorical variables – also known as Chi-Square Test for Association.

11
New cards

Coefficient of Determination

Tells what percent of the change in the response variable can be attributed to the change in the explanatory variable – symbolized as r2.

12
New cards

Complement of an Event

The set of all outcomes not defined as successful outcomes for any event.

13
New cards

Conditional Probability

The probability of an event occurring if it is known that another specific event has already occurred.

14
New cards

Confidence Interval

An interval estimate of a parameter calculated using a sample from that population.

15
New cards

Confidence Level

The probability that the desired parameter will fall into a confidence interval if many intervals were calculated from samples of the same size.

16
New cards

Confounding Variable

A variable which could affect the result of a statistical test but has not been controlled for.

17
New cards

Continuous Random Variable

A random variable which takes on all values in an interval of numbers.

18
New cards

Control Group

Any group of subjects who receive either a placebo or no treatment at all during an experiment.

19
New cards

Correlation

Measures the direction and strength of the linear relationship between two quantitative variables – symbolized as r.

20
New cards

Critical Value

A value (z-score, t-score, or χ2 value) used in a hypothesis test to help determine if the null hypothesis should be rejected.

21
New cards

Cumulative Distribution Function

A function which calculates the sum of the probabilities for each possible value for any random variable X.

22
New cards

Degrees of Freedom

A value used to help determine significance for a t-test or a Chi-Square test – measured as n-1 in most cases, or (r-1)(c-1) when dealing with two-way tables.

23
New cards

Dependent Trials

Trials whose probability is affected by the outcome of previous trials.

24
New cards

Dependent Variable

See Response Variable

25
New cards

Density Curve

A curve used to represent a distribution – a density curve is always on or above the horizontal axis and has a total area of exactly 1 underneath it.

26
New cards

Discrete Random Variable

A random variable with countable outcomes.

27
New cards

Disjoint Events

Events which cannot occur at the same time – also known as Mutually Exclusive Events.

28
New cards

Distribution

A list of what values a variable takes on and how often it takes on each one of those values.

29
New cards

Double Blind

Describes an experiment in which neither the subjects nor the researcher know which treatment each subject is getting.

30
New cards

Empirical Rule

Also known as the 68-95-99.7 rule – is used as an approximation for what percent of the data falls within 1, 2, or 3 standard deviations of the mean in any normal distribution.

31
New cards

Expected Value

See Mean

32
New cards

Experimental Units

The individuals on which an experiment is conducted – if the test is being conducted on humans, the units are called Subjects.

33
New cards

Explanatory Variable

Attempts to explain the observed outcomes in a statistical study – also known as the Independent Variable.

34
New cards

Exploratory Data Analysis

Uses graphs and numerical summaries to describe the variables in a data set and the relationships among them.

35
New cards

Factor

Any explanatory variable in an experiment.

36
New cards

Five Number Summary

A method to describe a data set using the minimum, first quartile, median, third quartile, and maximum points in the data set.

37
New cards

Geometric Distribution

A distribution of probabilities of when the first successful outcome occurs in a probability experiment.

38
New cards

Hypothesis Test

A type of inference used to determine the feasibility of an assumed population parameter – also known as a Significance Test.

39
New cards

Independent Trials

Trails whose probabilities are not affected by the outcome of previous trials.

40
New cards

Independent Variable

See Explanatory Variable

41
New cards

Individuals

People or objects described by a set of data.

42
New cards

Inference

The statistical process of drawing conclusions about a population by examining data from a sample.

43
New cards

Influential Point

A point which, if removed from the data set, would markedly change the regression equation for that data set.

44
New cards

Interquartile Range (IQR)

The difference between the third and first quartiles of a data set.

45
New cards

Law of Large Numbers

States that as increased numbers of observations are drawn from any population, the mean of the observations eventually approaches the mean of the population as closely as we would like to estimate it, and remains that close or closer.

46
New cards

Least Squares Regression Line

A regression line which makes the sum of the squares of the vertical distances from the data points to the line as small as possible.

47
New cards

Level

A numerical value of a factor of an experiment.

48
New cards

Matched Pairs

A statistical design which compares two treatments – this is usually done with one sample receiving each treatment over a different time period.

49
New cards

Mean

The “average” of a data set – also known as the Expected Value.

50
New cards

Median

The point at which 50% of the data is above and 50% of the data is below.

51
New cards

Mutually Exclusive Events

See Disjoint Events

52
New cards

Nonresponse

A type of bias that occurs when an individual chosen for a sample cannot be contacted or chooses not to participate.

53
New cards

Normal Distribution

A symmetric, bell-shaped distribution in which approximately 68% of the data lies within one standard deviation of the mean, 95% lies within two standard deviations of the mean, and 99.7% lies within three standard deviations of the mean.

54
New cards

Null Hypothesis

States that either a treatment has had no effect on a population, or that the population has not changed.

55
New cards

Observation

Any single point from a data set.

56
New cards

Outlier

An individual observation that falls outside the pattern of the data set – often defined as any number that is 1.5(IQR) outside of Q1 or Q3.

57
New cards

P-value

The probability that the observed outcome would take on a value as extreme or more extreme than observed if the null hypothesis were true.

58
New cards

Parameter

A number that describes a population.

59
New cards

Percentile

Tells what percent of a data set falls below the given observation.

60
New cards

Placebo

A false treatment which should have no effect on an experiment – placebos should appear to be the same as the actual treatment.

61
New cards

Pooled Procedures

Occurs when separate samples are combined into a single sample for analysis – this should only be done if it is known that the variances of the two populations are equal.

62
New cards

Population

The entire group of individuals that we want information about.

63
New cards

Power of a Hypothesis Test

The probability that the test will reject the null hypothesis when the null hypothesis is false – the power is equal to 1 minus (probability of a Type II error for the given alternative).

64
New cards

Probability

The proportion of times an outcome would occur over a large number of trials.

65
New cards

Probability Distribution Function

A function which assigns a probability for each possible value for any discrete random variable X.

66
New cards

Proportion

Tells what percent of a data set falls into a given category.

67
New cards

Qualitative Variable

A variable which takes on a non-numeric description.

68
New cards

Quantitative Variable

A variable which takes on a numeric value.

69
New cards

Quartiles

Observations which fall at the 25th, 50th, and 75th percentiles of a data set.

70
New cards

Range

The difference between the maximum and minimum values of a data set.

71
New cards

Random

When individual outcomes are uncertain, but there is a pattern to the distribution of the outcomes over time.

72
New cards

Random Variable

A variable whose value is a numeric outcome of a random phenomenon.

73
New cards

Randomization

Using the laws of probability – this is done to select members for a sample and also to assign treatments to specific samples in experiments.

74
New cards

Regression Line

A straight line that describes how a response variable changes as the explanatory variable changes.

75
New cards

Residual

The difference between and observed value of a response variable and its predicted value from a regression equation.

76
New cards

Response Variable

Measures the outcome of a statistical study – also known as the Dependent Variable.

77
New cards

Robustness

A measure of how much the P-value of a test is affected if the conditions of the hypothesis test are not met.

78
New cards

Sample

A part of the population used to gather information about the entire population.

79
New cards

Sample Space

A list of all possible outcomes for a random event.

80
New cards

Sampling Distribution

A distribution of values taken by a statistic in all possible samples of the same size from the same population.

81
New cards

Sampling Frame

A list from which a sample is chosen – ideally the sampling frame consists of the entire population.

82
New cards

Significance Level

The point at which it will be determined that a result is statistically significant.

83
New cards

Significance Test

See Hypothesis Test

84
New cards

Simple Random Sample (SRS)

A sample in which every member of the population has the same probability to be chosen, and every group of size n has the same probability to be chosen.

85
New cards

Simulation

A method for collecting data which uses the laws of probability to represent all possible outcomes of an experiment.

86
New cards

Skewed

Describes a distribution whose histogram extends much farther to one side of the mean than the other – the distribution is said to be skewed in the direction of this “tail”.

87
New cards

Standard Deviation

Square root of the variance – used as a common measure of spread for a data set.

88
New cards

Standard Error

The standard deviation of a sampling distribution – measures the amount of expected error per standard deviation from the mean of the distribution.

89
New cards

Standard Normal Distribution

A normal distribution with a mean of zero and a standard deviation of one.

90
New cards

Standardized Score

See z-Score

91
New cards

Statistic

A number that describes a sample.

92
New cards

Statistically Significant

An observed effect so far removed from the mean that it would be unlikely to occur by chance alone.

93
New cards

Stratified Random Sample

A sample chosen by splitting the population into several well-defined groups, then taking an SRS from each group.

94
New cards

Subjects

See Experimental Units

95
New cards

Symmetric

Describes a distribution whose histogram has its left and right sides as mirror images of each other.

96
New cards

t-Distributions

A family of symmetric, bell-shaped distributions with a standard deviation larger than that of the standard normal distribution – the specific shape of the t-distribution changes as the sample size changes – this distribution is defined by its degrees of freedom.

97
New cards

Treatment

A specific experimental condition applied to an experimental unit or subject.

98
New cards

Treatment Group

A group of subjects who receive an actual treatment during an experiment.

99
New cards

Type I Error

When the null hypothesis is rejected but it is in fact true – the probability of a Type I Error is the significance value for that test.

100
New cards

Type II Error

When the null hypothesis is not rejected but it is in fact false – the probability of a Type II Error must be calculated for a specific alternative test value.