AP Statistics Cumulative AP Exam Study Guide

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/100

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards for AP Statistics Exam Review.

Statistics

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

101 Terms

1
New cards

Statistics

The science of collecting, analyzing, and drawing conclusions from data.

2
New cards

Descriptive Statistics

Methods of organizing and summarizing statistics.

3
New cards

Inferential Statistics

Making generalizations from a sample to the population.

4
New cards

Population

An entire collection of individuals or objects.

5
New cards

Sample

A subset of the population selected for study.

6
New cards

Variable

Any characteristic whose value changes.

7
New cards

Categorical (Qualitative) Variable

Basic characteristics.

8
New cards

Numerical (Quantitative) Variable

Measurements or observations of numerical data.

9
New cards

Discrete Variable

Listable sets (counts).

10
New cards

Continuous Variable

Any value over an interval of values (measurements).

11
New cards

Univariate

One variable.

12
New cards

Bivariate

Two variables.

13
New cards

Multivariate

Many variables.

14
New cards

Symmetrical Distribution

Data on which both sides are fairly the same shape and size. “Bell Curve”

15
New cards

Uniform Distribution

Every class has an equal frequency (number) “a rectangle”

16
New cards

Skewed Distribution

One side (tail) is longer than the other side. The skewness is in the direction that the tail points (left or right)

17
New cards

Bimodal Distribution

Data of two or more classes have large frequencies separated by another class between them. “double hump camel”

18
New cards

S.O.C.S.

How to describe numerical graphs: Shape, Outliers, Center, Spread

19
New cards

Parameter

Value of a population (typically unknown).

20
New cards

Statistic

A calculated value about a population from a sample(s).

21
New cards

Median

The middle point of the data (50th percentile) when the data is in numerical order.

22
New cards

Mean

μ is for a population (parameter) and x is for a sample (statistic).

23
New cards

Mode

Occurs the most in the data. There can be more then one mode, or no mode at all if all data points occur once.

24
New cards

Variability

Allows statisticians to distinguish between usual and unusual occurrences.

25
New cards

Range

A single value – (Max – Min).

26
New cards

IQR

Interquartile range – (Q3 – Q1).

27
New cards

Standard Deviation

σ for population (parameter) & s for sample (statistic) – measures the typical or average deviation of observations from the mean – sample standard deviation is divided by df = n-1

28
New cards

Variance

Standard deviation squared.

29
New cards

Resistant

Not affected by outliers.

30
New cards

Non-Resistant

Affected by outliers.

31
New cards

Z-Score

Is a standardized score that tells you how many standard deviations from the mean an observation is.

32
New cards

Normal Curve

Is a bell shaped and symmetrical curve.

33
New cards

Empirical Rule (68-95-99.7)

Measures 1σ, 2σ, and 3σ on normal curves from a center of μ.

34
New cards

Boxplots

Are for medium or large numerical data. It does not contain original observations. Always use modified boxplots where the fences are 1.5 IQRs from the ends of the box (Q1 & Q3). Points outside the fence are considered outliers.

35
New cards

5-Number Summary

Minimum, Q1 (1st Quartile – 25th Percentile), Median, Q3 (3rd Quartile – 75th Percentile), Maximum

36
New cards

Sample Space

Is collection of all outcomes.

37
New cards

Event

Any sample of outcomes.

38
New cards

Complement

All outcomes not in the event.

39
New cards

Union

A or B, all the outcomes in both circles. A ∪ B

40
New cards

Intersection

A and B, happening in the middle of A and B. A ∩ B

41
New cards

Mutually Exclusive (Disjoint)

A and B have no intersection. They cannot happen at the same time.

42
New cards

Independent

If knowing one event does not change the outcome of another.

43
New cards

Experimental Probability

Is the number of success from an experiment divided by the total amount from the experiment.

44
New cards

Law of Large Numbers

As an experiment is repeated the experimental probability gets closer and closer to the true (theoretical) probability.

45
New cards

Correlation Coefficient – (r)

Is a quantitative assessment of the strength and direction of a linear relationship. Values – [-1, 1]

46
New cards

Least Squares Regression Line (LSRL)

Is a line of mathematical best fit that minimizes the deviations (residuals) from the line. Used with bivariate data.

47
New cards

Residuals (error)

Is vertical difference of a point from the LSRL. All residuals sum up to “0”.

48
New cards

Residual Plot

A scatterplot of (x (or ŷ) , residual). No pattern indicates a linear relationship.

49
New cards

Coefficient of Determination (r^2)

Gives the proportion of variation in y (response) that is explained by the relationship of (x, y).

50
New cards

Extrapolation

LRSL cannot be used to find values outside of the range of the original data.

51
New cards

Influential Points

Are points that if removed significantly change the LSRL.

52
New cards

Outliers

Are points with large residuals.

53
New cards

Census

A complete count of the population.

54
New cards

Sampling Frame

Is a list of everyone in the population.

55
New cards

Sampling Design

Refers to the method used to choose a sample.

56
New cards

SRS (Simple Random Sample)

One chooses so that each unit has an equal chance and every set of units has an equal chance of being selected.

57
New cards

Stratified Sample

Divide the population into homogeneous groups called strata, then SRS each strata.

58
New cards

Systematic Sample

Use a systematic approach (every 50th) after choosing randomly where to begin.

59
New cards

Cluster Sample

Based on location. Select a random location and sample ALL at that location.

60
New cards

Bias

Error – favors a certain outcome, has to do with center of sampling distributions – if centered over true parameter then considered unbiased

61
New cards

Voluntary Response

People choose themselves to participate.

62
New cards

Convenience Sampling

Ask people who are easy, friendly, or comfortable asking.

63
New cards

Undercoverage

Some group(s) are left out of the selection process.

64
New cards

Non-response

Someone cannot or does not want to be contacted or participate.

65
New cards

Experimental Unit

Single individual or object that receives a treatment.

66
New cards

Factor

Is the explanatory variable, what is being tested.

67
New cards

Level

A specific value for the factor.

68
New cards

Response Variable

What you are measuring with the experiment.

69
New cards

Treatment

Experimental condition applied to each unit.

70
New cards

Control Group

A group used to compare the factor to for effectiveness – does NOT have to be placebo.

71
New cards

Placebo

A treatment with no active ingredients (provides control).

72
New cards

Blinding

A method used so that the subjects are unaware of the treatment (who gets a placebo or the real treatment).

73
New cards

Double Blinding

Neither the subjects nor the evaluators know which treatment is being given.

74
New cards

Replication

Uses many subjects to quantify the natural variation in the response.

75
New cards

Randomization

Uses chance to assign the subjects to the treatments.

76
New cards

Completely Randomized Design

All units are allocated to all of the treatments randomly.

77
New cards

Randomized Block Design

Units are blocked and then randomly assigned in each block –reduces variation.

78
New cards

Matched Pairs Design

Units are matched up by characteristics and then randomly assigned.

79
New cards

Confounding Variables

Are where the effect of the variable on the response cannot be separated from the effects of the factor being tested.

80
New cards

Random Variable

A numerical value that depends on the outcome of an experiment.

81
New cards

Discrete Random Variable

A count of a random variable.

82
New cards

Continuous Random Variable

A measure of a random variable.

83
New cards

Fair Game

A fair game is one in which all pay-ins equal all pay-outs.

84
New cards

Binomial Distributions

Properties- two mutually exclusive outcomes, fixed number of trials (n), each trial is independent, the probability (p) of success is the same for all trials.

85
New cards

Geometric Distributions

Properties -two mutually exclusive outcomes, each trial is independent, probability (p) of success is the same for all trials. (NOT a fixed number of trials)

86
New cards

Sampling Distribution

Of a statistic is the distribution of all possible values of all possible samples.

87
New cards

Central Limit Theorem

When n is sufficiently large (n > 30) the sampling distribution is approximately normal even if the population distribution is not normal.

88
New cards

Confidence Intervals

Used to estimate the unknown population parameter.

89
New cards

Margin of Error

The smaller the margin of error, the more precise our estimate.

90
New cards

Hypothesis Testing

Tells us if a value occurs by random chance or not.

91
New cards

Null Hypothesis

H0 is the statement being tested. Null hypothesis should be “no effect”, “no difference”, or “no relationship”

92
New cards

Alternate Hypothesis

Ha is the statement suspected of being true.

93
New cards

P-Value

Assuming the null is true, the probability of obtaining the observed result or more extreme.

94
New cards

Level of Significance

α is the amount of evidence necessary before rejecting the null hypothesis.

95
New cards

Type I Error

Is when one rejects H0 when H0 is actually true. (probability is α)

96
New cards

Type II Error

Is when you fail to reject H0, and H0 is actually false. (probability is β)

97
New cards

Power of a Test

Is the probability that the test will reject the null hypothesis when the null hypothesis is false.

98
New cards

Chi-squared Test

Is used to test counts of categorical data.

99
New cards

Goodness of Fit Test

Is for univariate categorical data from a single sample. Does the observed count “fit” what we expect.

100
New cards

Test of Independence

Bivariate categorical data from one sample. Are the two variables independent or dependent?