AP Statistics Cumulative AP Exam Study Guide

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/121

flashcard set

Earn XP

Description and Tags

AP Statistics Exam Flashcards

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

122 Terms

1
New cards

Statistics

The science of collecting, analyzing, and drawing conclusions from data.

2
New cards

Descriptive Statistics

Methods of organizing and summarizing statistics

3
New cards

Inferential Statistics

Making generalizations from a sample to the population.

4
New cards

Population

An entire collection of individuals or objects.

5
New cards

Sample

A subset of the population selected for study.

6
New cards

Variable

Any characteristic whose value changes.

7
New cards

Data

Observations on single or multi-variables.

8
New cards

Categorical Variable

Basic characteristics (qualitative).

9
New cards

Numerical Variable

Measurements or observations of numerical data (quantitative).

10
New cards

Discrete Variable

Listable sets (counts).

11
New cards

Continuous Variable

Any value over an interval of values (measurements).

12
New cards

Univariate

One variable.

13
New cards

Bivariate

Two variables.

14
New cards

Multivariate

Many variables.

15
New cards

Symmetrical Distribution

Data on which both sides are fairly the same shape and size. “Bell Curve”

16
New cards

Uniform Distribution

Every class has an equal frequency (number) “a rectangle”

17
New cards

Skewed Distribution

One side (tail) is longer than the other side. The skewness is in the direction that the tail points (left or right)

18
New cards

Bimodal Distribution

Data of two or more classes have large frequencies separated by another class between them. “double hump camel”

19
New cards

Shape (S.O.C.S.)

Overall type (symmetrical, skewed right left, uniform, or bimodal)

20
New cards

Outliers (S.O.C.S.)

Gaps, clusters, etc.

21
New cards

Center (S.O.C.S.)

Middle of the data (mean, median, and mode)

22
New cards

Spread (S.O.C.S.)

Refers to variability (range, standard deviation, and IQR)

23
New cards

Parameter

Value of a population (typically unknown)

24
New cards

Statistic

A calculated value about a population from a sample(s).

25
New cards

Median

The middle point of the data (50th percentile) when the data is in numerical order.

26
New cards

Mean

μ is for a population (parameter) and x is for a sample (statistic).

27
New cards

Mode

Occurs the most in the data. There can be more then one mode, or no mode at all if all data points occur once.

28
New cards

Variability

Allows statisticians to distinguish between usual and unusual occurrences.

29
New cards

Range

A single value – (Max – Min)

30
New cards

IQR

Interquartile range – (Q3 – Q1)

31
New cards

Standard Deviation

Measures the typical or average deviation of observations from the mean – sample standard deviation is divided by df = n-1

32
New cards

Variance

Standard deviation squared

33
New cards

Resistant

Not affected by outliers.

34
New cards

Trimmed Mean

Use a % to take observations away from the top and bottom of the ordered data. This possibly eliminates outliers.

35
New cards

Z-Score

Is a standardized score. This tells you how many standard deviations from the mean an observation is. It creates a standard normal curve consisting of z-scores with a μ = 0 & σ = 1.

36
New cards

Normal Curve

Is a bell-shaped and symmetrical curve.

37
New cards

Empirical Rule (68-95-99.7)

Measures 1σ, 2σ, and 3σ on normal curves from a center of μ.

38
New cards

Boxplots

Are for medium or large numerical data. It does not contain original observations. Always use modified boxplots where the fences are 1.5 IQRs from the ends of the box (Q1 & Q3). Points outside the fence are considered outliers. Whiskers extend to the smallest & largest observations within the fences.

39
New cards

5-Number Summary

Minimum, Q1 (1st Quartile – 25th Percentile), Median, Q3 (3rd Quartile – 75th Percentile), Maximum

40
New cards

Sample Space

Is collection of all outcomes.

41
New cards

Event

Any sample of outcomes.

42
New cards

Complement

All outcomes not in the event.

43
New cards

Union

A or B, all the outcomes in both circles.

44
New cards

Intersection

A and B, happening in the middle of A and B.

45
New cards

Mutually Exclusive (Disjoint)

A and B have no intersection. They cannot happen at the same time.

46
New cards

Independent

If knowing one event does not change the outcome of another.

47
New cards

Experimental Probability

Is the number of success from an experiment divided by the total amount from the experiment.

48
New cards

Law of Large Numbers

As an experiment is repeated the experimental probability gets closer and closer to the true (theoretical) probability. The difference between the two probabilities will approach “0”.

49
New cards

Conditional Probability

Takes into account a certain condition.

50
New cards

Correlation Coefficient (r)

Is a quantitative assessment of the strength and direction of a linear relationship.

51
New cards

Least Squares Regression Line (LSRL)

Is a line of mathematical best fit. Minimizes the deviations (residuals) from the line. Used with bivariate data.

52
New cards

Residuals (error)

Is vertical difference of a point from the LSRL. All residuals sum up to “0”.

53
New cards

Residual Plot

A scatterplot of (x (or ŷ) , residual). No pattern indicates a linear relationship.

54
New cards

Coefficient of Determination (r^2)

Gives the proportion of variation in y (response) that is explained by the relationship of (x, y). Never use the adjusted r^2.

55
New cards

Extrapolation

LSRL cannot be used to find values outside of the range of the original data.

56
New cards

Influential Points

Are points that if removed significantly change the LSRL.

57
New cards

Outliers

are points with large residuals.

58
New cards

Census

A complete count of the population.

59
New cards

Sampling Frame

Is a list of everyone in the population.

60
New cards

Sampling Design

Refers to the method used to choose a sample.

61
New cards

SRS (Simple Random Sample)

One chooses so that each unit has an equal chance and every set of units has an equal chance of being selected.

62
New cards

Stratified Sampling

Divide the population into homogeneous groups called strata, then SRS each strata.

63
New cards

Systematic Sampling

Use a systematic approach (every 50th) after choosing randomly where to begin.

64
New cards

Cluster Sample

Based on location. Select a random location and sample ALL at that location.

65
New cards

Random Digit Table

Each entry is equally likely and each digit is independent of the rest.

66
New cards

Random # Generator

Calculator or computer program

67
New cards

Bias

Error – favors a certain outcome, has to do with center of sampling distributions – if centered over true parameter then considered unbiased

68
New cards

Voluntary Response

People choose themselves to participate.

69
New cards

Convenience Sampling

Ask people who are easy, friendly, or comfortable asking.

70
New cards

Undercoverage

Some group(s) are left out of the selection process.

71
New cards

Non-response

Someone cannot or does not want to be contacted or participate.

72
New cards

Response Bias

False answers – can be caused by a variety of things

73
New cards

Wording of the Questions

Leading questions.

74
New cards

Observational Study

Observe outcomes without giving a treatment.

75
New cards

Experiment

Actively imposes a treatment on the subjects.

76
New cards

Experimental Unit

Single individual or object that receives a treatment.

77
New cards

Factor

Is the explanatory variable, what is being tested

78
New cards

Level

A specific value for the factor.

79
New cards

Response Variable

What you are measuring with the experiment.

80
New cards

Treatment

Experimental condition applied to each unit.

81
New cards

Control Group

A group used to compare the factor to for effectiveness – does NOT have to be placebo

82
New cards

Placebo

A treatment with no active ingredients (provides control).

83
New cards

Blinding

A method used so that the subjects are unaware of the treatment (who gets a placebo or the real treatment).

84
New cards

Double Blinding

Neither the subjects nor the evaluators know which treatment is being given.

85
New cards

Control (Principles of Experimental Design)

Keep all extraneous variables (not being tested) constant

86
New cards

Replication (Principles of Experimental Design)

Uses many subjects to quantify the natural variation in the response.

87
New cards

Randomization (Principles of Experimental Design)

Uses chance to assign the subjects to the treatments.

88
New cards

Completely Randomized Design

All units are allocated to all of the treatments randomly

89
New cards

Randomized Block Design

Units are blocked and then randomly assigned in each block –reduces variation

90
New cards

Matched Pairs Design

Are matched up units by characteristics and then randomly assigned. Once a pair receives a certain treatment, then the other pair automatically receives the second treatment. OR individuals do both treatments in random order (before/after or pretest/post-test). Assignment is dependent

91
New cards

Confounding Variables

Are where the effect of the variable on the response cannot be separated from the effects of the factor being tested – happens in observational studies – when you use random assignment to treatments you do NOT have confounding variables!

92
New cards

Random Variable

A numerical value that depends on the outcome of an experiment.

93
New cards

Discrete Random Variable

A count of a random variable

94
New cards

Continuous Random Variable

A measure of a random variable

95
New cards

Discrete Probability Distributions

Gives values & probabilities associated with each possible x.

96
New cards

Fair Game

A fair game is one in which all pay-ins equal all pay-outs.

97
New cards

Binomial Distributions

Two mutually exclusive outcomes, fixed number of trials (n), each trial is independent, the probability (p) of success is the same for all trials,

98
New cards

Random variable (Binomial)

Is the number of successes out of a fixed # of trials. Starts at X = 0 and is finite.

99
New cards

Geometric Distributions

Two mutually exclusive outcomes, each trial is independent, probability (p) of success is the same for all trials. (NOT a fixed number of trials)

100
New cards

Random Variable (Geometric)

When the FIRST success occurs. Starts at 1 and is ∞.