AP Statistics Semester 1 Final Review

studied byStudied by 6 people
0.0(0)
get a hint
hint

mean

1 / 104

Tags and Description

Study guide and note content from units 1 - 7

105 Terms

1

mean

values in a data set added up and divided by the number of included values

New cards
2

median

middle value when data set is placed in ascending order

New cards
3

standard deviation

measure that is used to quantify the amount of set data values

New cards
4

variance

standard deviation squared

New cards
5

quantitative

numerical values - values that can be averaged

New cards
6

qualitative/categorical

values that are generally words, or grouped numbers - cannot “average”

New cards
7

range

the difference between the highest and lowest values in a data set

New cards
8

first quartile

middle value between the minimum and the median (the median of the bottom half)

New cards
9

third quartile

middle value between the median and the maximum (the median of the top half)

New cards
10

Interquartile range (IQR)

the difference between the first and third quartile

New cards
11

outliers

extreme values that are more than 1.5xIQR from the 1st/3rd quartile

New cards
12

resistant

resists the effects of outliers (ie: median, IQR)

New cards
13

nonresistant

influenced by the existence of outliers (ie: mean, standard deviation, range)

New cards
14

boxplot

displays general distribution of data

New cards
15

dotplot

each value represented by a dot - good for specific layout

New cards
16

histogram

displays data grouped into bins of the same width, but displaying the varying frequencies of values

New cards
17

bar chart

similar to histogram, but used for categorical variables - bars don’t touch and x-axis values are not continuous

New cards
18

stem-and-leaf plot

displays all but the last digit of each individual value as a stem, and last digit is the leaf - key must be included

New cards
19

statistical inference

method used to provide ways to answer specific questions from data with some guarantee of success

New cards
20

population

entire group of individuals to which the data is being generalized

New cards
21

sample

part of the group that is being studied

New cards
22

simple random sample

all samples size n have the same chance of being selected

New cards
23

probability sample

each member of a sample has a known chance greater zero of being selected

New cards
24

stratified random sample

dividing a population into groups of similar members and then choose a SRS within each smaller group to form the full sample

New cards
25

multistage sample design

process of selecting t counties, then x townships, y blocks in the township, and z households

New cards
26

cluster random sample

total population is divided into groups and a sample of the groups is selected

New cards
27

bias

contained in a study that systematically favors certain outcomes

New cards
28

voluntary response sample

sample that consists of people who choose themselves by responding to a general appeal

New cards
29

nonresponse

individual chosen for the sample can’t be contacted or refuses to cooperate

New cards
30

confounding variables

two variables whose effects on a response variable cannot be distinguished from each other

New cards
31

convenience sample

sample made from groups that are easiest to reach

New cards
32

response bias

when a responded lies about sensitive information or telescopes the timing of an event

New cards
33

observational study

data collector visually measures variables of interest, but does not attempt to influence the responses

New cards
34

statistically significant

an observed effect too large to attribute plausibly to chance

New cards
35

experiment

The most effective way to show a relationship between two or more variables

New cards
36

double-blind experiment

experiment where neither the person nor the data collector know the variable being applied to the person

New cards
37

matched pairs

special case of randomized block design used when the experiment has only two treatment conditions

New cards
38

blocking

grouping similar units to allow one to draw more specific, separate conclusions

New cards
39

experimental units

members on which an experiment is done

New cards
40

subjects

members of a group that are human beings

New cards
41

treatment

condition applied to a member or group

New cards
42

factor

different explanatory variables in an experiment

New cards
43

level

specific value of a factor

New cards
44

placebo

dummy treatment that can have effect

New cards
45

control group

group of people receiving a sham treatment

New cards
46

randomization

use of chance to divide experimental units into groups

New cards
47

principles of experimental design

  1. control - basis comparison

  2. Randomization - fair choice of experimental units/subjects

  3. Replication - need to ensure that results continue to tell the same story

New cards
48

hidden bias

occurs when the experimenter does not treat all the subjects the exact same way

New cards
49

Median formula

(N + 1 )/2

New cards
50

Standardized Score (z-score)

the number of standard deviations a value is from the mean of its respective data set

New cards
51

normal distribution

bell-shaped curve centered at the mean of a data and distributed approximately as outlined below

New cards
52

types of distribution problems

  1. raw value →formula → z-score → normalCDF →percentile

  2. percentile → invNorm → z-score → algebra → raw value

New cards
53

symmetric, normal shape

bell-shaped as outlined on other side as well

New cards
54

symmetric, but not normal

mean and median are the same, mode may be different

New cards
55

skewed left

values drag out to the left (smaller numbers)

New cards
56

skewed right

values drag out to the right (larger numbers)

New cards
57

Best descriptive statistics when distribution is symmetric

mean and standard deviation

New cards
58

Best descriptive statistics when distribution is skewed

median and IQR

New cards
59

statistic

value that describes a sample (ie: sample mean, sample standard deviation)

New cards
60

parameter:

value that describes a population (ie: population mean, population standard deviation)

New cards
61

sampling distribution

the sampling distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population

New cards
62

steps to create a sampling distribution

  1. Take a larger number of samples from the same population

  2. Calculate the p-hat or x-bar for each sample

  3. Make a histogram of these values

  4. Examine the distribution displayed in the histogram for overall pattern (shape), center, and spread

New cards
63

Bias versus Unbiased

If the sample is collected randomly, the mean of your sample should approach the mean of your population -- this is considered unbiased

New cards
64

variability

as you take the many samples of a sampling distribution, the bigger the sample size of each sample, the closer each sample mean will be to the population mean (bigger sample = less variability)

New cards
65

Central Limit Theorem

The sampling distribution of the means from any population whatsoever (regardless of shape) will be normal provided the sample size of the individual samples is large enough (generally 30+)

New cards
66

Sample means

The mean of the x-bars (sample means) is an unbiased estimator of the population mean

New cards
67

Sampling distribution requirements

  1. SRS

  2. n is greater than or equal to 30 (sample size of each individual sample is n)

New cards
68

Sample proportions

mean of the sampling distribution of p-hat is p (therefore p-hat is an unbiased estimator of p)

New cards
69

Sample proportion requirements

  1. SRS

  2. np & n(1-p) is greater than or equal 10

New cards
70

1 proportion z-test

testing a hypothesis regarding the proportion of a single population -- looking for evidence to reject Ho and statistically support Ha

New cards
71

2 prop. z-test

testing a hypothesis regarding the equivalence of the proportions of two populations -- determining if the evidence shows statistically a difference of higher/lower value between the two proportions

New cards
72

1 & 2 prop z-test: step 1

hypothesis; null and alternative hypothesis, and defining the parameter(s)

New cards
73

1 prop. z-test: step 2

type and conditions

A) one-proportion z-test

B) conditions (1. SRS, 2. success and failures greater than or equal 10)

New cards
74

2 prop z-test: step 2

type and conditions

A) two-proportion z-test

B) conditions (1. SRS, 2. success and failures greater than or equal to 5, 3. fair to believe the two populations are independent of each other)

New cards
75

1 & 2 prop. z-test: step 3

calculations; z-score, p-value

New cards
76

1 & 2 prop. z-test: step 4

conclusion; “based on our evidence [p-value compared to significance level], we [reject/fail to reject] the null hypothesis, so there [is/isn’t] significant evidence to support the alternative hypothesis [in context].”

New cards
77

1 prop. z-interval

using sample proportion to estimate a range of values that are likely to contain the population proportion

New cards
78

2 prop. z-interval

using our sample proportions to estimate a range of values that are likely to contain the difference in population proportions

New cards
79

1 & 2 prop. z-interval: step 1

defining in a sentence the population value/ difference in proportions that we are hoping to estimate (ie: “estimate the true proportion”)

New cards
80

1 prop. z-interval: step 2

type and conditions

A) one proportion z-interval

B) conditions (1. SRS, 2. success and failures greater than or equal to 10)

New cards
81

2 prop. z-interval: step 2

type and conditions

A) one proportion z-interval

B) conditions (1. SRS, 2. success and failures greater than or equal to 5, 3. fair to believe the two populations are independent of each other)

New cards
82

1 & 2 prop. z-interval: step 3

calculation; (calculator or formula)

New cards
83

1 & 2 prop. z-interval: step 4

interpretation; “we are % confident that our interval (,_) contains the true proportion/difference in proportions of [parameter of interest]

New cards
84

Type I Error

Rejecting Ho when Ho is true

New cards
85

Type II Error

Rejecting Ha when Ha is true

New cards
86

Power

The probability of accurately determining Ha as true

New cards
87

How can you increase power?

  • Increase n (the best option)

  • Increase a

  • Move Ho and Ha further apart

  • Decrease o

New cards
88

Calculator function: x → z → %

normalcdf

New cards
89

Calculator function: % → z → x

invNorm

New cards
90

1 sample t-test

Testing a hypothesis regarding the mean of a single population -- looking for evidence to reject Ho and statistically support Ha

New cards
91

2 sample t-test

Testing a hypothesis regarding the equivalence of the means of two populations -- determining if the evidence shows statistically a difference or higher/lower value between the two means

New cards
92

1/2 sample t-test: step 1

null and alternative hypothesis, define the parameter

New cards
93

1 sample t-test: step 2

Types and Conditions:

A) 1-sample t-test

B) Conditions (1. SRS, 2. Normality)

New cards
94

2 sample t-test: step 2

Types and conditions:

A) 2-sample t-test

B) Conditions (1. SRS’s, 2. Normality, 3. Independence b/w)

New cards
95

1/2 sample t-test: step 3

Calculations:

  • test statistic

  • degree of freedom

  • p-value

New cards
96

1/2 sample t-test: step 4

conclusion; “based on our evidence [p-value compared to significance level], we [reject/fail to reject] the null hypothesis, so there [is/isn’t] significant evidence to support the alternative hypothesis [in context].”

New cards
97

1 sample t-interval

Using our sample mean to estimate a range of values that are likely to contain the population mean

New cards
98

2 sample t-interval

using our sample means to estimate a range of values that are likely to contain the difference in population means

New cards
99

1/2 sample t-interval: step 1

Defining the parameter we are estimating (“estimate the true mean/difference”)

New cards
100

1 sample t-interval: step 2

Type and conditions:

A) 1-sample t-interval

B) Conditions (1. SRS, 2. Normality)

New cards

Explore top notes

note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 8 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 14 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 26493 people
Updated ... ago
4.8 Stars(224)

Explore top flashcards

flashcards Flashcard74 terms
studied byStudied by 20 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard24 terms
studied byStudied by 27 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard36 terms
studied byStudied by 17 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard25 terms
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard74 terms
studied byStudied by 24 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard38 terms
studied byStudied by 23 people
Updated ... ago
4.3 Stars(3)
flashcards Flashcard84 terms
studied byStudied by 35 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard68 terms
studied byStudied by 89 people
Updated ... ago
5.0 Stars(3)