Stat 250 final

0.0(0)
studied byStudied by 5 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/187

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

188 Terms

1
New cards

data

observations gathered for analysis; numerical or non-numerical

2
New cards

cases

subjects we obtain data about

3
New cards

when making comparisons, we would like to

determine association and determine causation

4
New cards

statistical inference

the process of using data from a sample to gain information about the population

5
New cards

sampling bias

occurs when the method of selecting a sample causes the sample to differ from the population in some relevant way

6
New cards

we try to obtain a sample that is (?) of the population

representative

7
New cards

simple random sampling

all groups of the population have the same chance of being chosen

8
New cards

is a voluntary sample a good sampling method?

no

9
New cards

association

values of one variable tends to be related to values of the other variable

10
New cards

causation

changing the value of the explanatory variable influences the value of the response variable

11
New cards

T/F: association automatically means causation

no; association does not imply causation

12
New cards

confounding variable

third variable that is associated with both the explanatory and response variable

13
New cards

observational study

a study in which the researcher does not actively control the value of any variable, but simply observes the values as they naturally exist

14
New cards

experiment

a study in which the researcher actively controls one or more of the explanatory variables; aka randomized experiment

15
New cards

which study can find causation

experiment

16
New cards

do confounding variables exist in a randomized experiement

no

17
New cards

three explanations for why association may be observed in sample data

  1. there is a causal relationship or association

  2. there is an association, but it is due to confounding variables

  3. there is no association; it is random chance

18
New cards

how do you avoid confounding variables

use random assignment

19
New cards

randomized comparative experiment

randomizing cases into different groups and then comparing results to response variable

20
New cards

matched pairs experiment

each case gets both treaments in random order and examine individual differences

21
New cards

random sampling vs random assignment

each unit has same chance of being chosen vs placing units into groups by chance

22
New cards

random selection allows us to

make generalizations about the population

23
New cards

random assignment allows us to

make conclusions about causality

24
New cards

frequency

number of times a value is observed in a data set

25
New cards

relative frequency

number of times a value is observed divided by the total number of observations

26
New cards

the sum of all relative frequencies is

1

27
New cards

one categorical variable summary statistics

frequency table, proportion

28
New cards

one categorical variable visualization

bar or pie chart

29
New cards

two categorical variable summary statistics

two-way table, difference is proportions

30
New cards

two categorical variable visualization

segmented or side-by-side bar chart

31
New cards

mode

the category that occurs most frequently

32
New cards

segmented bar chart

the height of each bar represents the frequency of one categorical variable and the segmented colors split each bar by the other categorical variable

33
New cards

side by side bar chart

separate bar charts are given for each group of one of the categorical variables

34
New cards

visualizing one quantitative variable

dot plot or histogram

35
New cards

shapes

symmetric or skewed

36
New cards

measures of center

mean or median

37
New cards

right skewed

tail of distribution extends out to the right

38
New cards

left skewed

tail of distribution extends out to the left

39
New cards

resistance

we say a statistic is resistant if it is unaffected by extreme values

40
New cards

which measure of center is impacted by utliers

mean

41
New cards

which measure of center is not impacted by outliers

median

42
New cards

left skewed mean vs median

mean < median

43
New cards

right skewed mean vs median

mean > median

44
New cards

standard deviation

a number that measures how far away the typical observation is from the mean

45
New cards

a larger standard deviation means

the data values are more spread out and have more variability

46
New cards

T/F: standard deviation is not affected by outliers and skeweness

false

47
New cards

IQR

Q3-Q1

48
New cards

only use the standard deviation as the measure of spread when you are using the (?) as the measure of center

mean

49
New cards

use this measure of center and this measure of spread when skewed

median and IQR

50
New cards

boxplot

graphical representation of the five-number summary

51
New cards

left skewed box plot

median line to the right of the box; left whisker longer

52
New cards

right skewed box plot

median line to the left of the box; right whisker longer

53
New cards

standard error

the standard deviation of the sample statistics

54
New cards

a low standard error means

statistics vary little from sample to sample

55
New cards

as the sample size increases, the variability of sample statistics tends to (?) and sample statistics tend to be (?) to the true value of the population parameter

decrease; closer

56
New cards

does the shape of the population affect the center of each sampling distribution?

no

57
New cards

confidence interval

captures parameter for a specified proportion of all samples

58
New cards

confidence interval formula

sample statistic ± critical value*(SE)

59
New cards

confidence interval interpretation

we are 95% confident that an interval captures the true population parameter

60
New cards

95% rule

if a distribution is approximately symmetric and bell-shaped, about 95% of the data should fall within 2 standard deviations of the mean

61
New cards

95% rule formula

statistic ± 2(SE)

62
New cards

bootstrapping

technique or simulating a sampling distribution when you do not have a population from which to sample

63
New cards

bootstrap sample

sample with replacement from the original sample using the same sample size

64
New cards

bootstrap distribution shape

bell shaped and symmetric

65
New cards

bootstrap distribution center

centered at sample statistic value

66
New cards

bootstrap confidence interval

when symmetric and bell-shaped, statistic ± 2(SE)

67
New cards

as the confidence level increases, the width of the confidence interval…

increases

68
New cards

as the sample size increases, the width of the confidence interval…

decreases

69
New cards

statistical test

used to determine whether results from a sample are convincing enough to allow us to conclude something about the population

70
New cards

goal of a hypothesis test

asses evidence provided by the sample data to test a claim made about a population parameter

71
New cards

null hypothesis

H-naught

72
New cards

alternative hypothesis

H-A

73
New cards

null hypothesis meaning

no change; always equals zero

74
New cards

alternative hypothesis meaning

claim for which we seek evidence; different from zero

75
New cards

hypothesis tests are always written in (?) notation

population parameter

76
New cards

two-sided hypothesis

H0: p1 = p2

HA: p1 does not equal p2

77
New cards

left-sided hypothesis

H0: p1 = p2

HA: p1 < p2

78
New cards

right-sided hypothesis

H0: p1 = p2

HA: p1 > p2

79
New cards

the null hypothesis is assumed to be (?) throughout the hypothesis test

true

80
New cards

how do you determine the p-value by hand?

count how many points were observed that are greater than or equal to the sample and divide that number by 100

81
New cards

p-value

proportion of samples that would give a statistic as extreme as the observed sample result when the null hypothesis is true

82
New cards

two-tailed p-value

when the alternative hypothesis contains a does not equal sign, the p-value is twice the proportion of the smallest tail

83
New cards

p-value < a

reject null hypothesis

84
New cards

p-value > a

do not reject the null hypothesis

85
New cards

smaller p values mean the sample results are

statistically significant

86
New cards

formal hypothesis test has only two possible conclusions

reject or do not reject the null hypothesis

87
New cards

possible significance levels

0.05, 0.01, 0.1

88
New cards

conclusions of hypothesis tests

conclude in terms of H-A in context of the question

89
New cards

type I error

occurs when we reject a true null hypothesis

90
New cards

type II error

occurs when we do not reject a false null hypothesis

91
New cards

if a = 0.05, there is a (?)% chance of a type I error

5

92
New cards

as a sample size increases, statistics in the randomization distribution will be more closely concentrated around the..

null value

93
New cards

a larger sample size (?) the chance of making a type II error

decreases

94
New cards

two methods of statistical inference

confidence intervals and hypothesis tests

95
New cards

sampling distribution

shows distribution of sample statistics obtained from a population, centered at true value of population parameter

96
New cards

bootstrap distribution

simulates a distribution of sample statistics for the population, centered at value of original sample statistic

97
New cards

randomization distribution

simulates a distribution of sample statistics for a population in which the null hypothesis is true, centered at value stated in null hypothesis

98
New cards

(-L, -U)

does not capture 0; reject the null hypothesis

99
New cards

(L, U)

does not capture 0; reject the null hypothesis

100
New cards

(-L, U)

captures 0; do not reject the null hypothesis