QTM Test 3

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/116

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

117 Terms

1
New cards

Central limit theorem for sample mean

When we collect sufficiently large sample of n independent observations from a population with mean and a standard deviation, the sampling distribution of will be nearly normal with mean and standard error

2
New cards

Two conditions for central limit theorem

  1. Independent observations (random sample, <10% of pop)

  2. Normality: if n< 30 and no clear outliers, we assume, the sample came from a nearly normal population

    f n ≥ 30 and there are no extreme outliers, then we
    assume the sampling distribution is nearly normal

3
New cards

To test for mean we use

T distribution

4
New cards

What is t distribution used for

  • used for large and small samples because p-values are determined for each sample size using degrees of freedom (aka based on sample size)

5
New cards

Properties of t distribution

  • bell shaped

  • Symmetric so t is + or -

  • Mean =0, s >1

  • Less peaked

  • Different curve for each degree of freedom

6
New cards

Hypothesis test: one mean t-test

H0: mean =#

HA: mean =/ #

7
New cards

Conditions for hypothesis test: one mean t-test

  1. Independent observations

  2. approximately normal

8
New cards

Areas on t table get ____ as you move right

smaller

9
New cards

Paired data

each observation in one set is related to or
corresponds with exactly one observation in
the second sample

10
New cards

Example of pairs data

Before/after data (SAT before and after)

11
New cards

To analyze paired data, look at the

Difference in outcomes of each pair of observations

12
New cards

Hypotheses for mean of differences

H0=average difference (before-after) = 0

Ha= average difference(before-after) =/0

13
New cards

Paired t-test aka

mean of differences

14
New cards

Conditions for paired data mean test

  1. Independent observations

  2. Normal distribution

15
New cards

What line on test 3 formula sheet is for paired data

3

16
New cards

If you can’t find degrees of freedom in paired mean problem

Go to nearest number that is SMALLER (ex: if df=199, you go to 150 NOT 200)

17
New cards

Interpretation of inference alt vs null claim

Alt: “support”

Null: “reject”

18
New cards

Type 1 vs Type 2 Error

19
New cards

If CI is + or -, mean difference is

Higher/lower

20
New cards

Mean of differences

From dependent or matched pair samples (aka paired t-test)

21
New cards

difference of two means

From independent samples

22
New cards

Look at means of both samples

Mean 1= mean 2 (mean 1-mean 2=0)

23
New cards

Can difference of two means have different sample sizes? What about mean of differences?

Yes

No

24
New cards

Conditions for 2 means

  1. Both sample approx normal

  2. Independent observations within the samples

  3. Independent observations BETWEN SAMPLES, meaning two samples are not related

25
New cards

Which degrees of freedom do you use for two means

Smaller N-1

26
New cards

How do you recognize two means

Two of everything: SD, mean, samples

27
New cards

ANOVA

Analysis of variance

Used to compare means from two or more groups by using variance

28
New cards

Variance

Measures spread of data

29
New cards

Between vs within in anova formula is

On the top vs on the bottom

30
New cards

Variability within groups in anova is also known as

Sampling error

31
New cards

Variability between samples is

variability within samples is

Due to different treatments

Due to regular sampling error

32
New cards

F distribution is for

ANOVA

33
New cards

F distribution is always

Right skewed

34
New cards

Large F statistic

Higher variability between each group

35
New cards

Repeated measures

Use the same group of subjects with each treatment

36
New cards

One way ANOVA

Samples are compared using one category/factor/characteristic

37
New cards

Conditions for ANOVA

  1. Observations should be independent within groups

  2. Observations should be independent between routes

  3. The observations within each group should be nearly normal

  4. Variances across groups should be about equal

38
New cards

Post hoc

After the fact

39
New cards

ANOVA table: Row 1

Variability between sections

40
New cards

ANOVA table: Row 2

Residuals (aka error aka WITHIN)

41
New cards

K in Anova

Number of categories

42
New cards

Degrees of freedom of sections for ANOVA

K-1

43
New cards

Degrees of freedom in residuals in Anova

N-K

44
New cards

Sum sq

Sum of squares

45
New cards

How to find sum of squares in ANOVA

Difference, square, add up (WHAT THOUGH)

46
New cards

Mean sq

Subtract every grade from mean square and add up

Get by divide sum sq by df

47
New cards

Pr(>+F)

Probability of getting f or more extreme aka P VALUE

48
New cards

Power of the test

Probability of correctly rejecting the null (1-beta)

49
New cards

To increase power of a test

Increase sample size

Increase alpha (but type 1 more likely)

Decrease standard deviation

50
New cards

sampling distribution would be

the mean and SD of several means

51
New cards

formal definition of sampling distribution

distribution of a statistic across an infinite number of samples

52
New cards

SE of sample means

SD/sq root of sample size

53
New cards

population vs sampling distribution

population: more spread out because larger SE

sampling distribution: less spread out

54
New cards

most frequently used test statistic

t

55
New cards

t is more accurate for

small samples

56
New cards

what parameter is needed for t distribution

standard deviation

57
New cards

as degrees of freedom increase, the t distribution

approaches standard normal (aka peak goes higher)

58
New cards

unique aspect of tails of t distributions

tails are thicker, meaning more observations are more likely to fall beyond 2 SD from mean

59
New cards

t score and z scores are similar in that

both measure spread on one

60
New cards

how to find estimated p value

look on df row
for your test statistic, t.s., then go
straight up to the top for p-value for interval like chi square

61
New cards

interpreting inference format

there is not enough/enough evidence to support (alt)/reject (null) the claim that _____________. Then compare different.

62
New cards

interpretation of p value

if the mean ___ is ___,
the probability of getting our sample mean of _____ or more extreme is less than [pvalue], which is highly unlikely/likely.

63
New cards

interpreting confidence interval

we are % confidence that the true population mean is between _____ and _______

64
New cards

type 1 vs type 2 errors

type 1: reject null but you should’ve failed to reject

type 2: fail to reject but you should’ve rejected

65
New cards

how to find sample size of the mean

first formula on formula sheet (always round up)

66
New cards

how to get two sets of data into one

subtract

67
New cards

Parameter for paired t test

Average difference
between the reading and writing scores of
all high school students.

68
New cards

Point estimate for paired t test

Average difference between
the reading and writing scores of sampled
high school students.

69
New cards

subscript for paired data

d

70
New cards

hypothesis for two means test

H0: Mean 1- Mean 2 =0

HA: Mean 1- Mean 2 =/ 0

71
New cards

example of interpreting beta value (beta =0.15)

if beta = 0.15, the probability of failing to reject the null hypothesis when you should have rejected it is 0.15

the power would then be 1-0.15=0.85 so probability of correctly rejecting the false null hypothesis is .85

72
New cards

rejecting null means you found a difference so

if you tested hypothesis 100
times, [power %] of the tests would
detect the difference

73
New cards

hypothesis test for ANOVA

H0: μ1 = μ2 = ... = μk
HA: At least one mean is different

74
New cards

how to get f in ANOVA

mean sq between/mean sq within

75
New cards

how to set up post hoc tests

find Bonferroni alpha (a*) => used as new alpha for comparison

76
New cards

steps for post hoc tests

  1. get means and populations for each option

  2. plug into formula with whichever two data you are comparing (have to do for all combinations!)

77
New cards

how to get sample error from anova table?

mean sq from residual

78
New cards

overall anova conclusion

some combinations will work and some may not so write that in conclusion and make inference based on data

79
New cards

what is linear regression?

Determines if a linear relationship exists
If one numerical variable can predict another

80
New cards

can linear regression show you causation

no

81
New cards

how to model relationship between two variables

linear regression line w/ points in (x,y) form

82
New cards

x vs y in linear regression

x= explanatory variable

y= response variable

83
New cards

points in linear regression model

usually do not fall exactly on line, instead cluster around the line

84
New cards

why is regression line called a regression line

Regress means to move
towards a previous or less
formed state, so the
points would all be moving
toward the line in a
perfect world

85
New cards

correlation coefficient

described with R

scale from -1 to 1, with -1 and 1 having the strongest correlation (line) and 0 being the weakest (no correlation)

86
New cards

different names for regression line

line of best fit, least squares line

87
New cards

residuals of error

how far the points are from the line

difference between observed data and predicted data

88
New cards

how to calculate residual

observed – predicted (formula with e)

89
New cards

variables in residual formula (y and y-hat)

y= observed data.
y-hat= predicted data.

90
New cards

Residual > 0

predicted data is an
underestimate.

91
New cards

Residual < 0

the
predicted value is an
overestimate.

92
New cards

The Least Squares Method

square all the residuals/errors, add them

93
New cards

purpose of least squares method

to find the best line of fit

94
New cards

why least square method?

  • Most commonly used
    • Easier to compute
    • Highlights the errors

95
New cards

basis formula for line

y=mx+b (m is slope and b is y intercept)

96
New cards

how to find slope for regression line

formula with b1=

97
New cards

how to find intercept for regression line

formula with b0=

98
New cards

interpret the slope

For each additional % _____ in ______, we would
expect the % _______ to decrease/increase (based on positive or negative) on average by ___% (slope)

99
New cards

interpret the intercept

situation when explanatory variable (x) is 0, ___% (intercept) of situation

100
New cards

interpretations of the intercept tend to be

unbelievable