Unit 6 : Inference for Proportions

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/34

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

35 Terms

1
New cards

H0

The old value that is hypothesized (assumed) to be true. Unless we get evidence otherwise, we assume this is still true.

H0: p = old/hypothesized value. This is also called the null hypothesis

2
New cards

In general, Ha will be one of :

Ha : P < hypothesized value

Ha : P > hypothezied value

Ha : P =/ hypothesized value

3
New cards

Test Statistic

Tells us how much evidence we have in favor of Ha (or how much evidence we have against Ho… how many Se’s the sample phat form the hypothesized value p

4
New cards

The test statistic is given by :

( sample statistic - null hypothesis value ) / SE

5
New cards

Sample statistic

the same estimate = phat

6
New cards

Hypothesized value

The H-0 value being tested = p

7
New cards

The average error between the sample result p^ and assumed true p is called

the standard error.

8
New cards

A large test statistic ~2 or ~3 means that

phat ia far from p, 2 or even 3 times the average amount of uncertainty due to randomness, so there is strong evidence in favor of Ha and we will likely reject H0

9
New cards

The p-value is the

tail area beyond the test statistic on the side specified by Ha.

10
New cards

The test statistic follows a __

standard normal distribution with mean = 0 and SD = 1. These are the default values for pnorm in R so we don’t need to include them

11
New cards

To get the p-value, use

pnorm (test statistic)

12
New cards

The P-value is the

tail area beyond the test statistic, on the side specified by Ha. This is the probability of getting a sample value the same as our current value or more extreme, under the assumption that H0 is true.

13
New cards

P VALUE < ALPHA :

Sufficient evidence to reject H0 (conclude Ha) —- IF THE P VALUE IS LOW, REJECT THAT HO!!!! (conclude Ha)

14
New cards

P-Value > ALPHA

Insufficient evidence to reject Ho (can’t conclude Ha), DO NOT REJECT THE HO

15
New cards

Small P-Value means

The observed sample result p^ is very unlikely. A small P-value means sufficient evidence to reject Ho and conclude Ha.

16
New cards

Large P-value means

The observed sample result p^ is quite reasonable, and the small difference could just be ordinary random variation. This large p-value means there is insufficient evident to reject H0 or to conclude Ha. Continue to give Ho the benefit of the doubt, but never ‘accept’ or ‘conclude’ Ho

17
New cards

If Ha points to the right, the P-value is the area on the right side. How do we compute this in R?

1-pnorm(test statistic)

18
New cards

Since normal distributions are symmetrical, the tail areas on the left and ride side are equal, so this means whenever Ha is one sided, we can use ___

pnorm(NEGATIVE TEST STATISTIC) to get the p-value

19
New cards

If Ha is =/, it is called

2-sided. This is DOUBLE the tail area on the smallest side.

20
New cards

To calculate p-value for a two sided Ha

2 x pnorm(negative value of test statistic)

21
New cards

The rejection threshold alpha is called the

significance level, and it is usually at 0.05, but sometimes 0.01

22
New cards

Alpha at 0.05 reduces what error

Type II

23
New cards

Alpha at 0.01 reduces what error

Typw I

24
New cards

In order for the test on a Normal Distribution to be valid, the sample size needs to be

at least np >= 10 and also n(1-p) >= 10.

25
New cards

How does increasing the sample size affect the P-value?

Larger sample sizes decrease the standard error, leading to larger test statistics, and smaller P-values, stronger evidence for Ha (easier to reject Ho)

26
New cards

Type I error

𝐻0 is really true, but a bad random sample leads
us to conclude 𝐻𝑎. This can be reduced by using 𝛼 = 0.01
(make it harder to reject 𝐻0).

27
New cards

Type II errror

𝐻𝑎 is really true, but a bad random sample leads
us to conclude 𝐻0. This can be reduced by using 𝛼 = 0.10
(make it easier to reject 𝐻0).

28
New cards

The critical value, z*

The value of the multiplier of standard errors that will produce the true P - within a normal curve. 95% of the time the true parameter p will be within 1.96 standard errors of the sample result phat

29
New cards

How to obtain z*

Use the function qnorm to find the cutoff value with 95% inside.

Since 0.95 is on the inside, there must be 0.05 outside, split equally with 0.025 above and 0.025 below. We could use qnorm 0.025 or qnorm 0.975. We discard the negative sign, it doesnt matter when doing +-

30
New cards

A confidence interval captures

the true p from the whole population

31
New cards

How should we interpret a confidence interval?

We are 95% confident the true proportion of ALL x who y is between (CI)

32
New cards

95% Confident means

we’re 95% confident in this procedure to successfully capture the true p. That is, the procedure’s success rate over many different samples is 95%

33
New cards

To increase the confidence level, you should increase

the critical value z*. A wider confidence interval has a greater chance of capturing the true p.

34
New cards

The margin of error, ME, gives

the maximum error between p and p hat, for the given confidence level. The wider the level, the greater the margin of error. More confudence implies a greater success rate in capturing the true value but this implies a greater margin of error and it might be too wide to be useful

35
New cards

CI & Hypothesis Tests

Since they’re two sided, they correspond with two sided hypothesis tests

Check to see if the Ho value is contained inside the interval :

If the Ho value is inside the interval we cannot reject it: there is insufficient evidence to reject Ho and we will have P-value > alpha

But if the Ho value is outside the interval, there is sufficient evidence to reject Ho, and P-value < alpha