Unit 7 : Confidence Intervals and Hypothesis Tests for Means

0.0(0)
studied byStudied by 0 people
0.0(0)
call with kaiCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/18

flashcard set

Earn XP

Description and Tags

Slideshow

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

19 Terms

1
New cards

Estimating the population mean requires

different formulas than proportions since the mean is calculated from a quantitative variable, whereas the proportion is calculated from a categorical variable, but the underlying principles are the same

2
New cards

The population (true) mean is called Miu, we estimate it based on

the sample aberage.

3
New cards

For a mean, instead of a normal / z curve we use

a t-curve

4
New cards

The collection of y(bars_ from many different samples is called

the sampling distribution of y bar, and is a t-distribution with degrees of freedon N-1

5
New cards

standard deviation can be found as

s / sqrt(n)

6
New cards

How does the t-curve differ from the normal curve?

It’s also bell shaped, but has longer tails than the z-curve. This is due to the uncertainty in not knowing the true Std. Dev, and results in wider confidence intervals.

7
New cards

confidence intervals when estimating the population mean

the critical value is = t* with DF = n-1

8
New cards

In r we find the cutoff value for 95% confidence using

qt (0.025, dof) or qt(0.975,df)

9
New cards

How do we interpret confidence intervals here?

There is 95% confidence that the true average deliver time for all deliveries is between 25.75 and 29.85 minutes

10
New cards

95% confidence here means

if we took a large number of random samples of size 50 under similar conditions, 95% of these samples which would produce confidence intervals contain the true average delivery time.

11
New cards

For this test, we use Ho: Miu = hypothesized value, and what are the Ha’s?

Ha : Miu > hypothesized value

Ha : Miu < hypothesized value

Ha : Miu =/ hypothesized value

12
New cards

Test statistic is given by :

(sample statistic - null hypothesis value (H0)) / SE

13
New cards

how to compute the p-value in R

using pt instead of pnorm, with DF = n-1, either 1-sided or 2- sided depending on Ha.

14
New cards

> and < mean the t test has how many sides?

one sided!!! because of this you use pt instaed of pnorm and enter the negative of the test statistic together with the Df

15
New cards

If we are doing a 2 sided t-test, what do we do?

use pt and enter the negative of the test statistic together with the df, then multiple by 2

2*pt(-TS, DF) this gives you the p-value

16
New cards

For tests of the mean, the same size should be

N is >= 30. Skewed data is okay if n>= 30. If the data follows a normal distribution, the sample size can be less than 30.

17
New cards

The sample size condition can be met in one of two ways (just one has to be true) : Either :

  1. The sample size must be n>= 30 OR

  2. The original data must follow a Normal distribution

If this condition is not met, the sampling distribution won’t be a t-curve and the P-value/CI willl be wrong.

THIS REPLACES THE CONDITION NP>= 10 FOR PROPORTIONS. THERE IS NO P FOR AVERAGES

18
New cards

R function for t test

t.test(data$variable, alternative = …, mu = …, conf.level = …)

19
New cards

Mu is

the value of the hypothesized mean (Ho being tested)