Statistical Analysis and Chi-Square Test Flashcards

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/52

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

53 Terms

1
New cards

What are the three main questions statistical inferences address?

1) Validity of a given result, 2) How significant?, 3) How close is observed result to an expected result?

2
New cards

What is a statistical population?

A large body of data containing all the data points of interest

3
New cards

What is a statistical sample?

Smaller collections of data points that are drawn from the much larger population

4
New cards

What do we use samples for?

To infer or estimate things about the population

5
New cards

What is relative frequency?

Proportion of particular observation relative to the total number of observations

6
New cards

What formula represents relative frequency?

Relative frequency = (frequency of specific observation) / (total number of observations)

7
New cards

How do scientists make inferences about populations?

Scientists make inferences about populations based on information obtained from smaller samples

8
New cards

What can be calculated from samples?

Event frequencies (and relative frequencies) can be calculated from samples

9
New cards

How can frequency distributions of samples be used?

Frequency distributions of samples can be used to predict the frequency distributions of populations

10
New cards

What happens to frequency distributions as sample size increases?

As sample size becomes larger, more likely to approximate a bell shape curve

11
New cards

What type of distribution approximates many frequency distributions in biology?

The normal distribution

12
New cards

Where is the normal distribution centered?

Around the mean/average value (x̄)

13
New cards

What percentage of data is on each side of the mean in a normal distribution?

50% right, 50% left

14
New cards

What does standard deviation measure?

Measure of the average deviation of data points from the sample mean (s)

15
New cards

What percentage of values lie within x̄ ± s in a normal distribution?

~68% of all values

16
New cards

What percentage of values lie within x̄ ± 2s in a normal distribution?

~95% of all values

17
New cards

What percentage of values lie within x̄ ± 3s in a normal distribution?

Nearly all of the measurements

18
New cards

How can area under the normal distribution curve be interpreted?

Area under the curve can be treated as probabilities

19
New cards

What is the total area under a normal distribution curve?

p = 1.0 (total area equals 1)

20
New cards

What is the normal distribution?

A type of probability distribution that applies to many events and phenomena in nature

21
New cards

What can standard deviation be used to analyze?

The numerical spread of a dataset, and to set specific boundary values underneath a curve

22
New cards

How can area underneath relative frequency distribution curves be interpreted?

As probabilities

23
New cards

What is the basic difference between statistical sample and statistical population?

Statistical population is a large body of data containing all data points of interest; statistical sample is a smaller portion of the total statistical population

24
New cards

What are frequency histograms?

Show data points that are placed into bins of equal sizes; histograms can either show absolute or relative frequencies

25
New cards

What do statistical tests involve the use of?

Probability distributions (one example is a normal distribution)

26
New cards

What type of distribution do many natural phenomena follow?

Normal distribution - mound shaped distribution for continuous variables

27
New cards

What does the interval x̄ ± s contain?

~68% of measurements

28
New cards

What does the interval x̄ ± 2s contain?

~95% of measurements

29
New cards

What does the interval x̄ ± 3s contain?

Nearly all of the measurements

30
New cards

What does area under the normal distribution curve represent?

Probability, where the total area = 1

31
New cards

What is a null hypothesis (H₀)?

A stated hypothesis which says there is no difference or no effect

32
New cards

What is an alternative hypothesis (Hₐ)?

A hypothesis which says that there is a difference

33
New cards

What will a statistical test do with the null hypothesis?

Either reject, or fail to reject, the stated null hypothesis

34
New cards

What is a goodness of fit statistical test?

A particular type of test that allows us to determine whether or not an observed result is consistent with an expected result

35
New cards

What is a specific example of a goodness of fit test?

Chi-square goodness of fit test

36
New cards

What is the chi-square goodness of fit test used for?

To detect the goodness of fit for data that fit into several predefined 'categories' such as different genotypes

37
New cards

What is the chi-square distribution also known as?

A probability distribution function

38
New cards

What determines the shape of the chi-square distribution function?

The number of categories 'k' being considered

39
New cards

How should area under any chi-square curve be treated?

As a probability, where total area equals a probability of 1

40
New cards

What p-value is typically used to reject the null hypothesis?

p = 0.05

41
New cards

What must the chi-square statistic be to reject the null hypothesis?

At least as high as the critical values at p = 0.05

42
New cards

What determines the number of degrees of freedom?

Number of categories 'k'

43
New cards

How are degrees of freedom calculated?

k - 1 (number of categories minus 1)

44
New cards

How many degrees of freedom are used for Hardy-Weinberg questions?

Always use 1 degree of freedom (row one)

45
New cards

For two alleles and 3 different genotypes, how many degrees of freedom are used?

One degree of freedom

46
New cards

What alpha value corresponds to the chi-square threshold value?

0.05 or less

47
New cards

When can you reject the null hypothesis in chi-square tests?

If calculated chi-square value is larger than threshold value

48
New cards

In chi-square goodness of fit tests for Hardy-Weinberg questions, what do the observed (O) and expected (E) values refer to?

The numbers of organisms, NOT the genotype frequencies

49
New cards

What shape does the normal distribution have?

Bell-shaped or mound-shaped curve

50
New cards

What makes a sample representative for statistical inference?

Being a smaller collection drawn from the larger population that can be used to estimate characteristics of that population

51
New cards

What is the relationship between sample size and normal distribution approximation?

Larger sample sizes are more likely to approximate a bell-shaped curve

52
New cards

Why is standard deviation important in normal distributions?

It allows you to determine what percentage of data falls within specific ranges from the mean

53
New cards

What does it mean when we say area under curves represents probability?

The proportion of area under a curve segment corresponds to the probability of values falling in that range