Biostatistics 281 Chapters 1-6 MIDTERM Study Guide

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/120

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

121 Terms

1
New cards

Statistics

  • A way of learning from data 

  • Concerned with all elements of study design, data collection, and analysis of numerical data

  • Requires judgement

2
New cards

Biostatistics

Statistics applied to biological and health problems

3
New cards

Data Judges

Judge and confirm clue

Use statistical inference

4
New cards

Data Detectives

Uncover patterns and clues

Use exploratory data analysis and descriptive statistics

5
New cards

Goals of Biostatistics

Improvement of the intellectual content of data

Organization of data into understandable forms

Reliance of tests of experience as a standard of validity

6
New cards

Data Collection Form

  • Observation:

    • Unit upon which measurements are made, can be individual or aggregate

  •  Variable:

    • The generic thing we measure 

    • Examples: Age, HIV status

  • Value: 

    • A realized measurement

    •  Examples: “27”, “positive”

7
New cards

Data Table

Each row corresponds to an observation

Each column contains information on a variable

Each cell in the table contains a value

8
New cards

Data Dictionary

knowt flashcard image
9
New cards

Types of Measurements

The assigning of numbers and codes according to prior-set rules.

10
New cards

Categorical - Nominal

No implied order of categories

E.g., Race, sex, colors

11
New cards

Categorical - Ordinal

Categories can be placed in some order

E.g., Likert Scales (Strongly agree, agree, no opinion, disagree, strongly disagree)

12
New cards

Numerical - Discrete

Counts or whole numbers

E.g., number of patients, number of children in a family.

13
New cards

Numerical - Continuous

Can take on any value within a range

E.g., height, weight, temperature

14
New cards

Population vs. Sample

Saves time

Saves money

Allows resources to be devoted to greater scope and accuracy

15
New cards

Sampling Methods

knowt flashcard image
16
New cards

Precision - Imprecision

Inability to be replicated

17
New cards

Precision - Bias

Tendency to overestimate or underestimate the true value of an object.

18
New cards

Many different types of biases exist, below are a few examples

  • Selection Bias

  • Detection Bias

  • Omitted Variable Bias

  • Attrition Bias

  • Non-Response Bias

  • Response Bias

19
New cards

Symmetry

Degree to which shape reflects a mirror image of itself around its center

<p><span style="background-color: transparent;">Degree to which shape reflects a mirror image of itself around its center</span></p>
20
New cards

Modality

Number of peaks

21
New cards

Kurtosis

Steepness of the mound (width of tails)

<p><span style="background-color: transparent;">Steepness of the mound (width of tails)</span></p>
22
New cards

Departures

Outliers

23
New cards

Skew Example

<p></p>
24
New cards

Two examples of chart

knowt flashcard image
25
New cards

Arithmetic Average (Mean)

Gravitational Center

26
New cards

Median

Middle Value

27
New cards

EXAMPLE - order the data from lowest to highest

  • 5 11 21 24 27 28 30 42 50 52

  • The median has a depth of (n + 1) ÷ 2 on the ordered array

  • When n is even, average the points adjacent to this depth 

  • For illustrative data: n = 10, median’s depth = (10+1) ÷ 2 = 5.5 

  • The median falls between 27 and 28

28
New cards

What is included in the spread?

  • Range

  • Inter-Quartile Range

  • Standard Deviation

  • Variance

29
New cards

Range

  • Minimum to maximum

  • The easiest but not the best way to describe spread (e.g., standard deviation, etc.) 

  • The range is “from 5 to 52” 

    • 5 11 21 24 27 28 30 42 50 52

30
New cards

Frequency Table

  • Frequency

    • Count

  • Relative Frequency

    • Proportion or %

  • Cumulative Frequency

    • % less than or equal to level

  • When data are sparse, group data into class intervals

  • Create 4 to 12 class intervals

  • Classes can be uniform or non-uniform (TRY TO KEEP IT UNIFORM!)

  • End point convention: e.g., first class interval of 0 to 10 will include 0 but exclude 10 (0 to 9.99)

  • Talley frequencies

  • Calculate relative frequency

  • Calculate cumulative frequency

<ul><li><p><span style="background-color: transparent;">Frequency</span></p><ul><li><p><span style="background-color: transparent;">Count</span></p></li></ul></li><li><p><span style="background-color: transparent;">Relative Frequency</span></p><ul><li><p><span style="background-color: transparent;">Proportion or %</span></p></li></ul></li><li><p><span style="background-color: transparent;">Cumulative Frequency</span></p><ul><li><p><span style="background-color: transparent;">% less than or equal to level</span></p></li></ul></li></ul><ul><li><p><span style="background-color: transparent;">When data are sparse, group data into class intervals</span></p></li><li><p><span style="background-color: transparent;">Create 4 to 12 class intervals</span></p></li><li><p><span style="background-color: transparent;">Classes can be uniform or non-uniform (TRY TO KEEP IT UNIFORM!)</span></p></li><li><p><span style="background-color: transparent;">End point convention: e.g., first class interval of 0 to 10 will include 0 but exclude 10 (0 to 9.99)</span></p></li><li><p><span style="background-color: transparent;">Talley frequencies</span></p></li><li><p><span style="background-color: transparent;">Calculate relative frequency</span></p></li><li><p><span style="background-color: transparent;">Calculate cumulative frequency</span></p></li></ul><p></p>
31
New cards

Frequency Table Example #1

  • Uniform class intervals table (width 10) for data:

    • 05, 11, 21, 24, 27, 28, 30, 42, 50, 52

  • Create a Frequency Table

<ul><li><p><span style="background-color: transparent;">Uniform class intervals table (width 10) for data:</span></p><ul><li><p><span style="background-color: transparent;">05, 11, 21, 24, 27, 28, 30, 42, 50, 52</span></p></li></ul></li></ul><ul><li><p><span style="background-color: transparent;">Create a Frequency Table</span></p></li></ul><p></p>
32
New cards

Histogram

  • A histogram is a frequency chart for a quantitative measurement

  • Notice how the bars touch

<ul><li><p><span style="background-color: transparent;">A histogram is a frequency chart for a<strong><u> quantitative </u></strong>measurement</span></p></li><li><p><span style="background-color: transparent;">Notice how the bars touch</span></p></li></ul><p></p>
33
New cards

Bar Chart

A bar chart with non-touching bars is reserved for categorical measurements

34
New cards

Pie Chart

knowt flashcard image
35
New cards

Summary statistics

  • Central Location 

    • Mean 

    • Median

    • Mode

  • Spread

    • Range and interquartile range (IQR)

    • Variance and standard deviation

  • Shape

36
New cards

Notation

  • n = sample size 

  • X = the variable (e.g., ages of subjects) 

  • xi = the value of individual i for variable X

  • Σ = sum all values (capital sigma) 

    • Example (ages of participants): 

      • 21 42 5 11 30 50 28 27 54 52 

        • n = 10

        • X = AGE variable 

        • x1 = 21, x2 = 42, ..., x10 = 52 

        • Σxi = x1 + x2 + ... + x10 = 21 + 42 + ... + 52 = 290

37
New cards

Central Location (Sample Mean)

  • “Arithmetic Average”

  • Traditional Measure of Central Location

  • Sum the values and divide by n

  • X refers to the sample mean

<ul><li><p><span style="background-color: transparent;">“Arithmetic Average”</span></p></li><li><p><span style="background-color: transparent;">Traditional Measure of Central Location</span></p></li><li><p><span style="background-color: transparent;">Sum the values and divide by <em>n</em></span></p></li><li><p><span style="background-color: transparent;">X refers to the sample mean</span></p></li></ul><p></p>
38
New cards

Central Location (Sample Mean) EXAMPLE

knowt flashcard image
39
New cards

The mean is the balancing point of a distribution

gravitational center

  • Susceptible to skews

  • Can be used to predict…

    • Randomly values from sample

    • Random values from population

    • Population mean

40
New cards

Central Location (Median)

  • Order the data from lowest to highest 

    • 5 11 21 24 27 28 30 42 50 52 

  • The median has a depth of (n + 1) ÷ 2 on the ordered array 

  • When n is even, average the points adjacent to this depth 

  • For illustrative data: n = 10, median’s depth = (10+1) ÷ 2 = 5.5 

  • The median falls between 27 and 28. Median = 27.5

  • The median is more resistant to skews and outliers than the mean; it is more robust. 

  • It is less impacted by skewness and outliers. 

  • 1362 1439 1460 1614 1666 1792 1867 

    • Mean = 1636 

    • Median = 1614 

  • 1362 1439 1460 1614 1666 1792 9867 

    • Mean = 2743

    • Median = 1614

41
New cards

Central Location (Mode)

  • The mode is the most commonly encountered value in the dataset

  • This data set has a mode of 7

    • {4, 7, 7, 7, 8, 8, 9} 

  • This data set has no mode 

    • {4, 6, 7, 8} 

  • The mode is useful only in large data sets with repeating values

42
New cards

Effect of a Skew on the Mean, Median, and Mode:

knowt flashcard image
43
New cards

Note how the mean gets pulled toward the longer tail more than the median

  • mean = median → symmetrical distribution 

  • mean > median → positive skew 

  • mean < median → negative skew

44
New cards

Spread

  • Two distributions can be quite different yet can have the same mean. 

  • This data compares particulate matter in air samples (μg/m3) at two sites. 

  • Both sites have a mean of 36, but Site 1 exhibits much greater variability. 

  • We would miss the high pollution days if we relied solely on the mean.

<ul><li><p><span style="background-color: transparent;">Two distributions can be quite different yet can have the same mean.&nbsp;</span></p></li><li><p><span style="background-color: transparent;">This data compares particulate matter in air samples (μg/m3) at two sites.&nbsp;</span></p></li><li><p><span style="background-color: transparent;"><strong><u>Both sites have a mean of 36</u></strong>, but Site 1 exhibits much greater variability.&nbsp;</span></p></li><li><p><span style="background-color: transparent;">We would miss the high pollution days if we relied solely on the mean.</span></p></li></ul><p></p>
45
New cards

Spread (Range)

  • Range 

  • Maximum – minimum 

  • Site 1 range is from 22 to 68 (range of 46) 

  • Site 2 range is from 32 to 40 (range of 8) 

  • Beware: the sample range will tend to underestimate the population range. 

  • Always supplement the range with at least one addition measure of spread

<ul><li><p><span style="background-color: transparent;">Range&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Maximum – minimum&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Site 1 range is from 22 to 68 (range of 46)&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Site 2 range is from 32 to 40 (range of 8)&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Beware: the sample range will tend to underestimate the population range.&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Always supplement the range with at least one addition measure of spread</span></p></li></ul><p></p>
46
New cards

Spread (Quartiles)

  • Quartile 1 (Q1): 

    • Cuts off bottom quarter of data 

    • Median of the lower half of the data set 

  • Quartile 3 (Q3) 

    • Cuts off top quarter of data 

    • Median of the upper half of the data set 

  • Interquartile Range (IQR) 

    • Q3 – Q1 

    • Covers the middle 50% of the distribution

<ul><li><p><span style="background-color: transparent;">Quartile 1 (Q1):&nbsp;</span></p><ul><li><p><span style="background-color: transparent;">Cuts off bottom quarter of data&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Median of the lower half of the data set&nbsp;</span></p></li></ul></li><li><p><span style="background-color: transparent;">Quartile 3 (Q3)&nbsp;</span></p><ul><li><p><span style="background-color: transparent;">Cuts off top quarter of data&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Median of the upper half of the data set&nbsp;</span></p></li></ul></li><li><p><span style="background-color: transparent;">Interquartile Range (IQR)&nbsp;</span></p><ul><li><p><span style="background-color: transparent;">Q3 – Q1&nbsp;</span></p></li><li><p><span style="background-color: transparent;">Covers the middle 50% of the distribution</span></p></li></ul></li></ul><p></p>
47
New cards

Spread (Quartiles) - Example

  • You are given a SRS of metabolic rates (cal/day), n = 7 

  • 1362 1439 1460 1614 1666 1792 1867 

  • When n is odd, include the median in both halves of the data set. 

  • Bottom half: 1362 1439 1460 1614 

    • Median = 1449.5 (Q1) 

  • Top half: 1614 1666 1792 1867 

    • Median = 1729 (Q3)

48
New cards

Five Point Summary

  • Q0 (the minimum) 

  • Q1 (25th percentile) 

  • Q2 (median) 

  • Q3 (75th percentile) 

  • Q4 (the maximum)

49
New cards

Standard Deviation

  • Most common descriptive measures of spread 

  • Based on deviations around the mean

50
New cards

Standard Deviation EXAMPLE

This data set has a mean of 36.

<p><span style="background-color: transparent;">This data set has a mean of 36.</span></p>
51
New cards

FORMULAS

knowt flashcard image
52
New cards

Example of using the formulas

knowt flashcard image
53
New cards

Sample -> Population

knowt flashcard image
54
New cards

68-95-99.7 Rule

knowt flashcard image
55
New cards

68-95-99.7 RULE (Example)

knowt flashcard image
56
New cards

Chebyshev’s Rule

  • ALL Distributions 

  • Chebychev’s rule says that at least 75% of the values will fall in the range μ ± 2σ.

  • Example: 

    • A distribution with μ = 30 and σ = 10 has at least 75% of the values in the range 30 ± (2)(10) = 10 to 50

57
New cards

Surveys

  • Describe population characteristics 

  • Example 

    • A study of the prevalence of hypertension in a population

58
New cards

Comparative Studies

  • Determine relationships between variables 

  • Example 

    • A study to address whether weight gain causes hypertension

59
New cards

Outline of Studies

knowt flashcard image
60
New cards

Comparative Studies

  • Comparative designs study the relationship between an explanatory variable and response variable.

  • Example: 

    • The first test you decide not to study and you get a C+. 

    • The second test you decide to study and get an A.

  •  Explanatory

    • What you do to cause change. 

  • Response

    • What you’re hoping to change.

<ul><li><p><span style="background-color: transparent;">Comparative designs study the relationship between an <u>explanatory variable</u> and <u>response variable</u>.</span></p></li><li><p><span style="background-color: transparent;">Example:&nbsp;</span></p><ul><li><p><span style="background-color: transparent;">The first test you decide not to study and you get a C+.&nbsp;</span></p></li><li><p><span style="background-color: transparent;">The second test you decide to study and get an A.</span></p></li></ul></li><li><p><span style="background-color: transparent;">&nbsp;Explanatory</span></p><ul><li><p><span style="background-color: transparent;">What you do to cause change.&nbsp;</span></p></li></ul></li><li><p><span style="background-color: transparent;">Response</span></p><ul><li><p><span style="background-color: transparent;">What you’re hoping to change.</span></p></li></ul></li></ul><p></p>
61
New cards

Comparative Studies - Experimental

Investigators assign the subjects to groups

62
New cards

Comparative Studies - Observational

Investigator does not assign the subjects to groups

63
New cards

Comparative Study Comparison

  • In the experimental design, the investigators controlled who was and who was not exposed.

  • In the non-experimental design, the study subjects (or their physicians) decided on whether or not subjects were exposed

64
New cards

Experimental Principles

  • Controlled comparison

  • randomized 

  • replication

65
New cards

Controlled Trial

  • Control Group = Non-exposed. 

  • You can’t know how a treatment causes change without comparing it to someone who didn’t take the treatment. 

  • You won’t know if studying before an exam helps without first not studying for it.

  • You cannot judge effects of a treatment without a control group because: 

    • Many factors contribute to a response 

    • Conditions change on their own over time 

    • The placebo effect and other passive intervention effects are operative

66
New cards

Randomization

  • Refers to randomly putting people into treatment groups. 

  • Balances lurking variables among treatments groups, mitigating their potentially confounding effects

67
New cards

Replication

The results of a study conducted on a larger number of cases are generally more reliable than smaller studies

68
New cards

Ethics Outline

  • Informed Consent

  • Beneficence

  • Equipoise

  • Institutional Review Board

  • Additional Ethical Principles

69
New cards

Informed Consent

Biostatisticians should obtain informed consent from research participants before collecting data. Informed consent involves providing participants with information about the study, including its purpose, procedures, risks, and benefits, and obtaining their voluntary agreement to participate.

70
New cards

Beneficence

Biostatisticians should maximize benefits and minimize harms to research participants and society. This principle involves ensuring that research is conducted in a way that promotes the well-being of participants and society.

71
New cards

Equipoise

Biostatisticians should ensure that the research question is scientifically valid and that the study design is appropriate to answer the question. This principle involves ensuring that the study is designed in a way that minimizes bias and confounding.

72
New cards

Institutional Review Board

Biostatisticians should work with IRBs to ensure that research is conducted in an ethical manner. IRBs are responsible for reviewing research proposals to ensure that they meet ethical standards and that the rights and welfare of research participants are protected.

73
New cards

Additional Ethical Principles

  • Integrity of data and methods 

  • Responsibilities to stakeholders 

  • Responsibilities to research subjects, data subjects, or those directly affected by statistical practices

74
New cards

Normal Distributions

  • Continuous random variables are described with smooth probability density functions (pdfs)

  • Normal pdfs are recognized by their familiar bell-shape

  • Example: Age distribution of a pediatric population

75
New cards

Parameters μ and σ

  • Normal pdfs are a family of distributions

  • Family members identified by parameters μ (mean)and σ (standard deviation)

76
New cards

μ controls location

knowt flashcard image
77
New cards

σ controls spread

knowt flashcard image
78
New cards

68-95-99.7 Rule

Normal Distribution

  • 68% of data in the range μ ± σ

  • 95% of data in the range μ ± 2σ

  • 99.7% of data the range μ ± 3σ

<p><span style="background-color: transparent;"><strong><u>Normal Distribution</u></strong></span></p><ul><li><p><span style="background-color: transparent;">68% of data in the range μ ± σ</span></p></li><li><p><span style="background-color: transparent;">95% of data in the range μ ± 2σ</span></p></li><li><p><span style="background-color: transparent;">99.7% of data the range μ ± 3σ</span></p></li></ul><p></p>
79
New cards

Reexpression of Non-Normal Variables

  • Many variables are not Normal

  • We can re-express non-Normal variables with a mathematical transformation to make them more Normal

  • Example of mathematical transforms include logarithms, exponents, square roots, and so on.

80
New cards

Logarithms

  • Logarithms are exponents of their base

  • There are two main logarithmic bases

  • common log10

(base 10)

  • natural ln

(base e)

<ul><li><p><span style="background-color: transparent;">Logarithms are exponents of their base</span></p></li><li><p><span style="background-color: transparent;">There are two main logarithmic bases</span></p></li><li><p><span style="background-color: transparent;">common log<sub>10</sub></span></p></li></ul><p><span style="background-color: transparent;">(base 10)</span></p><ul><li><p><span style="background-color: transparent;">natural ln</span></p></li></ul><p><span style="background-color: transparent;">(base <em>e</em>)</span></p>
81
New cards

Landmarks

  • log10(1) = 0

(because 100 = 1)

  • log10(10) = 1

(because 101 = 10)

82
New cards

Statistical inference

The act of generalizing from a sample to a population with calculated degree of certainty

<p><span style="background-color: transparent;">The act of generalizing from a <strong><u>sample</u></strong> to a <strong><u>population</u></strong> with calculated degree of certainty</span></p>
83
New cards

Population

  • Represents everyone

  • Mean → μ

  • Standard Deviation → σ

84
New cards

Sample

  • A subset of the population

  • Mean → x ̅ (x-bar)

  • Standard Deviation → s

85
New cards

Sampling Behavior of A Mean

  • How precisely does a given sample mean reflect the underlying population mean?

  • To answer this question, we must establish the sampling distribution of x-bar

  • The sampling distribution of x ̅ is the hypothetical distribution of means from all possible samples of size n taken from the same population

86
New cards

Finding 1 (central limit theorem)

The sampling distribution of x-bar tends toward Normality even when the population distribution is not Normal. This effect is strong in large samples.

87
New cards

Finding 2 (unbiasedness)

The expected value of x-bar is μ

88
New cards

Finding 3 (square root law)

knowt flashcard image
89
New cards

Standard Deviation (Error) of the Mean

  • The standard deviation of the sampling distribution of the mean has a special name: it is called the “standard error of the mean” (SE)

  • The square root law says the SE is inversely proportional to the square root of the sample size:

  • What do you think would happen if we increase our sample size?

  • Our SE or standard error of the mean would go down!

  • As, ↑n → ↓σ

<ul><li><p><span style="background-color: transparent;">The standard deviation of the sampling distribution of the mean has a <strong>special name</strong>: it is called the “standard error of the mean” (<em>SE</em>)</span></p></li><li><p><span style="background-color: transparent;">The square root law says the <em>SE</em> is inversely proportional to the square root of the sample size:</span></p></li><li><p><span style="background-color: transparent;">What do you think would happen if we increase our sample size?</span></p></li><li><p><span style="background-color: transparent;">Our <em>SE</em> or standard error of the mean would go down!</span></p></li><li><p><span style="background-color: transparent;">As, <u>↑n → ↓σ</u></span></p></li></ul><p></p>
90
New cards

Law of Large Numbers

  • As a sample gets larger and larger, the sample mean tends to get closer and closer to the μ

  • This tendency is known as the Law of Large Numbers

91
New cards

Statistical Inference

Generalizing from a sample to a population with calculated degree of certainty

92
New cards

Two forms of statistical inference

  • Hypothesis testing

  • Estimation

93
New cards

Introduction to Hypothesis Testing:

  • Hypothesis testing is also called significance testing

  • The objective of hypothesis testing is to test claims about parameters

  • For example, does a clinical study of a new cholesterol-lowering drug provide robust evidence of a beneficial effect in patients at risk for heart disease.

  • A drug is considered to have a beneficial effect on a population of patients if the population average effect is large enough to be clinically important. It is also necessary to evaluate the strength of the evidence that a drug is effective; in other words, is the observed effect larger than would be expected from chance variation alone?

  • A method for calculating the probability of making a specific observation under a working hypothesis, called the null hypothesis.

  • By assuming that the data come from a distribution specified by the null hypothesis, it is possible to calculate the likelihood of observing a value.

  • If the chances of such an extreme observation are small, there is enough evidence to reject the null hypothesis in favor of an alternative hypothesis.

94
New cards

Hypothesis Testing Steps

  1. Formulating null and alternative hypotheses

  2. Specifying a significance level (α)

  3. Calculating the test statistic

  4. Calculating the p-value

  5. Drawing a conclusion

95
New cards

Null and Alternative Hypothesis

  • The null hypothesis (H0) often represents either a skeptical perspective or a claim to be tested.

  • The alternative hypothesis (HA) is an alternative claim and is often represented by a range of possible parameter values.

The logic behind rejecting or failing to reject the null hypothesis is similar to the principle of presumption of innocence in many legal systems. In the United States, a defendant is assumed innocent until proven guilty; a verdict of guilty is only returned if it has been established beyond a reasonable doubt that the defendant is not innocent. In the formal approach to hypothesis testing, the null hypothesis (H0) is not rejected unless the evidence contradicting it is so strong that the only reasonable conclusion is to reject H0 in favor of HA.

96
New cards

two-sided alternative.

The alternative hypothesis HA : μ ≠ 0 is called a two-sided alternative.

97
New cards

Specifying a Significance Level (a)

  • It is important to specify how rare or unlikely an event must be in order to represent sufficient evidence against the null hypothesis. This should be done during the design phase of a study, to prevent any bias that could result from defining ’rare’ only after analyzing the results.

  • When testing a statistical hypothesis, an investigator specifies a significance level, α, that defines a ’rare’ event. Typically, α is chosen to be 0.05, though it may be larger or smaller, depending on context

98
New cards

Calculating the Test Statistic

  • The test statistic quantifies the number of standard deviations between the sample mean x ̅ and the population mean μ.

  • The perfect Normal distribution has a mean of 0 (μ = 0) and a standard deviation of 1 (σ = 1).

  • If your data’s distribution does not match these parameters exactly, then you need to standardize to make it fit. Different standardization formulas exist depending on the specific statistical analysis that is being conducting.

  • For example, a one sample t-test, the following formula would be used:

t=(x ̅-μ_0)/(s∕√n)

  • s represents the sample standard deviation and n represents the number of observations in the sample.

99
New cards

Calculating the P-Value

  • The p-value is the probability of observing a sample mean as or more extreme than the observed value, under the assumption that the null hypothesis is true.

  • The p-value can either be calculated from software or from the normal probability tables. For the weight-difference example, the p-value is vanishingly small:

<ul><li><p><span style="background-color: transparent;">The <strong><em>p</em>-value</strong> is the probability of observing a sample mean as or more extreme than the observed value, under the assumption that the null hypothesis is true.</span></p></li><li><p><span style="background-color: transparent;">The <em>p</em>-value can either be calculated from software or from the normal probability tables. For the weight-difference example, the <em>p</em>-value is vanishingly small:</span></p></li></ul><p></p>
100
New cards

Drawing a Conclusion

  • To reach a conclusion about the null hypothesis, directly compare p and α. Note that for a conclusion to be informative, it must be presented in the context of the original question; it is not useful to only state whether or not H0 is rejected.

  • If p > α, the observed sample mean is not extreme enough to warrant rejecting H0; more formally stated, there is insufficient evidence to reject H0.

  • A high p-value suggests that the difference between the observed sample mean and μ0 can reasonably be attributed to random chance.

  • If pα, there is sufficient evidence to reject H0 and accept HA.

  • Thus, the data support the conclusion that on average, the difference between actual and desired weight is not 0 and is positive; people generally seem to feel they are overweight.

Explore top flashcards

322 Exam 1
Updated 991d ago
flashcards Flashcards (78)
abdomen
Updated 815d ago
flashcards Flashcards (29)
Exam 2 Top 300
Updated 620d ago
flashcards Flashcards (56)
25.1!!!
Updated 205d ago
flashcards Flashcards (23)
georgaphy
Updated 989d ago
flashcards Flashcards (42)
Theatre Post 1950
Updated 535d ago
flashcards Flashcards (32)
Substance Abuse
Updated 4d ago
flashcards Flashcards (41)
322 Exam 1
Updated 991d ago
flashcards Flashcards (78)
abdomen
Updated 815d ago
flashcards Flashcards (29)
Exam 2 Top 300
Updated 620d ago
flashcards Flashcards (56)
25.1!!!
Updated 205d ago
flashcards Flashcards (23)
georgaphy
Updated 989d ago
flashcards Flashcards (42)
Theatre Post 1950
Updated 535d ago
flashcards Flashcards (32)
Substance Abuse
Updated 4d ago
flashcards Flashcards (41)