Biostats Exam 1

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/171

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

172 Terms

1
New cards

Population

the entire set of individuals, animals, objects or events that can be uniquely defined by a characteristic or set of characteristics

too large to measure or observe each member of the population

2
New cards

what are examples of a population?

1. All US males between 45-65 years of age

2. All deer in the state of Virginia

3. All shipwrecks on the bottom of the Atlantic Ocean

3
New cards

what is a sample?

subset of the population of interest

4
New cards

What makes a sample a good representation of the population?

if it is random (every member of the population has equal chance of being observed

5
New cards

What are examples of samples?

1. A sample of 1000 US males between. 45-65 years old

2. A sample of 100 deer caught and released int he state of VA

6
New cards

What are population parameters?

specific characteristics of the population

7
New cards

What is a statistic?

an estimator of a population characteristics/ parameter based upon the values observed in the sample

8
New cards

What are examples of statistics?

1. Average weight of 100 US males between 45-65 years old

2. Proportion of 50 deer caught in the state of VA with a wasting disease

9
New cards

What are convenience samplings?

-non random sampling approach

-involved using respondents who are "convenient" to the researcher or easily accessed

-subjects recruited mainly by location or membership

10
New cards

Why might convenient sampling not be useful?

extremely high chance of producing a biased statistical results that are not generalizable (the sample does not represent the population of interest)

11
New cards

What is a simple random sample?

-every member of the population is given an equal chance of being selected

-sampling frame creates then randomly, a subset of individuals are selected to be in the sample

-once the person is selected, an attempt is made to gather their data

<p>-every member of the population is given an equal chance of being selected</p><p>-sampling frame creates then randomly, a subset of individuals are selected to be in the sample</p><p>-once the person is selected, an attempt is made to gather their data</p>
12
New cards

What is systemic random sampling?

randomly picks a place in the sampling frame to begin and systemically select subjects from there

<p>randomly picks a place in the sampling frame to begin and systemically select subjects from there</p>
13
New cards

What is stratified random sampling?

-the sampling frame is subdivided by groups and then random sampling occurs within groups

-suppose an organization has 20,000 employees, 90% female and 10% males

-sampling could be stratified by female and male to make certain the sample is 90% female and 10% male

14
New cards

What is random sampling considered?

gold standard, providing the best chance to produce unbiased statistics and results that generalize to the population of interest

15
New cards

what does a case control study compare?

Those with the disease or condition under the study (cases) and a very similar group of people who do not have the disease or condition (controls)

16
New cards

What is a case control study?

researchers examine the history of the subjects (socioeconomic, educational, occupations, medical history, lifestyles) to examine which factors may be associat4ed with the disease or condition

17
New cards

What are retrospective studies?

Case-control studies choose population samples after the fact based on presence or absence of disease. E.g. People with lung cancer vs without to identify risk factors. More practical for rare diseases.

18
New cards

What are cohort studies?

-a study approach that follows a set of research participants over a period of time, sometimes for decades

-recruit and follow one group of participants who share a common characteristic, such as a particular occupation or demographic similarity

-can be retrospective or prospective

19
New cards

What is a randomized clinical trial?

-study in which the participants are divided randomly (flip a coin) into groups

-each group will receive a different intervention or treatment

-using chance to assign individuals to treatment means that the groups will likely be balanced on demographics and other factors

20
New cards

what does randomization allow for?

fair, unbiased comparison of the treatments; at the time of the trial, it is unknown which treatment is best

21
New cards

What does the statistics calculated depend on?

depends on the nature of the data (level of measurement) you collect within your sample

22
New cards

what is nominal level?

-lowest level of measurement

-named for characteristics

-no natural order (concepts of greater than or less than do NOT apply)

23
New cards

What are examples of nominal levels?

-sex (female/male)

-eye color (blue, brown, hazel, other)

-blood type (O, A, B, AB)

-race

-nationality

24
New cards

What is ordinal level?

-second lowest on the levels of measurement

-categories or levels that can be ranked or ordered

-distance between the levels/ characteristics is not constant and has no meaning (subtraction is useless)

25
New cards

what are examples of ordinals?

-Grades (A-F)

-restaurant ratings (5 star)

-Pain rating 1 to 10

26
New cards

What is interval level (levels of measurement)?

-second highest level

-measurements that can be ordered and distance between measurements has meaning (subtraction is valid) but zero does NOT imply absence of the trait)

27
New cards

What is an example of interval levels of measurement?

celsius (amount of heat between 30˚ and 25˚ is equal to the amount of heat between 10˚ and 5˚ but 0˚C does not imply NO heat but implies that water freezes)

28
New cards

What is the ratio level?

-highest level up on the levels of measurement

-measurements that can ordered and distance between measurements has meaning (subtraction is valid) and zero does imply the absence of a trait

29
New cards

what is an example of ratio level?

temperature in Kelvin, the amount of heat between 30˚ and 25˚ is equal to the amount of heat between 10˚ and 5˚ and 0˚ K is NO heat

30
New cards

what is the most common statistic for nominal levels of measurements?

proportions

31
New cards

what is the most common statistic for ordinal levels of measurements?

proportions, but sometimes sample mean and variance are reported

32
New cards

what is the most common statistic for interval and ratio levels of measurements?

sample means and variacnes

33
New cards

What kind of chart/ graph is used for nominal or ordinal variables?

-a bar chart most appropriate

-other options: pie chart (each category receives a proportional size fo the pie)

-comparing heigths is easier than angles

34
New cards

What kind of chart/ graph is best to show the distribution of continuous variables?

box and whisker plots (sometimes just called box plots) or histograms

35
New cards

What kind of chart/ graph is best to show association between two continuous variables (such as interval or ratio variables)?

scatterplot with one variable as the x axis and the other as the y axis, and a point at the (x,y) to represent the data

36
New cards

What is a sample mean?

a statistic that measures central tendency or central location; average of the data points

37
New cards

What is the formula for sample means?

knowt flashcard image
38
New cards

Imagine you have a ruler and you place an equal amount of weight at each datapoint. If you want to balance that ruler on your finger, where do you think the balance point is?

Data points: 73, 92, 105, 111, 120, 122, 127, 129, 131, 136, 138

111.92

<p>111.92</p>
39
New cards

What are we interested in in addition to the center of the data?

interested in the dispersion of the data, how 'spread out' the data is

40
New cards

what is sample variance?

-a common measure of dispersion of data/ how spread out the data is

-measures the averaged squared deviation of the data points from their sample mean

41
New cards

whaat is the formula for sample variance?

knowt flashcard image
42
New cards

what are degrees of freedom?

number of unconstrained data points in variance calculations

43
New cards

What are outliers?

unusual data points that fall far away (above or below) from the rest of the data

44
New cards

what is the p th percentile?

a value where p percent of the data fall at or below that value

45
New cards

what is the median (50th percentile)?

-a value where 50% of the data fall at or below that values

-the position is (n+1)/2

<p>-a value where 50% of the data fall at or below that values</p><p>-the position is (n+1)/2</p>
46
New cards

what is the first quartile?

-25th percentile, 25% of the data falls at or below that value

-position: (n+1)/4

47
New cards

what is the third quartile?

-75th percentile, 75% of the data fall at or below that value

-position: 3(n+1)/4

48
New cards

what is the median?

a robust measure of center

49
New cards

What is interquartile rnage?

-Q3-Q1

-robust measure of spread/dispersion

50
New cards

What is right skewed in histograms?

tail is towards the right, mean > median

<p>tail is towards the right, mean &gt; median</p>
51
New cards

What is left skewed in histograms?

tail towards the left, mean < median

<p>tail towards the left, mean &lt; median</p>
52
New cards

What is right skewed in a boxplot?

2 outliers, mean > median

<p>2 outliers, mean &gt; median</p>
53
New cards

what is left skewed in a boxplot?

multiple left sided outliers, mean < median

<p>multiple left sided outliers, mean &lt; median</p>
54
New cards

what is probability?

-the measure of how likely a specific event/ outcome will occur in an experiment

-all must be between 0 and 1, inclusive

55
New cards

What does it mean if the probability of an event (E) is 0?

the event is impossible, P(E)=0

56
New cards

What does it mean if the probability of an event (E) is 1?

the event is certain to happen, P(E)=1

57
New cards

The closer the probability is to 0 indicates...

the more unlikely the event is to be observed

58
New cards

The closer the probability is to 1 indicates...

the more likely the event is to be observed

59
New cards

What would you expect is P(E) = 0.5?

EXPECT 50% of repeated experiments to produce that outcome

60
New cards

What is the conditional probability equation?

P(A/B) = (P(A and B))/P(B)

<p>P(A/B) = (P(A and B))/P(B)</p>
61
New cards

What does it mean is P(A/B) = P(A)?

events A and B are said to be independent

the probability of A is not altered by conditioning to a specific group or event

62
New cards

what is science looking for in terms of probability and events?

looking for dependence among events, such as the probability of the event of stroke being dependent on high BP

63
New cards

What is a probability distribution?

Fully describes the probability of any outcome in the sample space

64
New cards

How does probability distribution guide expectations?

use reasonable and unreasonable expectations

understand and observe--> helps predict outcome

65
New cards

what are the characteristics of probability distributions?

-mean (center location)

-Variance (dispersion)

-Median/ Q1/ Q3 (percentiles)

-shape

66
New cards

what is sample mean X from your data estimating in a probability distribution?

estimating the probability distribution mean (u)

67
New cards

The observations (data) in your sample are inheriting characteristics from the probability distribution, so the sample mean X from your data is...

estimating the probability distribution's mean (u)

68
New cards

The observations (data) in your sample are inheriting characteristics from the probability distribution, so the sample variance s^2 from your data is...

estimating the probability distribution's variance (sigma^2)

69
New cards

The observations (data) in your sample are inheriting characteristics from the probability distribution, so the shape you see in a histogram/ boxplot is...

estimating the shape of the probability distribution

70
New cards

what are the statistics you calculate from data estimating?

estimating the parameters of the probability distribution that represents the population

71
New cards

Why do statisticians use probability distributions?

to model how the outcomes are distributed for the population

estimate 'population parameters'

72
New cards

What is a discrete uniform graph?

-equal amount of space between each line (symmetric)

-mean: the middle of the

<p>-equal amount of space between each line (symmetric)</p><p>-mean: the middle of the</p>
73
New cards

What is the Binomial Distribution?

-common distribution

-models outcomes that can only take on 2 values

-ex: prescribed a statin, either a person has been prescribed one or not

-assumes n independent trials where each trial has the probability pi of observing an event

-outcome X is the total number of events observed from n trials

74
New cards

what is the Binomial distribution equation?

pi= probability of observing an event

X= total number of events observed from n trial

n= independent trials

<p>pi= probability of observing an event</p><p>X= total number of events observed from n trial</p><p>n= independent trials</p>
75
New cards

what is the EXCEL equation for binomial distirbution?

=Binom.dist(4,10,.365,0)

X=4

pi= 0.365

n= 10

76
New cards

What is the mean equation for binomial distribution?

u= n*pi

77
New cards

what is the variance equation for binomial distribution?

(sigma^2)= npi(1-pi)

78
New cards

What is normal distribution?

a probability distribution used for continuous outcomes, such as height, IQ, serum cholesterol

79
New cards

What is the equation for normal distribution?

X~N(u,sigma)

80
New cards

What is standard normal distribution?

-normal distribution with u=0 and sigma=1

-Z~N(0,1)

-probability is measured area under the curve, meaning you can only assess probability for a 'range' such as P(Z<1)

81
New cards

When is standardizing used in normal distribution?

used to change the mean to 0 and standard deviation to 1, making it easier to calculate

82
New cards

What is the equation for standardizing?

Z= (X-u)/ (sigma)

83
New cards

What does the Central Limit Theorem state?

If n is sufficiently large, then the sample mean X-bar will follow an approximate Normal distribution with mean = u and standard deviation = (sigma)/ (square root n)

Assume you observe n independent and identically distributed observation (x1, x2, ... xn) from a probability distribution with the mean = u and standard deviation = sigma

84
New cards

why is the central limit theorem important?

states statistics follow probability distribution

85
New cards

For a sufficiently large sample size (n), what is the sampling distribution formula?

knowt flashcard image
86
New cards

What is the standard error of the mean?

the standard deviation of the sampling distribution of the mean

<p>the standard deviation of the sampling distribution of the mean</p>
87
New cards

what is the null hypothesis?

H0= u=u0

person is innocent

88
New cards

What is the alternative hypothesis?

Ha= u ≠ ua

89
New cards

What is a type 1 error?

when we reject the null hypothesis when it's true

ex: person convicted when they are innocent

90
New cards

What is a type 2 error?

we fail to reject the null hypothesis when it is false

ex: declaring a person NOT GUILTY when the person is truly guilty

91
New cards

what is the p-value?

probability of observing the test statistics or one even more extreme given the null hypothesis is true

92
New cards

what does alpha represent in hypothesis testing?

represent the probability of a Type 1 error rate

probability of rejecting the null hypothesis when it is true

often set at 0.05 or lower

93
New cards

what does beta represent in hypothesis testing?

-represents the probability of a Type II error

94
New cards

how are alpha and beta related?

inversely related

when you choose a very low level of alpha, you actually increase beta

95
New cards

what is Statistical Power?

probability of rejecting the null hypothesis Ho given it is false; compliment of beta, so power = 1-beta

96
New cards

what will be the effect of power if delta (uA-uO) increases?

power will increase

97
New cards

what will be the effect of power if delta (uA-uO) decreases?

power will decrease

98
New cards

what will be the effect of power if n (sample size) increases?

power will increase

99
New cards

what will be the effect of power if n (sample size) decreases?

power will decrease

100
New cards

what will be the effect of power if alpha (type 1 error) increases?

power will increase