Biostatistics for the Health Sciences - Everything for 1st Test

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/119

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

120 Terms

1
New cards

Nominal

order of categories irrelevant (also called unordered)

2
New cards

Ordinal

order of categories is meaningful (also called ordered)

3
New cards

Binary

Special case of categorical variable - only 2 possible values (also called dichotomous)

4
New cards

Discrete

Values equal to integers

5
New cards

Continuous

Values on a continuum

6
New cards

Is nominal data categorical or ordinal?

Categorical

7
New cards

Is blood type categorical or ordinal

Categorical

8
New cards

Is died of cancer binary or nonbinary

Binary

9
New cards

What are types of categorical data

Nominal, ordinal, binary

10
New cards

What are types of quantitative data

Discrete and continuous

11
New cards

Continuous examples

Blood pressure

Weight

Age

Lead

12
New cards

Quantitative examples

Number of babies out of 100 births who have low birth weight

Number of admissions to the emergency room

13
New cards

For a discrete variable, it isn’t sensible to

consider a value between two numbers (e.g. 1.5 heart attacks doesn’t make sense)

14
New cards

Although a continuous variable may be measured in whole numbers, it is still sensible

to consider a value between two numbers (age – 16.5 years old)

15
New cards

A quantitative measurement may be categorized and treated as a categorical variable

for the purpose of summaries (>25 years etc.)

16
New cards

Categorical variables are sometimes called ____, especially in stats classes

factors

17
New cards

A categorical with inherent logical ordering (age brackets) may be treated as

nominal in some analysis

18
New cards

Categorical data is usually summarized by:

  • The proportion (percent) of observations in each of the categories

  • The number in each category (frequency / count)

  • Important to provide the totals (denominators of percentages)

  • N or n is often used to represent the total

  • N = sample size

19
New cards

Pie graphs represent a

categorical variable pictorial

20
New cards

Pie graphs display

data as a percentage of the whole

21
New cards

Pie graphs require

proportional reasoning

22
New cards

Pie graphs are especially difficult with

Ordinal data

23
New cards

Pie graphs are

not the best way to summarize data, but are common in media/non-expert reports

24
New cards

Pie graphs are most appropriate

And better when

When the aim is to convey the relative size of the parts of a whole

Better when there are not too many categories (3-7)

25
New cards

Bar graphs present a

summary measure for each category by a bar

26
New cards

BARGRAPHS: For an ordinal categorical variable, order the bars

in the order of the categories

27
New cards

BARGRAPHS: For a nominal categorical variable, choose

an ordering that aids understanding (for ex, alphabetical or lowest to highest)

28
New cards

For quantitative data, we are usually interested in

The distribution of the observations

  • What are the most common or average values (center of the data?)

  • How spread out are the data? (variability of the data)

  • Are there some values far from the bulk of the data (outliers?)

29
New cards

Strategies for distribution of quantitative data

  • Visualize the distribution of the data with a graph

  • Summarize key aspects of the distribution with descriptive statistics, numerical descriptions of the center and spread of the data

30
New cards

Bar graphs can be

Stacked

31
New cards

Stacked bar graph methods

Can do totals, percent out of 100, or other

32
New cards

The histogram is a

graphical display of the distribution of quantitative data

33
New cards

Histogram: Horizontal scale (x) corresponds

to the values of the quantitative variable

34
New cards

Histogram: The x-axis is broken into a

contiguous series of sub-intervals (“classes” or “bins”)

35
New cards

Histogram: Bars are drawn that indicate the

frequencies or percentages of observations within each interval

36
New cards

Histogram:

  • The area of each bar corresponds to the number of observations in each bin

  • If all bins have same width, then heights of the of bars also correspond to the number of observations in each bin

37
New cards

Information from a histogram

  • Typical values

  • How much variation is present

  • Shape of the distribution

  • Unusual or outlying values

  • Approximate frequency or percentage in a given range

38
New cards

Histograms can be sensitive to bin width (or cut-points)

Rule of thumb:

use number of bins = square root of number of observations

39
New cards

In a histogram, each observation is given

one unit of area

40
New cards

With unequal bin widths, easier

to use a bar graph

41
New cards
  • Unequal intervals are really a

  • Similarly for ordinal variable, more common to use term

  • bar chart – does not necessarily show shape of distribution

  • bar chart (concept of the distribution not the same as for quantitative variables)

42
New cards

We reserve the term histogram for

quantitative variables

43
New cards
term image

Opposite of everyday use

<p><span style="background-color: transparent;">Opposite of everyday use</span></p>
44
New cards

One difference between histogram and bar-graph is that the number of observations is shown

by the area under the curve, or the integral of the category

45
New cards

Reason would have longer data is

less observations

46
New cards

If you have the data, use

equal size bins

47
New cards

In a proper histogram, each observation is

given one unit of area so the histogram reflects the shape of the distribution

48
New cards

In a bar graph, observations may not always be given the same unit of area so the bar graph

may not reflect the shape of the distribution

49
New cards

Histogram needs to show

SHAPE OF DISTRIBUTION!! (not equal sized bins)

50
New cards

Histograms for

quantitative data and to represent shape

51
New cards

Similarly for ordinal variable, more common to use the

term bar chart

52
New cards

We will reserve the term histogram for

quantitative variables

53
New cards

Stem and leaf plot:

another graph for quantitative data;

54
New cards

STEM AND LEAF:

Decimal point is 1 digit to the left of |

55
New cards

(S&P) 0 | 2 2 3 =

0.02, 0.02, 0.03

56
New cards

Numerical summaries for quantitative data

Central tendency and variation

57
New cards

Central tendency

The “middle” of the data

58
New cards

Variation

How “spread out” the data is

59
New cards

X bar

average/mean

60
New cards

Median

Middle point; half bigger half smaller

61
New cards

Mode

Most common value in the dataset

62
New cards

X bar = formula

(summation i=1 to n (Xsubi)) over n

Xsubi means the ith ordered observation in the data

63
New cards

Reasons why something can be the mode in a lead detection study

Lowest point to be detected could be why

64
New cards

Mean is sensitive to

Outliers

65
New cards

For right (positively) skewed data:

mean > median

66
New cards

For left (negatively) skewed data:

median > mean

67
New cards

Median is ___ to outliers

resistant

68
New cards

If symmetrical

Mean=median

69
New cards

Range

Smallest and largest, sometimes shown as difference between them

70
New cards

Interquartile range

25th and 75th percentiles, sometimes shown as difference between them

71
New cards

Computing IQR for 25th percentile

33 x .25.= 8.25, choose the 9th value

Value with 25% of data below it

72
New cards

Range and standard deviation

are sensitive to outliers

73
New cards

IQR is

less sensitive to outliers

74
New cards

B&W: top of box =

75th percentile

75
New cards

B&W: line in middle of box =

median

76
New cards

B&W: line in bottom of box =

25th percentile

77
New cards

B&W: whisker at top

largest value less than Q3 + 1.5 IQR

78
New cards

B&W: whisker at bottom

Smallest value greater than Q1 - 1.5IQR

79
New cards

B&W: dots

Outliers

80
New cards

Can use ___ box plots to show categories

Side by side

81
New cards

Cumulative incidence

Is the proportion (fraction) of individuals newly acquiring the disease (outcome) over a specified period of time

82
New cards

Cumulative incidence =

number of new cases / number at risk

83
New cards

Contingency table

Summarizes the information from two categorical variables (think treatment, cold-yes, cold-no, and total)

84
New cards

Risk factor

Variable that may increase or decrease the chance (risk) of outcome

85
New cards

Difference between treatment, risk factor, and exposure

Treatment is just type, but can be risk factor if it increases/decreases risk; exposure is just whether they were exposed (and would curtail exposed / unexposed categories)

86
New cards
term image
knowt flashcard image
87
New cards

Relative risk formula

knowt flashcard image
88
New cards

Relative risk table

knowt flashcard image
89
New cards

Relative risk is

Risk of treated over risk of untreated, max can be 1

Summary measure of association between risk factor and outcome; 0 <= RR <= infinity

90
New cards

If RR <1

Treatment is associated with lower risk of outcome

91
New cards

If RR > 1

Treatment is associated with higher risk of outcome

92
New cards

If RR = 1

No association of treatment with outcome

93
New cards

Depending on study design, RR may have a

Causal interpretation:

If <1, treatment lowers risk; treatment is beneficial

If RR >1, treatment increases risk

If RR = 1, no effect on outcome

94
New cards

RR = 1.1

10% higher risk

95
New cards

RR = 2.5

150% higher risk

96
New cards

RR = 0.6

40% lower risk

97
New cards

Risk difference formula

Risk treated - risk untreated = (a over a+b), - (c over c+d)

98
New cards

Risk difference, RD

Summary measure of association between risk factor and outcome

99
New cards

__ <= RD <= __

-1, 1

100
New cards

If RD < 0

Treatment is associated with lower risk of outcome