Statistics Terms & Definitions: EBCR3 EX2 Comprehensive Study

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/150

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

151 Terms

1
New cards

You're reading a study that is evaluating the effect of different vitamin supplements on the development of colon cancer. The independent variable is _________ and the dependent variable is ___________.

A. Vitamin supplements; colon cancer

B. Colon cancer; vitamin supplements

C. A continuous variable; a nominal variable

D. A nominal variable; a continuous variable

A. Vitamin supplements; colon cancer

2
New cards

In the study from KC #1, data on the use of vitamin supplements (VS) is divided into "never taken", "taken 1-10 days/month", "taken 11-20 days/month", or "taken >20 days/month". The development of colon cancer (CC) over the study period is classified as "yes" or "no". VS is best described as a ____variable and CC is best described as a_____ variable.

A. Continuous; nominal

B. Ordinal, nominal

C. Nominal, continuous

D. Nominal, ordinal

B. Ordinal, nominal

3
New cards

Two types of numeric variables?

Continuous (interval; no meaningful zero OR ratio; meaningful zero)

Discrete (integer data)

4
New cards

Categorical data types? (two)

Nominal

Ordinal

5
New cards

Which measure of central tendency is most sensitive to outliers in the data?

A. Mean

B. Median

C. Mode

A. Mean

6
New cards

When are medians preferred?

With numeric or continuous data that is not normally distributed; also for some ordinal data.

7
New cards

When are means or averages preferred?

Numeric or continuous data that is normally distributed

8
New cards

When data is arranged in numeric order, the middle 50% of data values is called the?

A. Range

B. Standard deviation

C. The interquartile

C. The interquartile range

9
New cards

Describe the interquartile range (IQR)

Middle 50% of the data values when arranged in numerical order

10
New cards

Describe a standard deviation

Measures the spread of data around the mean

The larger the SD, the more spread out or variable the data is

11
New cards

A frequency distribution that contains data values which are much higher than the mean ("outliers") is

A. Left skewed

B. Right skewed

C. Normal

B. Right skewed

<p>B. Right skewed</p>
12
New cards

What does left skewed data look like on a graph?

(has outliers much lower than the mean)

<p>(has outliers much lower than the mean)</p>
13
New cards

Normal distribution is otherwise known as what?

Gaussian distribution

14
New cards

T/F: In normally distributed data, the mean/median/mode are the same value.

True

15
New cards

What percentage of data values are within one standard deviation of the mean in gaussian distribution? Two standard deviations?

68% in one

95% in two

16
New cards

T/F: As the SD (variation in the data) increases, the curve becomes flatter and vice versa.

True

17
New cards

An investigator would like to compare Drug A (new) to Drug B (standard of care) in the treatment of rheumatoid arthritis. The null hypothesis for this study is:

A. Drug A is superior to Drug B

B. Drug A is inferior to Drug B

C. Drug A is no different than Drug B

C. Drug A is no different than Drug B

18
New cards

Population parameters (4 examples)

1. Mean

2. Variance

3. Standard deviation

4. Odds ratio

19
New cards

Describe point estimates in inferential statistics

Sometimes called statistics, estimate the corresponding population parameter

20
New cards

T/F: Inferential statistics starts with a research question.

True

21
New cards

An investigator, Dr. Z, would like to compare Drug A (new) to Drug B (standard of care) in the treatment of lung cancer. Alpha is 0.05 and beta is 0.20. Dr. Z would like to detect a 20% difference in in the response rate between A and B.

What is the likelihood of concluding there is a difference when there really isn't one?

A. 5%

B. 95%

C. 20%

D. 80%

A. 5%

22
New cards

Describe the significance of an alpha level

Expression of the likelihood of concluding there is a difference when there really isn't one

Type 1 error

False positive

23
New cards

Describe the significance of a beta level

Expression of the likelihood of concluding there is

24
New cards

Define "risk"

Risk = number of subjects who develop an event (incidence) divided by total number in that group

25
New cards

Define "relative risk"

Ratio of risk in the exposed treatment group divided by the risk in the unexposed or control group

26
New cards

Relative risk measurements are inappropriate in which of the following study types

A. Case-control studies

B. Cohort studies

A. Case-control studies

27
New cards

How to calculate risk in exposed group?

Exposed with event / Exposed with event + Exposed without event

28
New cards

How to calculate risk in unexposed group?

Unexposed with event / Unexposed with event + Unexposed without event

29
New cards

How to calculate relative risk between exposed and unexposed groups?

Risk in exposed / Risk in unexposed

30
New cards

How to interpret a relative risk ratio of > 1?

Risk of outcome is increased in the treatment or exposed group

31
New cards

How to interpret a relative risk ratio of < 1?

Risk of outcome is decreased in the treatment or exposed group

32
New cards

How to calculate absolute risk reduction?

Risk with standard treatment (%) - Risk with new treatment (%)

33
New cards

Describe number needed to treat/harm (NNT/NNH)

Number of patients needed to be treated in order for one patient to benefit or number of patients needed to be treated in order for one patient to experience an adverse event

34
New cards

How to calculate NNT?

1/ARR

round up for NNT, round down for NNH

35
New cards

Define "Odds ratio"

Odds of an event occurring in "exposed" individuals divided by the odds of an event occurring in "unexposed" individuals

36
New cards

In what study type are odds ratios most commonly used?

Case-control studies

37
New cards

Your PLRP group will be doing a chart review study to evaluate the incidence of hepatotoxicity of a new antibiotic recently approved for children hospitalized for meningitis. You will compare this to the standard antibiotic used.

Your PLRP mentor asks what statistical test you will use to analyze your results.

Which of the following tests is the correct choice?

A. Chi-square

B. Unpaired t-test

C. ANOVA

D. Logistic regression

A. Chi-square

38
New cards

During a journal club presentation, your preceptor asks about the appropriate statistical test that should be used when evaluating the difference in pain scores (measured by the scale shown below) before and after a group of patients are given a new combination opioid and NSAID analgesic.

Which test would be correct?

A. Fisher's exact test

B. Paired t-test

C. Wilcoxan signed-rank test

D. Mann Whitney U test

D. Wilcoxan signed-rank test

Two sets of paired ordinal data

39
New cards

Describe "paired" data

Samples are dependent

(before and after evaluation in the same group of individuals)

40
New cards

Describe "unpaired" data

Samples are independent

(two different groups of individuals)

41
New cards

When do we use parametric tests?

When the dependent variable is numeric AND normally distributed

(i.e. t-test, ANOVA)

42
New cards

When do we use nonparametric tests?

When the dependent variable is numeric, but NOT normally distributed

(i.e. Wilcoxan rank sum, Kruskal-Wallis)

43
New cards

Which two tests are used for unpaired nominal data? (<2 groups)

Chi squared test

Fisher's exact test

44
New cards

Which test is used in paired nominal data? (<2 groups)

McNemar's test

45
New cards

Which two tests are used for unpaired ordinal data? (<2 groups)

Wilcoxon two sample test

Wilcoxan rank sum test (aka Mann Whitney U test)

46
New cards

Which test is used in paired ordinal data? (<2 groups)

Wilcoxon paired sign rank test

47
New cards

Which test do we use for unpaired continuous data? (<2 groups)

Unpaired t-test

48
New cards

Which test do we use for paired continuous data? (<2 groups)

Paired t-test

49
New cards

When do we use a Fisher's Exact test as opposed to a chi-squared test?

Small sample sizes (<5)

50
New cards

When do we use a Marginal Homogeneity test?

Paired nominal data (>2 groups)

51
New cards

When do we use a Kruskal-Wallis test?

Unpaired ordinal data (>2 groups)

52
New cards

When do we use a Friedman test?

Paired ordinal data (>2 groups)

53
New cards

When do we use an ANOVA test?

Unpaired continuous data (>2 groups)

54
New cards

When do we use a repeated-measure ANOVA?

Paired continuous data (>2 groups)

55
New cards

You're preparing for a journal club presentation fo a study assessing the mean change in cholesterol in 3 different groups of patients. Results show: A=12mg/dL, B=16mg./dL, C=24 mg/dL. An ANOVA test was used to evaluate the results. The p value is less than alpha so you can conclude a statistically significant result.

The appropriate conclusion based on the ANOVA test is:

A. C is better than B

B. B is better than A

C. C is better than A

D. One of the groups is different from at least one of the other groups

D. One of the groups is different from at least one of the other groups

56
New cards

T/F: When testing more than 2 groups, determination of between which two groups a difference lies is not possible.

True

57
New cards

T/F: As you conduct more and more tests, your chance of a false positive increases (alpha increases).

True

58
New cards

What variety of test do we use to determine which groups are different from each other (after testing more than 2 groups)?

Post hoc test

59
New cards

Examples of post hoc tests?

Bonferroni test

Tukey's test

60
New cards

Predictor = ____________

A. Independent variable

B. Dependent variable

A. Independent variable

61
New cards

If our predictor is continuous and our outcome is continuous, what test do we use?

Correlation/linear regression

62
New cards

If our predictor is continuous and our outcome is categorical, what test do we use?

Logistic regression

63
New cards

If our predictor is categorical and our outcome is continuous, what test do we use?

ANOVA/General linear model

64
New cards

If our predictor is categorical and our outcome is categorical, what test do we use?

Kappa or kendall/spearman correlation

65
New cards

Describe "correlation" (positive and negative)

Increase in independent variable -> increase in dependent variable, positive correlation

Increase in independent variable -> decrease in dependent variable, negative correlation

66
New cards

What is a phase III study?

A prospective study comparing the effect of an investigational intervention to a control in human beings

67
New cards

T/F: Phase III studies can be comprised of animal participants.

False

68
New cards

T/F: Phase III studies usually employ randomization and blinding.

True

69
New cards

What is the general purpose of phase III studies?

The purpose is generally to demonstrate the clinical use of the intervention and establish a safety profile.

70
New cards

Describe "clinical equipoise"

A state of genuine uncertainty as to the advantages/disadvantages of one therapy versus another

71
New cards

When is a phase III trial needed?

When clinical equipoise exists

When Phase II data exists to suggest a new therapy might be better than the standard, but more evidence is needed

72
New cards

Challenges associated with Phase III trial design?

Medical care has improved dramatically; thus harder to show major therapeutic differences

Requires a larger sample size to achieve respectable power

Newer therapies may offer other advantages besides better efficacy; similar efficacy but lower cost, better compliance, fewer side effects etc.

73
New cards

Key characteristics of superiority designs?

Generally ask "Is new therapy different (hopefully better) than standard?"

"How much is "better/different?" (effect size)

Statistical analyses are generally two-tailed (interested in either outcome) (null hypothesis being that there is no difference)

May be one-tailed - easier to find a difference; if the results are in the unexpected direction, cannot state the difference is significant

74
New cards

When can selection bias occur?

Can occur if randomization is predictable or known (also called allocation bias)

Can also occur if subjects selected for the study are different than population of interest

75
New cards

When can accidental bias occur?

Can occur if the randomization process does not achieve balance of risk factors

More of an issue in smaller studies

76
New cards

When can investigator or assessment bias occur?

Can occur when investigator is assessing outcome in unblinded studies

77
New cards

When can follow up bias occur?

Lost to follow up occurs preferentially in one group over the other

78
New cards

T/F: Phase III interventions can be one or a combination of diagnostic, preventative or therapeutic drugs, biologics, devices, or procedures.

True

79
New cards

How are interventions applied in Phase III trials?

Standard, prespecified method to all participants throughout the course of the study

80
New cards

Randomization minimizes what?

Allocation bias and confounding

81
New cards

T/F: An independent party should be responsible for developing and implementing the randomization process.

True

82
New cards

T/F: Randomization is usually generated by a computer program that is "invisible" to investigators.

True

83
New cards

Three types of fixed randomization?

Simple

Blocked

Stratified

84
New cards

Describe "simple" randomization

Basic, like flipping a coin or using a random number generator

Easy

Can result in imbalance of the arm allocation, especially if the sample size is small

85
New cards

Advantage + disadvantage of block randomization?

Advantage: Allocation is equal at anytime during the trial

Disadvantage: May be able to predict final assignment in each block if block size is known (allocation bias)

86
New cards

T/F: Stratified randomization requires that prognostic factors be known at the time of randomization.

True

87
New cards

T/F: The number of strata in stratified randomization is restricted by sample size.

True

88
New cards

What is an alternative to stratification?

Prespecify a statistical adjustment for potential confounding factors (i.e. regression analysis)

89
New cards

When are "dummy" designs employed?

Used when control and intervention medications are given by different routes or on different schedules

As a method of maintaining blinding

90
New cards

What is the most commonly used Phase III randomized trial design?

Parallel trials

91
New cards

What is the intention behind parallel trial designs?

Subject stays on the same treatment arm throughout the study

92
New cards

Describe "crossover" randomized trial designs

Participants are their own controls since each patient is in the control arm and the experimental treatment arm

Requires a smaller sample size than do parallel trials

Washout period may be appropriate

93
New cards

In which of the following Phase III randomized trial designs are losses to follow up most detrimental

A. Crossover

B. Parallel

A. Crossover

94
New cards

T/F: Crossover trials are a good design for "waxing and waning" diseases or diseases that require urgent treatment.

False; not really good to withhold treatment in these participants.

95
New cards

Key features of composite outcomes?

Outcomes should be related

Generally reduces the sample size needed

MACE events are a good example

Interpreting composite outcomes becomes problematic if results are inconsistent

96
New cards

Measurement of outcome variables should be what? Especially when?

Should be standardized, validated, and free of bias; ESPECIALLY if trial is not double blinded

97
New cards

Describe a surrogate outcome/endpoint?

For example decrease in LDL-C -> reduction in ASCVD

An outcome that achieves something if the outcome is not specific, like MACE

98
New cards

Which of the following outcome types are typically described as "exploratory?"

A. Primary

B. Secondary

C. Tertiary

C. Tertiary

99
New cards

T/F: As more statistical tests (comparisons) are done, a "significant" finding may be found just by chance, thus increasing the likelihood of making a type I error.

True

100
New cards

What can be done to adjust for multiple comparisons?

Use a smaller alpha (Bonferroni's correction: if k comparisons are being made, then for each comparison, alpha should be divided by k)

Hierarchal procedures (Holm's and Hochberg's procedures) - these are rank comparisons done by level of importance