finding and appraising evidence for diagnostic tests and clinical measures

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/60

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

61 Terms

1
New cards

diagnostic tests

“special tests”: clincial examination techniques

tests performed and/or interpreted by others (ex. radiographs, labs)

2
New cards

measures

techniques we perform to quantify a pts impairment, activity limitation, or participation restriction

3
New cards

index test

new test or tool that researchers evaluate for diagnostic accuracy as per a specific condition

4
New cards

reference standard

current best available method for diagnosing a condition

used as a benchmark for the index test

5
New cards

gold standard

ultimate standard

best when this is ref standard

6
New cards

prospective design without randomization

consecutive pts who meet eligibility criteria are recruited

participants may or may not have target condition

all participants receive index test and ref standard

results from index and ref are determiend

7
New cards

prospective design w/ randomization

consecutive pts who meet eligibility criteria are recruited

participants may or may not have target condition

random assignment assigns participants to index, all receive same ref standard

results from each index and ref standard are determined

8
New cards

case control

pts who meet eligibility criteria and are known to have or not have target condition are recruited

all participants receive index test and same ref standard

results from index test and ref standard are determined

9
New cards

rater reliability

are ppl conducting the measurements achieving consistent measures?

types: intra (within) and interrater (between)

10
New cards

test reliability

given no real change, are measures stable over time?

type: test-retest

ex. ppl answering survey of pt satisfaction respond similarly when given same survey 2 weeks apart

11
New cards

equivalence reliability

do different measurement techniques of the same phenomenon provide equivalent results?

type: parallel (alternate)

ex. spanish translation of a test of items to diagnose dementia provide same results as original english

12
New cards

internal consistency reliability

do all items or components of measurement technique provide info about phenomenon?

type: split half reliability, item reliability

13
New cards

split half reliability example

correlation btwn scores on the even and odds item on a pt questionnaire of self-perception of health is high

14
New cards

item reliability example

each item on a pt questionnaire of self-perception of health correlates highly w/ total scorecard

15
New cards

nominal data statistics

percent agreement

kappa (k)

16
New cards

ordinal data statistics

percent agreement

weighted kappa (kw)

17
New cards

interval ratio data statistics

percent agreement

intraclass correlation coefficient (ICC)

18
New cards

percent agreement

values range from 0 to 100% (higher values=more agreeemnt)

influenced by prevalence

does not remove chance agreement (values inflated)

cannot determine probalbity (p value) of % agreement or confidence intervals

19
New cards

kappa (k)

values range from -1.0-1.0 (higher values=more agreement; negative values=worse than chance)

removes chance agreement (values are not inflated due to chance agreement)

influenced by prevalence

can determine probability (p value) of kappa and CIs

20
New cards

weighted kappa

values range from -1.0 and 1.0 (higher values indicate more agreement; negative values indicate agreement worse than chance)

removes chance agreement (values are not inflated due to ^)

can determine probability (p value) of kappa and CIs

21
New cards

intraclass correlation coefficient (ICC)

can examine where sources of error arise (raters, subjects)

3 models, each w/ 2 types

higher values=more agreement (1.0=perfect; 0.0 and negative=no agreement)

can have more than 2 raters and unequal observations

infl. by range of measures

role of chance agreement

can determine probability (p value) of ICC and CIs

22
New cards

face validity

tool or method of measurement appears appropriate for the stated purpose

assessment is subjective rather than statistical

23
New cards

construct validity

a phenomenon (ex. QoL) may have several aspects (domains, facets)—all domains must be measured

constructs not typically measured directly, can be assessed statistically and by other types of validity

24
New cards

content validity

for each domain of a phenomenon, items/content are developed to assess that domain

experts are asked if items adequately represent domain, if items are clear

25
New cards

criterion related validity

results of new test or measure are compared to a criterion

26
New cards

concurrent validity

new test/measure is given close in time to a well-established test or measure of same phenomenon

correlation coefficients are often used to assess concurrent validity

27
New cards

predictive validity

predictions based on measurements or scores from the new test are used to make predictions of a future behavior or outcome

28
New cards

sensitivity

ability of the test to correctly identify (+ test result/true positive) in someone with the disorder

formula: patients w/ disorder who test positive/all pts with disorder

29
New cards

specificity

ability of the test to correctly identify (- test result/true negative) in someone without the disorder

formula: pts w/o disorder who test negative/all pts without disorder

30
New cards

positive predictive value (PPV)

ability of the test to correctly determine % of people with disorder from all of the people w/ positive test results

formula: pts w disorder who test positive/all pts who test positive

31
New cards

negative predictive value (NPV)

ability of the test to correctly determine % of people without the disorder from all of the people with a negative test result

formula: pts w/o disorder who test negative/all pts who test negative

32
New cards

positive likelihood ratio (LR+)

likelihood that a positive test result was observed in a person with the disorder vs. a person without disorder of interest

probability of identifying true positive

formula: sensitivity/1-specificity

33
New cards

negative likelihood ratio (LR-)

likelihood that a negative test result is observed in a person with the disorder vs. in a person without the disorder of interest

probability of identifying true negatives

formula: 1-sensitivity/specificity

34
New cards

likelihood ratio

combines both sensitivity and specificity into a single measure of diagnostic performance

35
New cards

ROCs

used to compare diagnostic tests across different thresholds of sensitivity and specificity

<p>used to compare diagnostic tests across different thresholds of sensitivity and specificity</p>
36
New cards
<p>top purple line</p>

top purple line

positive upper limit

37
New cards
<p>red line</p>

red line

zero/no difference

38
New cards
<p>blue line</p>

blue line

mean difference btwn 2 exposures

39
New cards
<p>bottom purple line</p>

bottom purple line

negative lower limit

40
New cards
<p>distance btwn red and blue lines</p>

distance btwn red and blue lines

bias

41
New cards

pre-test probability

prevalence (%)

42
New cards

pre-test odds

what you think the odesa re that the pt has the disorder before you conduct the diagnostic test

43
New cards

post-test odds

what you think the odds are that the pt has the disorder after you conduct the diagnostic test

44
New cards

post-test probability

probability of the disorder once the test results are obtained

=pretest odds*LR+

45
New cards

LR+ > 10 or LR- <0.10

large effect

46
New cards

LR+ = 5-10 or LR- = 0.10-0.20

moderate effect

47
New cards

LR+ = 2-5 or LR- =0.20-0.50

small effect, sometimes important

48
New cards

LR+ = 1-2 or LR- =0.50-1.0

negligible

49
New cards

p value

probability that the result occurred due to chance or otherwise

50
New cards

confidence interval

range of values within which the true value is estimated to lie within a certain (usually 95%) probability

51
New cards

should you use this evidence?

is the study high quality (e.g. does the design minimize bias?)?

are the results important enough to use?

is the test or measure of interest available, practical, and safe for application in the clinical setting?

was your pt represented in the study sample?

can you estimate the pre test probability of the disorder and is it worth proceeding w/ the test especially if costly?

pt values or circumstances—risk of injury, important benefits, cost effectiveness, belief in evidence, previous experiences

52
New cards

verification bias

relates to reference standard

not all participants receive ref standard

53
New cards

incorporation bias

relates to ref standard

some or all of ref standard is incorporated within index test

54
New cards

differential verification bias

relates to ref standard

some participants receive different ref standard

55
New cards

observer bias

relates to examiners giving index test or ref standard

examiners conducting either ref standard or index test know clinical presentation of the participant

56
New cards

referral filter

relates to participants

possible participants are referred to the study due to suspicion of disease/disorder

57
New cards

diagnostic/revivew bias

relates to examiners giving index test or ref standard

examiners conducting either ref standard or index test/measure know result of other test/measures

58
New cards

spectrum bias

relates to participants and study design when a case control study is used

participants do not have a wide range of disease severity, comorbidities, or demographics that interact with the index or ref test

59
New cards

disease progression bias

relates to participants

the target condition changes in severity (either worsened or gets better) btwn administration of the index and reference tests

60
New cards

QUADAS tool

appraising a diagnostic study

61
New cards

assessment of study credibility-diagnostic tests or measures

did the investigators include subjects w/ all levels or stages of condition being evaluated by the index test? (spectrum bias)

was the time btwn index & ref tests short enough to rule out change in condition? (disease progression bias)

were all participants included in final analysis? if data was missing, was proportion of tests missing significant? (selection bias)—sensitivity testing

were the selection criteria for participants clearly described? (selection bias)

were the people interpreting either the ref standard and/or the index test blinded to result of the opposite test? (review bias)

was the reference standard independent of the index test? (incorporation bias)

was the ref standard given to all participants? (verification bias)

were individuals interpreting each test/measures results unaware (mask/blind) of the other tests results? (measurement bias)

was the time btwn application of the index test and the gold standard comparison test short enough to minimize the opportunity for change in the subjects’ condition? (disease progression bias)