finding and appraising evidence for diagnostic tests and clinical measures

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Knowt Play

New

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/60

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

61 Terms

New cards

diagnostic tests

“special tests”: clincial examination techniques

tests performed and/or interpreted by others (ex. radiographs, labs)

New cards

measures

techniques we perform to quantify a pts impairment, activity limitation, or participation restriction

New cards

index test

new test or tool that researchers evaluate for diagnostic accuracy as per a specific condition

New cards

reference standard

current best available method for diagnosing a condition

used as a benchmark for the index test

New cards

gold standard

ultimate standard

best when this is ref standard

New cards

prospective design without randomization

consecutive pts who meet eligibility criteria are recruited

participants may or may not have target condition

all participants receive index test and ref standard

results from index and ref are determiend

New cards

prospective design w/ randomization

consecutive pts who meet eligibility criteria are recruited

participants may or may not have target condition

random assignment assigns participants to index, all receive same ref standard

results from each index and ref standard are determined

New cards

case control

pts who meet eligibility criteria and are known to have or not have target condition are recruited

all participants receive index test and same ref standard

results from index test and ref standard are determined

New cards

rater reliability

are ppl conducting the measurements achieving consistent measures?

types: intra (within) and interrater (between)

New cards

test reliability

given no real change, are measures stable over time?

type: test-retest

ex. ppl answering survey of pt satisfaction respond similarly when given same survey 2 weeks apart

New cards

equivalence reliability

do different measurement techniques of the same phenomenon provide equivalent results?

type: parallel (alternate)

ex. spanish translation of a test of items to diagnose dementia provide same results as original english

New cards

internal consistency reliability

do all items or components of measurement technique provide info about phenomenon?

type: split half reliability, item reliability

New cards

split half reliability example

correlation btwn scores on the even and odds item on a pt questionnaire of self-perception of health is high

New cards

item reliability example

each item on a pt questionnaire of self-perception of health correlates highly w/ total scorecard

New cards

nominal data statistics

percent agreement

kappa (k)

New cards

ordinal data statistics

percent agreement

weighted kappa (kw)

New cards

interval ratio data statistics

percent agreement

intraclass correlation coefficient (ICC)

New cards

percent agreement

values range from 0 to 100% (higher values=more agreeemnt)

influenced by prevalence

does not remove chance agreement (values inflated)

cannot determine probalbity (p value) of % agreement or confidence intervals

New cards

kappa (k)

values range from -1.0-1.0 (higher values=more agreement; negative values=worse than chance)

removes chance agreement (values are not inflated due to chance agreement)

influenced by prevalence

can determine probability (p value) of kappa and CIs

New cards

weighted kappa

values range from -1.0 and 1.0 (higher values indicate more agreement; negative values indicate agreement worse than chance)

removes chance agreement (values are not inflated due to ^)

can determine probability (p value) of kappa and CIs

New cards

intraclass correlation coefficient (ICC)

can examine where sources of error arise (raters, subjects)

3 models, each w/ 2 types

higher values=more agreement (1.0=perfect; 0.0 and negative=no agreement)

can have more than 2 raters and unequal observations

infl. by range of measures

role of chance agreement

can determine probability (p value) of ICC and CIs

New cards

face validity

tool or method of measurement appears appropriate for the stated purpose

assessment is subjective rather than statistical

New cards

construct validity

a phenomenon (ex. QoL) may have several aspects (domains, facets)—all domains must be measured

constructs not typically measured directly, can be assessed statistically and by other types of validity

New cards

content validity

for each domain of a phenomenon, items/content are developed to assess that domain

experts are asked if items adequately represent domain, if items are clear

New cards

criterion related validity

results of new test or measure are compared to a criterion

New cards

concurrent validity

new test/measure is given close in time to a well-established test or measure of same phenomenon

correlation coefficients are often used to assess concurrent validity

New cards

predictive validity

predictions based on measurements or scores from the new test are used to make predictions of a future behavior or outcome

New cards

sensitivity

ability of the test to correctly identify (+ test result/true positive) in someone with the disorder

formula: patients w/ disorder who test positive/all pts with disorder

New cards

specificity

ability of the test to correctly identify (- test result/true negative) in someone without the disorder

formula: pts w/o disorder who test negative/all pts without disorder

New cards

positive predictive value (PPV)

ability of the test to correctly determine % of people with disorder from all of the people w/ positive test results

formula: pts w disorder who test positive/all pts who test positive

New cards

negative predictive value (NPV)

ability of the test to correctly determine % of people without the disorder from all of the people with a negative test result

formula: pts w/o disorder who test negative/all pts who test negative

New cards

positive likelihood ratio (LR+)

likelihood that a positive test result was observed in a person with the disorder vs. a person without disorder of interest

probability of identifying true positive

formula: sensitivity/1-specificity

New cards

negative likelihood ratio (LR-)

likelihood that a negative test result is observed in a person with the disorder vs. in a person without the disorder of interest

probability of identifying true negatives

formula: 1-sensitivity/specificity

New cards

likelihood ratio

combines both sensitivity and specificity into a single measure of diagnostic performance

New cards

ROCs

used to compare diagnostic tests across different thresholds of sensitivity and specificity

New cards

top purple line

positive upper limit

New cards

red line

zero/no difference

New cards

blue line

mean difference btwn 2 exposures

New cards

bottom purple line

negative lower limit

New cards

distance btwn red and blue lines

bias

New cards

pre-test probability

prevalence (%)

New cards

pre-test odds

what you think the odesa re that the pt has the disorder before you conduct the diagnostic test

New cards

post-test odds

what you think the odds are that the pt has the disorder after you conduct the diagnostic test

New cards

post-test probability

probability of the disorder once the test results are obtained

=pretest odds*LR+

New cards

LR+ > 10 or LR- <0.10

large effect

New cards

LR+ = 5-10 or LR- = 0.10-0.20

moderate effect

New cards

LR+ = 2-5 or LR- =0.20-0.50

small effect, sometimes important

New cards

LR+ = 1-2 or LR- =0.50-1.0

negligible

New cards

p value

probability that the result occurred due to chance or otherwise

New cards

confidence interval

range of values within which the true value is estimated to lie within a certain (usually 95%) probability

New cards

should you use this evidence?

is the study high quality (e.g. does the design minimize bias?)?

are the results important enough to use?

is the test or measure of interest available, practical, and safe for application in the clinical setting?

was your pt represented in the study sample?

can you estimate the pre test probability of the disorder and is it worth proceeding w/ the test especially if costly?

pt values or circumstances—risk of injury, important benefits, cost effectiveness, belief in evidence, previous experiences

New cards

verification bias

relates to reference standard

not all participants receive ref standard

New cards

incorporation bias

relates to ref standard

some or all of ref standard is incorporated within index test

New cards

differential verification bias

relates to ref standard

some participants receive different ref standard

New cards

observer bias

relates to examiners giving index test or ref standard

examiners conducting either ref standard or index test know clinical presentation of the participant

New cards

referral filter

relates to participants

possible participants are referred to the study due to suspicion of disease/disorder

New cards

diagnostic/revivew bias

relates to examiners giving index test or ref standard

examiners conducting either ref standard or index test/measure know result of other test/measures

New cards

spectrum bias

relates to participants and study design when a case control study is used

participants do not have a wide range of disease severity, comorbidities, or demographics that interact with the index or ref test

New cards

disease progression bias

relates to participants

the target condition changes in severity (either worsened or gets better) btwn administration of the index and reference tests

New cards

QUADAS tool

appraising a diagnostic study

New cards

assessment of study credibility-diagnostic tests or measures

did the investigators include subjects w/ all levels or stages of condition being evaluated by the index test? (spectrum bias)

was the time btwn index & ref tests short enough to rule out change in condition? (disease progression bias)

were all participants included in final analysis? if data was missing, was proportion of tests missing significant? (selection bias)—sensitivity testing

were the selection criteria for participants clearly described? (selection bias)

were the people interpreting either the ref standard and/or the index test blinded to result of the opposite test? (review bias)

was the reference standard independent of the index test? (incorporation bias)

was the ref standard given to all participants? (verification bias)

were individuals interpreting each test/measures results unaware (mask/blind) of the other tests results? (measurement bias)

was the time btwn application of the index test and the gold standard comparison test short enough to minimize the opportunity for change in the subjects’ condition? (disease progression bias)