Statistical Analysis of Quantitative Data

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/94

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

95 Terms

New cards

Purposes of statistical analysis in quantitative research

to describe the data (e.g., sample characteristics), to estimate population values, to test hypotheses, to provide evidence regarding measurement properties of quantified variables

New cards

Nominal

lowest level; involves using numbers simply to categorize attributes

New cards

Ordinal

ranks people on an attribute

New cards

Interval

ranks people on an attribute and specifies the distance between them

New cards

Ratio

highest level; ratio scales, unlike interval scales, have a meaningful zero and provide information about the absolute magnitude of the attribute

New cards

Nominal (categorical)

names; lowest level of measurement, data categorized into groups/categories, categories should be mutually exclusive

(ex. Male = 1 , Female = 2, NB = 3)

New cards

Ordinal (ranked)

2nd lowest level of measurement; numeric values on continuum but not equal, ordered and ranked but intervals not equal distance (ex: likert scale, ex: numeric pain scale 0-10-- differences cannot be specified)

New cards

Interval definition

3rd level of measurement; continuum of numeric values where intervals between numbers are equal but lacks a "true zero" (ex. fahrenheit scale 0 does not = no temperature....), datum ranked and attributed distance, between are equal, (equal intervals b/t numbers)

New cards

Ratio definition

highest level; numeric values assigned to data, begin with zero and have = intervals (ex. age, # hours studied for a test, many biophysiologic--pulse, ex. amount of money in your bank account-- 0 balance = NO money)

New cards

Levels of measurement high to low

ratio (absolute zero exists), interval (distance b/t numbers is meaningful), ordinal (numbers can be ranked or ordered), nominal (numbers are only names)

New cards

Descriptive statistics

used to describe and synthesize data; parameters and statistics

New cards

Parameters

descriptor for a population

New cards

Statistics

descriptive index from a sample

New cards

Inferential statistics

used to make inferences about the population based on sample data; assumes random sampling

New cards

Frequency distributions

a systematic arrangement of numeric values on a variable from lowest to highest and a count of the number of times (and/or percentage) each value was obtained

New cards

Frequency distributions can be described in terms of

shape, central tendency, variability

New cards

Frequency distributions presentation

in a table (Ns and percentages) or graphically (e.g., frequency polygons)

New cards

Shapes of distributions

symmetric, skewed (asymmetric)

New cards

Positive skew

long tail points to the right

New cards

Negative skew

long tail points to the left

New cards

Modality (number of peaks)

unimodal, bimodal, multimodal

New cards

Unimodal

1 peak

New cards

Bimodal

2 peaks, normal distribution (a bell-shaped curve)

New cards

Multimodal

2+ peaks

New cards

Central tendency

index of "typicalness" of a set of scores that comes from center of the distribution

New cards

Mode

the most frequently occurring score in a distribution (ex. 2, 3, 3, 3, 4, 5, 6, 7, 8, 9; mode = 3)

New cards

Median

the point in a distribution above which and below which 50% of cases fall (ex. 2, 3, 3, 3, 4 | 5, 6, 7, 8, 9; median = 4.5)

New cards

Mean

equals the sum of all scores divided by the total number of scores (ex. 2, 3, 3, 3, 4, 5, 6, 7, 8, 9; mean = 5.0)

New cards

Purpose of median

useful mainly as descriptor of typical value when distribution is skewed (e.g., household income)

New cards

Purpose of mean

most stable and widely used indicator of central tendency

New cards

Variability

the degree to which scores in a distribution are spread out or dispersed

New cards

Homogeneity

little variability

New cards

Heterogeneity

great variability

New cards

Range

highest value minus lowest value

New cards

Standard deviation (SD)

average deviation of scores in a distribution

New cards

Bivariate descriptive statistics

used for describing the relationship between two variables

New cards

Two common approaches for bivariate

crosstabs (contingency tables) and correlation coefficients

New cards

Correlation coefficients

describes intensity and direction of a relationship

New cards

Range of correlation coefficients

range from −1.00 to +1.00

New cards

Negative relationship

(0.00 to −1.00): one variable increases in value as the other decreases (e.g., amount of exercise and weight)

New cards

Positive relationship

(0.00 to +1.00): both variables increase (e.g., calorie consumption and weight)

New cards

The greater the absolute value of the coefficient...

...the stronger the relationship (ex. r - -0.45 is stronger than r = +0.40)

New cards

Correlation matrix

can be displayed to show all pairs of correlations with multiple variables

New cards

Pearson's r

(the product–moment correlation coefficient): computed with continuous measures

New cards

Spearman's rho

used for correlations between variables measured on an ordinal scale

New cards

Describing risk

clinical decision making for EBP may involve the calculation of risk indexes, so that decisions can be made about relative risks for alternative treatments or exposures

New cards

Frequently used indexes for describing risk

absolute risk, absolute risk reduction (ARR), odds ratio (OR), numbers needed to treat

New cards

Inferential statistics are used to

make objective decisions about population parameters using sample data, provide a means for drawing inferences about a population, given data from a sample, based on laws of probability, sampling error, uses the concept of theoretical distributions (ex. the sampling distribution of the mean error)

New cards

Sampling distribution of the mean

a theoretical distribution of means for an infinite number of samples drawn from the same population

New cards

Characteristics of sampling distribution of the mean

always normally distributed, its mean equals the population mean, its standard deviation is called the standard error of the mean (SEM)-- SEM is estimated from a sample SD and the sample size

New cards

Point estimation

a single descriptive statistic that estimates the population value (e.g., a mean, percentage, or OR)

New cards

Interval estimation

a range of values within which a population value probably lies-- involves computing a confidence interval (CI)

New cards

CIs reflect

how much risk of being wrong researchers take

New cards

Confidence intervals

indicate the upper and lower confidence limits and the probability that the population value is between those limits (ex. a 95% CI of 40 to 50 for a sample mean of 45 indicates there is a 95% probability that the population mean is between 40 and 50)

New cards

Hypothesis testing

helps researchers to make objective decisions about whether results are likely to reflect chance differences or hypothesized effects

New cards

Hypothesis testing involves

statistical decision making to either:

accept the null hypothesis or

reject the null hypothesis

New cards

Probability

likelihood or chance that an event will occur in a given situation; expressed as lower case p with values expressed as per cents (ex. p = 0.34)

New cards

Alpha

level of significance = probability of making a Type I error

New cards

Alpha characteristics

threshold at which statistical significance reached; determined prior to data analysis; α (alpha) = 0.05 unless otherwise stated

New cards

Decision theory

assumes all groups in a study are components of the same population; up to researcher to prove there really is a difference

New cards

Use of decision theory

to test the assumption of no difference (null hypothesis) a cut-off point is selected prior to analysis-- referred to as level of significance or alpha

New cards

Test research hypothesis

done statistically, disprove null hypothesis; reject null

New cards

Significant result

there is a difference between the two groups not as a result of chance

New cards

Non-significant result

any difference or relationship could have been purely chance

New cards

Statistically significant result

if the value of the test statistic indicates that the null hypothesis is improbable

New cards

Nonsignificant result

means that any observed difference or relationship could have happened by chance

New cards

Statistical decisions

are either correct or incorrect

New cards

Type I error

rejection of a null hypothesis when it should not be rejected; a false-positive result

New cards

Risk of error is controlled by

level of significance (alpha), e.g., α =.05 or .01

New cards

Type II error

false negative; failure to reject a null hypothesis when it should be rejected

New cards

Power

the ability of a test to detect true relationships

New cards

By convention, power should be at least

New cards

Larger samples =

greater power

New cards

Type I errors occur only

when findings are statistically significant (ss)

New cards

Type II errors occur only

when findings not ss

New cards

Power analysis

analysis for 4 parameters-- level of significance (alpha), sample size, power (beta), effect size; tells you how many subjects needed to start and finish the study

New cards

Hypothesis testing procedures

select an appropriate statistical test, specify the level of significance (e.g., α = .05), compute a test statistic with actual data, determine degrees of freedom (df) for the test statistic, compare the computed test statistic to a theoretical value

New cards

Bivariate statistical tests

t-tests, analysis of variance (ANOVA), chi-squared test, correlation coefficients, effect size indexes

New cards

t-Test

tests the difference between two means

New cards

t-Test for independent groups

between-subjects test

(for example, means for men vs. women)

New cards

t-Test for dependent (paired) groups

within-subjects test

(for example, means for patients before and after surgery)

New cards

Analysis of variance (ANOVA)

tests the difference between more than two means; sorts out the variability of an outcome variable into two components: variability due to the independent variable and variability due to all other sources; variation between groups is contrasted with variation within groups to yield an F ratio statistic

New cards

Types of ANOVA

one-way ANOVA (e.g., 3 groups), multifactor (e.g., two-way) ANOVA, repeated measures ANOVA (RM-ANOVA): within subjects

New cards

Chi-squared test

tests the difference in proportions in categories within a contingency table; compares observed frequencies in each cell with expected frequencies—the frequencies expected if there was no relationship

New cards

Correlation coefficient test

Pearson's r is both a descriptive and an inferential statistic; tests that the relationship between two variables is not zero

New cards

Effect size indexes

an important concept in power analysis; summarize the magnitude of the effect of the independent variable on the dependent variable; in a comparison of two group means (i.e., in a t-test situation), the effect size index is d

New cards

Effect size indexes by convention

d ≤ .20, small effect

d = .50, moderate effect

d ≥ .80, large effect

New cards

Reliability assessment

test-retest reliability, interrater reliability, internal consistency reliability

New cards

Validity assessment

content validity, construct validity, criterion validity

New cards

Research article info for hypothesis testing

the test used, the value of the calculated statistic, degrees of freedom, level of statistical significance

New cards

A researcher measures the weight of people in a study involving obesity and Type 2 diabetes. What type of measurement is being employed?

ratio

3 multiple choice options

New cards

T/F-- A bell-shaped curve is also called a normal distribution

true

New cards

The researcher subtracts the lowest value of data from the highest value of data to obtain:

range

3 multiple choice options

New cards

T/F-- A correlation coefficient of −.38 is stronger than a correlation coefficient of +.32

true

New cards

Which test would be used to compare the observed frequencies with expected frequencies within a contingency table?

chi-squared test

3 multiple choice options