Describing Variation & Distribution of Data

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/62

flashcard set

Earn XP

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

63 Terms

1
New cards

Variable

A measure of a single characteristic that can vary.

2
New cards

Causes of Variations

Factors such as biologic differences, genes, nutrition, environmental exposures, age, sex, race, presence or absence of disease, and extent of disease that contribute to variations in medical data.

3
New cards

Measurement Error

Errors in measurement techniques that can lead to variations in results.

4
New cards

Quantitative Data

Data represented by numbers and measurements.

5
New cards

Qualitative Data

Data represented by words.

6
New cards

Types of Variables

Nominal, Dichotomous, Ordinal, Continuous, and Ratio variables.

7
New cards

Frequency Distributions

Tables showing the frequency of values in a variable.

8
New cards

Range of a Variable

The difference between the lowest and highest observations of a variable.

9
New cards

Parameters of a Frequency Distribution

Measures of Central Tendency (Mean, Median, Mode) and Measures of Dispersion (Mean Absolute Deviation, Variance, Standard Deviation).

10
New cards

Skewness

Horizontal stretching of a frequency distribution leading to longer tails on one side (left or right).

11
New cards

Kurtosis

Vertical stretching or flattening of a frequency distribution.

12
New cards

Variable

A measure of a single characteristic that can vary

13
New cards

Causes of Variations

Biologic differences
Presence or absence of disease and extent of disease
Different conditions of measurement
Different techniques of measurement
Measurement error

14
New cards

Biologic differences

Genes, Nutrition, Environmental, Exposures, Age, Sex, Race

15
New cards

Different conditions of measurement

Often account for the variations observed in medical data

16
New cards

Measurement error

Can also cause variation

17
New cards

Types of Errors

Systematic Error and Random Error

18
New cards

Systematic Error

Can distort data systematically in one direction.

19
New cards

Random Error

Does not introduce bias

20
New cards

Quantitative Data 

Numbers and measurement

21
New cards

Qualitative Data

Generally use words

22
New cards

Nominal Variables

Naming or categoric variables that are not based on measurement scales or rank order.

23
New cards

Dichotomous (Binary) Variables

Variables with only two levels

24
New cards

Ordinal (Ranked) Variables

Data that can be characterized in terms of three or more qualitative values

25
New cards

Continous (Dimensional) Variables

Observation differs over time

26
New cards

Ratio Variables

If a continous scale has true 0 point

27
New cards

Frequency Distributions of Continuous Variable

Can be shown by creating a table that lists the values of the variable according to the frequency with which the value occurs.

28
New cards

Range of a variable

Range is the distance between the lowest and highest observations of the variable.

29
New cards

Real Frequency Distributions

Obtained from actual data or sample

30
New cards

Theoretical Frequency Distributions

Calculated using assumptions about the population from which the sample was obtained

31
New cards

Normal Distribution

Bell-shaped curve

32
New cards

Normal Distribution

Also called the Gaussian distribution after Johan Karl Gauss

33
New cards

Measures of Central Tendency

Mean
Median
Mode

34
New cards

Mean

Average value

35
New cards

Median

Middlemost or halfway value

36
New cards

Mode

Most frequent value

37
New cards

Mean Absolute Deviation

Does not have mathematical properties (as based form many statistical tests)

38
New cards

Variance

39
New cards

Standard Deviation

Square root of the variance

40
New cards

Standard Deviation

Used to describe the amount of spread in the frequency distribution

41
New cards

Standard Deviation

Average of deviations from the mean

42
New cards

Skewness

A horizontal stretching of a frequency distribution to one side or the other, so that one tail of observations is longer and has more observations than the other tail

43
New cards

Skewed to the left

When histogram or a frequency polygon has a longer tail on the left side of the diagram

44
New cards

Skewed to the left

Negatively skewed distribution

45
New cards

Skewed to the right

When histogram or a frequency polygon has a longer tail on the right side of the diagram

46
New cards

Skewed to the right

Positvely skewed distribution

47
New cards

Kurtosis

Characterized by a vertical stretching or flattening of the frequency distribution

48
New cards

Continous (Dimensional) Variables

Continous scales

49
New cards

Leptokurtic

Distribution with heavy tails.

50
New cards

Platykurtic

Distribution with light tails.

51
New cards

Mesokurtic

Distribution with moderate tails, similar to a normal distribution.

52
New cards

Graphs

It provide a visual way to understand the distribution and variation in the data.

53
New cards

Histogram

A bar graph that shows the frequency of data points within specified ranges (bins).

54
New cards

Box Plot (Box-and-Whisker Plot)

Displays the median, quartiles, and potential outliers. It helps visualize the spread and skewness of the data.

55
New cards

Dot Plot

Shows individual data points and their frequency.

56
New cards

Stem-and-Leaf Plot

Similar to a histogram but retains the original data values.

57
New cards

Density Plot

A smoothed version of the histogram, often used to estimate the probability density function of the data.

58
New cards

Five-Number Summary

Consists of the minimum, Q1, median, Q3, and maximum.

59
New cards

Summary Table

Includes mean, median, mode, range, variance, standard deviation, and other relevant statistics.

60
New cards

Outliers

Data points that significantly differ from the rest of the dataset.

61
New cards

Side-by-Side Box Plots

Useful for comparing the spread and central tendency of multiple groups.

62
New cards

Multiple Histograms

Placing histograms side by side or overlaying them for comparison.

63
New cards

Summary Statistics Comparison

Comparing means, medians, ranges, and standard deviations.