Describing Variation & Distribution of Data

studied byStudied by 6 people
0.0(0)
Get a hint
Hint

Variable

1 / 62

flashcard set

Earn XP

63 Terms

1

Variable

A measure of a single characteristic that can vary.

New cards
2

Causes of Variations

Factors such as biologic differences, genes, nutrition, environmental exposures, age, sex, race, presence or absence of disease, and extent of disease that contribute to variations in medical data.

New cards
3

Measurement Error

Errors in measurement techniques that can lead to variations in results.

New cards
4

Quantitative Data

Data represented by numbers and measurements.

New cards
5

Qualitative Data

Data represented by words.

New cards
6

Types of Variables

Nominal, Dichotomous, Ordinal, Continuous, and Ratio variables.

New cards
7

Frequency Distributions

Tables showing the frequency of values in a variable.

New cards
8

Range of a Variable

The difference between the lowest and highest observations of a variable.

New cards
9

Parameters of a Frequency Distribution

Measures of Central Tendency (Mean, Median, Mode) and Measures of Dispersion (Mean Absolute Deviation, Variance, Standard Deviation).

New cards
10

Skewness

Horizontal stretching of a frequency distribution leading to longer tails on one side (left or right).

New cards
11

Kurtosis

Vertical stretching or flattening of a frequency distribution.

New cards
12

Variable

A measure of a single characteristic that can vary

New cards
13

Causes of Variations

Biologic differences
Presence or absence of disease and extent of disease
Different conditions of measurement
Different techniques of measurement
Measurement error

New cards
14

Biologic differences

Genes, Nutrition, Environmental, Exposures, Age, Sex, Race

New cards
15

Different conditions of measurement

Often account for the variations observed in medical data

New cards
16

Measurement error

Can also cause variation

New cards
17

Types of Errors

Systematic Error and Random Error

New cards
18

Systematic Error

Can distort data systematically in one direction.

New cards
19

Random Error

Does not introduce bias

New cards
20

Quantitative Data 

Numbers and measurement

New cards
21

Qualitative Data

Generally use words

New cards
22

Nominal Variables

Naming or categoric variables that are not based on measurement scales or rank order.

New cards
23

Dichotomous (Binary) Variables

Variables with only two levels

New cards
24

Ordinal (Ranked) Variables

Data that can be characterized in terms of three or more qualitative values

New cards
25

Continous (Dimensional) Variables

Observation differs over time

New cards
26

Ratio Variables

If a continous scale has true 0 point

New cards
27

Frequency Distributions of Continuous Variable

Can be shown by creating a table that lists the values of the variable according to the frequency with which the value occurs.

New cards
28

Range of a variable

Range is the distance between the lowest and highest observations of the variable.

New cards
29

Real Frequency Distributions

Obtained from actual data or sample

New cards
30

Theoretical Frequency Distributions

Calculated using assumptions about the population from which the sample was obtained

New cards
31

Normal Distribution

Bell-shaped curve

New cards
32

Normal Distribution

Also called the Gaussian distribution after Johan Karl Gauss

New cards
33

Measures of Central Tendency

Mean
Median
Mode

New cards
34

Mean

Average value

New cards
35

Median

Middlemost or halfway value

New cards
36

Mode

Most frequent value

New cards
37

Mean Absolute Deviation

Does not have mathematical properties (as based form many statistical tests)

New cards
38

Variance

New cards
39

Standard Deviation

Square root of the variance

New cards
40

Standard Deviation

Used to describe the amount of spread in the frequency distribution

New cards
41

Standard Deviation

Average of deviations from the mean

New cards
42

Skewness

A horizontal stretching of a frequency distribution to one side or the other, so that one tail of observations is longer and has more observations than the other tail

New cards
43

Skewed to the left

When histogram or a frequency polygon has a longer tail on the left side of the diagram

New cards
44

Skewed to the left

Negatively skewed distribution

New cards
45

Skewed to the right

When histogram or a frequency polygon has a longer tail on the right side of the diagram

New cards
46

Skewed to the right

Positvely skewed distribution

New cards
47

Kurtosis

Characterized by a vertical stretching or flattening of the frequency distribution

New cards
48

Continous (Dimensional) Variables

Continous scales

New cards
49

Leptokurtic

Distribution with heavy tails.

New cards
50

Platykurtic

Distribution with light tails.

New cards
51

Mesokurtic

Distribution with moderate tails, similar to a normal distribution.

New cards
52

Graphs

It provide a visual way to understand the distribution and variation in the data.

New cards
53

Histogram

A bar graph that shows the frequency of data points within specified ranges (bins).

New cards
54

Box Plot (Box-and-Whisker Plot)

Displays the median, quartiles, and potential outliers. It helps visualize the spread and skewness of the data.

New cards
55

Dot Plot

Shows individual data points and their frequency.

New cards
56

Stem-and-Leaf Plot

Similar to a histogram but retains the original data values.

New cards
57

Density Plot

A smoothed version of the histogram, often used to estimate the probability density function of the data.

New cards
58

Five-Number Summary

Consists of the minimum, Q1, median, Q3, and maximum.

New cards
59

Summary Table

Includes mean, median, mode, range, variance, standard deviation, and other relevant statistics.

New cards
60

Outliers

Data points that significantly differ from the rest of the dataset.

New cards
61

Side-by-Side Box Plots

Useful for comparing the spread and central tendency of multiple groups.

New cards
62

Multiple Histograms

Placing histograms side by side or overlaying them for comparison.

New cards
63

Summary Statistics Comparison

Comparing means, medians, ranges, and standard deviations.

New cards

Explore top notes

note Note
studied byStudied by 44 people
... ago
4.7(3)
note Note
studied byStudied by 521 people
... ago
4.5(2)
note Note
studied byStudied by 2634 people
... ago
4.9(37)
note Note
studied byStudied by 15 people
... ago
5.0(1)
note Note
studied byStudied by 8 people
... ago
5.0(1)
note Note
studied byStudied by 2 people
... ago
5.0(1)
note Note
studied byStudied by 127 people
... ago
5.0(1)

Explore top flashcards

flashcards Flashcard (39)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (25)
studied byStudied by 8 people
... ago
5.0(1)
flashcards Flashcard (26)
studied byStudied by 22 people
... ago
5.0(1)
flashcards Flashcard (29)
studied byStudied by 6 people
... ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 26 people
... ago
5.0(2)
flashcards Flashcard (37)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 162 people
... ago
4.0(2)
flashcards Flashcard (49)
studied byStudied by 2 people
... ago
5.0(1)
robot