biostatistics

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Knowt Play

New

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/47

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

48 Terms

New cards

Statistics

the science of collecting, analyzing, and interpreting numerical data to understand patterns, test hypotheses, and make decisions about populations based on sample data.

New cards

Data

individual pieces of information, often numerical, collected through observation, measurement, or experimentation and used for statistical analysis.

New cards

Population

the entire group of individuals, events, or objects that a researcher wants to study or draw conclusions about.

New cards

Sample

a subset of the population selected for analysis, used to estimate characteristics of the whole population.

New cards

Parameter

a fixed, often unknown value that describes a characteristic of a population, such as the true mean or proportion.

New cards

Statistic

a numerical value calculated from sample data that estimates a population parameter.

New cards

Sampling

the process of selecting a subset of individuals from a population to represent the whole in statistical analysis.

New cards

Sampling Bias

a systematic error that occurs when the sample is not representative of the population, leading to distorted results.

New cards

Sampling Variability

the natural variation in sample statistics that occurs when different samples are drawn from the same population.

New cards

Random Sampling

a method where every member of the population has an equal chance of being selected, minimizing bias.

New cards

Simple Random Sample

a type of random sampling where each individual and each possible sample has an equal probability of selection.

New cards

Probability

a numerical measure between 0 and 1 that expresses the likelihood of an event occurring.

New cards

Properties of Probability

probabilities range from 0 to 1, with 0 meaning impossible, 1 meaning certain, and the sum of all possible outcomes equaling 1.

New cards

Random Process

a process whose outcome is unpredictable in the short term but follows predictable patterns over time, governed by probability.

New cards

Variable

a characteristic or attribute that can take on different values among individuals in a population.

New cards

Categorical Variable

a variable whose values are expressed in distinct categories, such as sex or religion.

New cards

Nominal Variable

a categorical variable with no inherent order among categories, like blood type or ethnicity.

New cards

Ordinal Variable

a categorical variable with a meaningful order among categories, such as severity of illness or education level.

New cards

Binary Variable

a categorical variable with only two possible values, such as yes/no or male/female.

New cards

Quantitative Variable

a variable measured on a numerical scale, including discrete and continuous types.

New cards

Discrete Variable

a quantitative variable that takes on countable, isolated values, such as number of children.

New cards

Continuous Variable

a quantitative variable that can take on any value within a range, such as height or blood pressure.

New cards

Describing Categorical Variables

typically summarized using frequencies and percentages, and visualized with bar plots or pie charts.

New cards

Describing Quantitative Variables

summarized using measures of central tendency and variability, and visualized with histograms or box plots.

New cards

Frequency Distribution

a table or graph showing how often each value or category occurs in a dataset.

New cards

Relative Frequency

the proportion of observations in each category relative to the total number of observations.

New cards

Cumulative Frequency

the running total of frequencies up to a certain category or interval.

New cards

Bar Plot

a graph using rectangular bars to represent the frequency or percentage of categorical data.

New cards

Pie Chart

a circular graph divided into slices to show proportions of categorical data.

New cards

Histogram

a graph displaying the distribution of a continuous variable using adjacent bars for intervals.

New cards

Skewness

a measure of asymmetry in a distribution, with positive skew indicating a tail on the right and negative skew indicating a tail on the left.

New cards

Kurtosis

a measure of the "tailedness" or peakedness of a distribution, indicating how heavy or light the tails are.

New cards

Measures of Central Tendency

statistics that describe the center or typical value of a dataset, including mean, median, and mode.

New cards

Mean

the arithmetic average of a dataset, calculated by summing all values and dividing by the number of observations.

New cards

Median

the middle value in an ordered dataset, dividing the data into two equal halves.

New cards

Mode

the value that occurs most frequently in a dataset, useful for identifying common values or peaks in distributions.

New cards

Range

the difference between the maximum and minimum values in a dataset, providing a crude measure of variability.

New cards

Centiles

values that divide a dataset into 100 equal parts; the 25th centile is the first quartile (Q1), the 50th is the median (Q2), and the 75th is the third quartile (Q3).

New cards

Interquartile Range (IQR)

the range between the 25th and 75th centiles, measuring the spread of the middle 50% of data and reducing the influence of outliers.

New cards

Box Plot

a graphical summary of a dataset showing the median, quartiles, IQR, and potential outliers; useful for comparing distributions.

New cards

Outlier

a data point that lies far outside the range of most other values, possibly indicating variability, error, or special cases.

New cards

Measures of Variability

statistics that describe the spread or dispersion of data, including range, variance, standard deviation, and coefficient of variation.

New cards

Variance

the average of squared deviations from the mean, reflecting how much values differ from the average.

New cards

Standard Deviation (SD)

the square root of the variance, indicating the average distance of values from the mean and widely used to assess variability.

New cards

Coefficient of Variation

a standardized measure of dispersion, calculated as SD divided by mean, useful for comparing variability across datasets with different units or scales.

New cards

Normal Distribution

a symmetric, bell-shaped distribution defined by its mean and SD, commonly found in biological and health-related variables.

New cards

Properties of the Normal Distribution

symmetrical about the mean, mean equals median and mode, defined entirely by mean and SD, with 68% of data within ±1 SD, 95% within ±2 SD, and 99.7% within ±3 SD.

New cards

Fitting a Normal Distribution

once sample mean and SD are known, a normal curve can be fitted to the data to model its behavior and make predictions.