1/56
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
What is a population?
A collection of things having some quantifiable characteristic in common
Objects, animals, people
A theoretical concept, often can't be counted
Dynamic, constant movement in and out
The group we want to study
How do we define populations?
Inclusion criteria
Exclusion criteria
Confounders
What is inclusion criteria?
The common characteristics of the population
What is exclusion criteria?
The common characteristics that the members of the population lack
What are confounders?
Characteristics of a population that could potentially affect study results
Things to watch for
Related to risk factor and outcome
What are inferential statistics?
Based on studies of populations
Results should apply to all individuals belonging to that population
Take care when extrapolating results from studies on a specific population to those who don't meet the inclusion and exclusion criteria for that population
What is internal validity?
The study was done right
What is external validity?
The study means something
What is a sample?
The group we actually study
A group of individuals that represents the population
What 2 concepts are essential to sample selection?
Random sampling
Law of independence
What is random sampling?
Every member of the population has an equal chance of being selected
What is the law of independence?
Selection of one member does not influence the change of choosing any other
An independent, random sample is chosen in such a way that...
Every possible combination of size N has an equal chance of being selected
What is a parameter?
A numerical (true) value that summarizes data for the population
What value we want to know
Stable, even though the population itself is dynamic
Not constant
What is a statistic?
The estimate of the parameter in a sample
The value for our study
What is a confidence interval?
Statistic +/- margin of error
Based on our study, what we think the range is for the parameter
With a 95% CI... you are ___ that 95% of __ will produce an __ around the __ that contains the __ for that population
Confident
Samples
Interval
Statistic
Parameter
A larger sample size gets...
More of the population you are sampling
More accurate statistic to the parameter
Narrower confidence interval
Each piece of data we collect is called a...
Variable
Each variable is measured in each member of the sample
Ex: weight, breed, BCS
How do we describe variables?
The type of variable (categorical, quantitative, qualitative)
The scale of measurement (binary, nominal, ordinal, interval, ratio)
The distribution of the values (normal: guassian or parametric, or nonparametric)
All 3 descriptions of variables combine to tell us...
What statistics we can use
What are qualitative variables?
Freeform, no structure
What are quantitative variables?
Numbers that reflect a value
What are the 2 types of quantitative variables?
Discrete: limited choices (size, sex, breed)
Continuous: extensive range of possible values (weight)
What are categorical variables?
No numerical value even if the categories have been assigned a number
What are the types of categorical variables?
Binary or dichotomous: 2 choices (Y/N, T/F, +/-) (intact)
Multinomial: more than 2 choices (breed)
Ordinal: more than 2 choices and ordered (BCS)
What are scales found in quantitative data?
Interval: continuous variables, difference between values is consistent (age in years, wt in kg)
Ratio: compare variables using different scales, relative distance from 1, uncommon in stats (test positivity)
How do we describe continuous variables?
Distribution
What is a distribution?
The pattern you see when you graph the frequency of the variable's different values
What is a histogram?
A bar graph showing the frequency of each value
What are things to look for in distribution?
Skew and outliers
What is skew?
Is the distribution symmetric
If not symmetric, it is skewed, there are more values in one direction than the other
Right skew means a tail to the right (positive skewness) and left skew means a tail to the left (negative skewness)
What is an outlier
A point that is really different from everything else
Error is the most common reason
What are the 2 ways we can describe distributions?
Central tendency
Variability
What are the measures of central tendency?
Mode: most common value
Median: 50th percentile, half higher and half lower
Mean: the average
What are the measures of variability?
Standard deviation: average distance of values from the mean
Variance: standard deviation squared
Range
Percentiles
What measure of central tendency should be used when a distribution is all over the place?
Mode
What measure of central tendency should be used when a distribution is skewed, but not all over the place?
Median
If we have a normal distribution how will our measures of central tendency look?
They will be the same
What are normal/gaussian/parametric distributions?
Symmetrically distributed
Describe with mean and SD
Mean = median = mode
95% of observations are within ~2 SD of the mean
What are nonparametric distributions?
Non-symmetrical
Describe with median and percentiles
How do we describe distributions with categorical data?
Describe with counts and percentages
How do we describe qualitative distributions?
Thematic analysis (repetitions)
Quotations (examples)
What are descriptive statistics?
Who is in the study and what are their characteristics
Describe each variable by type, scale, and distribution
Using tables and graphs
Why are descriptive statistics beneficial?
Have an idea of who the population is
Helps you compare your study sample to your target population
What are we evaluating in our inferential statistics?
Make comparisons with the data set (are values different between groups, is an event more common in one group)
Draw conclusions based on the results, hypothesis testing
Ask the question, is X associated with Y?
What is variable X?
Explanatory variable
AKA independent variable, predictor variable, risk factor, exposure of interest
Is correlated with the response variable, but not necessarily causing an effect on it
What is variable Y?
Response variable
AKA outcome variable, dependent variable, event of interest
What is linear regression?
Continuous predictor, continuous outcome
y = mx + b
y = outcome
m = slope
b = intercept where x is 0
m = 0 means...
There is no relationship between x and y
For every 1 unit change in x...
There will be an m unit change in y
What is multivariable regression?
Creates an equation to explain the relationship between the outcome, predictors, and other things like confounders
Allows us to consider multiple risk factors simultaneously
Often requires a larger data set
What is logistic regression?
Similar to linear, outcome is now dichotomous
Regression coefficients can be used to produce odds ratio
Slope variables are not really interpretable by themselves
Almost all statistics can be done using...
Generalized linear models
What do we need to know for generalized linear models?
The question we are asking
The distribution of the outcome variable
Any confounders we want to control for
Any population structure/clustering to account for
Probability is...
Between 0 (never going to happen) and 1 (guaranteed to happen)
P-value is...
Probability we use to decide if something we observed is real or just happened by chance