STSCI 2150 Prelim 1

0.0(0)
studied byStudied by 6 people
0.0(0)
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/58

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:01 PM on 9/30/24
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

59 Terms

1
New cards

Statistics

Measurement, variation/variability, & comparison

2
New cards

Goals of statistics

Estimate the values of important parameters, measure variability, compare groups, test hypotheses

3
New cards

Sample of convenience

Collection of individuals that happen to be available at the time; bad because study sample and population of interest should be as similar as possible

4
New cards

Variable

Characteristic measured on individuals drawn from a population under study

5
New cards

Data

Measurements of 1+ variables made on a collection of individuals

6
New cards

Data formatting

Rows contain observations, columns correspond to variables

7
New cards

Response & explanatory variable

Response variable is predicted or explained from the explanatory variable
Response~outcome; explanatory~predictor

8
New cards

Populations

Parameters (Greek letters); the "true" value
Include all existing organisms of interest

9
New cards

Samples

Estimates (Roman letters); an approximation or guess about the "truth" based on a sample group chosen methodically & randomly

10
New cards

Parameters vs. estimates

Population parameters are constants; estimates are random variables that change from sample to the next

11
New cards

Bias

Systematic discrepancy between estimates and the true population characteristic; if biased, not accurate

12
New cards

What makes an estimate accurate (unbiased)?

If the average of estimates obtained is centered on the true population value

13
New cards

Volunteer bias

Volunteers for a study are likely to be different on average from the population

14
New cards

How can we tell when an estimate is biased?

If experiment is repeated and estimate is consistently too high or too low

15
New cards

Precision

Measure of how far apart repeated estimates might be; a mathematical concept that can be derived

16
New cards

How do we know if estimates are precise?

Precise estimates are relatively close to each other

17
New cards

Effect of larger sample on precision

Larger samples yield more precise estimates

18
New cards

Properties of a good sample

Random selection of individuals (representative of the population of interest), independent selection of individuals, sufficiently large (large samples yield more precise estimates)

19
New cards

Random sample

Each member of population has an equal and independent chance of being selected

20
New cards

Experimental vs. observational studies

Experimental: researcher randomly assigns individuals to treatment groups; more powerful and can help determine cause-and-effect relationships
Observational: assignment of treatments is not made by researcher; can only assess associations between variables

21
New cards

Categorical variables

Dichotomous (binary), ordinal (ordered categories; ex. cancer stage, education level, recovery from an operation), nominal (categories have no natural ordering; ex. drug treatment, region, species type)

22
New cards

Numerical (quantitative) variables

Continuous (can be measured; ex. age, weight, miles traveled) or discrete (can be counted; ex. number of offspring, number of days)

23
New cards

Why does the type of variable matter?

Determines the way it's summarized (average vs. percentages), the type of statistical analysis, the graphical display format

24
New cards

Graphing categorical variables

Bar graph

25
New cards

Graphing numerical variables

Histogram, box plot, dot plot (usually for smaller amounts of data)

26
New cards

Features of a box plot

Center line in box: median
Top edge of box: 3rd quartile/75th percentile
Bottom edge of box: 1st quartile/25th percentile
Whiskers: extend to smallest and largest non-extreme observations
Extreme observations: observations beyond 1.5 IQR from box edges

27
New cards

Graphing two numerical variables

Scatter plot, line graph, map

28
New cards

Graphing two categorical variables

Mosaic plot (always preferred!), grouped bar graph

29
New cards

Graphing a categorical and numerical variable

Multiple (stacked) histograms, side-by-side box plots

30
New cards

Bad graphs

Pie charts & 3D graphs (hard to make accurate interpretations & perspective skews visual perception)

31
New cards

Common graphing errors

Truncation of y-axis, cumulative graphs, ignoring conventions, mislabeled or missing axes

32
New cards

Guidelines for good graphics

Show the data
Represent magnitudes accurately
Draw graphical elements clearly, minimizing clutter (maximize data-to-ink ratio)
Make displays easy to interpret
Clearly identify axes
All figures should have captions
Be sure that figures with color are effective in b&w

33
New cards

How is location (central tendency) measured?

Mean, median, mode

34
New cards

Sample mean

Average of observations, center of gravity; find sum of observations and divide by count

35
New cards

Median

Middle measurement in a set of ordered data; if even number of observations average the two middle values

36
New cards

Mode

Most frequent measurement

37
New cards

How is variability (width or spread) measured?

Range, standard deviation, variance, interquartile range

38
New cards

Range

Maximum value - minimum value
Poor measure of distribution width/biased estimator of true population range; small samples tend to give lower estimates of range than large samples

39
New cards

Sample variance

s² = (x1 - x̄ )² + (x2 - x̄ )² + ... + (xn - x̄ )² / (n-1)
(Almost) average squared difference from the mean; in original units squared

40
New cards

Standard deviation

s = sqrt(s²)
Sigma is the true standard deviation, s is the sample standard deviation
Related to the average distance between the mean and each observation; measures variability/spread of a distribution

41
New cards

Interpreting standard deviation in a normal distribution (bell curve)

2/3 (66.6%) of data falls within 1 standard deviation of the mean
95% of data falls within 2 standard deviations of the mean

42
New cards

Skew

A measurement of asymmetry; direction of skew refers to pointy tail of distribution

43
New cards

Mean vs. median in skewed data

Right-skewed data: sample mean > sample median
Left-skewed data: sample mean < sample median

44
New cards

Estimating standard deviation from a histogram

Eyeball 2.5% of sample size from top (U) and bottom (L)
s ~= (U - L) / 4

45
New cards

Does repeating an experiment result in the same results?

No, because of random variation (especially with small samples)

46
New cards

What is the key concept behind a sampling distribution, confidence intervals, and standard errors?

Variability and uncertainty of samples (and sample means)

47
New cards

Effect of increasing sample size on spread and variation

Larger sample size reduces the spread/variation of the sampling distribution of an estimate

48
New cards

Sampling distribution

Probability distribution of all values for an estimate we might have obtained when the population was sampled; illustrates how much the sample mean could vary and what values are typical
Only known in theory or via simulations

49
New cards

Standard error

Quantifies the innate variability of an estimator (uncertainty); the standard deviation of the estimator's sampling distribution

50
New cards

Standard error of the mean

SE = s/sqrt(n)
Sigma notation rarely used because the true standard deviation is rarely known, in most cases we only have a sample

51
New cards

Is the sample mean a good estimator of the population mean?

Yes

52
New cards

Do larger samples fix bias?

No

53
New cards

Confidence interval

Quantifies uncertainty about a parameter; an interval estimate of plausible values (not point) for the true population mean

54
New cards

How is a 95% confidence interval worded?

We are 95% confident that the true population value lies within the interval...
If we drew repeated samples, we expect that about 95% of the confidence intervals would contain the true population value
(NOT probability)

55
New cards

2 standard error rule of thumb

Assuming a normally distributed population and/or sufficiently large sample size, the interval of 2 standard errors above and below the mean roughly estimate the 95% confidence interval for the mean

56
New cards

Confidence intervals of different samples...

Different samples yield different intervals
Intervals vary in width
~95% of the time contains true parameter value, ~5% of time doesn't (but in reality we never know)

57
New cards

Effect of increasing sample size on confidence intervals

Larger sample size yields narrower confidence intervals

58
New cards

99% confidence intervals vs. 95% confidence intervals

99% intervals are wider than 95% intervals

59
New cards

Distribution of the data

Set of all values in sample

Explore top flashcards

OMM II Terms (4)
Updated 1012d ago
flashcards Flashcards (40)
peds exam 1
Updated 739d ago
flashcards Flashcards (95)
Cognition 2
Updated 1044d ago
flashcards Flashcards (60)
Final practice
Updated 1157d ago
flashcards Flashcards (106)
EP Test 1
Updated 1158d ago
flashcards Flashcards (63)
E1T2: La familia
Updated 87d ago
flashcards Flashcards (74)
OMM II Terms (4)
Updated 1012d ago
flashcards Flashcards (40)
peds exam 1
Updated 739d ago
flashcards Flashcards (95)
Cognition 2
Updated 1044d ago
flashcards Flashcards (60)
Final practice
Updated 1157d ago
flashcards Flashcards (106)
EP Test 1
Updated 1158d ago
flashcards Flashcards (63)
E1T2: La familia
Updated 87d ago
flashcards Flashcards (74)