BIOL2300 ch. 1-5

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/60

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:11 AM on 2/7/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

61 Terms

1
New cards

statistics

the study of methods for measuring aspects of populations from samples and for quantifying the uncertainity of the measurements

2
New cards

population

all the individual units of interest

3
New cards

sample

subset of units taken from the population

4
New cards

parameter

true value that describes an entire population; absolute ā€œtruthā€

5
New cards

estimate

an approximation of the parameter

always subject to error because it doesn’t include every individual in the population

6
New cards

random sample

each member of a population has an equal and independent chance of being selected

minimizes bias; makes it possible to calculate amount of sampling error

7
New cards

sample of convenience

a collection of individuals that are easily available to the researcher

8
New cards

sampling error

the natural, unavoidable difference (due to chance) between an estimate and the population parameter being estimatedĀ 

reducing sample size decreases precision

9
New cards

bias

a systematic discrepancy between the estimates

10
New cards

list 4 types of bias

  1. volunteer bias

  2. survivorship bias

  3. recall bias

  4. observer bias

11
New cards

accuracy

how close an estimate is to the true value (parameter)

12
New cards

precision

how consistent/reproducible measurements are from one sample/measurement to anotherĀ 

sampling variability affects precision

13
New cards

categorical data

qualitative characteristics of individuals (non-numerical)

14
New cards

what are the 2 types of categorical data?

  1. nominal

  2. ordinal

15
New cards

nominal

no inherent order (categorical)

16
New cards

ordinal

inherent order (categorical)

17
New cards

numerical data

quantitative measurements (numerical values)

18
New cards

what are the 2 types of numerical data?

  1. discrete

  2. continuous

19
New cards

discrete, provide examples

indivisible units (numerical)

i.e. # of teeth; # of siblings; # of vehicles

20
New cards

continuous, provide examples

Ā any real number within a range (numerical)

i.e. height, weight, temperature

21
New cards

absolute frequency distribution (n)

the count of observations in each category/interval

22
New cards

example of absolute frequency distribution

Ex. absolute frequency of age surveyed across students in a class;
Age 19: 37 students
Age 22: 5 students

23
New cards

relative frequency distribution (f) or (p)

the proportion/percentage of observations in each category/interval

24
New cards

example of relative frequency distribution

Ex. relative frequency of age surveyed across students in a class

Age 19: 40.7% of students

Age 22: 5.5% of students

25
New cards

experimental study

researcher assigns treatments randomly

26
New cards

observational study

assignments not made by researcher

27
New cards

confounding variable

masks/distorts the causal relationship between measured variables in a study

28
New cards

example of confounding variable

E.x. Coffee consumption and the incidence of lung cancer (does not actually cause lung cancer, smoking does)

29
New cards

how should categorical data be displayed?

  • frequency table

  • bar graph

30
New cards

how should numerical data be displayed?

  • frequency table

  • box plot

  • histogram

31
New cards

positive skew (right-skewed)

  • mean > median

  • long tail on right side

<ul><li><p>mean &gt; median</p></li><li><p>long tail on right side</p></li></ul><p></p>
32
New cards

negative skew (left-skewed)

  • mean < median

  • long tail on left side

33
New cards

how should relative frequency be displayed?

  • bar graph

  • pie chart

34
New cards

what should be used to visualize the association between categorical variables and compare their frequencies?

contingency table

35
New cards

how should the association between categorical variables be displayed?

  • grouped bar

  • mosaic plot

36
New cards

how should the association between numerical variables be displayed? label the axes

scatter plot

  • x-axis: explanatory variable

  • y-axis: response variable

37
New cards

how should the association between categorical and numerical variables be displayed?

  • strip chart

  • violin plot

38
New cards

normal distribution (histogram)

symmetric bell-shaped distribution

  • mean/median/mode are very similar

  • does not show percentiles

39
New cards

mean

the average value; typical value

40
New cards

median

middle value when data is ordered least to greatest

41
New cards

standard deviation (SD)

how spread out the data is around the mean; spread of individual observations

  • Never negative

  • The sum is zero

  • SD = Square root of variance

<p>how spread out the data is around the mean; spread of individual observations</p><ul><li><p><span style="background-color: transparent;"><span>Never negative</span></span></p></li><li><p><span style="background-color: transparent;"><span>The sum is zero</span></span></p></li><li><p><span style="background-color: transparent;"><strong><span>SD = Square root of variance</span></strong></span></p></li></ul><p></p>
42
New cards

mode

value that appears most frequently in a data set

43
New cards

z-score

indicates how many standard deviation a value is above or below the mean

<p>indicates how many standard deviation a value is above or below the mean</p>
44
New cards

paired/dependent observations

data collected multiple times from the same individuals

45
New cards

positive association

two variables move in the same direction (both increase or both decrease)

46
New cards

negative association

two variables move in opposite directions

47
New cards

variance

a numerical measure of how spread out a set of data points is around their average (mean)

<p><span style="background-color: transparent;"><span>a numerical measure of how spread out a set of data points is around their average (mean)</span></span></p>
48
New cards

coefficient of variation (CV)

the standard deviation (SD) expressed as a percentage (%) of the mean

SD/mean * 100

49
New cards

how to find median in a data set

knowt flashcard image
50
New cards

quartiles

values that partition the data into quarters

51
New cards

interquartile range (IQR)

difference between the 3rd (Q3) and 1st (Q1) quartile of the data

  • spans 50% of the data

52
New cards

how to find IQR in a data set

knowt flashcard image
53
New cards

features of box and whiskers plot

  • shows max/min, median, skewness, spread of data, quartiles, and interquartile range

54
New cards

cumulative relative frequency (%) or (N)

how many observations are less than or equal to a given valueĀ 

55
New cards

how to calculate cumulative relative frequency for a data set

  1. Order data from smallest to largest

  2. Start with first frequency; add frequencies and sum

56
New cards

how to calculate the proportion of observations in a given category for a data set?

the most important descriptive statistic for a categorical variable

# of observations in category/total # of observations

57
New cards

sampling distribution

describes how an estimate varies across repeated samples from the same population

  • as # of observations increases, the spread (and uncertainty) decreases

  • is less variable than the distribution of individual observations

58
New cards

standard error (SE)

measures the uncertainty of an estimateĀ 

  • reflects precision of estimate

  • as sample size increases, the standard error of mean decreases

59
New cards

confidence interval

a range of values surrounding the sample estimate that is likely to contain the population parameter

captures the true mean in 95% of repeated samples

60
New cards

2SE rule

  • a rough approximation of the 95% confidence interval for a mean can be calculated as the sample mean +/- 2 standard errors (SE)

61
New cards

error bars

lines on a graph extending outward from the sample estimate to show uncertainty about the value of the parameter being estimated

  • displays uncertainty, not spread of data