Data-based and Statistical Reasoning (12)

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/27

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:27 PM on 4/9/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

28 Terms

1
New cards

Measures of Central Tendency

provides a single value representation for the middle of a group of data

2
New cards

Arithmetic Mean or Average

is a measure of central tendency that equally weighs all values; it is most affected by outliers

3
New cards

Median

is the value that lies in the middle of the data set. Fifty percent of data points are above and below the median

4
New cards

Mode

is the data point that appears most often; there may be multiple (or zero) of these in a data set

5
New cards

Normal Distribution

is symmetrical. The mean, median, and mode are all the same in the normal distribution

6
New cards

Standard Distrubution

is a normal distribution with a mean of zero and a standard deviation of one; it is used for most calculations. 68% if data points occur within one standard deviation of the mean, 95% within two, and 99% within three

7
New cards

Skewed Distribution

have differences in their mean, median, and mode; the skew direction is the direction of the tail of the distribution

8
New cards

Bimodal Distribution

have multiple peaks, although not necessarily multiple modes, strictly speaking. It may be useful to perform data analysis on the two groups separately

9
New cards

Range

is the difference between the largest and smallest values in a data set

10
New cards

Interquartile Range

is the difference between the value of the third quartile and first quartile; this can be used to determine outliers

11
New cards

Standard Deviation

is a measurement of variability about the mean; this can also be used to determine outliers

12
New cards

Outliers

may be a result of true population variability, measurement error, or a non normal distribution

13
New cards

Independent Events

the probability of this does not change based on the outcomes of other events

14
New cards

Dependent Events

the probability of this changes depending on the outcome of other events

15
New cards

Mutually Exclusive Outcomes

cannot occur simultaneously

16
New cards

Exhaustive

when a set of outcomes is _________, there are no other possible outcomes

17
New cards

Hypothesis Tests

use a known distribution to determine whether a hypothesis of no difference (the null hypothesis) can be rejected

18
New cards

p-value; Significance Level (alpha)

whether or not a finding is statistically significant is determined by the comparison of a _________ to the selected __________

19
New cards

0.05

a significance level of ______ is commonly used

20
New cards

Confidence Intervals

are a range of values about a sample mean that are used to estimate the population mean

21
New cards

Confidence Level

a wider interval is associated with a higher _________ (95% is common)

22
New cards

Pie Charts (Circle Charts) and Bar Charts

are both used to compare categorical data

23
New cards

Histograms and Box Plots (Box-and-whisker Plots)

are both used to compare numerical data

24
New cards

Maps

are used to compare up to two demographic indicators

25
New cards

Linear, Semilog, and Log-log Plots

can be distinguished by their axes

26
New cards

Slope

can be calculated most easily from linear plots

27
New cards

Tables

may contain related or unrelated categorical data

28
New cards

Correlation and Causation

are separate concepts that are linked by Hill’s criteria