STATS 7 - Quiz 2 (LO3, LO4, LO5, LO6)

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/41

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:18 PM on 4/22/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

42 Terms

1
New cards

Categorical variable

The value of the variable consists of groups and categories that don’t have any logical ordering.

2
New cards

Quantitative variable

The raw data consists of numerical measurements on each individual/units or they consist of counts

3
New cards

Easiest way to identify categorical or quantitative variable?

Categorical variables are statements and quantitative variables are numbers

4
New cards

Bar graphs

Each characteristic, or level, is represented by a bar. The height of a bar represents either the count of individuals with that characteristic, the frequency, or the percent of individuals with that characteristic, the relative frequency.

5
New cards

Pie charts

A pie chart can only represent how one categorical variable breaks down into its components. Each characteristic is represented by a slice, and the size of a slice represents what percent of the whole is made up by that characteristic.

6
New cards

Histograms

A summary graph for a single quantitative variable. Useful to understand the pattern of variability in the data, especially for large data sets

7
New cards

Line graphs (time series plot)

Use them when there is a meaningful sequence, like time. The line connecting the points helps emphasize any change over time

8
New cards

Symmetric distribution

knowt flashcard image
9
New cards

Left skew

knowt flashcard image
10
New cards

Right skew

knowt flashcard image
11
New cards

Outliers

An important kind of deviation in the distribution of the data is an outlier. Outliers are observations that lie outside the overall pattern of a distribution. Always look for outliers and try to explain them.

12
New cards

Graphing time series

Data collected over time are displayed in a time plot, with time on the horizontal axis and the variable of interest on the vertical axis

We look for a possible trend and possible cyclical variations

13
New cards

Long-term trend

Variables change steadily over time

14
New cards

Seasonal components

Trend to be high in certain months or seasons and low in other years

15
New cards

Irregular Cycles and Random Fluctuations

Irregular cycles may be explained by social, environmental, political, or other factors

Unexplainable random fluctuations are part of the natural variabiltiy present in all measurements

16
New cards

Mean (or average)

Add all values and then divide by the number of values

17
New cards

Median

The middle value after the numbers have been put in order

18
New cards

Mode

Most common value in a dataset

19
New cards

Comparing median and mean

The median is a measure of center that is resistant to skew and outliers. The mean is not.

20
New cards

Variability

Describes how spread out or close together the data values are.

21
New cards

Minimum

The smallest value in the data set

22
New cards

Maximum

The largest value in the data set

23
New cards

Range

The difference between the maximum and minimum

24
New cards

First Quartile (Q1)

The median of the values below the median in the sorted data set

25
New cards

Third Quartile (Q3)

The median of the values above the mean in the sorted set

26
New cards

Five number summary

Minimum, Q1, Median, Q3, Maximum

27
New cards

Boxplots

Visual representation of five number summary

28
New cards

IQR

Q3 - Q1

29
New cards

Lower Bound

Q1 - (1.5 * IQR)

30
New cards

Upper Bound

Q3 + 1.5 * IQR

31
New cards

Standard Deviation

A useful measure of how spread out the numbers in a data set are

32
New cards

How to find standard deviation

Find mean, find deviation of each value to mean (value - mean), square the deviations, divide sum by number of values - 1 to find variance, square root the variance to find standard deviation

33
New cards

Contingency Table

Represent the counts and values of one variable in the rows and represent the counts and values of another variable in the columns

<p>Represent the counts and values of one variable in the rows and represent the counts and values of another variable in the columns</p>
34
New cards

Relative risk or relative odds

Involves comparing one category to another category

35
New cards

percentage with the trait

(number with trait / total) x 100%

36
New cards

Proportion with the trait

number with trait / total

37
New cards

Odds of having that trait

number with trait / number without trait to 1

38
New cards

Relative risk

Relative risk of an outcome for two categories of an explanatory variable is the ratio of the risks of the risks of each category. (So the risk of getting allergy from owning a pet vs risk of getting allergy from not owning a pet - double calculation)

39
New cards

Baseline risk

Risk without the treatment or behavior. Difficult to read

40
New cards

Increased risk

Comparing relative risk vs baseline risk to see if your chance of gaining something has changed or not

41
New cards

Odds ratio

Ratio of the odds, like the relative risk of odds

42
New cards

Simpson’s paradox

Third variable is missing due to combining of data leading to false conclusions