Ch12: Data-Based and Statistical Reasoning

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/53

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

54 Terms

1
New cards
Measures of Central Tendency
Describe “middle” of data
2
New cards
Outlier
Extremely large or small value compared to other data

Median and mean far apart
3
New cards
Outlier Causes

1. Statistical anomaly
2. Measurement error
3. Distribution not approximated to normal distribution
4
New cards
Mean/Average
Use when values are close together

Outliers shift mean
5
New cards
Median
Midpoint of data in ascending order (50% above and 50% below point)

Use when outliers present
6
New cards
Mode
Value appearing most in data
7
New cards
Normal Distribution
Symmetrical

All measures of central tendency equal
8
New cards
Standard Distribution (Bell Curve)
Transformed normal distribution

Mean = 0 and SD = 1
9
New cards
Skewed Distribution
Asymmetrical (tail on one side)

Different mean, median, mode values
10
New cards
Negatively Skewed Distribution
Tail on left side

Mean < Median
11
New cards
Positively Skewed Distriution
Tail on right side

Mean > Median
12
New cards
Bimodal Distribution
2 peaks
13
New cards
Range
Diff between largest and smallest values

Heavy outlier influence
14
New cards
Interquartile Range (IQR)
Diff between third and first quartile

Determine outliers
15
New cards
IQR: Quartiles
Relative to median

Divide data into groups containing 1/4 of set
16
New cards
IQR: Q1
n x 1/4

Whole Number: Mean of value at position and next highest position

Decimal: Round up
17
New cards
IQR: Q2
Median
18
New cards
IQR: Q3
n x 3/4

Whole Number: Mean of value at position and next highest position

Decimal: Round up
19
New cards
IQR: Outliers
Values 1.5 x IQR below Q1 or above Q3
20
New cards
Standard Deviation (SD)
Measure variability around mean

1/4 of range
21
New cards
SD: Outliers
Values 3 x SD away from mean
22
New cards
SD: Normal Distribution
68% data in 1 SD of mean

95% data in 2 SD of mean

99% data in 3 SD of mean
23
New cards
Independent Events
No effect on each other

Probabilities never change
24
New cards
Dependent Events
Affect each other

Probabilities change depending on prior event outcome
25
New cards
Mutually Exclusive Outcomes
Cannot occur together
26
New cards
Non-Mutually Exclusive Outcomes
Can occur together
27
New cards
Exhaustive Outcomes
Only possible outcomes of event
28
New cards
Hypothesis Testing
Test difference between 2 pop with known distribution
29
New cards
Null Hypothesis
2 pop equal (no difference)
30
New cards
Alternative Hypothesis
Nondirectional: Pop not equal

Directional: Pop A diff in specific way from pop B
31
New cards
P-Value
Test statistic to determine if results obtained by random chance
32
New cards
Significant Level (α)
Compare to p-value

Usually 0.05
33
New cards
p-value > α
Fail to reject null hypothesis

No statistically significant diff between pops
34
New cards
p-value < α
Reject null hypothesis

Statistically significant diff between pops
35
New cards
Type I Error
Report diff between groups when no diff exists (reject null when null true)

α
36
New cards
Type II Error
Report no diff between groups when diff exists (fail to reject null when null false)

β
37
New cards
Power
Reject null when null false

1-β
38
New cards
Confidence
Fail to reject null when null true

1-α
39
New cards
Confidence Intervals
Range of confidence for mean

Sample mean ± SD x z-score
40
New cards
Charts
Visual representation of info
41
New cards
Charts: Pie/Circle
Relative amounts for categorical data

Usually for demographics
42
New cards
Charts: Bar
Categorical data

Bar proportional to value
43
New cards
Charts: Histogram
Numerical data

Determine mode from data distribution
44
New cards
Charts: Box Plot
Numerical data

Boxes: Q1 and Q3

Middle Lines: Median

Whiskers: 1.5 x IQR

Points: Outliers
45
New cards
Charts: Maps
Geographic representation
46
New cards
Graphs
Use axes to provide numerical data
47
New cards
Graphs: Linear
Relationship between 2 variables

Can be linear, parabolic, exponential, logarithmic
48
New cards
Graphs: Semilog
Log graph to linear with axis ratio change (10 exponents)

1 log axis and 1 linear
49
New cards
Graphs: Log-Log
Both axis use constant ratio change

2 log axes
50
New cards
Correlation
Connection/relationship between data
51
New cards
Positive Correleation
Var trend together (both increase)
52
New cards
Negative Correlation
Var trend in opposite dir (1 increase, 1 decrease)
53
New cards
Correlation Coefficient
Between -1 and +1

Show relationship strength
54
New cards
Causation
1 var cause changes in other

Can only be proven with Hill’s criteria