Stats Unit 1 Vocab

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/38

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

39 Terms

1
New cards

Descriptive stats

describe/display sample data

2
New cards

Inferential stats

use sample data to make inference about larger population

3
New cards

Population

  • Size N, set of all subjects we want to describe

  • defined by researcher

  • large, varied, and hard to describe

4
New cards

parameters

  • characteristics of a population

  • μ- mean, σ- stdev, ρ- correlation

5
New cards

sample

  • size n, subset of a population

6
New cards

statistics

  • characteristics of a sample (estimates of parameters of a pop)

  • X̄- mean, s-stdev, r-correlation

7
New cards

sampling error

happens by chance, reduce by increase sample (size n)

8
New cards

bias

systematic discrepancy, eliminate by taking a random sample

9
New cards

categorical variable

  • define membership in a group

  • (univariate) describe using frequencies or proportions/%

10
New cards

categorical nominal

categorical variable that has no inherent order (last movie seen, etc)

11
New cards

categorical ordinal

categorical variable that exists in an order (ie fruit ripeness, class)

12
New cards

numeric variable

  • quantitative measurement that typically has magnitude and units

  • (univariate) describe using a measure of center (mean, median, mode) and a measure of spread (stdev, IQR)

13
New cards

numeric discrete

variable that only exists in individual increments, doesn’t typically have decimals (# of texts sent, shoe size (only have .5 increments))

14
New cards

numeric continuous

exists as any real number (decimals typically make sense); hours slept, height, etc

15
New cards

confounding variable

unmeasured variables that may mask/distract from the causal relationships between variables of interest (homeless people cause car crashes)

16
New cards

univariate

describe 1 variable’s distribution

17
New cards

bivariate

describe relationship between 2 variables

18
New cards

explanatory variable

(bivariate) independent/predictor variable; see if it explains or predicts another variable

19
New cards

response variable

(bivariate) dependent/outcome variable, what you want to explain/predict (variable of interest)

20
New cards

standard deviation

measure of spread, gives average distance for each value to X̄ (mean)

21
New cards

Inner quartile range (IQR)

measure of spread, distance between 1st (25th percentile) and 3rd quartile (75th percentile)

Q3-Q1

22
New cards

percentile

percentage of values in a set of data scores that fall below a given value (25th percentile- 25% of the values are below the given value)

23
New cards

5 number summary

set of numbers that explain distribution

{min, Q1, median, Q3, max}

24
New cards

Describing numeric variables

  • include shape (symmetric or skewed) and a measure of center and spread (depends on shape)

25
New cards

symmetric

describe using x̄ (mean) and s (stdev)

26
New cards

skewed

describe using median and IQR

pos/neg based on which direction the tail points

27
New cards

contingency/2-way frequency table

table to display 2 categorical variables

compare the marginal distribution to the conditional distribution

28
New cards

marginal distribution

percentage out of the totals, probability of one variable

29
New cards

conditional distribution

probability of one variable given another

30
New cards

grouped bar chart

used to display 2 categorical variables

<p>used to display 2 categorical variables</p>
31
New cards

mosaic plot

used to display 2 categorical variables, better for when group sizes differ

<p>used to display 2 categorical variables, better for when group sizes differ</p>
32
New cards

grouped box plot

used to display one categorical, one numeric variable

<p>used to display one categorical, one numeric variable</p>
33
New cards

grouped histogram

used to display one categorical, one numeric

<p>used to display one categorical, one numeric</p>
34
New cards

pearson correlation (r)

tells strength and direction of linear relationships

35
New cards

law of large numbers

probability of an even is what its relative frequency will converge on after infinite trials

36
New cards

or probability

pr(a or b) = pr(a) + pr(b) - pr(a and b)

  • removes overlap

37
New cards

mutually exclusive events

pr(a and b) =0

38
New cards

and probability

pr(a and b) = pr(a) * pr(b|a)

  • to remove overlap

39
New cards

conditional probability

pr(b|a)

  • probability of b, given a has already happened

  • reduces sample space