Stats Unit 1 Vocab

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/42

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

43 Terms

1
New cards

Individual

An object described in a set of data.

2
New cards

Variable

A attribute that can take dif. values for dif. individuals

3
New cards

Categorical Variable

Assigns labels that place individuals into a specific group called category

4
New cards

Quantitative Variable

Takes # values (quantities) ex. counts, measures, age

5
New cards

Discrete Vriaable

A quantitative variable that takes a fixed set of possible values w/ gaps between them. (ex. 1 - 2 - 3)

6
New cards

Continuous Variable

A quantitative variable that can take any value in an interval on the number line (ex. 0- infinity)

7
New cards

Distribution

Tells us what values the variable takes and how often

8
New cards

Frequency Table

shows # of individuals having each value

9
New cards

Relative Frequency Table

shows proportion/percent of individuals having each value

10
New cards

Bar Graph

shows each category as a bar, heights of bars show category frequencies/relative frequencies

11
New cards

Pie Chart

shows each category as a slice of a pie. Areas of slices are proportional to category frequencies/relative frequencies

12
New cards

Two-way-Table

Table of counts that summarizes data on the relationship between two categorical variable for some group of individuals.

13
New cards

Marginal Relative Frequency

Gives the %/proportion of individuals that have a specific value for one categorical variable. (tells you only about one of the variables in a two-way table)

14
New cards

Joint Relative Frequency

Gives the %/proportion of individuals w/ specific values for two categorical variables.

15
New cards

Conditional Relative Frequency

Gives the proportion of individuals that has a specific value for one categorical variable among individuals who share the same value of another categorical variable (the condition).

16
New cards

Association

There is association between two variable if knowing value of one, helps predict the other.

17
New cards

Graphs

Use a side by side bar graph, a segmented bar graph or a mosaic plot to compare the distribution of a categorical variable for two or more groups.

18
New cards

Dotplot

Show each data value as a dot above its location on a #-line

19
New cards

Symmetric Distribution

right side of dot plot = roughly a mirrored version on the left

20
New cards

Skewed Right

Right side is much longer and has less values

21
New cards

Skewed Left

Left side has less values and is much longer

22
New cards

Graph Shape

Ex. The distribution is skewed right with a single peak at 1 goal. There is a gap between 5 and 9.

23
New cards

Outliers

An observation that falls outside of the overall pattern. Ex. Games where 9&10 goals were scored appear to be outliers.

24
New cards

Center

The median is 2 goals scored.

25
New cards

Variability

The data vary from 1 to 10 goals scored.

26
New cards

Stemplot

shows each data value separated into two parts.

A stem: consisted of all but the final digit

A Leaf: the final digit

Stems are ordered from lowest to highest in a vertical column

Leaves are ordered from smallest to largest number from the appropriate stems

<p>shows each data value separated into two parts.</p><p></p><p>A stem: consisted of all but the final digit</p><p>A Leaf: the final digit</p><p>Stems are ordered from lowest to highest in a vertical column</p><p>Leaves are ordered from smallest to largest number from the appropriate stems</p>
27
New cards

Histograms

Shows each interval of values as a bar.

Heights of bars show the frequencies or relative frequencies of value in each interval.

28
New cards

Mean

The average of all the individual data values.

29
New cards

Statistic

A # that describes some characteristic of a sample.

30
New cards

Parameter

A # that describes some characteristic of a population.

31
New cards

Resistant

A statistical measure is resistant if it isn’t sensitive to extreme values.

32
New cards

Median

The midpoint of a distribution, half the observations are smaller, half are larger.

33
New cards

Comparing mean and median

Mean < Median : skewed left

Mean = Median : Roughly Symmetric

Mean > Median : skewed right

34
New cards

Range

The distance between the min. & max. value

Range = max - min

Ex. The data varies from 1 to 10 with a range of 9.

35
New cards

St. dev

measures the typical distance of values in a distribution from the mean

  • always greater of equal to 0

  • 0 when all values in a dsitrubtion = same

  • increases with increased variation from mean

  • measures variation about the mean

36
New cards

St dev formula

( value - mean)² = variance

square root variance = st. dev

37
New cards

Quartiles

the values that divide the distribution into 4 groups of roughly equal size. Arrange data least to greatest to find.

38
New cards

1st Quartile

the median of the data values that are left of the median

39
New cards

3rd Quartile

The median of the data values to the right of the median

40
New cards

IQR (interquartile range)

measures the variability in the middle half of the distribution

the distance between the 1st and 3rd quartiles of a distribution

IQR = Q3 - Q1

41
New cards

How to identify outliers

If it falls more than 1.5 x IQR above the 3rd or below the 1st quartile

Low Outliers < Q1 - (1.5 x IQR)

High Outliers > Q3 - (1.5 x IQR)

42
New cards

5 Number Summary

The 5 #;s of a distribution of quantitative data consists of the

minimum, Q1, median, Q3, maximum

43
New cards

Boxplot

A visual representation of the 5 # summary