intro to statistics

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/49

flashcard set

Earn XP

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

50 Terms

1
New cards

frequency

number of values that fall within a given interval

2
New cards

interval

numerical width (ex. 50 ≤ x< 60)

3
New cards

modal class

class of values and intervals that appears most often

4
New cards

continuous variable:

has infinite possibilities in a range (ex. time)

5
New cards

difference between stratified and quantum sampling

The main difference is that in stratified sampling, you draw a random sample from each subgroup (probability sampling). In quota sampling you select a predetermined number or proportion of units, in a non-random manner (non-probability sampling

6
New cards

discrete variable:

only has particular values

7
New cards

mean

knowt flashcard image
8
New cards

median

also the 50th percentile

<p>also the 50th percentile</p>
9
New cards

mode

value with the highest frequency

10
New cards

cumulative frequency

when one value of the frequency is added to the sum of all the values that came before it

11
New cards

percentile

the score below which a certain percentage of the data lies. ex. if you score in the 95th percentile, that means 95% of other scored less than you

12
New cards

cumulative frequency graph

knowt flashcard image
13
New cards

upper quartile:

75th percentile

14
New cards

lower quartile

25th percentile

15
New cards

interquartile range

Q3-Q1

16
New cards

spread/variation

how far the data ranges from the mean

17
New cards

range

the difference between the max and min values

18
New cards

box and whisker plot

<p></p>
19
New cards

upper boundary

upper quartile +(1.5)(IQR). any data larger than this is an outlier

20
New cards

lower boundary

lower quartile - (1.5)(IQR). any data smaller than this is an outlier

21
New cards

standard deviation σx

how consistent a set of values are

<p>how consistent a set of values are</p>
22
New cards

variance

standard deviation squared

23
New cards

finding σx on a calculator

stat→ edit → put the values in L1 (midpoint of each interval in L2 if using frequency) → stat → calc → 1. var stats (freq. list=L2 if using frequency) → calculate → enter

24
New cards

simple random sampling

everyone has an equal chance of being selected. ex. drawing names out of a hat

25
New cards

systematic sampling

selects members at regular intervals ex. polling every 20th customer. interval =population size / sample size

26
New cards

stratified sampling

sample has the same proportion from each stratum (group with commonality) as the population does, selected randomly

27
New cards

quota sampling

sample has the same proportion from each stratum (group with commonality) as the population does, selected specifically

28
New cards

cluster sampling

random selection from groups likely to be representative of the entire population instead of the whole population. ex. surveying employees from some Walmart stores instead of employees throughout the chain

29
New cards

multistage sampling

several levels of random sampling

30
New cards

voluntary sampling

members of a sample group are invited to participate

31
New cards

convenience sampling

members are chosen out of convenience

32
New cards

sampling bias

sample does not accurately represent population due to sampling technique/members chosen

33
New cards

non-response bias

groups are underrepresented because they chose not to respond

34
New cards

measurement bias

collection process affects the dependent variable. ex coercion, multiple choice survey

35
New cards

response bias

members give inaccurate answers

36
New cards

a sample has mean X and standard deviation O. if d is subtracted…

  • mean = X-d

  • standard deviation = O

  • variance= O²

37
New cards

a sample has mean X and standard deviation O. if multiplied by p…

  • mean =pX

  • standard deviation =pO

  • variance =p²O²

38
New cards

correlation

the relationship between 2 variable

39
New cards

correlation coefficient ( r )

measures the strength of correlation. ranges from -1 to 1

40
New cards

+r

positive correlation

41
New cards

-r

negative correlation

42
New cards

r=-1

perfect negative correlation

43
New cards

r=1

perfect positive correlation

44
New cards

r=0

no correlation

45
New cards

r is close to 1 or -1

strong positive/negative correlation

46
New cards

finding r on graphing calculator

stat→ x and y values in L1 and L2→ stat - calc→ 8 → calculate. make sure you switch diagnosticON by going into catalogue

47
New cards

line of best fit

A line that represents the trend or pattern in a scatter plot. It minimizes the distance between the data points and the line, showing the overall relationship between the variables. must pass through mean point

48
New cards

linear regression

method for finding line of best fit. Linear regression is used for predicting and understanding the impact of independent variables on the dependent variable. find by going to 4 instead of 8 (on graphing calculator)

49
New cards

x=my+c

the horizontal distance of points, found by switching the x and y columns

50
New cards

r2

closer to 1, the stronger the correlation