STAT1450 Test 1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/74

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

75 Terms

1
New cards

population

collection of all elements (people or things) of interest

2
New cards

example of population

ALL cscc students, ALL toddlers at daycare

3
New cards

sample

subset of the population actually surveyed- usually identified by the sample size (n=#)

4
New cards

example of sample

50 cscc students, 12 toddlers at daycare

5
New cards

parameter

summary values that comes from population data, usually noted by greek letters (mu=populaiton mean, sigma= population standard deviation)

6
New cards

statistic

summary values that come from sample data

7
New cards

descriptive statistics

facts/outcomes of collected data, describes what was collected (volleyball highlights)

8
New cards

inferential statistics

uses descriptive stats to make predictions or generalize the population (predictions for who will win the game)

9
New cards

data collection methods

census, simulation, experiment, sampling

10
New cards

census

collect “as is” data from the entire population (collect data from all stat1450 sp25 students)

11
New cards

simulation

collect “pretend” data- reduces times, risk and cost (flight simulation, study effects of meds on mice)

12
New cards

experiment

impose a treatment and record the response, often compares control (placebo) vs. experimental groups

13
New cards

sampling

collect “as is” data from a subset of the population (survey 30 randomly selected hair salon customers)

14
New cards

sampling methods

simple random, convenience, cluster, stratified, systematic

15
New cards

simple random

requires randomly selecting from a list of the entire population so that element of the population has the same change of being selected; this is the least biased sampling methods but the most costly/timely

16
New cards

example of simple random

drawing names from a hat, computer random dialing

17
New cards

convenience

survey those that are easily accessible, this is the most biased methods

18
New cards

example of convenience method

survey one class, phone in survey, magazine survey

19
New cards

cluster

sample all from a few pre-existing locations; in theory, each location is a good representation of the population (age, gender, etc)

20
New cards

example of cluster method

survey all students in 3 classes on campus

21
New cards

stratified

characteristic is determined that might bias the survey results (gender, politics, age) and sample equal/proportionate numbers from each

22
New cards

example of stratified method

20 male and 20 female, 100 democrat and 100 republican

23
New cards

systematic

sample every nth element of the population (every 3rd package on the assembly line)

24
New cards

qualitative data

words or numbers that describe words

25
New cards

examples of qualitative data

hometown, rank mood from 1-10, phone number, room number

26
New cards

quantitative data

number that represents counts or measurements

27
New cards

examples of quantitative data

age, gpa, number of siblings, “how much?”, total $

28
New cards

discrete data

finite number of outcome in any given interval, graph displays gaps

29
New cards

continuous data

infinite number of possible outcomes, graph displays no gaps/breaks

30
New cards

levels of measurement

nominal, ordinal, interval, ratio

31
New cards

nominal

“name”, qualitative data (words) that aren’t ranked, names something EX: hometown, zip code, favorite color

32
New cards

ordinal

“order”, qualitative data (words), that have an inherent rank, ranks words EX: military rank, class rank, letter grade

33
New cards

interval

quantitative data that has an arbitrary scale, zero does not mean absence of EX: clock time, temperature, year

34
New cards

ratio

quantitative data where zero means ‘nothing exists’, highest level of measurement EX: age, $, height, gpa

35
New cards

formula to find class width

(max-min)/ # of classes; round up to next whole #

36
New cards

histogram

  • bar graph

  • quantitative data

  • bars touch

  • horizontal axis: midpoints

  • vertical axis: frequency

37
New cards

relative frequency histogram

  • bar graph

  • bars touch

  • HA: midpoints

  • VA: relative frequency

38
New cards

polygon

  • line graph (connect dots)

  • HA: midpoints

  • VA: frequency

  • goes up and down

39
New cards

ogive

  • line graph

  • HA: upper class limits

  • VA: cumulative frequency

  • always increases/ stays level

  • NEVER decreases

40
New cards

stem and leaf plot rules

  • numeric order

  • can’t skip values

  • at least 3 stems required

41
New cards

time series plot

  • quantitative data

  • show how something changes over time

  • uses time units along HA (days, years, months)

  • uses collected data along VA

42
New cards

dot plot

  • quantitative data

  • used to display shape

  • stacked dots for data

43
New cards

box plot rule about outliers

there if an outlier if it is further than 1.5 from box

44
New cards

pareto chart

  • qualitative data

  • HA: categories

  • gaps between bars

  • go in decreasing height

  • VA: frequency/ rel. frequency

  • displays mode

  • shape is data dependent

  • common mode is brown hair color

45
New cards

pie chart

  • qualitative data

  • circle graph

  • separate parts of a whole

  • each section displays the amount

  • all add to 100%

46
New cards

measures of center

mean, media, mode

47
New cards

mean

average, “not resistant measure”→ affected by extreme data

48
New cards

median

middle piece of data, considered “resistant measure”→ not affected by extreme data

49
New cards

mode

data that occurred most often, no mode if all values occurred the same number of times

50
New cards

trimmed mean

average caluclated from a “trimmed” data set, trime % of data from both low and high end of sorted list, then find mean of the remaining values

  • Multiply % times n (sample number) to determine how many values to deleter from each end of the list

51
New cards

outliers

extreme values that pull the mean away from the center

52
New cards

best when no outliers exist

mean

53
New cards

best when outliers exist

median

54
New cards

measures of variation (spread)

range, standard deviation

55
New cards

range

max - min

56
New cards

standard deviation

average distance that each peice of data is away from the mean

  • Sx= sample standard deviation

  • Sigmax= population standard deviation

57
New cards

uniform distribution

no mode, mean=median

58
New cards

normal distribution

1 mode= mean= median (all in center top of peak)

59
New cards

skewed right

“positively skewed”, mean>median

60
New cards

skewed left

“negatively skewed”, mean<median

61
New cards

bimodal

2 modes, mean=median

62
New cards

empirical rule

only applies to normal distributions for k=1,2,3 respectively

63
New cards

empirical rule of 68%

mu (sample) ±1sigma (stdev sx)= 68%

64
New cards

empirical rule of 95%

mu ± 2sigma(stdev)= 95%

65
New cards

empirical rule of 99.7%

mu ± 3sigma=99.7%

66
New cards

chebyshev’s theorem

used when distribution is not normal, at least 1-(1/k²) of the data lie within k standard deviation of the mean

67
New cards

chebyshev’s theorem of 75%

mu (sample) ± 2(stdev)= 75%

68
New cards

chebyshev’s theorem of 89%

mu (sample) ± 3(stdev)= 89%

69
New cards

chebyshev’s theorem of 94%

mu (sample) ± 4(stdev)= 94%

70
New cards

coefficient of variation formula

stdev/mean; the smaller the CV- the more consistent the data

71
New cards

measures of relative position

percentiles, quartiles, z-scores, unusual data

72
New cards

percentiles

percent of data below a given value; P= #’s below given #/ total amount of #’s times 100

73
New cards

quartiles

divide data into quarters

  • Q1: 25% of data is below this

  • Q2: median, 50% of data is below this

  • Q3: 75% of data is below this

74
New cards

z-scores

number of standard deviations a value is from the mean

  • z= sample - mean/ stdev

  • pos= above mean

  • neg= below mean

  • 0= exactly on the mean

75
New cards

unusual data

data that is more than 2 standard deviation from the mean, left or right