Purdue STAT 301 Exam 1

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/74

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

75 Terms

1
New cards

Categorical variable

a variable that is described in WORDS (ex: eye color)

2
New cards

Quantitative variable

a variable that is described in NUMBERS (ex: weight)

3
New cards

Bar graphs are used for

categorical data

4
New cards

Pie charts are used for

quantitative data

5
New cards

Stemplots

are tables in which NUMERICAL (quantitative) data values are divided into "stems" that can have multiple "leaves"

6
New cards

Histograms

are graphs consisting of vertical bars that touch each other and represent the frequency distribution of a set of data (quantitative)

7
New cards

bar graph

knowt flashcard image
8
New cards

pie chart

knowt flashcard image
9
New cards

Stemplot

knowt flashcard image
10
New cards

Histogram

knowt flashcard image
11
New cards

Outliers

extreme values that don't appear to belong with the rest of the data

12
New cards

influential observation

an observation that has a strong influence or effect on the regression results

13
New cards

1 peak=

unimodal

14
New cards

2 peaks=

bimodal

15
New cards

more than 2 peaks=

multimodal

16
New cards

symmetric

mean is equal to median

<p>mean is equal to median</p>
17
New cards

right skewed

mean is greater than median

<p>mean is greater than median</p>
18
New cards

left skewed

mean is less than median

<p>mean is less than median</p>
19
New cards

Center of Distribution

Described by the mean, median, or mode, it is in some way the middle of the distribution.

20
New cards

Spread of Distribution

Described by Range, Interquartile Range, or Standard Deviation, the spread says how "wide" the distribution is.

21
New cards

Outliers

Any point that falls outside the pattern of the association should be considered an outlier.

22
New cards

Influential Points

A point is influential if it has a big effect on a calculation, such as the correlation or equation of the least-squares regression line. Points separated in the x-direction are often influential.

23
New cards

Mean

only used when data is NOT skewed, only used with continuous data (ratio or interval)

24
New cards

Median

used when data IS SKEWED, aka 50% percentile, normal distribution curve, can be used with all the data types

25
New cards

range

the difference between the highest and lowest scores in a distribution

26
New cards

IQR (interquartile range)

measure of statistical dispersion, being equal to the difference between the upper and lower quartiles (IQR = Q3 − Q1)

27
New cards

Variance

a difference between what is expected and what actually occurs (standard deviation squared)

28
New cards

standard deviation

a computed measure of how much scores vary around the mean score (square root of variance)

29
New cards

resistant measure

A statistic that is not affected very much by extreme observations.

30
New cards

5 number summary

minimum, Q1, median, Q3, maximum

31
New cards

1.5 IQR Rule

used for identifying outliers, any values that are more than 1.5 times the IQR lower than the first quartile or higher than the third quartile are called outliers

32
New cards

Boxplots (modified)

knowt flashcard image
33
New cards

Boxplots (side-by-side)

knowt flashcard image
34
New cards

unit/subject

one member of the entities being studied

35
New cards

Population vs. Sample

The population is the whole group versus a sample which are parts of the population.

36
New cards

Census

the official count of a population

37
New cards

Experiment

A study is an experiment ONLY if researchers impose a treatment upon the experimental units.

38
New cards

Observational Study

In an observational study, researchers make no attempt to influence the results and cannot conclude cause-and-effect.

39
New cards

non-random sampling

an alternative sampling method to random sampling, where the sample is not chosen at random.

40
New cards

voluntary response sample

A sample which involves only those who want to participate in the sampling

41
New cards

simple random sample (SRS)

every member of the population has a known and equal chance of selection

42
New cards

stratified random sampling

separation of the target population into different groups, called strata, and the selection of samples from each stratum

43
New cards

Multistage Random Sampling

a sample design in which the elements of the sampling frame are subdivided and the sample is chosen in more than one stage

44
New cards

anecdotal evidence

an informal observation that has not been systematically tested

45
New cards

undercoverage bias

occurs when some groups in the population are left out of the process of choosing the sample

46
New cards

nonresponse bias

bias introduced to a sample when a large fraction of those sampled fails to respond

47
New cards

response bias

people do not respond honestly

48
New cards

sampling variability

the natural tendency of randomly drawn samples to differ from each other

49
New cards

Parameter vs. Statistic

a characteristic or measure of a POPULATION vs. a characteristic or measure of SAMPLE

50
New cards

sampling distribution

the distribution of values taken by the statistic in all possible samples of the same size from the same population

51
New cards

treatments

the experimental conditions imposed by the experimenter

52
New cards

Factors

used during an experiment in order to determine their effect on the response variable

53
New cards

Factor levels

factors can only assume a limited number of possible values

54
New cards

explanatory variable

a variable that we think explains or causes changes in the response variable

55
New cards

response variable

a variable that measures an outcome or result of a study

56
New cards

control group

the group that does not receive the experimental treatment

57
New cards

Placebo

something which has a positive mental effect, but no physical effect

58
New cards

Bias

something that causes an inaccuracy in statistics and should be avoided

59
New cards

3 principals of experimental design

control, randomization, replication

60
New cards

completely randomized design

the treatments are assigned to all the experimental units completely by chance

61
New cards

block design

the random assignment of individuals to treatments is carried out separately within each block (group A and group B)

62
New cards

matched pairs design

A method of assigning subjects to groups in which pairs of subjects are first matched on some characteristic and then individually assigned randomly to groups.

63
New cards

double-blind experiment

an experiment in which neither the experimenter nor the participants know which participants received which treatment

64
New cards

review board

screening committees at research institutions that evaluate all research projects relative to their potential harm to participants

65
New cards

informed consent

an ethical principle that research participants be told enough to enable them to choose whether they wish to participate

66
New cards

Confidentiality

the act of holding information in confidence, not to be released to unauthorized individuals

67
New cards

ethics of doing experiments with animals

reduce, refine, replace

68
New cards

Causation

A cause and effect relationship in which one variable controls the changes in another variable.

69
New cards

lurking variable

a variable that is not among the explanatory or response variables in a study but that may influence the response variable

70
New cards

68-95-99.7 rule

in a normal model, about 68% of values fall within 1 standard deviation of the mean, about 95% fall within 2 standard deviations of the mean, and about 99.7% fall within 3 standard deviations of the mean

71
New cards

standard normal distribution

a normal distribution of z scores

72
New cards

sample mean symbol

73
New cards

standard deviation symbol

weird looking "o" that is actually a Greek letter that I do not know the name of (sorry)(maybe sigma?)

74
New cards

population mean symbol

μ

75
New cards

Central Limit Theorem (CLT)

the sampling distribution derived from a simple random sample will be approximately normally distributed