Purdue STAT 301 Exam 1

0.0(0)

Studied by 1 person

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/74

There's no tags or description

Looks like no tags are added yet.

Last updated 3:27 PM on 9/15/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

75 Terms

New cards

Categorical variable

a variable that is described in WORDS (ex: eye color)

New cards

Quantitative variable

a variable that is described in NUMBERS (ex: weight)

New cards

Bar graphs are used for

categorical data

New cards

Pie charts are used for

quantitative data

New cards

Stemplots

are tables in which NUMERICAL (quantitative) data values are divided into "stems" that can have multiple "leaves"

New cards

Histograms

are graphs consisting of vertical bars that touch each other and represent the frequency distribution of a set of data (quantitative)

New cards

bar graph

New cards

pie chart

New cards

Stemplot

New cards

Histogram

New cards

Outliers

extreme values that don't appear to belong with the rest of the data

New cards

influential observation

an observation that has a strong influence or effect on the regression results

New cards

1 peak=

unimodal

New cards

2 peaks=

bimodal

New cards

more than 2 peaks=

multimodal

New cards

symmetric

mean is equal to median

New cards

right skewed

mean is greater than median

New cards

left skewed

mean is less than median

New cards

Center of Distribution

Described by the mean, median, or mode, it is in some way the middle of the distribution.

New cards

Spread of Distribution

Described by Range, Interquartile Range, or Standard Deviation, the spread says how "wide" the distribution is.

New cards

Outliers

Any point that falls outside the pattern of the association should be considered an outlier.

New cards

Influential Points

A point is influential if it has a big effect on a calculation, such as the correlation or equation of the least-squares regression line. Points separated in the x-direction are often influential.

New cards

Mean

only used when data is NOT skewed, only used with continuous data (ratio or interval)

New cards

Median

used when data IS SKEWED, aka 50% percentile, normal distribution curve, can be used with all the data types

New cards

range

the difference between the highest and lowest scores in a distribution

New cards

IQR (interquartile range)

measure of statistical dispersion, being equal to the difference between the upper and lower quartiles (IQR = Q3 − Q1)

New cards

Variance

a difference between what is expected and what actually occurs (standard deviation squared)

New cards

standard deviation

a computed measure of how much scores vary around the mean score (square root of variance)

New cards

resistant measure

A statistic that is not affected very much by extreme observations.

New cards

5 number summary

minimum, Q1, median, Q3, maximum

New cards

1.5 IQR Rule

used for identifying outliers, any values that are more than 1.5 times the IQR lower than the first quartile or higher than the third quartile are called outliers

New cards

Boxplots (modified)

New cards

Boxplots (side-by-side)

New cards

unit/subject

one member of the entities being studied

New cards

Population vs. Sample

The population is the whole group versus a sample which are parts of the population.

New cards

Census

the official count of a population

New cards

Experiment

A study is an experiment ONLY if researchers impose a treatment upon the experimental units.

New cards

Observational Study

In an observational study, researchers make no attempt to influence the results and cannot conclude cause-and-effect.

New cards

non-random sampling

an alternative sampling method to random sampling, where the sample is not chosen at random.

New cards

voluntary response sample

A sample which involves only those who want to participate in the sampling

New cards

simple random sample (SRS)

every member of the population has a known and equal chance of selection

New cards

stratified random sampling

separation of the target population into different groups, called strata, and the selection of samples from each stratum

New cards

Multistage Random Sampling

a sample design in which the elements of the sampling frame are subdivided and the sample is chosen in more than one stage

New cards

anecdotal evidence

an informal observation that has not been systematically tested

New cards

undercoverage bias

occurs when some groups in the population are left out of the process of choosing the sample

New cards

nonresponse bias

bias introduced to a sample when a large fraction of those sampled fails to respond

New cards

response bias

people do not respond honestly

New cards

sampling variability

the natural tendency of randomly drawn samples to differ from each other

New cards

Parameter vs. Statistic

a characteristic or measure of a POPULATION vs. a characteristic or measure of SAMPLE

New cards

sampling distribution

the distribution of values taken by the statistic in all possible samples of the same size from the same population

New cards

treatments

the experimental conditions imposed by the experimenter

New cards

Factors

used during an experiment in order to determine their effect on the response variable

New cards

Factor levels

factors can only assume a limited number of possible values

New cards

explanatory variable

a variable that we think explains or causes changes in the response variable

New cards

response variable

a variable that measures an outcome or result of a study

New cards

control group

the group that does not receive the experimental treatment

New cards

Placebo

something which has a positive mental effect, but no physical effect

New cards

Bias

something that causes an inaccuracy in statistics and should be avoided

New cards

3 principals of experimental design

control, randomization, replication

New cards

completely randomized design

the treatments are assigned to all the experimental units completely by chance

New cards

block design

the random assignment of individuals to treatments is carried out separately within each block (group A and group B)

New cards

matched pairs design

A method of assigning subjects to groups in which pairs of subjects are first matched on some characteristic and then individually assigned randomly to groups.

New cards

double-blind experiment

an experiment in which neither the experimenter nor the participants know which participants received which treatment

New cards

review board

screening committees at research institutions that evaluate all research projects relative to their potential harm to participants

New cards

informed consent

an ethical principle that research participants be told enough to enable them to choose whether they wish to participate

New cards

Confidentiality

the act of holding information in confidence, not to be released to unauthorized individuals

New cards

ethics of doing experiments with animals

reduce, refine, replace

New cards

Causation

A cause and effect relationship in which one variable controls the changes in another variable.

New cards

lurking variable

a variable that is not among the explanatory or response variables in a study but that may influence the response variable

New cards

68-95-99.7 rule

in a normal model, about 68% of values fall within 1 standard deviation of the mean, about 95% fall within 2 standard deviations of the mean, and about 99.7% fall within 3 standard deviations of the mean

New cards

standard normal distribution

a normal distribution of z scores

New cards

sample mean symbol

x̅

New cards

standard deviation symbol

weird looking "o" that is actually a Greek letter that I do not know the name of (sorry)(maybe sigma?)

New cards

population mean symbol

New cards