statistics 121 test 1

4.8(4)

Studied by 17 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/102

Earn XP

Description and Tags

Statistics

Last updated 2:16 PM on 9/26/22

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

103 Terms

New cards

population

the entire group of individuals that is the target of our interest; generally too big to actually measure or observe

New cards

sample

subgroup of the population which we can examine or observe, measure and collect data from

New cards

individual

single entity that is being observed

New cards

variable

characteristic measured on each individual

New cards

quantitative variable

variable whose possible values are meaningful numbers

New cards

categorical variable

variable whose possible responses are non-quantitative categories (words/labels/attributes)

New cards

measurement

value of a variable for an individual

New cards

data

measurements for a set of individuals (Goal of Statistics: convert this to useful information)

New cards

data set

data identified with contextual information (who was observed, what was measured, why is study done) often given in a table

New cards

EDA (exploratory data analysis) goals

- organize and summarize data
- discover features, patterns and striking deviations
- interpret patterns in context
- include visual displays and numerical values

New cards

single variable pattern

distribution of a variable: summary of data one variable at a time (all the possible values and how often they occur)

New cards

process of statistical problem solving

1. Collect data
2. Summarize data
3. Interpret data

New cards

parameter

numerical fact about the variable in the population

New cards

statistic

numerical fact about the variable in the sample

New cards

convenience sampling

select individuals in the easiest possible way

New cards

volunteer response sampling

individuals select themselves

New cards

quota sampling

force the sample to meet specified quotas

New cards

simple random sample (SRS)

every possible set of a specified size has an equal chance of being selected

New cards

cluster sampling

a random sample of clusters is taken and all individuals in selected clusters are included in sample

New cards

stratified random sample

select a random sample (SRS) from each stratum and combine these SRSs together

New cards

multi-stage sample

take a sample at each hierarchical level of the population

New cards

treatment

the condition applied to a subject in an experiment (one of the subcategories/values of the explanatory variable)

New cards

lurking variables

variables that affect both the explanatory and response variables but are not measured or included as a planned factor in the study

New cards

control

an effort to reduce the effects of lurking variables

New cards

confounding

situation in which effects of lurking variables cannot be distinguished from effects of factors

New cards

historical comparison experiments

study involving only one treatment, where treated subjects are compared to untreated subjects from some external source

New cards

unreplicated experiments

assigns one subject only to each treatment

New cards

confounded experiments

treatment groups are handled differently in some way OTHER than the treatment

New cards

undercoverage

some individuals have no possibility of being selected

New cards

non-response

some selected individuals choose not to be in the sample because they refuse to provide information or cannot be contacted

New cards

misleading response

people lie or give inaccurate answers (often about sensitive issues)

New cards

interviewer effect

person asking questions influences responses (for in-person/phone surveys)

New cards

question order effect

the order that questions are asked promotes certain responses

New cards

question wording

the way a question asked leads, misleads or confuses

New cards

open questions

allow for almost unlimited possible responses (short answer), less restrictive but more difficult to analyze

New cards

closed questions

limit response options (multiple choice), easier to analyze but may be biased by the options provided. should include "other/unsure" option

New cards

observational studies

individuals are not assigned to treatments, are self selected, cannot conclude causation

New cards

experiment

study where individuals are assigned to treatments, causation okay if valid

New cards

subject

individual to which treatment is applied

New cards

response variable

characteristic measure on each subject; outcome of interest

New cards

explanatory variable

characteristic/measurement that is use to predict or explain changes in the response variable; variable we think could help us know about the response (measured earlier or more easily); independent variable

New cards

factor

planned explanatory variable

New cards

comparison

two or more groups; controls lurking variables by including comparison treatments

New cards

randomization

randomly assign subjects to groups; neutralizes effects of lurking variables by assigning subjects to treatments using a random device

New cards

replication

two or more subjects in each group; assign more that one subject to each treatment to detect important effects

New cards

double blinding

neither subjects nor the researchers in direct contact with the subjects know which treatment is received

New cards

placebo effect

favorable response of a human subject to a placebo because of trust in the medical provider or belief that the treatment will work

New cards

diagnostic bias

diagnosis of subjects is biased by preconceived notions about the effectiveness of the treatment (person administering treatments expects certain responses)

New cards

lack of realism

realism is compromised by the conditions of the study

New cards

hawthorne effect

people in experiment behave differently than they would normal behave, not like real life

New cards

non-compliance

subjects fail to submit to the assigned treatment or refuse to follow the protocol of the experiment

New cards

principles of data ethics

• safety and well-being of the subjects must be protected
• all individuals must give their informed consent before data are collected
• individual data must be kept confidential

New cards

randomized controlled experiment

randomly assign subjects to treatments, grouped by treatment

New cards

randomized block design

randomly assign to treatments within blocks, grouped by treatment or by block

New cards

benefits of randomized block design (RBD)

- removes confounding of lurking variables
- reduces chance variation by removing variation associated with the blocking variable
- yields more precise estimates of chance variation

New cards

matched pairs

two treatments; matched individuals or two measurements per subject

New cards

three principles of experiments

- randomly assign two treatments to two individuals or randomize the order of treatment application to each individual
- replication = number of pairs
- compare the two treatments

New cards

analysis of distribution of quantitative data

- always plot data first
- look for an overall pattern and for striking deviations
- look at shape, center, spread of distribution
- add numerical summaries to supplement graph
- if pattern is regular, use mathematical model to describe data

New cards

symmetric and bell shaped distribution examples

blood pressure, IQ, biological factors

New cards

symmetric and bell shaped distribution

mean, median, and mode are the same

New cards

right skewed distribution

concentration of data on left, tail extends to the right; mean > median

New cards

right skewed distribution examples

salary, home price, children, economic variables

New cards

left skewed distribution

concentration of data on right and the tail on the left; median > mean

New cards

left skewed distribution examples

test scores, olympic high jump

New cards

bimodal distribution

a distribution with two modes

New cards

bimodal distribution examples

speed limits, restaurant patrons

New cards

flat or uniform distribution

relatively equal across graph

New cards

flat or uniform distribution examples

rolling a die, day of the month born

New cards

center

typical, middle value; half of data to each side

New cards

spread

consistency/inconsistency of data; look for maximum and minimum

New cards

outliers

values that are far outside most of data
- is data point miscoded?
- unusual conditions?
- should data point be excluded?

New cards

mode

most frequently occurring score, corresponds to a peak

New cards

median

the middle score in a distribution; half the scores are above it and half are below it

New cards

mean

center of gravity; the arithmetic average of a distribution, obtained by adding the scores and then dividing by the number of scores

New cards

mean vs median

- construct graph to evaluate skewness and outliers
- use median if distribution is markedly skewed or outliers are present
- use mean if distribution is roughly symmetric

New cards

range

maximum - minimum

New cards

interquartile range (IQR)

the difference between the first and third quartiles

New cards

standard deviation

average distance of values from the mean

New cards

first quartile (Q1)

a number for which 25% of the data is less than that number; same as the median of the data which are less than the overall median

New cards

second quartile (Q2)

median

New cards

third quartile (Q3)

a number for which 75% of the data is less than that number; same as the median of the part of the data which is greater than the median

New cards

5 number summary vs 2 number summary

use 5 number for skewed, and 2 number for symmetric

New cards

5 number summary

minimum, Q1, median, Q3, maximum

New cards

random phenomenon

individual outcome unpredictable, but outcomes from large number of repetitions follow regular pattern

New cards

sample space

the set of all possible outcomes

New cards

event

a collection of possible outcomes

New cards

probability of an outcome

The proportion of times that an outcome occurs in many, many repetitions of the random phenomenon

New cards

probability rules

- 0

New cards

theoretical probability

number of favorable outcomes divided by total number of possible outcomes

New cards

empirical probability

number of outcomes divided by total of repetitions

New cards

law of large numbers

As the number of repetitions of a probability experiment increases, the proportion with which a certain outcome is observed gets closer to the theoretical probability of the outcome

New cards

probability

the long-run relative frequency with which an event will occur

New cards

probability distribution

all possible events and their associated probabilities

New cards

random variable

a variable whose value is a numerical outcome of a random phenomenon

New cards

continuous random variable

a variable that can take on any possible value, all values cannot be listed