GLBL 2121 Vocabulary

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/92

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

93 Terms

New cards

descriptive statistics

the process of describing/summarizing sample/population data collected

New cards

inferential statistics

the process of making predictions about population parameters using the data

New cards

population

the entire group of interest in regards to a statistical study

New cards

sample

a section of the entire group of interest of study

New cards

variables

characteristics that vary among subjects

New cards

quantitative

numerical values that represent different magnitudes of the variable

New cards

qualitative

values that are categorical without a specific order or magnitude

New cards

nominal data

measurements that are categorical/qualitative and unordered (no category is greater than or smaller than any other)

New cards

ordinal data

measurements that are categorical/qualitative and ordered (however, there is no particular defined distance between the levels of data)

New cards

discrete variables

values that form a set of separate numbers

New cards

continuous variables

values that can form an infinite continuum of possible real number values

New cards

sample size

New cards

simple random sample

method for creating a sample population in which each possible sample within that population has the same probability of being selected

New cards

sampling frame

list of all subjects in a population

New cards

random numbers

numbers generated by a computer to facilitate the selection of random samples in SRS

New cards

sample survey

method of collecting data involving supplying samples with questions

New cards

experiment

data gathered from the process of systematically changing a/certain condition(s) and measuring the output

New cards

treatments

different conditions within an experiment

New cards

observational studies

study of a population/sample without any manipulation of conditions

New cards

sampling error

how much the statistic differs from the parameter it predicts because samples are variable from the larger population

New cards

sampling bias

a sample is collected in such a way that some members of the intended population have a lower or higher sampling probability and may thus have adverse effects on the data collected

New cards

nonprobability sampling

methods for which it is not possible to determine the probabilities of the possible samples (i.e. volunteer sampling)

New cards

selection bias

bias introduced by the selection of individuals, groups, or data for analysis in such a way that proper randomization is not achieved, thereby failing to ensure that the sample obtained is representative of the population intended to be analyzed

New cards

undercoverage

when the sample selected for a study lacks representation from some groups in the population

New cards

response bias

wide range of tendencies for participants to respond inaccurately or falsely to questions

New cards

nonresponse bias

inability to gather data from certain subjects within a sample, either because of their refusal to participate or they are unreachable

New cards

systematic random sampling

process of selecting subjects by choosing a subject at random from the first nth name within a sampling frame and then selecting every nth subject listed after that one.

New cards

stratified random sampling

process of selecting subjects by dividing the population into separate groups called strata and then selecting a simple random sample from each stratum

New cards

proportional stratified random sampling

occurs when the sampled strate proprtions are the same as those in the entire population

New cards

disproportional stratified random sampling

occurs when the sampled strate proportions differ from the population proportions

New cards

cluster sampling

process for selecting subjects in which the population is divided into large number of clusters and a simple random sample is selected from amongst the clusters

New cards

multistage sampling

process for selecting samples by which mulitple sampling methods are utilized

New cards

frequency distribution

list of possible values for a variable with # of observations for that variable

New cards

relative frequency distribution

frequency distribution but with percentages/proportions

New cards

histogram

frequency distribution for quantitative variables segmented by intervals

New cards

stem-and-leaf plots

observations presented with their leading digit (stem) and final digit (leaf)

New cards

population distribution

frequency distributions for populations

New cards

sample data distribution

frequency distribution for samples

New cards

symmetrical distribution types

U-shaped and bell-shaped

New cards

skewed distributions

when the extreme ends of data frequencies form “tails” that elongate the shape

New cards

mean (average)

sum of the observations divided by the # of observations

New cards

y-bar

sample mean

New cards

properties of the mean

(1) highly influenced by outliers

(2) influenced by skewed distributions

(3) the “point of balance” on a number line when an equal weight is at each observation point

New cards

weighted average

where two sets of data with sample sizes n1 and n2 with sample means y1 and y2 are combined: (n1y1 + n2y2) / (n1 + n2)

New cards

median

the observation that falls in the middle of the ordered sample

New cards

properties of the median

(1) valid for quantitative and ordinal data

(2) for symmetric distributions, median and mean are the same

(3) in skewed distributions, it lies less farther out along the tail than the mean

(4) insensitive to the distances of the observations from the middle (only uses order of the data)

(5) not affected by outliers

New cards

best case use for mean

highly discrete data; distribution is close to symmetric or only mildly skewed

New cards

best case use for median

highly skewed data

New cards

mode

the values that occurs most frequently

New cards

bimodal distribution

when two distinct clusters of data occur within a data distribution

New cards

range

the difference between the largest and smallest observations within a data set

New cards

standard deviation

a measure of the amount of variation of the values of a variable about its mean; found by computing the squared sum of squared deviations from the y-bar mean and dividing by the sample size n—1; denoted by s

New cards

variance

standard deviation squared; denoted by s-squared

New cards

sum of squares

the sum of all calculated deviations squared; the larger the deviations, the larger the sum of squares and the larger s tends to be

New cards

properties of standard deviation

(1) s always greater than or equal to 0

(2) s = 0 when all the observations have the same value

(3) the greater the variability about the mean, the larger is the value of s

(4) standard deviation can be “rescaled” by multiples

New cards

percentiles

indicate the percentage of observations that fall below or at that point; the percentage of data falling above it = (100 - p)%

New cards

lower quartile

25th percentile

New cards

quantile

50th percentile

New cards

upper quartile

75th percentile

New cards

IQR (interquartile range)

different between the upper and lower quartiles; describes the spread of the middle half of the observations (increases as variability increases); not as sensitive as standard deviation is to outliers; for bell-shaped distributions, the IQR is approximately (4/3)s

New cards

boxplot

graph display that captures center (median) and variability (quartiles); extends to minimum and maximum but does not encapsulate outliers

New cards

outlier

falls more than 1.5(IQR) above the upper quartile and more than 1.5 IQR below the lower quartile

New cards

z-score

the # of standard deviations that an observation falls from the mean

New cards

association

exists between two variables if certain values of one variable tend to go with certain values of the other

New cards

bivariate analysis

an analysis of association between two variables (usually explanatory and response variables)

New cards

explanatory variable

the variable that defines groups (independent variable)

New cards

response variable

the outcome variable (dependent variable)

New cards

contingency table

displays the # of subjects observed at different combinations of possible outcomes for the two variables; illustrates contingency between explanatory variable and outcome

New cards

scatterplot

graph that plots data between bivariate quantitative variables using one dot to represent one occurence of that outcome

New cards

correlation

describes the strength of association between variables in terms of how closely the data follows a straight-line trend

New cards

regression analysis

analysis method that provides a straight-line formula for predicing the value of y given a value of x

New cards

population mean; average of the observations for the entire population

New cards

population standard deviation; describes the variability of those observations about the population mean

New cards

probability

the proportion of times that the outcome would occur in a very long sequence of observations

New cards

probability of A not occurring

P (not A) = 1-P(A)

New cards

probability of A or B

P (A or B) = P(A) + P(B)

New cards

probability of A and B

P (A and B) = P(A) x P(B given A); contains a conditional probability

New cards

probability of A and B (independent)

P(A and B) = P(A) x P(B)

New cards

histogram

graphic display for probability distribution where the probability of a value is represented by the height of a bar

New cards

mean of a probability distribution (discrete)

sum of total observations times their probability of occurence

New cards

E(y)

expected value of y; also known as the mean of a probability distribution

New cards

normal probability distribution

symmetric, bell-shaped, and characterized by its mean (μ) and standard deviation (σ); 0.68 of observations fall within 1 standard deviation, 0.95 within 2 SDs, and 0.997 within 3 SDs

New cards

Empirical Rule

for bell-shaped histograms, about 68% of the data fall within 1 SD of the mean, 95% falls within 2 SDs of the mean, and 99.7% of data falls within 3 SDs of the mean

New cards

z-score

represents the # of SDs that observed value y falls from the mean

New cards

standard normal distribution

a normal distribution with the mean μ = 0 and the SD σ = 1

New cards

covariance

represents the average of the cross products about the population means between bivariate variable distributions (joint probabilities for pairs of random variables)

New cards

sampling distribution of a statistic

the probability distribution that specifies probabilities for the possible values that statistic can take (i.e. sample proportion or sample mean)

New cards

sample mean y in relation to population mean μ

fluctuates (sample mean y varies in value from sample to sample)

New cards

standard error

the standard deviation of the sampling distribution y; describes how sample mean y varies from sample to sample; denoted by σ(y)

New cards

as n increases, the standard error

decreases

New cards

as n increases, the sampling distribution gets

narrower (sample proportion falls closer to the population proportion; less probability of getting any other wild answer)

New cards

Central Limit Theorem

for random sampling with a large size n, the sampling distribution of the sample mean y is approximately a normal distribution; for most cases, n of 30 is sufficient

New cards

implication of Central Limit Theorem

the bell shape of the sampling distribution applies no matter the same of the population distribution