GLBL 2121 Vocabulary

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/92

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

93 Terms

1
New cards

descriptive statistics

the process of describing/summarizing sample/population data collected

2
New cards

inferential statistics

the process of making predictions about population parameters using the data

3
New cards

population

the entire group of interest in regards to a statistical study

4
New cards

sample

a section of the entire group of interest of study

5
New cards

variables

characteristics that vary among subjects

6
New cards

quantitative

numerical values that represent different magnitudes of the variable

7
New cards

qualitative

values that are categorical without a specific order or magnitude

8
New cards

nominal data

measurements that are categorical/qualitative and unordered (no category is greater than or smaller than any other)

9
New cards

ordinal data

measurements that are categorical/qualitative and ordered (however, there is no particular defined distance between the levels of data)

10
New cards

discrete variables

values that form a set of separate numbers

11
New cards

continuous variables

values that can form an infinite continuum of possible real number values

12
New cards

n

sample size

13
New cards

simple random sample

method for creating a sample population in which each possible sample within that population has the same probability of being selected

14
New cards

sampling frame

list of all subjects in a population

15
New cards

random numbers

numbers generated by a computer to facilitate the selection of random samples in SRS

16
New cards

sample survey

method of collecting data involving supplying samples with questions

17
New cards

experiment

data gathered from the process of systematically changing a/certain condition(s) and measuring the output

18
New cards

treatments

different conditions within an experiment

19
New cards

observational studies

study of a population/sample without any manipulation of conditions

20
New cards

sampling error

how much the statistic differs from the parameter it predicts because samples are variable from the larger population

21
New cards

sampling bias

a sample is collected in such a way that some members of the intended population have a lower or higher sampling probability and may thus have adverse effects on the data collected

22
New cards

nonprobability sampling

methods for which it is not possible to determine the probabilities of the possible samples (i.e. volunteer sampling)

23
New cards

selection bias

bias introduced by the selection of individuals, groups, or data for analysis in such a way that proper randomization is not achieved, thereby failing to ensure that the sample obtained is representative of the population intended to be analyzed

24
New cards

undercoverage

when the sample selected for a study lacks representation from some groups in the population

25
New cards

response bias

wide range of tendencies for participants to respond inaccurately or falsely to questions

26
New cards

nonresponse bias

inability to gather data from certain subjects within a sample, either because of their refusal to participate or they are unreachable

27
New cards

systematic random sampling

process of selecting subjects by choosing a subject at random from the first nth name within a sampling frame and then selecting every nth subject listed after that one.

28
New cards

stratified random sampling

process of selecting subjects by dividing the population into separate groups called strata and then selecting a simple random sample from each stratum

29
New cards

proportional stratified random sampling

occurs when the sampled strate proprtions are the same as those in the entire population

30
New cards

disproportional stratified random sampling

occurs when the sampled strate proportions differ from the population proportions

31
New cards

cluster sampling

process for selecting subjects in which the population is divided into large number of clusters and a simple random sample is selected from amongst the clusters

32
New cards

multistage sampling

process for selecting samples by which mulitple sampling methods are utilized

33
New cards

frequency distribution

list of possible values for a variable with # of observations for that variable

34
New cards

relative frequency distribution

frequency distribution but with percentages/proportions

35
New cards

histogram

frequency distribution for quantitative variables segmented by intervals

36
New cards

stem-and-leaf plots

observations presented with their leading digit (stem) and final digit (leaf)

37
New cards

population distribution

frequency distributions for populations

38
New cards

sample data distribution

frequency distribution for samples

39
New cards

symmetrical distribution types

U-shaped and bell-shaped

40
New cards

skewed distributions

when the extreme ends of data frequencies form “tails” that elongate the shape

41
New cards

mean (average)

sum of the observations divided by the # of observations

42
New cards

y-bar

sample mean

43
New cards

properties of the mean

(1) highly influenced by outliers

(2) influenced by skewed distributions

(3) the “point of balance” on a number line when an equal weight is at each observation point

44
New cards

weighted average

where two sets of data with sample sizes n1 and n2 with sample means y1 and y2 are combined: (n1y1 + n2y2) / (n1 + n2)

45
New cards

median

the observation that falls in the middle of the ordered sample

46
New cards

properties of the median

(1) valid for quantitative and ordinal data

(2) for symmetric distributions, median and mean are the same

(3) in skewed distributions, it lies less farther out along the tail than the mean

(4) insensitive to the distances of the observations from the middle (only uses order of the data)

(5) not affected by outliers

47
New cards

best case use for mean

highly discrete data; distribution is close to symmetric or only mildly skewed

48
New cards

best case use for median

highly skewed data

49
New cards

mode

the values that occurs most frequently

50
New cards

bimodal distribution

when two distinct clusters of data occur within a data distribution

51
New cards

range

the difference between the largest and smallest observations within a data set

52
New cards

standard deviation

a measure of the amount of variation of the values of a variable about its mean; found by computing the squared sum of squared deviations from the y-bar mean and dividing by the sample size n—1; denoted by s

53
New cards

variance

standard deviation squared; denoted by s-squared

54
New cards

sum of squares

the sum of all calculated deviations squared; the larger the deviations, the larger the sum of squares and the larger s tends to be

55
New cards

properties of standard deviation

(1) s always greater than or equal to 0

(2) s = 0 when all the observations have the same value

(3) the greater the variability about the mean, the larger is the value of s

(4) standard deviation can be “rescaled” by multiples

56
New cards

percentiles

indicate the percentage of observations that fall below or at that point; the percentage of data falling above it = (100 - p)%

57
New cards

lower quartile

25th percentile

58
New cards

quantile

50th percentile

59
New cards

upper quartile

75th percentile

60
New cards

IQR (interquartile range)

different between the upper and lower quartiles; describes the spread of the middle half of the observations (increases as variability increases); not as sensitive as standard deviation is to outliers; for bell-shaped distributions, the IQR is approximately (4/3)s

61
New cards

boxplot

graph display that captures center (median) and variability (quartiles); extends to minimum and maximum but does not encapsulate outliers

62
New cards

outlier

falls more than 1.5(IQR) above the upper quartile and more than 1.5 IQR below the lower quartile

63
New cards

z-score

the # of standard deviations that an observation falls from the mean

64
New cards

association

exists between two variables if certain values of one variable tend to go with certain values of the other

65
New cards

bivariate analysis

an analysis of association between two variables (usually explanatory and response variables)

66
New cards

explanatory variable

the variable that defines groups (independent variable)

67
New cards

response variable

the outcome variable (dependent variable)

68
New cards

contingency table

displays the # of subjects observed at different combinations of possible outcomes for the two variables; illustrates contingency between explanatory variable and outcome

69
New cards

scatterplot

graph that plots data between bivariate quantitative variables using one dot to represent one occurence of that outcome

70
New cards

correlation

describes the strength of association between variables in terms of how closely the data follows a straight-line trend

71
New cards

regression analysis

analysis method that provides a straight-line formula for predicing the value of y given a value of x

72
New cards

μ

population mean; average of the observations for the entire population

73
New cards

σ

population standard deviation; describes the variability of those observations about the population mean

74
New cards

probability

the proportion of times that the outcome would occur in a very long sequence of observations

75
New cards

probability of A not occurring

P (not A) = 1-P(A)

76
New cards

probability of A or B

P (A or B) = P(A) + P(B)

77
New cards

probability of A and B

P (A and B) = P(A) x P(B given A); contains a conditional probability

78
New cards

probability of A and B (independent)

P(A and B) = P(A) x P(B)

79
New cards

histogram

graphic display for probability distribution where the probability of a value is represented by the height of a bar

80
New cards

mean of a probability distribution (discrete)

sum of total observations times their probability of occurence

81
New cards

E(y)

expected value of y; also known as the mean of a probability distribution

82
New cards

normal probability distribution

symmetric, bell-shaped, and characterized by its mean (μ) and standard deviation (σ); 0.68 of observations fall within 1 standard deviation, 0.95 within 2 SDs, and 0.997 within 3 SDs

83
New cards

Empirical Rule

for bell-shaped histograms, about 68% of the data fall within 1 SD of the mean, 95% falls within 2 SDs of the mean, and 99.7% of data falls within 3 SDs of the mean

84
New cards

z-score

represents the # of SDs that observed value y falls from the mean

85
New cards

standard normal distribution

a normal distribution with the mean μ = 0 and the SD σ = 1

86
New cards

covariance

represents the average of the cross products about the population means between bivariate variable distributions (joint probabilities for pairs of random variables)

87
New cards

sampling distribution of a statistic

the probability distribution that specifies probabilities for the possible values that statistic can take (i.e. sample proportion or sample mean)

88
New cards

sample mean y in relation to population mean μ

fluctuates (sample mean y varies in value from sample to sample)

89
New cards

standard error

the standard deviation of the sampling distribution y; describes how sample mean y varies from sample to sample; denoted by σ(y)

90
New cards

as n increases, the standard error

decreases

91
New cards

as n increases, the sampling distribution gets

narrower (sample proportion falls closer to the population proportion; less probability of getting any other wild answer)

92
New cards

Central Limit Theorem

for random sampling with a large size n, the sampling distribution of the sample mean y is approximately a normal distribution; for most cases, n of 30 is sufficient

93
New cards

implication of Central Limit Theorem

the bell shape of the sampling distribution applies no matter the same of the population distribution