Stats 107 FINAL

5.0(1)
studied byStudied by 5 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/102

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

103 Terms

1
New cards

individual/unit

the objects described by a set of data, may be people but also be may animals or things

2
New cards

variable

any characteristic of an individual, can take different values for different individuals

3
New cards

categorical variable

places an individual into one of several groups or categories

4
New cards

numerical/quantitative variables

takes numerical values for which arithmetic operations such as adding and averaging make sense.

5
New cards

population

in a statistical study is the entire group of individuals about which we want information

6
New cards

sample

part of the population from which we actually collect information and is used to draw conclusions about the whole

7
New cards

sample survey

survey some group of individuals by studying only some of its members, selected not because they are of special interest but because they represent the larger group

kind of observational study

8
New cards

response variable

a variable that measures an outcome or result of a study

9
New cards

observational study

observes individuals and measures variables of interest but does not intervene the responses. purpose is to describe some group or situation

10
New cards

census

a sample survey that attempts to include the entire population in the sample

11
New cards

experiment

deliberately imposes some treatment on individuals in order to observe their response. purpose is to study whether the treatment causes a change in the the response

12
New cards

biased

systematically favors certian outcomes

13
New cards

convenience sampling

selection of whichever individuals are easiest to reach

14
New cards

voluntary response sample

chooses itself by responding to general appeal. write-in or call-in opinion polls are examples of this

15
New cards

convenience sampling and voluntary response sample are often…

biased

16
New cards

simple random sampling (SRS)

of size “n” consists of “n” individuals from the population chosen in such a way that every set of “n” individuals has an equal chance to be the sample actually selected

17
New cards

a SRS gives…

each individual an equal chance to be chosen (avoiding bias0 and every possible sample an equal chance to be chosen

18
New cards

what are the different ways to obtain a simple random sample?

names in a hat, random integer generator, table of random digits

19
New cards

what are two steps to obtain a simple random sample?

1) assign a numerical label to every individual in the population

2) use random digits to select labels at random

20
New cards

parameter

a number that describes the population. it is a fixed number, but in practice we don’t know the actual value of this number because we cannot access the entire population

21
New cards

statistic

a number that describes a sample. the value can be determined and is known once we have taken a sample, but its value can change from sample to sample. often used to estimate an unknown parameter

22
New cards

variability

describes how the values of the sample statistic will vary when we take many samples

23
New cards

what does large variabilty mean?

the result of the sampling is not repeatable

24
New cards

what does a good sampling method have?

both small bias and small variability

25
New cards

to reduce bias…

use random sampling

26
New cards

to reduce the variability of an SRS…

use a larger sample

27
New cards

large samples…

almost always give an estimate that is close to the truth

28
New cards

margin of error

represents the natural sampling variability

29
New cards

to cut the margin of error in half…

we must use a sample four times as large

30
New cards

confidence statement

has two parts: a margin of error and a level of confidence. the margin of error says how close the sample statistic lies to the population parameter. the level of confidence says what percentage of all possible samples satisfy the margin of error

31
New cards

the conclusion of a confidence statement…

always applies to the population, not to the sample

32
New cards

our conclusion about the population is…

never completely certain

33
New cards

a sample survey can choose…

to use a confidence level other than 95%

34
New cards

a smaller margin of error with the same confidence?

take a larger sample

35
New cards

the variability of a statistic from a random sample is essentially…

unaffected by the size of the population as long as the population is at least 20 times larger than the sample

36
New cards

confidence statement

to say how accurate our conclusions about the population are

37
New cards

sampling errors

errors caused by the act of taking a sample. they cause sample results to be different from the results of a census of the population

38
New cards

random sampling error

the variation due to chance in choosing a random sample. the margin of error in a confidence statement includes only random sampling error

39
New cards

nonsampling errors

errors not related to the act of selecting a sample from the population. they can be present even in a census

40
New cards

undercoverage

occurs when some groups in the population have no chance of being included in the sample

41
New cards

sampling frame

list of individuals from which we will draw our sample

42
New cards

nonresponse

the failure to obtain data from an individual selected for a sample. most happens because subjects can’t be contacted or because some subjects who are contacted refuse to cooperate

43
New cards

what is the most serious problem facing sample surveys?

nonresponse

44
New cards

processing errors

mistakes in mechanical tasks such as doing arithmetic or entering responses into a computer

45
New cards

multiple inclusions

occur if some population members appear multiple times in the sampling frame so that they have a higher chance of being sampled

46
New cards

erroneous inclusions

can occur if the frame includes units that are not in the population of interest so that the invalid units have a chance of being in the sample

47
New cards

probability sample

a sample chosen by chance. we must know what samples are possible and what chance, or probability, each possible sample has

48
New cards

frame errors

can occur because the sampling frame is not an accurate representation of the population

49
New cards

response errors

incorrect answers by respondents

50
New cards

what are examples of nonsampling errors?

processing errors, response errors,

51
New cards

explanatory variable

a variable that we think explains or causes changes in the response variable

52
New cards

subjects

individuals studied in an experiment

53
New cards

treatment

any specific experimental condition applied to the subjects

54
New cards

lurking variable

a variable that has an important effect on the relationship among the variables in a study but is not one of the explanatory variables

55
New cards

confounded

when two variables’ effects on a response cannot be distinguished from each other. may be explanatory variables or lurking variables

56
New cards

placebo

dummy treatment with no active ingredients

57
New cards

double-blind

an experiment in which neither the subjects nor the physicians recording the symptoms know which treatment was received

58
New cards

randomized comparitive experiment

one that compares just two treatments

59
New cards

control group

placebo group, comparing the treatment and control group allows us to control the effects of lurking variables

60
New cards

what are the basic principles of statistical design of experiments?

control the effects of lurking variables, randomize, and use enough subjects

61
New cards

statistically significant

an observed effect of a size that would rarely occur by chance

62
New cards

a good comparative study…

measures and adjusts for confounding variables

63
New cards

discrete data

data that is counted

64
New cards

continuous data

unit of measure

65
New cards

qualitative

descriptive

66
New cards

completely randomized experimental design

all the experimental subjects are allocated at random among all treatments

67
New cards

matched pairs design

compares just two treatment. choose pair of subjects that are as closely matched as possible. assign one of the treatments to each subject in a pair by tossing a coin or reading odd and even digits

68
New cards

block

group of experimental subjects that are known before the experiment to be similar in some way that is expected to affect the response to the treatments

69
New cards

block design

the random assignment of subjects to treatments is carried out separately within each block

70
New cards

Hawthorne Effect

the tendency of some people to work harder and perform better when they are participants in an experiment. individuals may change their behavior due to the attention they are receiving from researchers rather than because of any manipulation of independent variables

71
New cards

what is the most common weaknesss in experiments?

we can’t generalize the conclusions widely

72
New cards

measure

a property of a person or thing when we assign a value to represent the property

73
New cards

instrument

used to make a measurement

74
New cards

units

used to record the measurements

75
New cards

valid

measure of property if it is relevant or appropriate as a representation of that property

76
New cards

rate

(a fraction, proportion, or percentage) at which something occurs is a more valid measure than a simple count of occurrences

77
New cards

predictive validity

it can be used to predict success on tasks that are related to the property measured

78
New cards

reliable

if the random error is small

79
New cards

random error

repeated measurements on the same individual give different results

80
New cards

a reliable measurement process has what variance?

a small variance

81
New cards

the average of several repeated measurements of the same individual is more or less reliable?

more reliable (less variable) than a single measurement

82
New cards

what to look for when determining if the numbers make sense?

missing information, inconsistencies, incorrect arithmetic, implausible, too regular or agree too well, hidden agenda

83
New cards

distribution of a variable

tells us what values it takes and how often it takes these values

84
New cards

pie chart

show how a whole is divided into parts

85
New cards

bar graph

all bars are the same width and do not touch each other

can also compare the size of the categories that are not parts of one whole

86
New cards

what does the height of the bars in a bar graph represent?

amount of people in that category

87
New cards

pie charts and bar graphs show the distribution of?

categorical variables

88
New cards

pictogram

a bar graph in which pictures replace the bars

89
New cards

line graph

used to display how a quantitative variable changes over time

time on x-axis

90
New cards

trend

a long-term upward or downward movement over time

91
New cards

seasonal variation

a pattern that repeats itself at known regular intervals of time

92
New cards

seasonally adjusted

the expected seasonal variation is removed before the data are published

93
New cards

linear correlation coefficient

r, it measures the strength and direction of linear relationship between x and y

94
New cards

R squared

coefficient of determination, measures the explanatory ability of the line of best fit

95
New cards

skewed right

bump is on the left and it tapers off to the right

96
New cards

skewed left

bump is on the right and it tapers off to the left

97
New cards

symmetrical

peak in the middle

98
New cards

uniform

bars are the same height

99
New cards

to describe the overall pattern of a distribution:

describe the center and the variability and describe the shape of the histogram with a few words

100
New cards

histogram

bars touch each other, quantitative variables