Unit 1 Ap statistics vocabulary/concepts

studied byStudied by 73 people
5.0(5)
get a hint
hint

cluster sampling

1 / 58

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

59 Terms

1

cluster sampling

A probability sampling technique in which clusters of participants within the population of interest are selected at random, followed by data collection from all individuals in each cluster

New cards
2

advantages/disadvantages of cluster sampling

cluster sampling is widely used because of its cost- effectiveness and ease of implementation. In many cases, the only representative sampling frame available to researchers is one based on clusters.

disadvantages: a primary disadvantage of cluster sampling is that the clusters often are homogeneous. the more homogeneous the cluster, the less precise the sample estimates. another concern is the appropriateness of the designated cluster factor used to identify the sampling unites within clusters.

New cards
3

stratified sampling

a variation of random sampling; the population is divided into subgroups of characteristics (similar people get grouped together)

New cards
4

Stratified sampling advantages/disadvantages

Advantages: Sample accurately reflects the population structure; Guarantees proportional representation of groups within a population; Disadvantages: Very costly and time-consuming. Also, people must be clearly classified into distinct strata. Selection within each stratum suffers from same disadvantages as simple random sampling

New cards
5

undercoverage bias

occurs when some groups in the population are left out of the process of choosing the sample

New cards
6

non-response bias

Bias introduced into survey results because individuals refuse to participate or are not reachable

New cards
7

What term do you have to put when using calculator to generate random numbers

"regenerate if repeated values"

New cards
8

steps to SRS using random digits table

  1. ASSIGN each member of the population from 01 to N.*

2. Determine the population size and sample size.

3. Select a starting point on the random number table. (randint)

4.) Select the first n numbers (however many numbers are in your sample) whose last X digits are between 0 and N. For instance, if N is a 3-digit number, then X would be 3.

5. Continue this way through the table until you have selected your entire sample, whatever your n is

New cards
9

systematic sampling

select some starting point and then select every kth (e.g. every 3rd) element in the population

New cards
10

advantages/disadvantages of systematic sampling

Advantages: Simple and quick to use, no need to label each member of the sample, suitable for large samples and large populations; Disadvantages: It can introduce bias if the sampling frame is not random (e.g. numbering off people for dodgeball and students stand in a certain position to be on the same team)

New cards
11

Sample survey procedure

figure out → 1.) population we want to describe 2.) what we want to measure 3.) how to sample

New cards
12

Census

the entire population

New cards
13

Population

everybody within the character of interest -> does not actually mean the entire population unless the character of interest is the entire population of the world

New cards
14

collecting a census advantages/disadvantages

advantages - most accurate representation of the data no need to even sample; disadvantages - near impossible to do when the population gets bigger

New cards
15

What are some bad sampling methods

convenience sample, voluntary sample, etc

New cards
16

what makes a bad sampling method - bad

they overestimate or underestimate the data that are not able to be generalized to the vast population.

New cards
17

random sampling

a method of poll selection that gives each person in a group the same chance of being selected

New cards
18

Simple Random Sample (SRS)

a sample in which each set of n elements in the population has an equal chance of selection

New cards
19

confounding

When two or more variables are associated in such a way that their effects on a response variable cannot be distinguished from each other. (testing for two things at once, like the sitting and closing eyes heart rate test that we did during class)

New cards
20

Control group

Experimental group whose primary purpose is to provide a baseline for comparing the effects of the other treatments. Depending on the purpose of the experiment, a control group may be given a placebo or an active treatment (called a double blind)

New cards
21

Sampling with replacement

any number can be counted multiple times (e.g. lottery, prize selection the same person can get three pieces of candy)

New cards
22

Sample without replacement

a number cannot be repeated (ignore duplicates) - the same item cannot be selected more than once

New cards
23

How do both experimental and observational studies choose their study subjects

Both types of studies use random sampling to select participants

New cards
24

Experimental study versus observational study

experimental studies involve actively assigning participants specific tasks/treatments - manipulating a variable; observational studies observe participants’ behaviors without directing them/observing data that already exists without changing anything

New cards
25

Setting of experimental study versus observational study

Experimental studies are often conducted in structured environments like labs; observational studies are often conducted in natural environments without researcher control

New cards
26

Cost and length difference between experimental study versus observational study

Experimental studies are generally more expensive when there are more controls; observational studies are usually less expensive but can go on for longer

New cards
27

treatment

a specific condition that is applied to the individuals in an experiment → consider the experimental group

New cards
28

experimental units

smallest collection of individuals to which treatments are applied. When the units are people, they are often called “subjects”

New cards
29

response variable

dependant variable → measures an outcome of a study

New cards
30

explanatory variable

independent variable → the treatment/the condition; explains or predicts changes in a response variable

New cards
31

advantages of an experiment over an observational study

causality establishment, control over variables, precision and accuracy, takes less time, replicability, manipulation of variables, experimental conditions(allow researchers to create conditions that might not naturally occur or take too long to occur)

New cards
32

double blind

neither the subjects nor those who interact with them that measure the response variable know which treatment a subject received (real versus placebo)

New cards
33

Inference for sampling

Drawing conclusions that go beyond the data at hand → making inferences/generalizations about the entire populations which are likely based on the info that is put out

New cards
34

Completely randomized design

Design in which the experimental units are assigned to the treatments completely by chance

New cards
35

Block

Group of experimental units that are known before the experiment to be similar in some way that is expected to affect the response to the treatments

New cards
36

Statistically significant

Observed effect so large that it would rarely occurred by chance (specified as the P value which is 5 percent)

New cards
37

Placebo

A fake treatment (typically used on the control group to signify the effects of the real treatment)

New cards
38

Prospective study

A prospective study watches for outcomes, such as the development of a disease, during the study period and relates this to other factors such as suspected risk or protection factor(s). The study usually involves taking a cohort of subjects and watching them over a long period.

New cards
39

What are effective means to display categorical data

bar chart, pie chart

New cards
40

What are effective means to display quantitative data

Histograms, percentage polygon, line graphs, cumulative percentage graphs

New cards
41

response-bias

The response bias refers to our tendency to provide inaccurate, or even false, answers to self-report questions, such as those asked on surveys or in structured interviews; response bias is also introduced based on how the question is worded.

New cards
42

population paramater

a population parameter is a number that describes something about an entire group or population; e.g. the mean, median, standard deviation etc

New cards
43

principles of experimental design

comparison, random assignment, control, replication

New cards
44

replication (in an experiment)

replicating a treatment to numerous different experimental units

New cards
45

common response variable

a type of lurking/confounding variable → neither explanatory nor response, but still affects the relationship between both of these variables

New cards
46

Control (in an experiment)

When conducting an experiment, a control is an element that remains unchanged or unaffected by other variables

New cards
47

what is always the first step of any type of sampling (cluster, stratified etc)

defining the population you would like to take your sample from

New cards
48

what are the three types of experimental designs

completely randomized design, matched pairs design, and repeated measure design

New cards
49

matched pair design

Pairs of people who share similar characteristics are each tested with differing treatments. Then, their results are compared to see if the treatment has an effect on what is being measured.

<p>Pairs of people who share similar characteristics are each tested with differing treatments. Then, their results are compared to see if the treatment has an effect on what is being measured.</p>
New cards
50

repeated measure design

Repeated measures design is a research design that involves multiple measures of the same variable taken on the same or matched subjects either under different conditions or over two or more time periods

<p>Repeated measures design is a research design that involves multiple measures of the same variable taken on the same or matched subjects either under different conditions or over two or more time periods</p>
New cards
51

what are some examples of nonsampling error

data entry errors, biased survey questions (which might lead to response bias), biased processing/decision making, non-responses, inappropriate analysis conclusions, and false information provided by respondents.

New cards
52

sampling frame

a list of the items or people forming a population from which a sample is taken.

New cards
53

what is the diference between block design and stratified random sampling

block design is a specific method use for designing an experiment while stratified random sampling is a method of gathering a sample

New cards
54

sampling variability

results vary from sample to sample

New cards
55

margin of error

how far off our sample from the truth of the population

New cards
56

factor

another word for explanatory variable

New cards
57

how should you choose which variable to block for

blocks should be formed based on the most important sources of variability.

New cards
58

levels

number of variations within a factor (explanatory variable)

New cards
59

population of generalization

the people that you are pulling your sample from

New cards

Explore top notes

note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 8 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 14 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 26493 people
Updated ... ago
4.8 Stars(224)

Explore top flashcards

flashcards Flashcard74 terms
studied byStudied by 20 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard24 terms
studied byStudied by 27 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard36 terms
studied byStudied by 17 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard25 terms
studied byStudied by 3 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard74 terms
studied byStudied by 24 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard38 terms
studied byStudied by 23 people
Updated ... ago
4.3 Stars(3)
flashcards Flashcard84 terms
studied byStudied by 35 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard68 terms
studied byStudied by 89 people
Updated ... ago
5.0 Stars(3)