TOPIC #8: Sample surveys

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/20

flashcard set

Earn XP

Description and Tags

Module 3, DATA1001

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

21 Terms

1
New cards

how a population is surveyed

census

2
New cards

sample is a…

subset of a population

3
New cards

limitations of a census

  • time consuming to collect information about every unit of a population

  • resource intensive

  • costly

  • difficult to enforce participation.

4
New cards

parameter

numerical fact about the population which we are interested in e.g population mean

5
New cards

estimate (or statistic)

calculation of sample values which best predicts the parameter, e.g sample mean

6
New cards

selection bias

systematic tendency to exclude or include on type of person from the sample

7
New cards

non-response bias

caused by participants who fail to complete surveys

8
New cards

interviewer bias

when the interviewer has to make a choice of particpants in the survey or when characteristics of the interviewer have an effect on the answer given by participants

9
New cards

measurement bias

when the form of the question in the survey affects the response to the question.

10
New cards

examples of measurement bias

recall bias: people forget details

social desirability bias: people may not tell the truth

lack of clarity in the question: people may misinterpret the question, or certain words may be ambiguous and so mean different things to different people.

11
New cards

simple random sampling

probability sampling technique which involves drawing individuals from a population at random without replacement - each individual has an equally random chance of being selected from the population

12
New cards

stratified sampling

sampling from a population that is partitioned into subpopulations (e.g gender, age, location, etc). sampling is done independently in each subpopulation (stratum), ensuring that the sample is representative of the population with respect to certain characteristics.

13
New cards

multistage cluster sampling

takes samples in stages and simple random sampling is performed within each stage.

14
New cards

quota sampling

non-probabilistic version of stratified sampling, assembled sample is required to have a pre-determined number of individuals with respect to certain characteristics

may result in unintentional bias from the interviewers when they choose subjects to survey

15
New cards

convenience sampling

non probability sampling technique, subjects are selected because it is convenient to select them. (not recommended as the partcipants that are easy to recruit may not be representative of the population of interest)

16
New cards

sample surveys always involve ….. error, because…..

chance, each sample is just one possible draw from the population.

17
New cards

why does the SE from the box model need to be adjusted to get the exact SE for a survey?

because in practice surveys are drawn without replacement and the same person can’t be interviewed twice.

18
New cards

correction factor (finite population correction)

SEwithout replacement= correction factor x SEwith replacement

where:
correction factor = root [(population size - sample size) / (population size - 1)]

19
New cards

how do you construct a confidence interval from a single sample survey?;

  1. estimate the box population proportion (mean) to be equal to the sample proportion.

  2. follow the box model and the CLT, we can then assume the sample proportion (mean) follows a normal distribution, eith EV = meanbox, and SE = SDbox/root n)

20
New cards

bootstrapping

a computational process that allows us to estimate the properties of the population, by using the properties of a particular sample. it is a method of resampling with replacement from a sample to estimate variability and CIs.

21
New cards

what is a confidence interval

A range of plausible values for the population parameter based on sample data