lec 8 - sampling framework and measurement error

Survey: around 3 open ended, mostly closed

Sampling Process:

DRAWING AN EFFECTIVE SAMPLE

1. Define the Target Population

2. Identify a Sampling Frame

main idea: ask the right people
use the right way
to the right number of people

Population: total of all the elements sharing the characteristic of interest

Census: collecting data from all members of the population

Sample: subset of observations from the target population

Represents the target population
used to infer characteristics of target population

Sampling Frame: specific list from which the sample is to be drawn

Telephone book
- target population: everyone
- excluding households without a landline or telephone
Student Email List
- Target population: students
- excluding no internet access, those who didn’t sign up
Campus Intercept
- Target population: students
- excludes online students, not everyone is on campus on certain times of days

Sampling Frame Error: occurs when a biased sub-population is used to select a sample

Example: Roosevelt vs Landon. despite 2.27 million responses, they used car registrations and telephone directories (tend to be richer)'

3. Select a Sampling Method

Probability Sampling: randomly drawn, all members of the population have a known and non-zero probability of being included in the sample

simple random - made using a random number, all equal probabilities
systematic - using a skip interval, random starting point, selecting the k-th element in each chosen interval, efficient
cluster - select from a set of divided clusters, randomly select from within the cluster, less costly and more efficient
stratified - choose from subsets that you create based on an important characteristic (ex: marketing and finance, selected from each stratum) less random error than SRS
- Proportional stratified - sample reflects the original proportion (60-30-10) split
- Disproportional stratified - sample doesn’t reflect original population and is instead equal. use to emphasize the voice of smaller groups

Non-Probability sampling: Members are selected based on certain criteria (non-random). The probability of being included in the sample is not systematically known.

convenience - select people who are easily accessible
judgement - based on researcher’s judgment
quota - made from subsets with certain characteristics, simple to stratified but elements are not selected randomly
snowball - based on referrals from earlier respondents, useful for understanding low incidence or rare population

4. Determine a Sample Size

relates to accuracy vs how it represents.
- too small - large sampling error
- too large - time-consuming and costly
Rule of Thumb: commonly used sample sizes in marketing research

Incidence rate: probability of finding qualified respondents

response rate: probability of qualified respondents completing the survey

Measurement Error: built up of true characteristic and measurement error (random vs systematic)

Random Error: Deviation from the truth because we use a sample. cannot be fully eliminated, but reduced by increasing sample size

Systematic Error: results from mistakes or problems in the questionnaire or flaws in the execution of the sampling. cannot be reduced by increasing sample size

Non-sampling error:
- Measurement Instrument Bias: due to poorly design questions
- Interviewer Bias: affects responses “do you really think so?”
- Response Bias: respondents answer incorrectly
- Nonresponse bias: participants differ from non-participants
Sampling design error:
- Population specification Error: when research does not correctly choose survey target
- Sampling Frame Error: wrong sub-population is used for sample selection
- Sample Selection Error: sampling units are selected in a biased way