The population of interest is public high school students in her county who identify as American Indian/Alaskan Native, Asian, Black, Hispanic, White, or multi-ethnic. She should consider having the representative sample be chosen randomly.
Whenever we take a sample from a population, there is the potential of introducing sampling bias. It is important to be aware of potential sources of bias and take steps to minimize the chance that sampling bias is present in the way that we sample.
Here are the four main sources of bias to consider when sampling from a population:
Undercoverage occurs when some groups of the population are left out of the sampling process and the individuals in these groups do not have an equal chance of being selected for the sample. For example, a sample survey of households in a country may miss people who are homeless, prison inmates, or students living in dorms.
Non-response bias occurs when an individual chosen for a sample cannot be contacted or decides to not participate in the study or research. This type of bias occurs after the sample has been selected and can create potential bias in the data collected.
Response bias is defined as a systemic pattern of inaccurate responses to questions. This type of bias can occur when a person does not understand a question or feels influenced to respond to a question in a certain way. Response bias can also occur as a result of the wording of questions that are of a sensitive nature.
A voluntary response bias is another form of bias because the sample is not random or representative of the population. The people who volunteer for a study or survey may be more inclined to respond to questions or report certain behaviors.
Some potential source of bias could be people lying, location of the school, people who don’t know their ethnicity, and a specific ethnicity or race being randomly chosen more than others.
Some potential sources of bias could be people who are not white answering or only white people answering. Voluntary response bias where there can be people more inclined to respond.
She can select a representative sample by randomly selecting 35 students from each high school. This minimizes bias because you get an equal sample from each school.
The population of interest is the race/ethnicity of all high school aged students in her county ages 14-18. This sample of students is not appropriate anymore for question 4 because it takes the whole county of high school kids rather than 3 public high schools.