Sampling Methods and Population Parameters

Defined as the degree of uncertainty in sample estimates.
Margin of error formula: ext{Margin of Error} = rac{1}{ ext{sqrt}(n)}
- Where n is the sample size.

Simple random sample: A method where every unit in the sampling frame has an equal chance of being included:
- Every possible combination of units has the same probability of selection.
Importance: Ensures fair representation in the sample.

Upcoming discussion topics include:
- Stratified random sampling
- Cluster sampling
- Systematic sampling
- Discussion on poor sampling methods and cautionary tales on data collection.

Example presented of sampling from 1,600 people watching a YouTube channel yielding a response of 24%:
- Calculation of margin of error:
- ext{Margin of Error} = rac{1}{ ext{sqrt}(1600)} = 2.5 ext{%}
- Range: 24% ± 2.5% (i.e., between 21.5% and 26.5%).
Question arose regarding the actual population parameter:
- Census: The only way to know the true population parameter completely.
- Importance of understanding that estimates can only provide an approximation of the truth.

Acknowledged that we live in a world of uncertainty regarding statistical estimates:
- No guarantee of accurate population parameters without a full census.
- Statistical methods aim to provide credibility and plausibility but cannot achieve 100% certainty.

Definition: Population divided into distinct strata, from which random samples are drawn for each strata.
Example criteria for strata:
- Ethnicity, gender, or other relevant demographic factors.
Advantages:
- More accurate estimates within each strata.
- Individual estimates for each strata.
- Potential cost savings if strata are geographically distinct.
Disadvantages:
- Difficulty in defining appropriate strata.
- Risk of bias if strata are incorrectly defined.
- More complex implementation compared to simple random sampling.

Definition: Population divided into clusters; entire clusters are randomly selected.
Advantage:
- No need for a complete list of individuals; only clusters are needed.
Example: Sampling entire floors in a dormitory.
Disadvantages:
- Inherent risk of bias; results may not represent the entire population poorly.
- Less precision due to similarity within clusters.

Definition: Sampling method where the sampling frame is ordered and units are selected at regular intervals.
Example: Taking every nth unit from a list.
Disadvantages:
- Potential periodicity bias if there are underlying trends corresponding to the intervals used.
Requires a complete sampling frame for implementation.

Combination of different sampling methods:
- Useful for large-scale studies.
- Example: Stratifying first by region, then randomly sampling within those strata.

Importance of methodologies in surveys and polls:
- Random digit dialing as a common method for surveys.
- Issues of coverage error related to those without telephones.

Recent poll conducted by Curia Market Research:
- Sample size of 1,000 people — margin of error calculated using:
  ext{Margin of Error} = rac{1}{ ext{sqrt}(n)}
  ightarrow ext{where } n = 1000 yielding approximately 3.16%.
Critique on methodology of Curia Market Research by statisticians.

Sampling Error: Difference between the sample estimate and the true population value due to the sample selection method.
- Measurable through margin of error.
Non-Sampling Error: Errors not associated with the sampling process itself;
- Arises from flaws in survey design, methodology, and sampling frame inaccuracies.
- Cannot be quantified, making it difficult to validate results.

Awareness of the difference between these errors aids in improving the accuracy and credibility of statistical findings.
Next steps include discussions on how to minimize non-sampling errors.

Discussion also previewed for upcoming lectures covering selection methods and the implications of sampling errors in survey research.
Acknowledge the complex nature of data collection and continuous need for critical thinking in statistical evaluation.