sampling types

Sampling Types

Sampling in Statistics: Refers to techniques used to select a subset of individuals from a population to estimate characteristics of the whole population.
Types of Sampling:
  - Simple Random Sampling
  - Systematic Sampling
  - Stratified Sampling
  - Cluster Sampling
  - Convenience Sampling

Sampling Methods in Statistics

1. Simple Random Sampling

A sample of size n is taken such that every possible sample of size n has an equal chance of being chosen.
Often synonymous with "random sample".

2. Systematic Sampling

Selects every kth subject from a list or queue.

3. Stratified Sampling

The population is divided into strata (groups) that share similar characteristics, then a random sample is taken from each stratum.

4. Cluster Sampling

The population is divided into clusters (groups), then a number of clusters are randomly selected, with all members from chosen clusters surveyed.

5. Convenience Sampling

Data is collected from individuals who are easiest to access rather than following a random selection process.

Summary of Sampling Methods

Simple Random Sample: Every subject has an equal chance of selection.
- Example: A local health clinic randomly selects 100 patients from registered lists for a diabetes study.
Exemplar of Systematic Sampling: Select every kth subject (e.g., every 10th person).

Sampling Errors

Sampling Error: Occurs when there are discrepancies between sample results and the true population results due to chance.
Non-Sampling Error: Arises from human error, such as incorrect data entries, biased questions, or inappropriate statistical methods.
Nonrandom Sampling Error: Results from using a non-random sampling method, like a convenience sample.

Types of Studies

A. Observational Studies

No manipulation or intervention on subjects or variables.
Risk factors are variables presumed to relate to potential outcomes.

Study Designs:

Cross-sectional study: Measures both exposure and outcome simultaneously.
Retrospective (case-control) study: Looks back at subjects with known outcomes to assess prior exposure.
Prospective (longitudinal or cohort) study: Follows subjects over time to assess the development of outcomes based on their initial risk factors.

Definitions and Examples of Studies Conducted

Diabetes Study: Random selection of 100 patients to check for diabetes indicators.
Blood Pressure Study: Random selection of 200 individuals without stratification to check average blood pressure levels.
Diabetes Risk Study: Population categorized into age groups for proportional representation in sampling.

Notation for Probabilities

P denotes probability.
Specific events are denoted as A, B, C, etc.
P(A) represents the probability of event A occurring.

Approaches to Finding Probability

Relative Frequency Approximation
Classical Approach
Subjective Probability

Relative Frequency Probability Example

Example: Skydiving - 3,000,000 jumps with 21 deaths.
- P(skydiving death) = [ \frac{21}{3000000} = 0.000007 ]

Classical Probability Example - Gender of Children

Sample space for three children: {bbb, bbg, bgb, bgg, gbb, gbg, ggb, ggg}
P(three same gender) = [ \frac{2}{8} = 0.25 ]

Subjective Probability Example - Acute Appendicitis

Estimated probability of 0.001 based on experience and lack of historical data.

Incidence and Prevalence

Incidence: Number of new cases of a disease developing in a population at risk over time; indicates disease risk.
Prevalence: Total number of existing cases of a disease at a specific time; indicates disease burden.

Rates of Mortality & Morbidity

Infants: Babies born alive.
Neonates: Infants under 28 days old.

Contingency Table

A matrix format table displaying frequency distribution for variables.

Absolute Risk Reduction

[ ext{Absolute Risk Reduction} = | P( ext{event in treatment}) - P( ext{event in control}) | ]

Relative Risk Definition

Ratio comparing risk of disease in those with a risk factor against those without.

Interpreting Relative Risk

Value of 1: No difference in risk.
Value greater than 1: Increased risk for treatment group.

Example: Salk Vaccine Study

Treatment Group with polio: 33 out of 200,712, Control Group: 115 out of 201,114
[ P_t = \frac{33}{200712} \text{ and } P_c = \frac{115}{201114} ]
Relative Risk: [ RR = \frac{0.000164}{0.000571} = 0.287 ]

Interpretation of Study Results

Vaccines reduce polio risk; rate for vaccinated kids is 0.287 compared to placebo.
Reciprocal indicates placebo group is 3.48 times more likely to contract polio.

Odds Ratio Definition

[ \text{Odds Ratio} = \frac{\text{odds in favor of treatment group}}{\text{odds in favor of control group}} ]

Example of Odds Ratio

Retrospective Study of newborns indicates rehospitalization odds of [ \text{Odds Ratio} = \frac{457 \cdot 2860}{3199 \cdot 260} = 1.571 ]
Interpretation: Higher rehospitalization risk for newborns discharged early.

Conclusion

Relative Risk is typically used for prospective studies, while Odds Ratio can be used in both prospective and retrospective studies.