Sampling Theory

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/46

flashcard set

Earn XP

Description and Tags

Flashcards on Sampling Theory

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

47 Terms

1
New cards

Census

The oldest form of data collection, aiming for organization of tax payments or political representation, now supplemented with a wide variety of relevant information.

2
New cards

Survey

A well-targeted data collection method with a limited but precise scope, used in various fields like health, quality of life, unemployment, and agriculture.

3
New cards

Psychometric Concepts

Aspects such as reliability and validity that capture the quality of a survey.

4
New cards

Standardized Measurements

Measurement instruments that collect data in a standardized fashion, ensuring good psychometric properties like validity and reliability.

5
New cards

Probability Sampling

A family of probabilistic methods by which a subset of units from the sample frame is selected.

6
New cards

Survey Population

The collection of units (individuals) about which the researcher wants to make quantitative statements.

7
New cards

Sample Frame

The set of units (individuals) that has a non – zero probability of being selected.

8
New cards

Sample

The subset of units that have been selected.

9
New cards

Estimand

The true population quantity.

10
New cards

Estimator

A (stochastic) function of the sample data, with the aim to “come close” to the estimand.

11
New cards

Estimate

A particular realization of the estimator, for the particular sample taken.

12
New cards

List Quality

Considerations for a sample frame that requires knowing how a list has been composed and how updating takes place.

13
New cards

Simple Random Sampling

A sampling standard method, studied to compare other methods with.

14
New cards

Systematic Sampling

A sampling method chosen to increase precision and/or to ensure sampling with certainty for a subgroup of units.

15
New cards

Stratification

A sampling method performed to increase precision of population-level estimates or to allow for estimation at sub-population level.

16
New cards

Selection Probability

The probability of an individual to be selected, which should be known or estimable, but does not have to be constant.

17
New cards

Population Average

Y = (1/N) * Σ(I=1 to N) YI

18
New cards

Population Total

Y = Σ(I=1 to N) YI

19
New cards

Population Variance

σY^2 = (1/N) * Σ(I=1 to N) (YI - Y)^2 or SY^2 = (1/(N-1)) * Σ(I=1 to N) (YI - Y)^2

20
New cards

Population Covariance

σXY = (1/N) * Σ(I=1 to N) (XI - X)(YI - Y) or SXY = (1/(N-1)) * Σ(I=1 to N) (XI - X)(YI - Y)

21
New cards

Population Correlation

ρXY = σXY / (σX * σY) = SXY / (SX * SY)

22
New cards

Sampling Mechanisms

Assigning a probability PS (s = 1, …, S) to each sample.

23
New cards

Sample Fraction

f = n/N, relevant only in finite populations.

24
New cards

Expectation

The average of all possible estimates.

25
New cards

Bias

Y – E(^y)

26
New cards

Variance

The averaged squared deviation of a random variable around its mean σ ^y 2 = E( ^y−E ( ^y )) 2 = ∑s=1 S Ps(^ys−∑s=1 S Ps ^ys)2

27
New cards

Standard Error

In the specific case of an estimator, the standard deviation is termed the standard error.

28
New cards

MSE (Mean Squared Error)

MSE(^y) = σ ^y 2 + [bias(^y)]²

29
New cards

Sample Fraction f

f = n/N

30
New cards

Variance of the Mean

σy^2 = (1/n) * σY^2 (with replacement) or σy^2 = (1/n) * (1-f) * SY^2 (without replacement)

31
New cards

Variance of the Total

σ^y^2 = (N^2/n) * σY^2 (with replacement) or σ^y^2 = (N^2/n) * (1-f) * SY^2 (without replacement)

32
New cards

Population Proportion (P)

Denoted by P or . The proportion of units belonging to the subgroup, at the population level P = (1/N) * Σ(I=1 to N) ZI

33
New cards

Population Variance for the Proportion

σZ^2 = (N/(N-1)) * PQ ≃ PQ

34
New cards

Estimating the Size of a Subgroup

Estimated from a sample of size n by n^g = N × ^p, with variance σ^ng^2 = N^2 * (1/n) * (1-f) * (N/(N-1)) * PQ

35
New cards

Delta Method

A method used to assess the random error in a function of random variables, which can use covariances of other variables.

36
New cards

Relative Variance

RVar(X) = Var(X) / X^2 and RCov(X, Y) = Cov(X, Y) / (X * Y)

37
New cards

SRS

Simple Random Sampling

38
New cards

Auxiliary variable

Variables used for correcting mechanisms such as stratification.

39
New cards

Stratification

Partitioning the population in subgroups according to the levels of an auxiliary variable, so that the survey variable is more homogeneous within such a subgroup, or stratum, than in the population as a whole.

40
New cards

Post-stratification

Stratified analysis of a sample that was taken in an un-stratified way.

41
New cards

Multi-stage sampling

Hierarchy of units that is selected. Starting with a primary sampling units (PSU), then with the secondary sampling units (SSU) are subselected, then within which are tertiary sampling units (TSU) are subselected etc.

42
New cards

Weights

Arises naturally in a variety of context such as stratification, clustering and probabilities of section. Represented as ∑wi yi/ ∑wi and N ∑wi yi/ ∑wi

43
New cards

Design Effect

The ratio of two variances: The variance of an estimator taking design aspects into account and The variance of the SRS estimator

44
New cards

PROC SURVEYSELECT

A software that Allows SRS, URS, SYS, SEQ and PPS which can be combined with STRATIFICATION.

45
New cards

Missing Completely at Random (MCAR)

A missing framework represented by f(Ri |ψ).

46
New cards

Missing at Random (MAR)

A missing framework represented by f(Ri |Yi 0 , ψ)

47
New cards

Multiple Imputation

A process involving fillign in missing values M times, analyzing the M complete data sets by using standard procedures and combining the results from the M analyses into a single inference