Statistics and Sampling - Lecture Flashcards

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/86

flashcard set

Earn XP

Description and Tags

A comprehensive set of flashcards covering sampling design, study components, data types, measurement, surveys, and common biases.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

87 Terms

1
New cards

What questions should you ask about the 'Source' in a study?

Is the source reputable? Is the journal peer-reviewed? Who funded the study? Could funders have a vested interest in the outcome?

2
New cards

What does the 'Researchers' component refer to in the seven components?

The people actively collecting data; their behavior or expectations can influence responses.

3
New cards

What does 'Individuals or Objects Studied' require in terms of representative sampling?

A representative sample that replicates the population demographics; specify state/country, major, year, and how they were selected.

4
New cards

What does 'Measurements' involve in the seven components?

Some questions are hard to measure; some are ambiguous; definitions and measurement quality matter.

5
New cards

What does 'Setting' include in a study?

When, where, and how data were collected (season/time of day, public/private/lab, method like email/phone/face-to-face).

6
New cards

What are 'Extraneous Differences' and how are they addressed?

Differences between groups other than the treatment; random assignment helps control them in experiments.

7
New cards

What is meant by 'Magnitude' in the context of study results?

The size of observed differences; avoid vague language and report meaningful, clear estimates (e.g., risk reduction numbers).

8
New cards

What is 'Random Assignment' and why is it used?

Assigning participants to treatment or control groups by chance to protect against systematic differences between groups.

9
New cards

What is a 'Frame' in sampling?

A list or method used to define the population from which the sample is drawn.

10
New cards

What is a 'Census'?

Survey attempting to include every unit in the population.

11
New cards

Differentiate 'Population' vs 'Sample' vs 'Unit'.

Population is the entire group of interest; Sample is a subset from which data are collected; Unit is a single individual/object (e.g., person, household, school).

12
New cards

What is a 'Representative Sample'?

A sample that reflects the population's characteristics, allowing generalization to the population.

13
New cards

What is an 'Observational study' vs an 'Experiment'?

E study: observational, no manipulation. Experiment: manipulation of one or more explanatory variables to observe the effect on a response variable.

14
New cards

What is 'Data' in research terms?

Information gathered about subjects; can be numerical (measurements) or categorical (types).

15
New cards

What is a 'Sample Survey'?

An observational study where a sample is used to estimate characteristics of the whole population.

16
New cards

What is a 'Unit' in data collection?

A single individual or object on which data are collected (e.g., person, household, school).

17
New cards

What is a 'Placebo' in experiments?

A fake treatment with no active ingredients used to monitor the placebo effect.

18
New cards

What is a 'Control Group' in experiments?

Receives no treatment or a placebo; used for comparison with the treatment group.

19
New cards

What is a 'Treatment Group'?

Receives the actual treatment being tested.

20
New cards

What is a 'Lurking Variable'?

A variable not included in the study that can affect the results.

21
New cards

What is a 'Confounding Variable'?

An extraneous variable related to both the explanatory and response variables that may explain observed differences.

22
New cards

What is an 'Interacting Variable'?

A variable that changes the way the explanatory variable affects the response.

23
New cards

What is 'Measurement Validity'?

Whether a measurement accurately reflects the concept of interest without distortion.

24
New cards

What is 'Reliability' in measurement?

Consistency of a measurement; results are reproducible under the same conditions.

25
New cards

What is 'Bias (Measurement Context)'?

Systematic deviation from the true value due to the measurement context or method.

26
New cards

What is 'Natural Variability'?

Inherent fluctuations in values across individuals over time, even without measurement error or bias.

27
New cards

What is a 'Case-Control' study?

An observational study comparing those with a condition (cases) to those without (controls), often retrospective.

28
New cards

What is a 'Meta-Analysis'?

A study that combines results from multiple studies on the same topic to draw a comprehensive conclusion.

29
New cards

What is a 'Case Study'?

Detailed examination of a single individual or group; not generally generalizable.

30
New cards

What is a 'Double-Blind' study?

Neither participants nor researchers know who receives which treatment.

31
New cards

What is a 'Single-Blind' study?

Participants do not know which treatment they receive; researchers may know.

32
New cards

What is a 'Randomized Controlled Trial' (RCT)?

Participants are randomly assigned to treatment or control groups to test treatment effects.

33
New cards

What is a 'Matched Pairs Design'?

Subjects are paired and each pair receives different treatments.

34
New cards

What is a 'Simple Random Sample'?

Every unit has an equal chance of being selected; every possible sample of the same size is equally likely.

35
New cards

What is a 'Stratified Random Sample'?

Population is divided into strata; random samples are drawn from each stratum to improve representation.

36
New cards

What is a 'Cluster Sample'?

Population divided into clusters; some clusters are randomly selected; all individuals in chosen clusters are surveyed.

37
New cards

What is a 'Systematic Sample'?

Select every kth item from a list, starting at a random point.

38
New cards

What is a 'Volunteer Sample'?

Participants self-select into the study; typically biased and not representative.

39
New cards

What is a 'Convenience/Haphazard Sample'?

Non-random sample chosen for ease of access; prone to bias and not typically representative.

40
New cards

What is 'Random Digit Dialing'?

Randomly generating telephone numbers to reach a representative sample.

41
New cards

What is the 'Margin of Error' in sampling?

A measure of precision of an estimate due to sampling; e.g., ±3%; about 95% of samples would yield an interval containing the true population value.

42
New cards

What is a 'Frame out of date' error?

Non-sampling error where the frame no longer reflects the population, leading to bias.

43
New cards

Open-ended vs Closed-ended questions—what's the difference?

Open-ended allows free-form responses; closed-ended provides fixed choices and is easier to analyze.

44
New cards

What is a 'Pilot Study'?

A small preliminary version of a study to test procedures, questions, and measurement tools.

45
New cards

What is a 'Bias' in survey design?

Tendency for measurements to misrepresent the truth due to wording, sampling, or data collection methods.

46
New cards

What is the 'Desire to Please' bias?

Participants give answers they think researchers want to hear rather than their true beliefs.

47
New cards

What is 'Ordering of Questions' bias?

The sequence of questions can influence how participants respond.

48
New cards

What is 'Confidentiality' vs 'Anonymous' in surveys?

Confidentiality means researchers know identities but won’t share; Anonymous means researchers do not know identities, increasing honesty.

49
New cards

What is a 'Five-number Summary'?

Minimum, first quartile (Q1), median, third quartile (Q3), maximum.

50
New cards

What is a 'Boxplot' used for?

A graphical representation of the five-number summary and potential outliers.

51
New cards

What is the 'Mean' in data?

The average value; sum of measurements divided by the number of measurements.

52
New cards

What is the 'Median' in data?

The middle value when data are ordered; if even, the average of the two middle values.

53
New cards

What is the 'Mode' in data?

The most frequently occurring value.

54
New cards

What is the 'Range' in data?

Difference between the highest and lowest value.

55
New cards

What is 'Standard Deviation'?

A measure of the spread of data around the mean.

56
New cards

What is 'Variance'?

The square of the standard deviation; a measure of data dispersion.

57
New cards

What does 'Skewness' describe?

Asymmetry in the distribution of data.

58
New cards

What does 'Symmetry' mean in distributions?

A distribution where two halves are mirror images; mean ≈ median in symmetric distributions.

59
New cards

What is 'Left-skewed' (negatively skewed) data?

Longer left tail; mean < median.

60
New cards

What is 'Right-skewed' (positively skewed) data?

Longer right tail; mean > median.

61
New cards

What is a 'Likert scale'?

An ordinal scale (e.g., strongly disagree to strongly agree) used for attitudes or opinions.

62
New cards

What is a “Categorical” Variable?

A variable that represents distinct categories without a natural order, such as gender or marital status.

63
New cards

What are the two subsets of a Categorical Variable?

Nominal, Ordinal

64
New cards

What is a “nominal” variable? What is an example?

A type of categorical variable that labels distinct categories without any inherent order. None are better/worse than the other, just different

Examples: Student Numbers, Major, ethnicity, home state

65
New cards

What is a “ordinal” Variable? What is an example?

A type of categorical variable that represents categories with a meaningful order or ranking, where the distances between the ranks may not be uniform.
Examples: Education level, satisfaction rating, class rank, likert.

66
New cards

What is a “measurement” Variable?

A type of variable that quantifies with meaningful numbers or traits, such as temperature, height, weight, age, etc.

67
New cards

What are two types of Measurement variables?

Interval and Ratio

68
New cards

What is an “interval” variable? What is an example?

A type of measurement variable where the difference between values is meaningful, but there is no true zero point, such as temperature in Celsius or Fahrenheit. Also, differences are consistent (30 apart) but NOT in a ratio, no multiplication or division.

69
New cards

What is an “Ratio” variable? What is an example?

A type of measurement variable where both differences and ratios between values are meaningful, such as weight or height, # of somethings, which includes a true zero point.

70
New cards

What is “continuous” vs. “discrete” and which variable does this effect?

Continuous variables can take any value within a range, while discrete variables can only take specific, separate values. This distinction affects variables in measurement, where continuous could include height or weight, and discrete could include the number of individuals.

71
New cards

What is the definition of “statistics?

Science of collecting, describing, and interpreting data. Involving understanding how data is gathered, summarized, and used to draw conclusions.

72
New cards

What is “Quantitative Data”?

Data that represents counts or measurements.

73
New cards

What is a “dotplot”?

Graph w/ dots representing data values

74
New cards

What is a “stem-and-leaf” plot?

Short-hand way of writing down a list of numbers

75
New cards

What is a “mean”

The average of a set of numbers, calculated by dividing the sum of the values by the total number of values.

76
New cards

What is the “median”?

The middle value of a data set when arranged in ascending or descending order, or the average of the two middle values if there is an even number of observations.

77
New cards

What is the “mode”?

The value that appears most frequently in a data set.

78
New cards

What is the “range”?

The difference between the highest and lowest values in a data set, providing a measure of variability.

79
New cards

What is “standard deviation”?

A statistic that measures the dispersion or spread of a set of values around the mean, indicating how much individual data points differ from the average.

80
New cards

What is “variance”? How do you calculate it?

Variance quantifies the degree of spread in a data set, calculated by squaring the standard deviation.

81
New cards

What are “quartiles”?

Quartiles divide a data set into four equal parts, each containing 25% of the data points, helping to describe the distribution and spread.

82
New cards

What is the “Lower Quartile” and what symboles?

The Lower Quartile, also known as Q1 or QL, is the median of the lower half of a data set, separating the lowest 25% of the data from the rest.

83
New cards

What is the “upper quartile” and what symbols?

The Upper Quartile, also known as Q3 or QU, is the median of the upper half of a data set, separating the highest 25% of the data from the rest.

84
New cards

What is the “Five-number summary”?

The Five-number summary consists of the minimum, lower quartile (Q1), median, upper quartile (Q3), and maximum of a data set, providing a quick overview of its distribution.

85
New cards

What is “skewness”?

Description of how the values in a data set are spread out

86
New cards

Describe “Skewed to the left”?

Negatively skewed, smaller values more spread out than larger, mean is smaller than the median.

87
New cards

Describe “Skewed to the right?”

Positively skewed, larger values more spread out than smaller, mean is larger than the median.