Measurements, Mistakes, and Misunderstandings

Open vs Closed Questions:
- Open Questions:
- Respondents provide answers in their own words.
- Example: "Describe the most important world events of the last 50 years."
- Allows for detailed responses but difficult to summarize and categorize.
- Closed Questions:
- Respondents choose from a list of predefined options.
- Example: "Which of the following events impacted the world most?"
- Easier to summarize but risks missing crucial information.

Open Questions:
- Advantages:
- Capture more nuanced information from respondents.
- Disadvantages:
- Difficult to analyze; can introduce bias in categorizing responses.
- Respondents may not provide their ideal answer on the spot but think of better responses later.
Closed Questions:
- Advantages:
- Easier to analyze and quantify data.
- Disadvantages:
- Limited options may fail to include respondents’ true thoughts.
- People may be influenced by the "recency effect" where they choose options listed later in the survey.

Discussion on societal events:
- Examples proposed: COVID-19 pandemic and September 11 attacks.
- Importance of collecting data to understand relationships; e.g., between height and happiness (the latter being much harder to define).

Measurement of physical attributes (e.g., height) is straightforward compared to subjective concepts (e.g., happiness).
Difficulty in defining what is being measured can skew results significantly:
- Example - Unemployment Measurement in New Zealand:
- Definition of unemployed: working-age individuals who:
  - do not have a job;
  - have actively sought work in the last four weeks;
  - are available to work.
Statistics of Unemployment Rate:
- Calculated as:
  ext{Unemployment Rate} = rac{ ext{Number of Unemployed}}{ ext{Total Labor Force}}
Contextual variations, such as underemployment and discouraged workers, may lead to misconceptions regarding the true rate of unemployment.

Happiness:
- Challenging to measure because it lacks a standardized metric.
Intelligence:
- IQ tests are a narrow measure and controversial due to:
- cultural bias,
- variability in scoring due to lack of standardization.
Examples of Intelligence Characteristics:
- Cultural context can radically change what skills and knowledge are valued.

Variables:
- Definition: Elements that can take on different values.
- Types of Variables:
- Categorical Variables: Place responses into distinct categories (e.g., gender, age categories).
- Measurement (Quantitative) Variables: Numerical values measured on a number line (e.g., height, income).
  - Discrete Measurement Variables: Whole numbers (e.g., number of students).
  - Continuous Measurement Variables: Any number in a range (e.g., height, weight).

Valid Measurement:
- Requires quantifiable numbers, standard units, and measurable properties.
- Example: Usain Bolt’s speed of 44.72 km/h is valid due to quantifiability and recognition as a measurable property.
- Invalid measure: an ambiguous statement without units or quantifiable data.
Reliability:
- Indicates whether measurements are consistent and repeatable.
- Characteristics: Consistency, precision, reproducibility.
- Comparison of accurate automated systems vs. manual timing methods in sports.

Bias: Systematic skewing of data leading to inaccurate conclusions.
- Factors causing bias: human error, faulty equipment, or flawed data collection methods.
- Example of biased measurement: automated systems malfunctioning and yielding consistently incorrect results.

Aim: Quantifying uncertainty in data for accurate predictions and analyses.
Understanding populations and samples:
- Population: Entire group of interest with specific characteristics (parameters).
- Sample: Subset taken from the population for analysis (statistics).
Importance of unbiased, reliable, and valid data for effective statistical analysis and inferences.

Future emphasis on data collection, descriptive statistics, and ethics of data analysis.

Mention of resources (like Netflix shows on bias) related to data ethics that provide practical insights into the subject matter.
Encouragement to keep informed and engaged as the course progresses into technical aspects of data and statistical analysis.