STATS OVERVIEW: Quantitative Research Design

Quantitative Research Design Notes

Today's Agenda

  • Quantitative Research

  • Variables

  • Data Collection

  • Sampling

  • Descriptive Research

  • Standard Scores

  • Data Analysis & Report Writing

The Research Process

  • Research Design: An outlined plan for data collection and analysis; the purpose of the study determines the design that will be used.

  • Six Steps of the Research Process:

    1. Identify the topic

    2. Review literature to narrow the problem

    3. Develop research design

    4. Collect data

    5. Analyze data

    6. Report results

Variables/Constructs

  • Definition: Any trait, attribute, or characteristic that varies across individuals.

  • Something a researcher intends to explore and understand.

  • Quantitative Research: Uses "Variables" (measurable traits).

  • Qualitative Research: Uses "Constructs" (conceptual ideas).

Defining Variables
  • Operationalizing (for quantitative research):

    • Stating its specific definition for the purpose of the intended research.

    • Precisely defining what the variable includes and does not include.

    • Stating how the variable will be identified and measured.

    • Example: Mental Health

      • Could be operationally defined as a DSM diagnosis.

      • Could be operationally defined by a score on a specific wellness scale.

  • Conceptually Defining (for qualitative research):

    • Assigning a dictionary-type meaning.

    • Example: Mental Health

      • Could be conceptually defined as "a person's condition regarding their psychological and emotional well-being."

Research Questions

  • Definition: Questions that the researcher seeks to answer.

  • Quantitative Research Questions:

    • Reflect the relationship between variables.

    • Are posed in the form of a question.

  • Qualitative Research Questions:

    • Are broad, open-ended questions.

    • Answer the "how" or "what" about a particular phenomenon.

Quantitative Research Overview

  • Key Characteristics:

    • Defining the variables.

    • Posing research questions or hypotheses that reflect the relationship between variables.

    • Collecting data in numeric values.

    • Conducting statistical analyses to determine whether proposed relationships exist.

  • Note: Understanding quantitative research does not require being an expert in advanced math; software like SPSS and JASP assist with analyses.

Variables in Quantitative Research
  • Purpose: To determine the relationship between variables.

  • All quantitative research has at least one of each of the following:

    • Independent Variable (IV):

      • Hypothesized to be the cause.

      • Researchers manage or manipulate the IV.

    • Dependent Variable (DV):

      • Hypothesized to be the outcome or the effect of the IV.

  • Relationship: The IV causes changes to the DV.

  • Examples for Identification (IV and DV):

    1. Study: "Does the frequency of counseling affect counseling outcome?" ext{IV} = ext{Frequency of counseling} ext{DV} = ext{Counseling outcome}

    2. Study: "Do different CBT techniques reduce depression?" ext{IV} = ext{Different CBT techniques} ext{DV} = ext{Depression}

    3. Study: "Can one's success in college be predicted by income, intelligence, and race?" ext{IV} = ext{Income, Intelligence, Race} ext{DV} = ext{Success in college}

    4. Study: "Does mindfulness have a significant relationship with client perceived empathy?" ext{IV} = ext{Mindfulness} ext{DV} = ext{Client perceived empathy}

Data Collection in Quantitative Research

  • First Step: Identify the targeted population.

    • Population: An entire unit or group; includes all units of interest in the study (e.g., all students, all clients, all employees).

  • Reality: Researchers usually cannot gather data from the whole population.

    • Solution: Use sampling strategies to draw a smaller group of individuals (a "sample") that may represent the population.

    • Goal: Conduct statistical analyses on the sample and then support that the findings are generalizable to the larger population.

Sampling in Counseling Research
  • Challenges: Counseling researchers often work with protected populations (e.g., individuals in medical, clinical, or correctional institutions; elders or minors), which limits sampling options.

  • Common Sampling Strategies:

    • Random sampling

    • Nonprobability sampling

    • Convenience sampling

    • Snowball sampling

  • Adequate Sample Size: "It depends" on the research; tools like G*Power can help determine appropriate sample size.

Sample Bias
  • Definition: Issues that arise from the inability to identify or gain access to a true representation of the population, often leading to underrepresenting or overrepresenting certain members.

  • Causes of Bias:

    • Inability to identify a population.

    • Difficulty gaining access to an identified population.

    • Underrepresenting or overrepresenting certain members of the population.

    • Volunteerism: Participants may have a personal interest or connection to the topic, which can skew results.

  • Addressing Sample Bias:

    • Reporting participants' demographic characteristics.

    • Discussing how certain sample biases may influence the results in the research report.

  • Relationship between Population, Sampling, and Statistics:

    • Population o Sampling o Sample o Descriptive Statistics o Inferential Statistics o Population (generalization)

Descriptive Statistics

  • Definition: Describe the basic features of the data in a study.

  • Key Components:

    • Frequency Distribution

    • Measures of Central Tendency

    • Variability

Scales of Measurement
  • Definition: Classification of what numbers mean to a particular variable.

  • Crucial for managing statistical software calculations.

  • Types of Scales:

    • Nominal:

      • Assigned to groups or categories.

      • Numbers serve as names or labels.

      • Examples: Gender (1=cisgender woman, 2=cisgender man), marital status, religion.

    • Ordinal:

      • Measurement by rank or order.

      • No assumed equal interval between places.

      • Examples: Letter grade (1=A, 2=B, 3=C…), ranking.

    • Interval:

      • Equal interval between scale points.

      • Arbitrary zero point (zero does not mean absence of a quantity).

      • Examples: Temperature, IQ test scores.

    • Ratio:

      • Possesses a true zero point (zero means the complete absence of a quantity).

      • Examples: Age, years of counseling experience.

Frequency Distribution
  • Purpose: To organize and summarize data to show how often different values or ranges of values occur.

  • Frequency Table: Organizes data into categories or intervals, showing absolute and relative frequencies.

    • Example Structure:
      | Years of Experience (X) | Frequency (f) | Percentage | Cumulative Percentage |
      | :-------------------- | :------------ | :--------- | :-------------------- |
      | 13-15 | 1 | 10.0% | 100.00% |
      | 10-12 | 0 | 0.0% | 90.0% |
      | 7-9 | 1 | 10.0% | 90.0% |
      | 4-6 | 5 | 50.0% | 80.0% |
      | 1-3 | 3 | 30.0% | 30.0% |
      | N=10 | | 100.00%| |

  • Graphing Frequencies:

    • Histogram: Bar graph showing quantities or percentages within specified ranges.

    • Polygon (Frequency Polygon): Line graph connecting the midpoints of the tops of the bars of a histogram.

Measures of Central Tendency
  • Purpose: To find a single value that best represents the center or typical value of a distribution.

  • Mean:

    • Most commonly used method of describing central tendency.

    • The arithmetic average of all scores.

    • Calculated by summing all values and dividing by the number of values.

    • Formula: \bar{X} = \frac{\sum X}{N}

  • Median:

    • The midpoint of an ordered list of values.

    • Found by listing values in rank order and identifying the point below which one-half of the scores lie.

    • Example 1 (odd number of scores): 2, 3, 3, 5, 7, 8, 9 (Median = 5).

    • Example 2 (even number of scores): 2, 3, 3, 5, 6, 7, 8, 9 (Median = (5+6)/2 = 5.5).

  • Mode:

    • Reports the most frequent score in the variable.

    • Most useful when studying nominal variables.

    • Not often a useful indicator of central tendency in a continuous distribution.

Types of Distribution
  • Symmetrical Distribution:

    • Often described as a "Bell Curve" or normal distribution.

    • Mean, Median, and Mode are approximately equal and located at the center.

  • Skewed Distribution: The "tail tells the tale" in terms of skewness.

    • Positive Skew (Right Skew):

      • The tail extends to the right (positive direction).

      • Mean > Median > Mode.

      • Indicates a few very high scores pulling the mean upwards.

    • Negative Skew (Left Skew):

      • The tail extends to the left (negative direction).

      • Mean < Median < Mode.

      • Indicates a few very low scores pulling the mean downwards.

  • Multimodal Distribution: Has more than one mode (peaks in frequency).

Standard Normal Distribution ("Bell Curve")
  • A specific type of symmetrical distribution representing many naturally occurring phenomena.

  • Properties:

    • Mean = 0, Standard Deviation (SD) = 1.

    • Specific percentages of data fall within certain standard deviations from the mean:

      • Approx. 68.2\% of data within \pm 1 SD.

      • Approx. 95.4\% of data within \pm 2 SD.

      • Approx. 99.7\% of data within \pm 3 SD.

Variability

  • Definition: The extent to which the scores in a distribution differ from each other; how the scores are spread out.

  • Homogenous Distribution: Lacking variability (scores are close together).

  • Heterogeneous Distribution: Having much variability (scores are spread out).

  • Three Frequently Used Measures of Variability:

    • Range

    • Variance

    • Standard Deviation

Measures of Variability
  • Range:

    • Calculated by taking the highest score and subtracting the lowest score.

    • Represents the distance or difference between the largest and smallest scores.

    • Considered unstable because it relies on only two scores.

  • Variance:

    • The average of the squared deviations from the mean.

    • Represents how close the scores in the distribution are to the mean (larger variance indicates more spread).

  • Standard Deviation (SD):

    • The square root of the variance.

    • Indicates the average difference between individual scores and the group mean.

    • It is in the same units as the original data, making it more interpretable than variance.

Standard Scores

  • Standardization: A process of converting individual raw scores in a distribution to a common scale.

Z-score
  • Definition: A number indicating the distance an individual raw score is above or below the mean in standard deviation units.

  • Purpose:

    • Tells how large or small an individual score is relative to other scores in the distribution.

    • Allows comparison of individual scores from two variables that use different scales for measurement.

  • Properties: When a distribution is converted to z-scores, its mean becomes 0 and its standard deviation becomes 1.

  • The absolute value of a z-score indicates the distance of the raw score from the mean of the distribution.

T-score
  • Definition: Standardized scores widely used to report performance on standardized tests and inventories (e.g., personality assessments).

  • Properties:

    • A T-score of 50 represents the mean.

    • A difference of 10 from the mean indicates a difference of one standard deviation.

  • Formula for Conversion from Z-score: T = 50 + 10z

Basic Data Analysis and Reporting

  • The next step after collecting and describing data involves further statistical analysis and report writing based on the research design.