PH102: Introduction to SPSS - Summarizing Categorical Variables

PH102: Introduction to SPSS - Summarizing Categorical Variables

This lecture introduces the fundamental concepts of datasets and variable types within the context of statistical analysis using SPSS, focusing specifically on how to summarize and visualize categorical data.

Today's Objectives

  • Understand Dataset Structure: Comprehend the general structure of a dataset and its specific representation in SPSS.

  • Learn Variable Types in SPSS: Differentiate between various variable types like continuous and categorical (including binary, ordinal, and nominal) and understand how to work with them in SPSS.

  • Summarize Categorical Variables: Learn methods to summarize binary and categorical variables, including calculating counts (nn), percentages (%\%), creating cross-tabulations, and generating bar plots.

  • Read and Explore Data in SPSS:

    • Opening Data Files: Navigate via File > Open > Data.

    • Descriptive Statistics: Utilize Analyze > Descriptive Statistics > Frequencies and Analyze > Descriptive Statistics > Crosstabs.

    • Graphical Representation: Create Graphs > Bar > Simple and Graphs > Bar > Cluster plots.

  • Hands-on Application: Begin reading and exploring data in SPSS using the provided Hypertension Dataset.

What is a Dataset?

  • A dataset is essentially the collection of data from a study.

  • It is typically organized as a rectangular data table where:

    • Each row represents an observation.

      • An observation is the unit of study, referring to an individual, subject, or participant.

    • Each column represents a variable.

      • A variable is a specific measurement or property recorded for each observation.

Variable Types

  • Continuous (or Numerical) Variables:

    • Take values along a number line.

    • Examples: Age (measured in years), serum cholesterol (measured in mg/dL).

    • Significance: The unit of measure is crucial for interpretation.

  • Categorical Variables:

    • Observations fall into one of several distinct values or categories.

    • Ordinal Categories:

      • Categories can be ranked or ordered.

      • Example: Cancer stage (I, II, III, IV) – Stage IV is