Unit 1C: Sets and Venn Diagrams

Sets and Notation

  • A set is a collection of objects; the members of a set are the specific objects within it.

  • Sets are written by listing members inside braces: { }.

  • Use three dots, …, to indicate a continuing pattern if there are too many members to list.

  • A Venn diagram uses circles to represent sets.

  • Example concepts from the transcript:

    • The set of countries larger in land area than the United States can be written as {Russia, Canada}.
    • The set of natural numbers greater than 5 is {6, 7, 8, …}; the dots indicate the list continues to ever-larger numbers.
  • Key general statements:

    • The set whales is a subset of the set mammals. ({Whales} ⊆ {Mammals})
    • The set of fish is disjoint from the set of mammals. ({Fish} ∩ {Mammals} = ∅)
    • The men who are not doctors concept shows that sets can overlap (e.g., doctors, women). In Venn diagrams, overlap means some members belong to both sets.
  • Quick visual intuition:

    • Subset: every member of A is also a member of B.
    • Disjoint: A and B share no members.
    • Overlapping: A and B share some members.

Subset, Disjoint, and Overlapping Relationships

  • A may be a subset of B (or vice versa), meaning all members of A are also members of B.
    • Notation: A \subseteq B
  • A may be disjoint from B, meaning that the two sets have no members in common.
    • Notation: A \cap B = \emptyset
  • A and B may be overlapping sets, meaning that the two sets share some of the same members.
    • Notation: A \cap B \neq \emptyset
  • Practical note: Venn diagrams illustrate these relationships with circles and their intersections.

Example: Venn Diagrams and Political Parties

  • Describe the relationship between Democrats and Republicans (party affiliations):
    • In the typical case, a person can be registered for only one party, so the sets Democrats and Republicans are disjoint.
    • The region outside both circles represents people who are neither Democrats nor Republicans — i.e., those registered for other parties, independents, or not registered.
    • This example emphasizes interpreting all regions of the diagram:
    • Democrats only, Republicans only, both (if overlap existed), and neither.

Example: Sets of Numbers

  • Task: Draw a Venn diagram showing relationships among natural numbers, whole numbers, integers, rational numbers, and real numbers.
  • Question: Where are irrational numbers found in this diagram? (Irrationals are real numbers not rational; i.e., \mathbb{R} \setminus \mathbb{Q}.)
  • Conceptual takeaway:
    • There is a nested containment among standard number sets:
    • \mathbb{N} \subseteq \mathbb{W} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
    • Irrational numbers lie in \mathbb{R} \setminus \mathbb{Q}, outside the rational numbers but inside the real numbers.

Categorical Propositions (Structure and Notation)

  • Categorical propositions must have the structure of a complete sentence.
  • One set appears in the subject (S); the other appears in the predicate (P).
  • Example: “All whales are mammals.”
    • S = whales, P = mammals.
    • We typically rewrite this as All\ S\ are\ P with S and P replacing the actual sets: where S = whales and P = mammals.
  • Common forms (with S and P as placeholders):
    • All S are P
    • No S are P
    • Some S are P
    • Some S are not P

Categorical Propositions (2): Forms and Interpretations

  • All S are P
  • No S are P
  • All whales are mammals (example from the text)
  • No fish are mammals (example from the text)
  • These examples illustrate how the subject and predicate sets relate in Venn-like logic, though the diagrammatic representation requires a Venn diagram for visual interpretation.

Categorical Propositions (3): Examples

  • Some S are P
  • Some S are not P
  • Examples from the text:
    • Some doctors are psychiatrists
    • Some foods are not vegetables
  • These provide the existential and non-existential forms that can be translated into diagram regions or logical expressions.

Constructing a Venn Diagram from a Statement

  • Statement: “Some dogs can swim.”
  • Rephrase to: “Some dogs are animals that can swim.”
  • Set definitions for the diagram:
    • S = \text{dogs}
    • P = \text{animals that can swim}
  • The Venn diagram should show an intersection between S and P, indicating that there exist dogs that are also animals capable of swimming.

Example: Smoking and Pregnancy (Two-Way Table and Venn Diagram)

  • Context: A study investigates whether a pregnant mother’s smoking status affects birth weight outcome (low vs normal).
  • The table (Table 1.1) contains four numbers corresponding to the four combinations of two binary variables: smoking status (smoker vs nonsmoker) and baby birth weight status (low vs normal).
  • a) Key facts summarized from the table:
    • 18 babies were born with low birth weight to smoking mothers.
    • 132 babies were born with normal birth weight to smoking mothers.
    • 14 babies were born with low birth weight to nonsmoking mothers.
    • 186 babies were born with normal birth weight to nonsmoking mothers.
  • b) Venn diagram representation:
    • The circles represent the sets: smoking mothers and low birth weight babies.
    • Each region corresponds to one of the entries in Table 1.1.
  • c) Brief interpretation from the diagram:
    • Normal birth weight babies were much more common than low birth weight babies in both smoking and nonsmoking groups.
    • Smoking mothers had a lower proportion of normal birth weight babies and a higher proportion of low birth weight babies.
    • This pattern suggests that smoking increases the risk of delivering a low birth weight baby, a conclusion supported by careful statistical analysis in this and other studies.
  • Additional notes: Two-way tables help assess associations between two categorical variables and visualize joint distributions.

Blood Types: Three Sets with Numbers (Antigens A, B, and Rh)

  • Blood type classification:
    • Three antigens: A, B, and Rh (present or absent for each).
    • Blood type designation depends on which antigens are present:
    • A: only A antigen present
    • B: only B antigen present
    • AB: both A and B antigens present
    • O: neither A nor B antigens present
    • Rh status adds positive (present) or negative (absent) to each type.
  • The eight blood types that result, with their approximate percentages in the U.S. population:
    • A Positive: 34\%
    • B Positive: 8\%
    • AB Positive: 3\%
    • O Positive: 35\%
    • A Negative: 8\%
    • B Negative: 2\%
    • AB Negative: 1\%
    • O Negative: 9\%
  • A three-set Venn diagram (A, B, Rh) can illustrate how the eight regions correspond to the eight blood types, with each region representing a unique combination of the presence/absence of A, B, and Rh:
    • Region for A+, Region for A−, Region for B+, Region for B−, Region for AB+, Region for AB−, Region for O+, Region for O−.
  • Practical takeaway:
    • The percentages provide a probabilistic snapshot of blood type distribution in the population.
    • The three-set Venn diagram links antigen presence to the observed blood types in a compact visual form.

Connections and Real-World Relevance

  • Sets and Venn diagrams provide a formal framework for classifying objects and visualizing relationships between groups.
  • Subset, disjoint, and overlapping relations underpin many logical and probabilistic reasoning tasks used in statistics, computer science, and data analysis.
  • Categorical propositions (All/No/Some) translate everyday statements into formal logical forms useful for reasoning and proof.
  • Two-way tables and Venn diagrams are practical tools for analyzing the relationship between two categorical variables, illustrating concepts such as association, independence, and conditional proportions.
  • Blood type data exemplify how multiple binary attributes (A/B and Rh) interact to yield a complete set of categories; real-world data often require multi-set or multi-variable representations.

Mathematical Highlights (LaTeX Formatted)

  • Subset relation: A \subseteq B
  • Disjointness: A \cap B = \emptyset
  • Overlap: A \cap B \neq \emptyset
  • Nested number sets: \mathbb{N} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
  • Irrationals within reals: \mathbb{R} \setminus \mathbb{Q}
  • Two-way table basics: counts for each combination (e.g., smoking vs birth weight) and derived totals.
  • Blood type data: eight possible combinations corresponding to the presence/absence of A, B, and Rh antigens; listed percentages as above.

Quick Reference Formulas

  • Subset and intersection relations often used in Venn diagrams:

    • A \subseteq B
    • A \cap B = \emptyset
    • A \cap B \neq \emptyset
  • Number set inclusions:

    • \mathbb{N} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
    • \mathbb{R} \setminus \mathbb{Q} (irrationals)
  • From the smoking and pregnancy example, compute conditional proportions as needed:

    • Total smoking: 18 + 132 = 150
    • Total nonsmoking: 14 + 186 = 200
    • Total low birth weight: 18 + 14 = 32
    • Total normal birth weight: 132 + 186 = 318
    • Grand total: 350
  • Conditional probability examples (optional additions):

    • Probability(low birth weight | smoking) = \frac{18}{150} = 0.12
    • Probability(low birth weight | nonsmoking) = \frac{14}{200} = 0.07
  • Note: The above notes mirror the content and examples from the provided transcript, organized into a comprehensive, study-ready format with explicit definitions, examples, and real-world connections.