Unit 1C: Sets and Venn Diagrams
Sets and Notation
A set is a collection of objects; the members of a set are the specific objects within it.
Sets are written by listing members inside braces: { }.
Use three dots, …, to indicate a continuing pattern if there are too many members to list.
A Venn diagram uses circles to represent sets.
Example concepts from the transcript:
- The set of countries larger in land area than the United States can be written as {Russia, Canada}.
- The set of natural numbers greater than 5 is {6, 7, 8, …}; the dots indicate the list continues to ever-larger numbers.
Key general statements:
- The set whales is a subset of the set mammals. ({Whales} ⊆ {Mammals})
- The set of fish is disjoint from the set of mammals. ({Fish} ∩ {Mammals} = ∅)
- The men who are not doctors concept shows that sets can overlap (e.g., doctors, women). In Venn diagrams, overlap means some members belong to both sets.
Quick visual intuition:
- Subset: every member of A is also a member of B.
- Disjoint: A and B share no members.
- Overlapping: A and B share some members.
Subset, Disjoint, and Overlapping Relationships
- A may be a subset of B (or vice versa), meaning all members of A are also members of B.
- Notation: A \subseteq B
- A may be disjoint from B, meaning that the two sets have no members in common.
- Notation: A \cap B = \emptyset
- A and B may be overlapping sets, meaning that the two sets share some of the same members.
- Notation: A \cap B \neq \emptyset
- Practical note: Venn diagrams illustrate these relationships with circles and their intersections.
Example: Venn Diagrams and Political Parties
- Describe the relationship between Democrats and Republicans (party affiliations):
- In the typical case, a person can be registered for only one party, so the sets Democrats and Republicans are disjoint.
- The region outside both circles represents people who are neither Democrats nor Republicans — i.e., those registered for other parties, independents, or not registered.
- This example emphasizes interpreting all regions of the diagram:
- Democrats only, Republicans only, both (if overlap existed), and neither.
Example: Sets of Numbers
- Task: Draw a Venn diagram showing relationships among natural numbers, whole numbers, integers, rational numbers, and real numbers.
- Question: Where are irrational numbers found in this diagram? (Irrationals are real numbers not rational; i.e., \mathbb{R} \setminus \mathbb{Q}.)
- Conceptual takeaway:
- There is a nested containment among standard number sets:
- \mathbb{N} \subseteq \mathbb{W} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
- Irrational numbers lie in \mathbb{R} \setminus \mathbb{Q}, outside the rational numbers but inside the real numbers.
Categorical Propositions (Structure and Notation)
- Categorical propositions must have the structure of a complete sentence.
- One set appears in the subject (S); the other appears in the predicate (P).
- Example: “All whales are mammals.”
- S = whales, P = mammals.
- We typically rewrite this as All\ S\ are\ P with S and P replacing the actual sets: where S = whales and P = mammals.
- Common forms (with S and P as placeholders):
- All S are P
- No S are P
- Some S are P
- Some S are not P
Categorical Propositions (2): Forms and Interpretations
- All S are P
- No S are P
- All whales are mammals (example from the text)
- No fish are mammals (example from the text)
- These examples illustrate how the subject and predicate sets relate in Venn-like logic, though the diagrammatic representation requires a Venn diagram for visual interpretation.
Categorical Propositions (3): Examples
- Some S are P
- Some S are not P
- Examples from the text:
- Some doctors are psychiatrists
- Some foods are not vegetables
- These provide the existential and non-existential forms that can be translated into diagram regions or logical expressions.
Constructing a Venn Diagram from a Statement
- Statement: “Some dogs can swim.”
- Rephrase to: “Some dogs are animals that can swim.”
- Set definitions for the diagram:
- S = \text{dogs}
- P = \text{animals that can swim}
- The Venn diagram should show an intersection between S and P, indicating that there exist dogs that are also animals capable of swimming.
Example: Smoking and Pregnancy (Two-Way Table and Venn Diagram)
- Context: A study investigates whether a pregnant mother’s smoking status affects birth weight outcome (low vs normal).
- The table (Table 1.1) contains four numbers corresponding to the four combinations of two binary variables: smoking status (smoker vs nonsmoker) and baby birth weight status (low vs normal).
- a) Key facts summarized from the table:
- 18 babies were born with low birth weight to smoking mothers.
- 132 babies were born with normal birth weight to smoking mothers.
- 14 babies were born with low birth weight to nonsmoking mothers.
- 186 babies were born with normal birth weight to nonsmoking mothers.
- b) Venn diagram representation:
- The circles represent the sets: smoking mothers and low birth weight babies.
- Each region corresponds to one of the entries in Table 1.1.
- c) Brief interpretation from the diagram:
- Normal birth weight babies were much more common than low birth weight babies in both smoking and nonsmoking groups.
- Smoking mothers had a lower proportion of normal birth weight babies and a higher proportion of low birth weight babies.
- This pattern suggests that smoking increases the risk of delivering a low birth weight baby, a conclusion supported by careful statistical analysis in this and other studies.
- Additional notes: Two-way tables help assess associations between two categorical variables and visualize joint distributions.
Blood Types: Three Sets with Numbers (Antigens A, B, and Rh)
- Blood type classification:
- Three antigens: A, B, and Rh (present or absent for each).
- Blood type designation depends on which antigens are present:
- A: only A antigen present
- B: only B antigen present
- AB: both A and B antigens present
- O: neither A nor B antigens present
- Rh status adds positive (present) or negative (absent) to each type.
- The eight blood types that result, with their approximate percentages in the U.S. population:
- A Positive: 34\%
- B Positive: 8\%
- AB Positive: 3\%
- O Positive: 35\%
- A Negative: 8\%
- B Negative: 2\%
- AB Negative: 1\%
- O Negative: 9\%
- A three-set Venn diagram (A, B, Rh) can illustrate how the eight regions correspond to the eight blood types, with each region representing a unique combination of the presence/absence of A, B, and Rh:
- Region for A+, Region for A−, Region for B+, Region for B−, Region for AB+, Region for AB−, Region for O+, Region for O−.
- Practical takeaway:
- The percentages provide a probabilistic snapshot of blood type distribution in the population.
- The three-set Venn diagram links antigen presence to the observed blood types in a compact visual form.
Connections and Real-World Relevance
- Sets and Venn diagrams provide a formal framework for classifying objects and visualizing relationships between groups.
- Subset, disjoint, and overlapping relations underpin many logical and probabilistic reasoning tasks used in statistics, computer science, and data analysis.
- Categorical propositions (All/No/Some) translate everyday statements into formal logical forms useful for reasoning and proof.
- Two-way tables and Venn diagrams are practical tools for analyzing the relationship between two categorical variables, illustrating concepts such as association, independence, and conditional proportions.
- Blood type data exemplify how multiple binary attributes (A/B and Rh) interact to yield a complete set of categories; real-world data often require multi-set or multi-variable representations.
Mathematical Highlights (LaTeX Formatted)
- Subset relation: A \subseteq B
- Disjointness: A \cap B = \emptyset
- Overlap: A \cap B \neq \emptyset
- Nested number sets: \mathbb{N} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
- Irrationals within reals: \mathbb{R} \setminus \mathbb{Q}
- Two-way table basics: counts for each combination (e.g., smoking vs birth weight) and derived totals.
- Blood type data: eight possible combinations corresponding to the presence/absence of A, B, and Rh antigens; listed percentages as above.
Quick Reference Formulas
Subset and intersection relations often used in Venn diagrams:
- A \subseteq B
- A \cap B = \emptyset
- A \cap B \neq \emptyset
Number set inclusions:
- \mathbb{N} \subseteq \mathbb{Z} \subseteq \mathbb{Q} \subseteq \mathbb{R}
- \mathbb{R} \setminus \mathbb{Q} (irrationals)
From the smoking and pregnancy example, compute conditional proportions as needed:
- Total smoking: 18 + 132 = 150
- Total nonsmoking: 14 + 186 = 200
- Total low birth weight: 18 + 14 = 32
- Total normal birth weight: 132 + 186 = 318
- Grand total: 350
Conditional probability examples (optional additions):
- Probability(low birth weight | smoking) = \frac{18}{150} = 0.12
- Probability(low birth weight | nonsmoking) = \frac{14}{200} = 0.07
Note: The above notes mirror the content and examples from the provided transcript, organized into a comprehensive, study-ready format with explicit definitions, examples, and real-world connections.