Unit 5- Activity 1: Comprehensive Study Notes on Categorical Variables

Relationship Between Categorical Variables Overview

  • Conceptual Framework: This unit focuses on analyzing the connection between two categorical variables using contingency tables. These tables allow for the calculation of probabilities to determine how categories within one variable relate to categories within another.

  • Mathematical Definitions:     * Marginal Probability: The probability of a single categorical outcome based on the total row or column count relative to the grand total. It focuses on one variable at a time (e.g., the total percentage of males in a group regardless of other factors).     * Conditional Probability: The probability of an outcome given that another specific condition is already met. This narrows the denominator to a specific row total or column total (e.g., the percentage of females, specifically within the group of highly motivated individuals).

Case Study: Motivation Level and Gender (Slide 8)

  • Raw Data Table:

Gender

Low Motivation

Medium Motivation

High Motivation

Total

Female

3232

5252

1515

9999

Male

2828

5555

2626

109109

Total

6060

107107

4141

208208

  • Analysis and Multiple Choice Questions:     * Q1: Percentage of Males (Marginal Probability):         * Formula: Total MalesGrand Total\frac{\text{Total Males}}{\text{Grand Total}}         * Calculation: 1092080.52403\frac{109}{208} \approx 0.52403         * Answer: d) 52.4%52.4\%     * Q2: Percentage of Males reporting High (Conditional Probability):         * Formula: Male and High MotivationTotal Male\frac{\text{Male and High Motivation}}{\text{Total Male}}         * Calculation: 261090.2385\frac{26}{109} \approx 0.2385, which rounds to 23.9%23.9\%         * Answer: a) 23.9%23.9\%     * Q3: Percentage of High Motivated who are Female (Conditional Probability):         * Formula: Female and High MotivationTotal High Motivation\frac{\text{Female and High Motivation}}{\text{Total High Motivation}}         * Calculation: 15410.36585\frac{15}{41} \approx 0.36585         * Answer: b) 36.6%36.6\%     * Q4: Least common motivation level for both genders:         * Comparison of totals: Low (6060), Medium (107107), High (4141).         * Answer: c) High

  • Short Answer Analysis:     * Q5: Total Reporting High Motivation: 4141     * Q6: Percentage of Overall Medium Motivation (Marginal Probability):         * Formula: Total Medium MotivationGrand Total\frac{\text{Total Medium Motivation}}{\text{Grand Total}}         * Calculation: 107208×10051.44%\frac{107}{208} \times 100 \approx 51.44\%     * Q7: Comparison of Low Motivation Rates:         * Female Low Rate: 3299×10032.3%\frac{32}{99} \times 100 \approx 32.3\%         * Male Low Rate: 28109×10025.7%\frac{28}{109} \times 100 \approx 25.7\%         * Comparison: Females have the higher rate of low motivation by approximately 32.3%25.7%=6.6%32.3\% - 25.7\% = 6.6\%.

Case Study: Teachers’ Quality Based on School Type (Slide 13)

  • Raw Data Table:

School Type

Low Quality

Medium Quality

High Quality

Total

Private

99

3030

2525

6464

Public

1010

155155

3636

201201

Total

1919

185185

6161

265265

  • Calculated Proportions:     * a) Percentage of Medium Quality Teachers in Private Schools:         * Calculation: 3064×100=46.875%\frac{30}{64} \times 100 = 46.875\%     * b) Percentage of High Quality Teachers in Public Schools:         * Calculation: 36201×10017.91%\frac{36}{201} \times 100 \approx 17.91\%

Case Study: Laptop Sales per Gender (Slide 15)

  • Raw Data Table:

Group

PC Sale

Mac Sale

Total

Males

6565

4545

110110

Females

3535

8585

120120

Total

100100

130130

230230

  • Short Answer Analysis:     * a) Percentage of All Sales that are MAC:         * Calculation: 130230×10056.52%\frac{130}{230} \times 100 \approx 56.52\%     * c) Percentage of PC Customers who are Male:         * Calculation: 65100×100=65%\frac{65}{100} \times 100 = 65\%     * d) Percentage of Sales from Female Customers (Marginal Probability):         * Calculation: 120230×10052.17%\frac{120}{230} \times 100 \approx 52.17\%     * e) Percentage of MAC Customers who are Female:         * Calculation: 85130×10065.38%\frac{85}{130} \times 100 \approx 65.38\%

Activity: Instagram Users by Age Group (Slide 15/Slide 19)

  • Raw Data Table:

Age Group

Use Instagram: Yes

Use Instagram: No

Total

18—29

225225

127127

352352

30—49

211211

317317

528528

50—64

114114

430430

544544

65+

5252

477477

529529

Total

602602

13511351

19531953

  • Multiple Choice Questions Analysis:     * Q1: Percentage of all respondents who DO NOT use Instagram:         * Formula: Total NoGrand Total\frac{\text{Total No}}{\text{Grand Total}}         * Calculation: 135119530.69175\frac{1351}{1953} \approx 0.69175         * Answer: d) 69.2%69.2\%     * Q2: Percentage of all respondents in the 18–29 age group:         * Formula: Total 18—29Grand Total\frac{\text{Total 18—29}}{\text{Grand Total}}         * Calculation: 35219530.18023\frac{352}{1953} \approx 0.18023         * Answer: b) 18.0%18.0\%     * Q3: Percentage of those aged 30–49 who use Instagram:         * Formula: 30—49 YesTotal 30—49\frac{\text{30—49 Yes}}{\text{Total 30—49}}         * Calculation: 2115280.39962\frac{211}{528} \approx 0.39962         * Answer: a) 40%40\%     * Q4: Percentage of those aged 65+ who do not use Instagram:         * Formula: 65+ NoTotal 65+\frac{\text{65+ No}}{\text{Total 65+}}         * Calculation: 4775290.9017\frac{477}{529} \approx 0.9017         * Answer: d) 90.2%90.2\%

Activity: Commuting and Parking Tickets Case Study

  • Survey Narrative: In a statistics project, students were surveyed about commuting by car and receiving parking tickets.     * Total who commute by car = 2525. From this group, 1919 received a ticket.     * Total who do not commute by car = 2828. From this group, 77 received a ticket.

  • Variable Identification:     * Explanatory Variable: Commute to school by car (Yes or No). This is the variable that predicts or influences the other.     * Response Variable: Whether the student had ever received a parking ticket (Yes or No). This is the outcome variable being measured.

  • Completed Contingency Table:

Group

Received Ticket: Yes

Received Ticket: No

Total

Commute by Car: Yes

1919

66 (Calculated: 251925 - 19)

2525

Commute by Car: No

77

2121 (Calculated: 28728 - 7)

2828

Total

2626

2727

5353