Untitled Notes

QMB 3302 - Week 4 Study Notes

What-if Analysis and Breakeven Analysis

Many business problems require analysis to find the appropriate level of some activity. This could be to:
- Maximize profit
- Minimize cost
- Achieve breakeven (no profit, no loss)

Example 1: Breakeven Analysis at Quality Sweaters

Quality Sweaters Company specializes in selling hand-knitted sweaters.
The company plans to:
- Print a $20,000 catalog plus $0.10 per catalog printed.
- Mail catalogs at $0.15 each (includes postage and marketing data).
- Include a direct reply envelope costing $0.20 per envelope used by the respondent.
Average customer order value: $40
Variable cost per order (labor/material): 80% of order value = $32
Total catalogs planned to mail: 100,000

Key Questions for Analysis:

How does a change in response rate impact profit?
For what response rate does the company break even?

Creating the Spreadsheet Model for Breakeven Analysis

To create a range name in Excel:
- Highlight the desired cell(s)
- Click the Name Box next to the Formula Bar and type a name.
To use a range name:
- Select a cell (e.g., cell G2), access the Use in Formula dropdown in the Formulas ribbon, and select Paste List.

Impact of Response Rate on Profit

A data table, known as a what-if table, can show the effect of changes in inputs on outputs.
A one-way data table can be set up to examine the effect of response rate (input) on profit (output).

Determining the Company's Breakeven Point

Breakeven point lies between 5% and 6% response rates.
To find the exact breakeven point using Goal Seek in Excel:
- Go to What-If Analysis dropdown and select Goal Seek. Fill out the dialog box:
- Set cell: Profit
- To value: 0
- By changing cell: Response_rate
Result from Goal Seek:
- Respond Rate: 5.77%, Profit: $0
- If response rate > 5.77%, the company makes a profit.
- If response rate < 5.77%, the company incurs a loss.

Formula Auditing Tool in Excel

The Formula Auditing tool helps visualize relationships between parts in the spreadsheet model.
Example: Traces dependents related to the Number of responses in cell E5, impacting:
- Total Revenue (cell E8)
- Total Variable Cost of Orders (cell E11)

Descriptive Analytics Overview

Explanation of Descriptive Statistics:
- Summarizing data for a categorical variable.
- Utilizes labels or names for categories (e.g., types of soft drinks).
- Summarizing data for quantitative variables utilizing numerical values (e.g., Age).
- Summarizing data for two variables to understand their relationships.

Summarizing Categorical Data

Types of Data Summarization:
- Frequency Distribution
- Relative Frequency Distribution
- Percent Frequency Distribution
- Graphical Representations: Bar Chart, Pie Chart

Frequency Distribution

A tabular summary showing the frequency (number of observations) for each category.
Provides insights that visual observation of raw data cannot yield.

Example: Marada Inn Quality Ratings

Guests rated accommodations as:
- Excellent
- Above Average
- Average
- Below Average
- Poor

Relative and Percent Frequency Distributions

Relative Frequency: Proportion of total data items in each class.
Percent Frequency: Relative frequency expressed as a percentage.
- Example: For a class with 1/20 items, relative frequency is 0.05, and percent frequency is 5%.

Bar Chart Representation

Displays qualitative data effectively:
- One axis for class labels, another for frequency or percent.
- Fixed-width bars above class labels, separated to reflect distinct categories.

Pie Chart Representation

Graphical display for relative and percent frequency distributions:
- Circle subdivided using relative frequencies for each class.
- E.g., A relative frequency of 0.25 corresponds to 90 degrees in the pie chart.

In-Class Exercise: Excel

Students to download an Excel exercise file from Canvas to practice.

Summarizing Quantitative Data

Methods include:
- Frequency Distribution
- Relative Frequency and Percent Frequency Distributions
- Histogram
- Cumulative Distributions.

Frequency Distribution - Quantitative Data Example

Study of parts costs for engine tune-ups based on 50 customer invoices focusing on:
1. Non-overlapping classes
2. Width of each class
3. Class limits

Guidelines for Class Limits

Each data point must belong to one class only:
- Lower class limit: smallest data value in the class
- Upper class limit: largest data value in the class
Open-end classes only require one limit.

Relative Frequency Insights from Hudson Auto Repair Example

Analysis indicates specific cost ranges and their frequencies (e.g., 32% of costs are in $70-$79 class).

Cumulative Distributions

Cumulative frequency distribution represents the count of items with values ≤ upper limit.
Cumulative relative distributions show the proportion below limits, while cumulative percent shows percentage.

Histogram Explanation

Graphical representation of quantitative data:
- Variable on horizontal axis, rectangles above each class interval corresponding to frequency.
- No gaps between rectangles, unlike bar graphs.

Skewness in Histograms

Types of Skewness:
- Moderately Skewed Left (tail on left)
- Symmetrical (equal tails)
- Moderately Right Skewed (tail on right)

In-Class Exercise: Excel

Another exercise is accessible via Canvas.

Summarizing Two Variables - Crosstabulation

Method for understanding relationships between two variables:
- Can be used for categorical with quantitative, both categorical, or both quantitative variables.
The table’s margins define classes for each variable.

Example: Finger Lakes Homes

Homes sold across styles and prices analyzed:
- Highest count: split-level under $250,000.
- Very few A-Frame homes priced > $250,000.

Crosstabulation - Row Percentages

Converting entries to percentages for better insight:
- Example of row percentages revealing relationships.

Crosstabulation - Column Percentages

Calculating column percentages:
- Example using home styles and their prices under and over $250,000.

Simpson’s Paradox in Crosstabulation

Caution in summarizing data: aggregated data can reverse conclusions observed in unaggregated data.

Summarizing Two Variables - Scatter Diagrams

Graphical presentation for two quantitative variable relationships:
- One variable on horizontal and another on vertical axis.
- Points suggest overall relationship, with trendline showing approximation.

Example: Panthers Football Team

Analyzing relationship between interceptions and points scored.
Insights:
- The data shows a positive correlation; as interceptions rise, points scored also rise, though not perfectly linear.

Tabular and Graphical Methods Summary

Methods comparing categorical and quantitative data with various tabular and graphical approaches.

Descriptive Statistics: Numerical Measures Recap

Key Measures:
- Central Tendency (Location) Measures
- Variability (Dispersion) Measures
- Sample Statistics vs. Population Parameters (Sample statistic = point estimator for population parameter).

Mean (Average)

Critical measure of central location:
- Defined as the total of all data values divided by the number of values.
- Sample mean denoted as $ar{x}$, population mean as $bc$.

Median Definition

Value separating the higher half from the lower half of data:
Most effective when dealing with extreme values.
Examples provided with odd and even number observations to illustrate calculation.

Mode Explanation

The mode represents the most frequently occurring value in a dataset:
- Data can be unimodal, bimodal, or multimodal based on frequency distribution.

Percentiles Overview

A percentile indicates how data is spread across the range from smallest to largest:
- The p-th percentile has at least p percent of the values at or below it.
- Steps involve arranging data in ascending order and locating p-th percentile.

Quartiles Explanation

Quartiles are specific percentiles that segment data:
- 1st Quartile = 25th Percentile
- 2nd Quartile = 50th Percentile = Median
- 3rd Quartile = 75th Percentile

Measures of Variability

Important to assess variability alongside central measures:
- Common measures of variability include:
1. Range
2. Interquartile Range (IQR)
3. Variance
4. Standard Deviation

Range Definition

The difference between maximum and minimum values:
- Calculation: Range = largest value - smallest value

Interquartile Range (IQR) Explanation

Difference between the first and third quartiles:
- Represents the spread of the middle 50% of data, less sensitive to extremes.

Variance Definition

Measures how much the data varies from the mean:
- Sample variance considers sample's differences and requires an adjustment (n-1).

Standard Deviation Definition

The standard deviation is the square root of the variance, facilitating easier interpretation due to compatible units with original data.

Z-Score Understanding

Standardized value indicating how many standard deviations an observation is from the mean:
- A value < mean gives negative z-score, = mean gives 0, and > mean gives positive.
- Excel’s STANDARDIZE function can compute z-scores.

Detecting Outliers

An outlier possesses a z-score < -3 or > +3, indicating an unusual value:
- May result from errors in data recording or legitimate variability.

Summary of Data Functions in Excel

Functions available for various measures in Excel:
- Mean: =average(data_range)
- Median: =median(data_range)
- Mode: =mode(data_range)
- Percentile: =percentile(data_range, p/100)
- Quartiles: =quartile(data_range, 1) and =quartile(data_range, 3)
- Range: =max(data_range) - min(data_range)
- IQR: =quartile(data_range, 3) - quartile(data_range, 1)
- Variance: =varp(data_range) / =var(data_range)
- Standard Deviation: =stdevp(data_range) / =stdev(data_range)