ADMN3121H - CHAPTER 12 LECTURE SLIDES (Blackboard)

Chapter 12: Data Analytic Thinking and Prediction

Learning Objectives

  • Explain how management accountants can work with data to create value.

  • Identify the questions management wants to ask and the relevant data.

  • Explain the elements of a decision tree model.

  • Describe how to refine a decision tree model to ensure data represents the business context.

  • Explain how to validate predictions of full versus refined decision trees.

  • Evaluate predictions of different data science models to choose the best one for business needs and visualize and communicate model insights.

  • Describe how to use and deploy data science models.

Data Science Basics in Management Accounting

  • Data Science: Use of data analytics to draw conclusions from data.

    • Intersects with:

      1. Computer science and data skills

      2. Math and statistics

      3. Subject matter expertise in a specific area along with management accounting knowledge.

Predictive Modeling

  • Predictive Modeling: A data science technique to make predictions using past or current data.

    • Utilizes large data sets to train sophisticated algorithmic models.

    • Models learn from training data to predict new records based on features of interest.

    • Helps understand cost drivers; primary goal is value creation across the value chain.

Data Science Framework

  • A seven-step decision-making process for applying machine learning techniques:

    1. Gain a business understanding of the problem.

    2. Obtain and explore data.

    3. Prepare data.

    4. Build a model.

    5. Evaluate the model.

    6. Visualize and communicate insights.

    7. Deploy the model.

Step 1: Gain a Business Understanding of the Problem

  • Essential questions for management accountants to examine for business decisions.

  • Forming questions is critical for a successful data science undertaking.

  • Evaluate vast data availability to determine the worth of analyses.

Step 2: Obtain and Explore Relevant Data

  • Management accountant's role is to evaluate data quality:

    • Objectivity of data: Is it estimated or carefully measured?

    • Relevance of data and costs for decisions.

  • Exploratory Data Analysis: Provides insight into a data set through numeric analysis (mean, median, etc.).

Step 3: Prepare the Data

  • Organizing and processing data:

    • Identifying additional needed data and measuring variables.

    • Data Leakage: Exclude data not available during the analysis.

    • Ensure features used for predictive modeling are appropriate and legal considerations involving personal data.

Step 4: Build a Model

  • Various models, from regression to neural networks, are used to analyze data.

  • Models fit data flexibly and rely on computational power.

  • Collaboration between management accountants and data scientists is key in model development.

Functional Relationship Models and Decision Trees

  • Functional Relationship Models: Assume specific relationships between feature and target variables.

  • Decision Trees: Easy to interpret and build, segmenting the target variable using rules.

    • Decision Tree Algorithm: Subdivides data based on features.

    • Recursive Partitioning: Continual subdivision until pure rectangles are created.

Decision Tree Illustration

  • Visualization of decision trees highlighting:

    • Decision nodes (indicated by circles) and their connecting paths.

    • Terminal nodes (rectangles) signify groupings that are pure.

Measuring Impurity: Gini Impurity

  • Gini Impurity: Measures the purity of observation collections.

    • High impurity indicates a mixed set; lower impurity suggests dominance of one class.

    • Steps to Calculate Gini Impurity:

      1. Establish baseline Gini impurity.

      2. Assess new Gini impurities for potential cuts.

Refining the Decision Tree

  • Management accountants help in refinement by addressing:

    1. Overfitting: Models adhering too closely to noise in data.

    2. Pruning: Limiting the tree's growth to a predetermined depth.

Overfitting and Pruning

  • Overfitting: Reduces model effectiveness by capturing random noise, affecting future predictions.

    • Recognizing overfitting is crucial for management accountants.

  • Pruning: Controls tree growth to enhance model performance, raising questions about optimal depth.

Validating and Choosing Models

  • Data scientists apply techniques to compare full vs. pruned decision trees:

    1. Cross-validation using prediction accuracy.

    2. Cross-validation using maximum likelihood.

    3. Testing on holdout samples.

Cross-Validation Techniques

  • Cross-Validation for Prediction Accuracy: Comparing model predictions on known outcomes.

  • Cross-Validation for Maximum Likelihood: Uses likelihood values to gauge model performance.

Model Evaluation and Criteria

  • Evaluate models based on:

    • Likelihood values, feature variable relevance, and misclassification rates.

    • ROC Curve and Confusion Matrix as tools for visual evaluation.

Visual Representation of Model Insights

  • Visualizing insights helps communicate model value:

    • Decision trees show separation of classes.

    • ROC curves depict model classification accuracy.

    • Confusion matrices outline predicted vs. actual classifications.

Step 7: Deploy the Model

  • Collaboration with managers to operationalize models:

    • Evaluation of necessary modifications and balance between quantitative and qualitative assessments.

    • Sensitivity evaluation of payoffs based on decisions.

    • Understanding statistical tools is key in creating value through informed decision-making.