ACCT 331: WEEK 9

ACCT 331: INTRODUCTION TO APPLIED ARTIFICIAL INTELLIGENCE

Course Information

Location: Schreiber #302
Schedule: Tuesday & Thursday, 1:00-2:15
Course Code: 10304
Additional Notes: www.company.com

MATHEMATICS IN ARTIFICIAL INTELLIGENCE

AI is firmly rooted in mathematics:
- It is not magic, but rather mathematics.
- Emphasis on quantitative methods used to model, predict, and interpret data.

NEWS YOU CAN USE

Walmart and OpenAI Collaboration

Partnership Overview:
- Walmart partners with OpenAI to enhance shopper experiences.
Key Insights:
- Customers can make purchases within ChatGPT.
- AI-first shopping moves from reactive to proactive.
- Enhancements in product catalogues and customer service.
- Educational initiatives for associates to promote AI literacy.

JPMorgan Chase AI Research Associate Program - Internship

Job Information:
- Position: AI Research Associate
- Location: New York, NY (383 Madison Ave, New York, NY, 10179, US)
- Salary Range: $135,000.00-$155,000.00
Description:
- Focus on cutting-edge research involving AI, machine learning (ML), and cryptography.
- Collaborate with teams to integrate AI solutions in finance.

Data Science Internship at IBM Quantum

Opportunity: Summer internship available at IBM Quantum.
Contact: Daniella Pombo, daniellapombo@ibm.com

AI Internship at CME Group

Position: Fall intern focusing on AI & Data.
Requirements: Strong background in CS/AI/ML, effective communication skills.
Contact: vijay.pillai@cmegroup.com

AI'S IMPACT ON VENTURE CAPITAL AND APPLICATION DEVELOPMENT

Insight from Byron Deeter, Bessemer Venture Partners:
- AI is the major technological opportunity of our times, surpassing the cloud computing revolution.
- Future of successful companies predicted to emerge in consumer-facing AI applications, with focus on productivity enhancement.

DATA MANAGEMENT AND ANALYTICS IN AI

Current Trends:
- AI dominates headlines, yet demand remains for data professionals in data science and analytics fields.
- Need for roles like data engineers and cloud data architects increases due to complexity of AI systems.

KEY FINDINGS FROM CISCO AI RESEARCH

Adoption Insights:
- AI-ready companies outperform peers and are more likely to see success in innovation.
Barriers:
- Rising workloads and lack of centralized data hinder AI adoption.

SCHEDULE AND TOPICS

Important Dates:
- SKP Release: 10/23/25; Due: 10/30/25
- Week 12 Topics:
- Model evaluation for classification
- Model selection and interpretability
- Fairness in machine learning use cases

MEASURING ERROR OF REGRESSION MODELS

Assessing Model Accuracy

Objective: Assess how well model predictions match actual data.
Key Metrics:
1. R² Statistic: Indicates the proportion of variance explained by the model.
2. Mean Absolute Error (MAE): Average magnitude of errors in predictions.
3. Mean Squared Error (MSE): Average of squared differences between predicted and actual values.
4. Root Mean Squared Error (RMSE): Square root of MSE, providing error metric in same units as original data.

Error Function

Definition: Compares predictions with actual outcomes, returning smaller values for better predictions and larger values for poorer predictions.

REGRESSION MODEL FORMULA

Assumed model relationship: Y = f(X) + oldsymbol{oldsymbol{ ext{ϵ}}}
Linear approximation: Y = eta0 + eta1X + oldsymbol{oldsymbol{ϵ}}

MEASURING ACCURACY FOR CLASSIFICATION MODELS

Key Considerations:
- Evaluation of prediction correctness is essential for model selection and feature refinement.

Classification Metrics

Accuracy: Determines overall correctness of predictions.
Precision: $ext{Precision} = rac{TP}{TP + FP}$ , focusing on minimizing false positives.
Recall (Sensitivity): $ext{Recall} = rac{TP}{TP + FN}$ , focusing on minimizing false negatives.
F1-Score: $F1 = 2 imes rac{ ext{Precision} imes ext{Recall}}{ ext{Precision} + ext{Recall}}$ , achieving balance between precision and recall.
ROC and AUC: Measure aggregate performance across different classification thresholds. The AUC indicates overall classifier performance, with values closer to 1 indicating better performance.

CONFUSION MATRIX

Definition: Visual representation of prediction performance broken down by True Positives (TP), False Positives (FP), True Negatives (TN), and False Negatives (FN).
- Key Metrics Derived:
- Accuracy: $(TP + TN) / (TP + FN +TN + FP)$
- Precision: $TP / (TP + FP)$
- Recall: $TP / (TP + FN)$
- F1 Score: $2 imes (Precision imes Recall) / (Precision + Recall)$
Interpretation:
- Diagonal elements reflect correct predictions, while off-diagonal elements reflect classification errors.

ERROR IN CLASSIFICATION MODELS

False Positive (FP): Incorrectly predicting true outcome as positive.
False Negative (FN): Incorrectly predicting a negative outcome as positive.
Examples:
- Cancer diagnosis: High FP leads to unnecessary anxiety.
- Fraud detection: High FN leads to financial losses.

ACCURACY EXPLAINED

Accuracy Formula: $ext{Accuracy} = rac{TP + TN}{TP + TN + FP + FN}$
Concerns with Imbalanced Classes: Accuracy is insufficient; should accompany other metrics for robust evaluation.

METRICS TRADE-OFF AND INTERPRETATION

Interpreting Metrics:
- High Precision & Low Recall: Model identifies few positives reliably.
- High Recall & Low Precision: Model identifies most positives but includes negatives.
- F1-Score: Provides single metric balancing Precision and Recall.

CLASSIFICATION USE CASE: MEDICAL DIAGNOSIS

Key Applications

Cancer detection from imaging (mammography).
Cardiac risk prediction from lab results.
Neurological disorder identification from clinical data.

Algorithms

Convolutional Neural Networks (CNN)
Random Forest
Support Vector Machines (SVM)
XGBoost

Challenges

Data imbalance in rare disease diagnosis.
Need for interpretability in clinical applications.

MODEL SELECTION IN MACHINE LEARNING

Overview

Purpose: Selecting the best model for task performance from candidates.
Stages:
- Comparing algorithms (e.g., decision trees vs. neural networks).
- Evaluating hyperparameters for model optimization.

Factors to Consider

Model training duration and explainability.
Impact of data preparation steps, pipeline considerations, and maintainability.

ETHICS AND FAIRNESS IN AI

Importance: Models must aim for fairness and avoid discrimination based on sensitive attributes (race, gender, etc.).
Legal & Ethical Obligations: Comply with regulations for fairness, transparency, and accountability.
Bias Sources: Unfair models emerge from biased data, inappropriate feature selections, and black-box nature of complex models.

LEGISLATIVE REGULATIONS

Recent regulations emphasize data protection, algorithmic transparency, fairness, and ethical AI use.

PRODUCT MANAGEMENT IN AI CONTEXT

Basket Composition & Product Substitution Models

Model 1: K-means clustering for product basket analysis to ensure co-stocking.
Model 2: Predictive modeling for substitution patterns using XGBoost regression.

PERFORMANCE METRICS - CASE STUDY**

Insights on Retail Impact

Demand patterns affected by COVID-19 and holiday peaks; elevated OOS rates observed.
Operational excellence achieved notable stock-out reductions.

Key Metrics

Customer satisfaction increases from improved product availability.

KEY TAKEAWAYS FOR AI PROJECTS

Establish clear objectives and metrics before model development.
Invest in A/B testing and experimental design to validate performance.
Thoroughly explore data for informed modeling decisions.

ADDITIONAL RESOURCES

Investigate readings about deep learning and large language models.
Review ethical implications and regulatory responsibilities in AI deployment.

CLOSING REMARKS

For Q&A: Steven Keith Platt, Director of Analytics and Executive Lecturer of Applied AI, Loyola University Chicago.
Contact: splatt1@luc.edu