MIST 5440 - Final Exam Review Guide

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/121

There's no tags or description

Looks like no tags are added yet.

Last updated 7:23 PM on 4/30/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

122 Terms

New cards

What do we mean by AI for not bad? What is the distinction with AI for good?

AI for NOT Bad

Avoid ethical pitfalls in one’s pursuit of one’s goals
Risk mitigation

AI for Good

To create positive social impact

New cards

What are the consequences of ethical risks (happen at scale, reputational, regulatory, and legal)?

AI scales — therefore risks affect & can harm a large number of individuals
These ethical risks create significant reputational, regulatory, and legal risks that are costly in terms of money, resources, and money to address and loss of reputation and consumer trust
- Many of these arise and are realized when AI is in use
Differentiate between intended or anticipated, and unintended risks

New cards

Why are corporate codes of conduct and current regulations not adequate to cover AI ethical risks?

Corporate Codes of Conduct

Corporate codes of conduct cover employees’ behavior, but AI ethical risks are not realized because of bad behavior
They result from:
- Not thinking through the consequences
- Not monitoring AI “in the wild“
- Not knowing what one should be on the lookout for when developing or procuring AI

Current Regulations

New regulations are needed as there are new ways to break the law
Some techniques for mitigating AI ethical risks can be:
- Legally compliant but ethically risky
- Ethically sound but illegal AI

New cards

What are the big three AI ethical challenges?

Contents of AI Ethics Program

Bias
Lack of Explainability
Privacy

(but other AI Ethics risks exist as well and should not be overlooked)

New cards

What is the difference between content and structure for developing AI ethics programs?

Content defines what ethical risks and values an organization cares about
Structure is the system of processes and governance used to identify, evaluate, and mitigate those risks in practice.

New cards

Be able to define content and structure and discuss their importance in AI ethics programs

Content

Reflect a deep understanding of all the ethical risks we are trying to avoid
What does the organization see as good or bad
“What are the risks we’re trying to avoid?”

Structure

Leave no stone unturned in looking for them
Formal mechanisms for identifying and mitigating ethical risks
“How do we operationalize ethical risk mitigation”

New cards

Why are misperceptions about the nature of ethics a major obstacle to organizational buy-in to developing AI ethics programs? What are these misperceptions?

Getting buy-in requires that people understand what AI ethics is, and especially what ethics is
Ethics is based on well-founded standards of right and wrong that prescribe what humans ought to do, usually in terms of rights, obligations, benefits to society, fairness, or specific virtues
Distinction between ethical right or wrong and people’s ethical beliefs
- If ethics (what is right or wrong) is subjective, then there is no such thing as responsible ethical inquiry because no one can possibly be incorrect in their conclusions

New cards

What specific misperceptions are preventing genuine organizational buy-in to developing AI ethics programs?

Three reasons why ethics are misperceived as subjective:

Ethics is subjective because people disagree about what is right and wrong
Science delivers us truth. Ethics isn’t science so it doesn’t deliver us truth
Ethics requires an authority figure to say what’s right or wrong; otherwise, it is subjective

New cards

Why is it not advisable to focus on consumer ethical beliefs (perceptions) as the basis for the organization’s AI ethics program?

Consumers’ perceptions are too coarse-grained for the fine-grained problems you are facing
Your problems are ones that your consumers have not even thought about yet
Consumers are looking for ethical leadership, and a mere appeal to the sentiment of the day does not meet the bar

The approach will alienate both those who are not particularly concerned about the ethical risks of AI within your organization and those who are, leading to a lack of compliance and turnover, respectively

New cards

Key Takeaway from getting buy-in for AI ethics programs

Creating an AI ethics program requires buy-in from the top to the bottom & requires education about ethics (and AI) to genuine organizational buy-in

New cards

What is responsible AI?

A governance framework that documents how a specific organization is addressing the challenges around artificial intelligence (AI) from both an ethical and legal point of view.

Resolving ambiguity for where responsibility lies if something goes wrong is an important driver for responsible AI initiatives

New cards

Why is there a movement towards the necessity of responsible AI practices?

When not designed in a thoughtful and responsible manner, AI systems can be biased insecure, and not compliant with existing laws, even going so far as to violate human rights. AI presents a significant risk of financial and reputational harm for companies that haven't thought through their strategies and roadmaps

New cards

We saw examples of Responsible AI frameworks for many organizations. What are the common (essential) characteristics of these frameworks?

Promote AI benefits and mitigate AI harm/risks by ensuring:

Fairness
Interpretability
Privacy
Security
Reliability

Do so through creating governance structures and accountability and by translating principles to actionable guidelines and tools

New cards

What is fairness?

Impartial and just treatment or behavior without favoritism or discrimination

“Fair“ does not mean “equal“ even though equal opportunity for equal benefit is central to its meaning. Fairness means that everybody in the group has an equal opportunity to benefit

New cards

What is a major challenge to achieving fairness?

Deciding what fairness actually means

When a particular approach is the right one to use

New cards

Equality v. Equity

Equality

Everybody is treated the same
Fails to take into account that not every one of us starts from the same place and that some might need different support that others do

Equity

Giving everybody equal access to the same outcomes
Adjusts for unfair disadvantages to ensure fair, just outcomes

<p>Equality</p><ul><li><p><strong>Everybody is treated the same</strong></p></li><li><p>Fails to take into account that not every one of us starts from the same place and that some might need different support that others do</p></li></ul><p>Equity</p><ul><li><p>Giving everybody equal access to the same outcomes</p></li><li><p><strong>Adjusts for unfair disadvantages</strong> to ensure fair, just outcomes</p></li></ul><p></p>

New cards

Disparate Treatment v. Disparate Impact

Disparate Treatment

Liability could be imposed if there is an explicit classification based on the protected attribute or if there was an intent/motive to discriminate

Disparate Impact

Even if the policy is neutral on its face, if there is a disproportionately adverse impact on minority groups, liability will be imposed.

<p>Disparate Treatment</p><ul><li><p>Liability could be imposed if there is an explicit classification based on the protected attribute or if there was an intent/motive to discriminate</p></li></ul><p>Disparate Impact</p><ul><li><p>Even if the policy is neutral on its face, if there is a disproportionately adverse impact on minority groups, liability will be imposed.</p></li></ul><p></p>

New cards

What are the three steps needed to define and quantify fairness in building fair ML systems?

What is the right definition of fair outcome for the specific use case?
Who selects and defines what is a fair outcome and for whom?
What are the metrics we use?

New cards

What is the Group Fairness vs. Individual Fairness tradeoff?

Group fairness ensures equality between groups, while individual fairness ensures consistency between similar people, and optimizing one often means compromising the other.

New cards

Group Fairness Approaches

Aim to achieve the same outcomes across different demographics, or more generally, a set of protected population classes.

The population that receives a given assessment by the algorithm (let it be positive or negative) should reflect the whole population and its demographics.
The types of mistakes the model makes, and the severity of these errors, should be evenly distributed across the population.
BUT... being fair with respect to parity can seem highly unfair from a single individual’s viewpoint.

New cards

Individual Fairness

Advocates treating similar individuals similarly

New cards

Individual Fairness: The Unaware Approach

Algorithm is blinded or unaware of any identifiable factors and prohibited attributes by law such as gender, race, sexual orientation, etc. After removing the prohibited attributes, the factors with the highest correlation to the outcome are considered
The unaware approach can work only in cases where inequality is not an issue; a highly sterilized environment where the group of individuals that the algorithm classifies between is very homogenic.

New cards

Strengths & Limitations of Individual Fairness: The Unaware Approach

Strengths

Removes protected attributes (e.g., race, gender), making it legally compliant and easy to justify
Simple to implement and aligns with “merit-based” decision-making
Treats individuals consistently based on observable qualifications

Limitations

Other variables (e.g., ZIP code) can act as proxies, so bias still seeps in
Ignores structural inequalities in access to resources and opportunities
Can produce unequal outcomes across groups despite appearing “fair”
Creates a false sense of neutrality (“colorblindness” critique)
Only works well in rare, highly homogeneous or equal environments

New cards

Individual Fairness: The Awareness Approach

Two similar individuals should be treated similarly.

Individual fairness relies entirely on how you define "similarity" between applicants, and you can run the risk of introducing new fairness problems if your similarity metric misses important information. It is hard to determine what is an appropriate metric function to measure the similarity of two inputs.

New cards

Strengths & Limitations of Individual Fairness: The Awareness Approach

Strengths

Promotes consistency: similar applicants are treated similarly
Focuses on individuals rather than group averages, aligning with merit-based decisions
Intuitively fair and easy to justify in principle

Limitations

Depends heavily on how “similarity” is defined—there’s no clear or objective metric
Different reasonable metrics can lead to different (and conflicting) outcomes
Sensitive attributes complicate things—unclear whether or how they should factor into similarity
Can introduce new biases if the similarity function overlooks important context or structural inequality

New cards

Fairness Notions: Hybrid Approaches

Combine group and individual fairness.

Examples:
- Learning Fair Representations approach (abstract transformations of the data points into high-dimensional numeric vectors) that could be used in downstream modeling tasks.
- Individual Risk Scores approach that uses a thresholding policy to treat similarly risky individuals the same way.

New cards

Demographic (or Statistical) Parity

Scenario

AI model classifies patients as high risk (extra screening) or low risk (no action)
Two groups: Group A (80%) and Group B (20%), otherwise medically similar

Demographic (Statistical) Parity Idea

Fairness defined as matching outcomes to population proportions
~80% of “high-risk” flags should go to Group A, ~20% to Group B
Prevents over-favoring one group (e.g., giving disproportionately more screenings to Group B)

Limitation of Demographic Parity

If Group B actually has a higher cancer rate, strict parity becomes harmful
Forces fewer Group B patients to be flagged → missed diagnoses
Forces more Group A patients to be flagged → unnecessary testing

Key Takeaway

Equal proportions ≠ medically optimal outcomes; enforcing parity can reduce overall accuracy and harm patients

New cards

Limitations of Demographic (Statistical) Parity

Can create unfair outcomes: forces unnecessary testing for healthy patients and misses truly sick ones
Ignores intersectionality: fairness for one group may still disadvantage overlapping groups
Reduces model accuracy (“detunes” the algorithm), increasing costs
Doesn’t consider ground truth (actual disease status), only outcome proportions

Key Insight

Better fairness approaches account for ground truth, often by balancing error rates across groups rather than just matching proportions

New cards

Confusion Matrix

New cards

Equality of False Negatives

Enforces constant false-negative rates across groups.

This means that patients – irrespective of their group – have the same probability of being missed.
This measure aims at reducing harm from missed diagnoses.
It does not amend false positives errors

New cards

Equal Opportunity with Fairness

The group that should be favored is the group of individuals who belong to the positive class.

This means that patients – irrespective of their group – have the same probability of being correctly classified as High Risk.
This measure aims at providing equal opportunity to be identified/ selected. Fairness among those who truly need treatment.
It does not amend false negative errors

New cards

Equality as Equalized Odds

Equalized odds requires the same true-positive and false-positive rates across groups.

Equal probability for patients from both groups who need treatment to be correctly detected as needing additional testing, and equal probability for patients from both groups who do not need testing to be selected for unnecessary follow-ups

New cards

Where is Equality as Equalized Odds Used

In a hiring example (gender), the probability of a qualified applicant being hired should be the same for males and females & the probability of an unqualified applicant being hired should be the same for males and females. —> Equalized odds
There is a strong emphasis on predicting the positive outcome correctly (e.g.: correctly identifying who should get a loan as it drives profits for the bank) and minimizing costly False Positives (e.g.: reducing the grant of loans to people who would not be able to pay back) —> Equalized odds
One of the highest levels of algorithmic fairness (but also costly)

New cards

Advantages & Limitations of Demographic (Statistical) Parity

Advantages

Ensures equal outcome rates across groups (e.g., similar % flagged high-risk)
Simple to understand, measure, and implement
Helps detect and prevent systematic disparities or underrepresentation
Aligns with goals of equal access to opportunities/resources

Limitations

Ignores ground truth (e.g., actual disease rates), focusing only on proportions
Can lead to harmful errors: unnecessary interventions or missed true cases
Reduces overall accuracy and can increase operational costs
Struggles with intersectionality (overlapping group identities)
May require treating individuals differently, conflicting with individual fairness

New cards

Advantages & Limitations of Equality of False Negatives

Advantages

Reduces missed true cases (e.g., sick patients not flagged)
Uses ground truth, making it more aligned with real-world outcomes
Prioritizes fairness in high-stakes situations like healthcare
Ensures each group has a similar chance of receiving needed intervention

Limitations

Ignores false positives (may increase unnecessary interventions for some groups)
May require different thresholds by group, raising fairness/legal concerns
Can reduce overall model accuracy
Does not ensure equal overall outcomes or representation across groups

New cards

Advantages & Limitations of Equal Opportunity

Strengths

Ensures individuals in the positive class (e.g., truly sick patients) have the same chance of being correctly identified as high risk
Provides fair access to needed treatment or intervention, regardless of group membership
Focuses fairness on those who truly need help, rather than just overall population proportions

Limitations

Does not address false negative errors (missed cases still occur and may be unequal in other ways)
Only equalizes true positive rates, ignoring other types of errors like false positives
May still result in unequal outcomes across groups in practice
Does not ensure overall parity or eliminate all forms of bias in the model

New cards

Advantages & Limitations of Equality as Equalized Odds

Advantages

Ensures equal true positive rates and false positive rates across groups
Provides fairness for both those who need help (correctly identified) and those who do not (avoiding unnecessary selection)
Reduces systematic bias in both beneficial and harmful errors
Useful in high-stakes settings (e.g., hiring, lending, healthcare) where both types of mistakes matter
Strong, comprehensive fairness standard often considered a high level of algorithmic fairness

Limitations

Can be computationally and operationally complex to enforce
Often leads to tradeoffs with overall accuracy and efficiency
May require different decision thresholds across groups, which can raise ethical or legal concerns
Can be costly for organizations to implement and maintain
May still conflict with other fairness goals (e.g., calibration or individual fairness)

New cards

The Accuracy vs. Fairness Trade-Off

Fairness always comes at a cost: as we put an additional constraint on the
model, we introduce a trade-off with accuracy
For example, when we compare a model that maximizes total revenue, a fairness
constrained model will probably promise less profit.
But sometimes, greater accuracy may lead to greater unfairness

<ul><li><p><span style="color: rgb(0, 0, 0);">Fairness always comes at a cost: as we put an additional constraint on the<br>model, we introduce a trade-off with accuracy</span></p></li><li><p><span style="color: rgb(0, 0, 0);">For example, when we compare a model that maximizes total revenue, a fairness<br>constrained model will probably promise less profit.</span></p></li><li><p><span style="color: rgb(0, 0, 0);">But sometimes, greater accuracy may lead to greater unfairness</span></p></li></ul><p></p>

New cards

Final Takeaways on Using Group & Individual Fairness Together

Fairness metrics usually either emphasize individual or group fairness, but fail to combine both.
Many approaches to group fairness often tackle between-group issues (e.g., between groups of different gender or race), but this can in fact increase the within-group unfairness (between members of the same group)
Reducing the between-group unfairness can exacerbate overall/individual unfairness: the overall unfairness in fact goes up.

New cards

Why is fairness complex and what can we do about it?

Fairness is highly contextual. There is no one-size-fits-all approach and it depends on the stakeholders and the application.
- Finding the right metrics and risk scores for individual assessments can be very challenging and needs to be done on a case-by-case basis.
There are no set answers. A lot of times cost and benefit decisions have to be made.
- There are tradeoffs
- Certain fairness requirements cannot be satisfied simultaneously.
Fairness is not a “measurement” as it implies a straightforward process, but a continuous process.

The implication of “measurement” is, however, precarious as it implies a straightforward process. However, it should be seen as an investigative process that requires detection, explanation, and mitigation. There is no single fairness checkpoint; harmful properties can enter a system under biased data and/or through data science practices and decisions. This triggers the need for strong internal governance, checklists, and monitoring

New cards

What is AI Bias and why is it important?

Occurs when an algorithm produces results that are systematically prejudiced against a specific group(s).
AI algorithms are increasingly used to distribute goods and services.
- information, loans, housing, jobs, interviews, diagnoses, medical treatments, punishments, etc.
Can have a significant impact on opportunities, health, and quality of life.

New cards

What are the sources of AI bias?

Problems with training data

Discrimination in real-world practices that are reflected in the (biased) training data
- E.g., biased hiring practices, loan practices, etc.
- Biased image labeling
Under-sampling/Selection bias: Data not representative of the population
Proxy Bias: using proxies for data that is difficult to access
- E.g., using arrest records to proxy criminal convictions

Problems with testing and how you think about the use case

Coarse-grained model: one-size-fits all (e.g., heart disease presents differently in men vs women)
Benchmark or testing bias
- Benchmarks are biased
- e.g., your algorithm predicts well in terms of true positive rates against a benchmark mortgage dataset. However, the dataset under-sampled black applicants. Therefore, you have a highly accurate mortgage-worthy-white-person predictor.
Objective function bias – goals may unintentionally lead to disparate impacts
among subpopulations
- E.g., lung transplant determinant that aims to maximize years out of the lungs (give it to
  someone who will live the longest, all other things being equal)

New cards

How can we identify and measure bias?

Rely on definitions of fairness and related computational measures of bias based on these definitions.

E.g., equal opportunity definitions of fairness assume equal true positive rates across groups; equal odds definitions of fairness assume equal true positive and equal false positive rates across groups

New cards

What are the major challenges in measuring bias?

Metrics of fairness
Timing of Identifying Bias
Allocative AND Representational Harms
Legal Issues
Choosing Risk Mitigation Strategies

New cards

What are some suggested guidelines in terms of structure that can be helpful for bias?

Metrics of Fairness
- Include individuals with relevant ethical, legal, and business expertise, who can determine which, if any, of the mathematical metrics of fairness are appropriate for a particular use case
Timing of Identifying Bias
- Attempts at identifying and mitigating potential biases of your models should start before training your model and, ideally, before determining what your training data sets should look like and be sourced from
Legal Issues
- Include a lawyer when determining the appropriate risk-
  mitigation technique
Choosing Risk Mitigation Strategies
- Include individuals with relevant expertise for choosing
  appropriate risk-mitigation strategies

New cards

What are representational harms vs. allocative harms and which is easier to identify through computational AI fairness measures?

Distinction between “representational harms” (algorithmically filtered depictions that are discriminatory) vs. “allocative harms” (unfairly assigned opportunities or resources by the algorithm)

Computational measures capture “allocative harms” of unjust distribution of goods and services.
Typically humans are needed to identify many types of representational harms.

Representational Harms

Occur when systems reinforce the subordination of some groups along the line of identity

New cards

What are some bias mitigation strategies?

Input Data
- Get more data in case of under-sampling
- Get better proxies (to mitigate proxy bias)
- Examine if one-size-fits-all model is right for your population
- Alter inputs or adjust their weights: e.g., zip code proxies for race. Remove zip code from data or reduce its weight
- Use bias-corrected synthetic data
Decisions/Output: Adjusting Thresholds for Outputs
- Use a binary yes or no or use thresholds? What should these thresholds be? Should you use different thresholds across different populations?
Identifying bias: Lack of demographic data against which to compare outputs
- Not clear how this can be addressed
- Synthetic data may be an option

New cards

Bias Examples Table

New cards

What is AI explainability?

The ability to understand and explain the outputs of a machine learning (ML) model in human terms.
Enables monitoring and accountability both during production and in the “wild” to ensure fairness and debiasing and mitigate against model drift.
Lack of explanation may lead to ethical, regulatory, or legal risk.

New cards

What are the objectives of AI explainability?

Explain to justify
Explain to control
Explain to Discover
Explain to Improve
Also to ensure fairness, to inform users about what they would need to change to get a favorable outcome, to show respect

<ul><li><p>Explain to justify</p></li><li><p>Explain to control</p></li><li><p>Explain to Discover</p></li><li><p>Explain to Improve</p></li><li><p>Also to ensure fairness, to inform users about what they would need to change to get a favorable outcome, to show respect</p></li></ul><p></p>

New cards

Why is AI explainability important?

Lack of explanation may lead to ethical, regulatory, or legal risk

New cards

Who are the various stakeholders that need explanations?

All system builders
- Who: data scientists, developers
- Why: ensure/improve performance
End user decision makers
- Who: physicians, judges, loan offices, recruiters
- Why: trust/confidence, insights
End consumers
- Who: patients, accused, loan applicants, job applicants
- Why: understanding of factors
Regulatory bodies
- Who: EU (GDPR), NYC Council, US Govt
- Why: ensure fairness for constituents (e.g., compliance to right to explanation by GDPR)

New cards

People Explanations

Why people made the decisions they did.
- E.g., we automated hiring decisions because we received more applications than we could handle; we set the threshold to 3.74 because of the way the outputs were clustered combined with knowledge of our company’s risk preference.

New cards

Machine Explanations

How the model arrived at its outputs given the inputs.

New cards

How do People v. Machine Explanations Differ?

Humans explain in terms of meaning and reasoning, while machines explain in terms of data patterns and computations

Machine explanations differ from people explanations
- Machine explanations are complex
  - # of variables at play, # of relationships, nature of relationships
- Not understandable in the same way human explanations are

New cards

What are the various types of machine explanations?

Global Explanations: What are the rules that transform inputs to outputs? What features are important to the model? How do features interact to produce the output?
Local Explanations: Why did the model yield this particular output given these particular inputs for this particular case? Why did the model come to this conclusion?

New cards

Why not ALWAYS implement Explainable AI?

Machine explanations

Require resources to develop
Tradeoff between accuracy and explainability
- The more data and the more patterns your ML finds, the less likely it is to understand what’s going

New cards

Types of Machine Learning Explanations: Global v. Local

Global Explanations
- Understanding the logic of the model as a whole
- What are the main features that influence the predictions of the model? (e.g., feature importance)? How do these interact to produce the output?
- Useful to data engineers, to managers approving use of the model, regulators
- E.g., SHAP, LOFO, PDP, Permutation FI
Local Explanations
- Understanding the reasons behind a specific prediction
- Why did the model come to this conclusion?
- Useful to decision-makers (e.g., loan officer), end-users, regulators
- E.g., SHAP, LIME

Local = one decision (individual-level explanation)
Global = overall model behavior (system-level explanation)

New cards

Types of Machine Learning Explanations: Intrinsic interpretability vs. Post-hoc interpretability

Intrinsic (or Direct) Interpretability
- Models are interpretable due to their simple structure (e.g., linear models, trees)
Post-hoc Interpretability
- Interpreting a black-box model like a neural network by applying model interpretability methods such as LIME, SHAP, LOFO, etc. after training the model or while in production

Intrinsic = built-in transparency
Post-hoc = added explanation after the fact
Tradeoff between accuracy (and model complexity) and intrinsic interpretability

New cards

Types of Machine Learning Explanations: Model specific vs. model agnostic

Model-specific explanation methods
- Limited to specific model or groups of models (e.g., feature importance assessments work for tree models)
- They depend heavily on the working and capabilities of a specific model
Model-agnostic explanation methods
- Work on any model
- Do not look into the black box of the algorithm but work with input – output pairs (e.g., LIME, SHAP, LOFO, etc.)

Model-specific = tied to one model’s structure (more precise)
Model-agnostic = works on any model (more flexible but approximate)

New cards

Who determines what explanations are needed for a specific use case?

Explanations are not one-size-fits-all—they are defined by the context, risk level, and needs of the people affected by or responsible for the system
You need the right people to determine whether people explanations, global machine explanations, local machine explanations, or all of the above are important for a specific use case

New cards

Why don’t we provide explanations for all AI applications?

We only prioritize explanations when the stakes, risks, or legal/ethical requirements justify the added complexity.

Explanations come with costs, limitations, and tradeoffs, and they aren’t always necessary or useful

New cards

The Importance of Machine Explanations

Helps determine if the rules are fair
Helps monitor if the algorithm is functioning as intended
It informs what one can do to change the outcome
It shows respect
Need to balance the importance of explanations against accuracy and cost/resources

New cards

When Machine Explainability does not Matter

When the model does not directly deal with decisions about how anyone should be treated
- No ethical risk
  - E.g., predicting delivery dates of inventory
When people explanations of why you want to use a black box plus informed consent justifies use
- Accuracy is very important or trading off accuracy for explainability is not prudent
  - Stock market predictions, Identifying cancer in mammograms
  - Trust a black box in such cases if the algorithm performs well - reliably and robustly against benchmarks
Caveat: People explanations + informed consent not sufficient when procedural justice is required. In such cases, algorithmic inscrutability makes it impossible to assess whether someone was treated procedurally fairly or unfairly.

New cards

When Machine Explainability DOES Matter

When expressing respect is ethically required
- E.g., when the person may be harmed in some way by output, “owe” them an explanation
When people need to know how to get better results
- When an opportunity is denied, important to explain why so that they can make changes to improve their chances next time
When people need to know how to approach and make a decision
- Human in the loop by exception: explain why a decision was flagged as an exception
When outputs are strange
- Should we rely on the judgment of an algorithm or a human expert?
- Correlation vs causation
When you need to justify treatment
- Ethical, regulatory, legal
  - 1. Global Explanations: Are the ML rules - what turns inputs into outputs – justifiable?
  - 2. Local Explanations: Was this particular output about this particular person justified?
- The question of whether certain rules are justified (e.g., use parents’ loan repayment history to decide whether to grant one loan) is not a question for data scientists.

New cards

What makes for good explanations?

Objective

Demonstrate respect
Help users make informed decisions
Support fairness, transparency, and trust

Truth

Accurate and faithful to the model
Use simplifications only when appropriate

Usefulness (Ease & Effectiveness)

Clear, not overly detailed or complex
Right level of detail (global vs. local as needed)

Intelligibility

Easy for the intended audience to understand
Matches users’ knowledge and context

Key idea

Good explanations are designed with end users based on their needs and goals

New cards

How can we use explanations to build the appropriate level of trust?

Avoid overreliance (too much trust) and avoid algorithmic aversion (too little trust)

Help users understand when to trust the system and when they should use their own judgment instead

New cards

How can we appropriately calibrate trust?

Explain in the moment and overall
- Explain the system and the output
- Both during onboarding and by tying explanations to actions
Articulate data sources
Explain what is important
- Partial explanations with progressive disclosure if warranted
Account for situational stakes
- Consider the risk of a user trusting a false positive, false negative, or a prediction that is off by X%
- Which of these deserve an explanation so that the user can make an appropriate decision?
Decide if and how to show model confidence to manage AI influence

New cards

What are example-based explanations and how are they useful?

Explain an AI decision by showing similar past cases (examples) instead of technical model details
Help users understand “why this outcome happened” through comparison

Appropriate when it is challenging to explain the reasons behind AI predictions.
These can help users understand why the model behaved the way it did and whether to trust the recommendation.
- Global: The AI shows image examples it tends to make errors on and examples it performs well on.
- Local: To help the user decide whether to trust a “poison oak” classification, the AI displays most-similar images of poison oak as well as images of other leaves.

New cards

What are confidence displays?

Show how certain or uncertain an AI model is about its prediction
Usually expressed as a probability or confidence score (e.g., 92% likely, low confidence)

It can help users calibrate their trust and make better decisions
But it is not always appropriate and can be confusing and misleading
- On average, the AI has 80% confidence in its categorizing of images.
- It is unclear whether a certain level (e.g., 80%) reflects high or low confidence.
Numeric-Confidence level (poison oak - 80%) or categorical (high, medium, low confidence)

New cards

What is privacy?

Privacy is the claim of groups, individuals, and institutions to determine for themselves when, how, and to what extent information about them is communicated to others (Westin 1967)
Privacy is the individual’s control of information concerning their person (U.S. Dept of Justice)
- What information is being collected
- How that information is being used
The right to control who can collect what data about you and what they can use it for.
Extent to which people have knowledge and control over their data without undue pressure.

New cards

What are the three aspects of privacy?

Regulatory Compliance
Ethics
Cybersecurity (integrity & security of data)

New cards

The privacy act of 1974 centers around “notice and consent.” What issues around consent make this not an effective privacy protection regulation?

Impossible to enforce in many AI applications (smart driving cars)
Onerous to properly inform oneself about the privacy policies of organizations
People not only care about how their information will be used, but also about their data being collected.
Desire to change this model by shifting the burden of protecting individual privacy from consumers over to the businesses that collect data: Data Stewardship
- Regulate how companies process data: what they collect and how they use it and share it

New cards

What are the privacy concern issues that are unique to AI and AI privacy issues?

AI is data hungry

Data Persistence
- Data existing longer than the human subjects that created it, driven by low data storage costs.
Data Repurposing
- Data being used beyond their originally imagined purpose
Data Spillovers
- Data collected on people who are not the target of data collection

New cards

What are the four elements that organizations can combine to define their level of privacy?

Transparency
- If you do not know what info is being collected about you, what’s been done with it, what decisions it contributes to, who it’s been shared with or sold to, then you cannot possibly exercise control

Data Control
- Have the ability to collect, edit, or delete information about oneself, and be able to opt in or out of being treated in a certain way
Opt out by default
- Do users automatically opt-in for collection of their information when using a site or is the default opt-out
- Opt-in puts the burden on the user while Opt-out puts the burden on the organization
Full Services
- Orgs may need to adjust the level of services it provides based on amount and type of data a person shares

New cards

The five levels of privacy for organizations

Blindfolded and handcuffed
- People are in the dark (not knowledgeable) and passive with regard to their data and the predictions made about them
Handcuffed
- People are knowledgeable about their data and predictions made about them, but they are passive with regards to what is collected and what is done with their data
Pressured
- People are knowledgeable and they have some degree of control over what data is collected and how it’s used
Slightly Curtailed
- People are knowledgeable and their data has not been collected and used without their consent
Grateful
- Organization provides full services independent of what data the person opts in to provide and consents to be used

New cards

The five levels of privacy for organizations key points

At levels 1 (blindfolded and handcuffed) and 2 (handcuffed)
- Users have no control over their data.
- The judgment that we are providing valuable services and therefore we are justified in collecting and using private data is made by the organization.
At levels 3 (pressured) and 4 (slightly curtailed)
- People can opt out.
- Judgment of whether giving up their privacy is beneficial is made by users.
- But orgs can incentivize people to make the tradeoff by curtailing services.

New cards

Tensions in Applying Existing Privacy Guidelines for AI: Three Pillars of Information Privacy (OECD Guidelines)

Collection Limitation
- Collection of personal information should be limited to only what is necessary; personal information should only be collected by lawful and fair means; and where appropriate, should be collected with the knowledge and/or consent of the individual
- TENSION: AI relies on large amounts of data that individuals are not aware of it being collected (IoT, smartphones, web tracking technology).
Purpose Specification
- The purpose of collecting personal information should be specified to the individual at the time of collection.
- TENSION: Organization may not know how the data will be used in the future by AI.
Use Limitation
- Personal information should only be used or disclosed for the purpose for which it was collected, unless there is consent or legal authority to do otherwise.
- TENSION: Organizations are permitted to use information for a “reasonably expected secondary purpose”. But AI can reveal patterns that are not reasonably expected

New cards

Surveillance Capitalism

Unilaterally claims human experience as free raw material for translation into behavioral data.
Although some of these data are applied to product or service improvement, the rest are declared as a proprietary behavioral surplus, fed into advanced manufacturing processes known as ‘machine intelligence,’ and fabricated into prediction products that anticipate what you will do now, soon, and later

New cards

Attention Economy

“bottleneck of human thought” that limits both what we can perceive in stimulating environments and what we can do.
“A wealth of information creates a poverty of attention,” suggesting that multitasking is a myth

New cards

Intention Economy

Born from “combining (a) hyper-personalized manipulation via LLM-based sycophancy, ingratiation, and emotional infiltration and [from] (b) increasingly detailed categorization of online activity elicited through natural language.
This new dimension of automated persuasion draws on the unique capabilities of LLMs and generative AI more broadly, which intervene not only on what users want, but also, to cite Williams, ‘what they want to want.’”

New cards

Why are surveillance capitalism, attention economy, and intention economy important with respect to AI privacy issues?

These concepts show how AI shifts from just analyzing data to tracking attention and predicting or influencing human behavior, which creates major privacy risks

New cards

How is privacy determined in organizations, and what tradeoffs does it involve?

Privacy level depends on an organization’s business model and goals
Organizations may apply different privacy levels to different products/services
There are key tradeoffs between privacy and other values:
- Privacy vs. safety/security (greater good)
- Privacy vs. innovation
- Privacy vs. accuracy of predictions

New cards

Who should be involved in making determinations about the right level of privacy for a given product or service?

Privacy decisions should not be made by one group alone, they require a collaborative, multi-stakeholder process balancing user rights, legal constraints, and organizational goals

New cards

What are the main components of structure?

Roles and responsibilities
Policies
Processes & procedures

New cards

What is the objective of AI governance?

How to create, scale, and maintain an AI ethical risk governance structure in your organization to systematically and comprehensively identify and manage the ethical, reputational, regulatory, and legal risks of AI

New cards

What is the typical first step in creating AI governance?

Articulating ethical principles for AI development and deployment

New cards

Describe the problems with current ethics statements that prevent them from being helpful

They lump together Content and Structure
- Confusing means and ends, strategies and goals, that is, governance structures and content (ethical values)
They lump together ethical and non-ethical values
- Lumping together ethical, cybersecurity, and engineering (e.g., accuracy) issues into “responsible” AI obfuscates addressing the true AI ethical issues (e.g., bias, privacy violations)
They lump together instrumental and non-instrumental values
They describe overly abstract values
- Values not specific enough, Don’t state “what to do”
- Need AI ethical principles that guide action

New cards

What is the recommended 4-step approach to enable an organization to develop relevant and actionable ethical value statements?

Slides

State your values by thinking about your ethical nightmares
Explain why you value what you do in a way that connects to your organization’s mission or purpose
Connect your values to what you take to be ethically impermissible
Articulate how you will realize your ethical goals or avoid your ethical nightmares

Study Guide

State your ethical values by identifying the organization’s ethical nightmares
Understand why these are important by linking to the organization’s mission
Connect these values to what is ethically impermissible
Articulate how you will realize these values

New cards

What are the advantages of this approach (4-Step) to developing AI ethical values

Defined values and strategies in a way that
- Enables action (helps determine KPIs)
- Connects to what is ethically off-limits
Can perform gap analysis based on the values you have specified to see where your company is and where it needs to be
If you involve stakeholders across the organization, you create awareness and buy-in
By articulating what is ethically impermissible and why, you enable people to think about ethically tough cases
- Explanations for why some things are impermissible and why the organization does X are helpful in discerning the right thing to do
Can be used for branding and public relations

New cards

How do we create an AI Governance Structure and what is the importance of executive leadership and ownership of this structure

Change starts from the very top. “Executives need to speak with their dollars, their time, their resources, that they’re allocating to this,” he says. Otherwise, people working on ethical AI are set up for failure.”

“Successful responsible-AI teams need enough tools, resources, and people to work on problems, but they also need agency, connections across the organization, and the power to enact the changes they're being asked to make”

New cards

Describe the 7-step approach outlined in the chapter

Articulate clear AI ethical standards.
Create broad organizational awareness of AI ethical standards and questions/risks.
Provide product development and procurement teams with the appropriate tools and processes to identify and mitigate AI ethical risk.
Include relevant experts in making these decisions (beyond data scientists).
Assign role-specific responsibilities to create accountability.
Create and track KPIs for the AI ethical risk program.
Have executive ownership of the AI ethical risk program

New cards

What is ethical case law and how do we create it? Why is this important and useful to have in deciding on AI ethical questions? Why should we create this a priori?

Like legal case precedents
- Prior cases where the org faced an ethical dilemma and how it resolved it well
- Prior cases from other similar organizations
- Fictional cases and their resolution
- One way: make statements (e.g., we never sell our data to third parties) and see how much consensus there is. Lack of consensus implies this is an area to think through what our stance should be, and under what conditions

Create these a priori (before encountering an ethical question)
- Develops a skill to tackle tough ethical questions well
- Resolution is not compromised because the right thing to do in the specific situation is painful (e.g., may lead to a loss of profit or loss of bonus, etc.).
Can be used by AI teams in developing products, by experts and senior execs in tackling tough cases, and in AI ethics training across the organization

New cards

Why is it critical to create AI ethical risk organizational awareness?

AI applications are procured across the organization [e.g., HR, marketing, etc.] by people who are not aware of the potential for, and are unable to assess, AI ethical risk

Two false assumptions
- AI ethics are for techies to figure out in terms of technical solutions
- AI ethics are for techies and product teams
AI is procured and used by many departments and units within an organization (e.g., HR, marketing).
- Each needs to be aware of AI ethical risks that may be built into vendors’ solutions

Personnel in these units need to be aware of sources of AI ethical risk and be able to assess it.
- Need for education and upskilling of personnel
- Develop new processes to assess vendor software by appropriately trained personnel

New cards

What do product development teams need to identify and mitigate AI ethical risk?

Product development teams must understand the issues and how to address them in principle.
Product development teams must have concrete tools and processes to enable them to
- Identify the ethical risks
- Mitigate the ethical risks
Product development teams need to engage in “AI ethics by design”.

New cards

Why shouldn’t the identification and mitigation of AI ethical risk be solely the responsibility of data scientists and product developers?

Unwise and unfair to put the burden of identifying and mitigating AI ethical risk on data scientists, engineers, and product designers.
- They lack expertise in ethics
- AI ethical issues can be very complex and can result in reputational, regulatory, and legal risks
- AI ethical risks scale rapidly (e.g., bias amplification)

New cards

Discuss the function, membership, and jurisdiction of an AI ethics committee (AIEC).

Function of an AI Ethics Committee

To play an oversight role in systematically and comprehensively identifying and mitigating the ethical risks of AI products that are developed in-house or that are procured from third-party vendors

What does that look like?

When teams bring a proposal for an AI solution, AIEC is responsible for
- Recommending whether or not to develop or procure the solution
- Confirm that there are no ethical risks pertaining to the solution that would warrant cessation
- Recommend future changes to the proposed solution, that if adopted, would lead to a second review
AIEC should document all cases and record its recommendations
AIEC process should be overseen and audited
This addresses understanding risk before a product is deployed or designed.

Who is part of the AIEC?

AIEC requires diverse expertise:
- Data scientist who understands the technical aspects of the product and understands what is being done and can be done from a technical perspective.
- Product design expert who can shape ethical risk-mitigation strategies in ways that do not undermine the essential functions of the product.
- Ethics-adjacent experts such as lawyers and privacy officers who understand current and potential regulations, anti-discrimination law, and privacy practices.
- Ethicist who has training, knowledge, and experience in understanding and spotting ethical issues, familiarity with important concepts and distinctions that can aid in ethical deliberations, and skill in helping people assess ethical issues objectively
- Subject-matter experts (e.g., if you are deploying an algorithm for hiring, HR)
- At least one member unaffiliated with the organization
All members should have an understanding of the business goals and necessities of the organization.

When should the product teams consult with the AIEC?

Should be consulted before research or product development begins
To identify potential ethical risks
Much cheaper and efficient to make changes in products that do not yet exist

How much power should the AIEC have?

Are product development and procurement teams required to consult with AIEC or is it merely recommended
Are the decisions of the AIEC requirements that must be followed or are they merely recommendations? And if they are required, can a senior executive overrule them?

New cards

What are the risks of not having an AI ethics committee

Increased risk of not identifying AI ethical risks
Risks of identifying AI ethical risks only in deployment, when it is costly to address
- legal, reputational, etc. costs have already been incurred
- more difficult and costly to change the system
Inconsistencies in ethical standards across departments
More opportunities for conflict between short-term career goals, short-term profit etc. and the long-term welfare of the organization.

New cards

Discuss what is involved in creating an accountability structure in the organization

Assign role-specific responsibilities aimed at identifying and mitigating AI ethical risks.
For data collectors, data scientists and engineers, product owners, etc
There should be both incentives to fulfill responsibilities and disincentives for those who do not fulfill responsibilities.
- Failure to fulfill responsibilities must be taken seriously and have consequences.
  - Orgs that turn a blind eye to ethical risks will realize those risks
  - Adhering to ethical practices should NOT put individuals at a competitive disadvantage in terms of bonuses, raises, promotions, etc. compared to those who do not
- Recognize, formally and informally, those who adhere to AI ethical standards
- Make this part of the annual evaluation process and incentives

100

New cards

What two types of KPIs are used to monitor an AI Ethics program, and what do they measure?

(1) Compliance / Adoption KPIs – Measure how well the organization is following ethical standards.
Examples:

% of product proposals approved/rejected by ethics committee
# of ethics violations or disciplinary actions
% of employees trained or aware of standards

(2) Effectiveness / Risk Mitigation KPIs – Measure whether the standards actually reduce ethical risk.
Examples:

% of products meeting privacy commitments
% of systems providing understandable explanations
Metrics related to fairness, privacy, and respect outcomes