AI Evaluation and Fairness Flashcards

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/32

Earn XP

Description and Tags

Flashcards covering key vocabulary related to AI evaluation and fairness.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

33 Terms

New cards

Fairness

Treating all individuals and groups equitably, without favoring one group over another.

New cards

Bias

Producing systematically prejudiced outcomes.

New cards

Traceability

Ability to track and document the decision-making processes, data sources, and changes made to an AI model throughout its lifecycle.

New cards

Explainability

Ability of an AI system to provide understandable reasons or explanations for its outputs or decisions.

New cards

Responsibility

Assigning accountability for the actions and outcomes of AI systems, entails mitigating risks such as harm or misuse.

New cards

Liability

Legal responsibility assigned to individuals or organizations for the actions, outcomes, or consequences of an AI system.

New cards

Gender Bias

NLP models associate certain professions or behaviors with specific genders.

New cards

Racial Bias

NLP models underperform on languages or dialects spoken by minority groups.

New cards

Cultural Bias

NLP chatbot fails to understand cultural nuances.

New cards

Data Transparency

Knowing the sources of data used to train NLP models.

New cards

Versioning

Keeping track of changes in an NLP model.

New cards

Reproducibility

Allows developers to reproduce results and check for consistency over time.

New cards

Explainability Benefit

Helps users trust NLP systems by explaining outputs.

New cards

Content Moderation

NLP models used in social media platforms need to avoid spreading misinformation or hate speech.

New cards

Ethical Model Development

Developers must ensure that datasets used for training NLP models are ethically sourced and do not contain biases.

New cards

Corrective Mechanisms

Implementing processes to fix or update models when biased or unethical behavior is identified.

New cards

Adversarial Examples

Inputs that have been deliberately perturbed to mislead a model.

New cards

Synonym Substitution

Changing words to synonyms to confuse a model.

New cards

Character-level Attacks

Introducing typos to confuse a model.

New cards

Paraphrasing

Rephrasing text to confuse a model.

New cards

Clean Data

Remove duplicates, irrelevant content, or highly noisy data.

New cards

Cohen's kappa agreement

Take into account the agreement by chance.

New cards

Baseline

A simple model or method used as a point of comparison to evaluate the performance of more advanced models.

New cards

Random Baseline

Assigns labels randomly.

New cards

Majority Class Baseline

Always predicts the most frequent class in the dataset.

New cards

BLEU (Bilingual Evaluation Understudy)

Calculates the overlap of n-grams between the machine-generated translation and the reference translation.

New cards

ROUGE-N

Measures overlap of n-grams.

New cards

ROUGE-L

Measures the longest common subsequence (LCS).

New cards

MMLU (Massive Multitask Language Understanding)

Tests knowledge and reasoning across 57 academic subjects.

New cards

HellaSwag

Tests commonsense reasoning and narrative plausibility.

New cards

TruthfulQA

Tests truthfulness, especially in response to tricky or adversarial questions.

New cards

BIG-bench (Beyond the Imitation Game)

A giant collection of 200+ tasks to probe creativity, reasoning, morality.

New cards

HLE (Humanity’s Last Exam)

A new gold-standard benchmark to test whether AI systems have reached expert human reasoning.

Explore top notes

Verpleegkundige visies

Updated 199d ago

Note

Langage et In.Conscient — Philosophes

Updated 795d ago

Note

Relative Strengths of Acids and Bases

Updated 1079d ago

Note

AMSCO AP World History 6.2, 6.4

Updated 905d ago

Note

Fundamentals of Anatomy and Physiology - Bone

Updated 103d ago

Note

Vocabulary List: Early MesoAmerican Civilizations

Updated 83d ago

Note

Блок 4: Питание — Пищеварительная система

Updated 852d ago

Note

AP French Language and Culture: Unit 1: Families in Different Societies

Updated 663d ago

Note

Explore top flashcards

crucible characters

Updated 163d ago

Flashcards (22)

Theology Parable Test

Updated 871d ago

Flashcards (54)

Religion Unit 4

Updated 842d ago

Flashcards (50)

Roman figures unit 1 latin III

Updated 992d ago

Flashcards (31)

Psych exam 2

Updated 868d ago

Flashcards (89)

Concepts of adult motor control

Updated 697d ago

Flashcards (25)

English Vocab 51 - 75

Updated 677d ago

Flashcards (24)

Adv Micro exam 1

Updated 1004d ago

Flashcards (72)