Generative AI Exam 2

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/76

There's no tags or description

Looks like no tags are added yet.

Last updated 6:53 PM on 11/11/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

77 Terms

New cards

RAG purpose

Ground LLM outputs in external knowledge to reduce hallucinations

New cards

RAG pipeline

Index, retrieve, generate

New cards

Index steps

Ingest and clean, chunk, embed, and store (vector DB + metadata)

New cards

Ingest and clean

Convert to plain texts and clean so it's readable (extract text, fix broken encodings and headers/footers, capture source URL, last-updated, and owner as metadata)

New cards

Chunk

Segment text into digestible chunks with overlap (~200-500 token units with small overlaps and attached metadata)

New cards

Embed

Compute embeddings for each chunk (numerical representation that captures meaning)

New cards

Store

Vector database stores, indexes, and searches embeddings

New cards

Retrieval

Encode query, compute similarity scores, and retrieve top-K most relevant chunks to build context

New cards

Similarity scores between query vector and document chunks

Cosine similarity or dot product

New cards

Sparse approach

Token matches used for keyword queries and exact matches

New cards

Dense approach

Semantic vectors used for similarity search and semantic QA

New cards

Hybrid approach

Combines sparse and dense signals for improved recall and precision

New cards

Precision

If a model predicts a positive outcome, how likely is that prediction to be correct?

New cards

Recall

Given all relevant instances, how many did the model actually detect?

New cards

Generate prompt

Query + retrieved chunks + instructions

New cards

Hit rate

Fraction of queries where the correct document appears

New cards

Mean reciprocal rank (MRR)

How early does the first correct answer appear? (1/rank of first relevant result)

New cards

Normalized discounted cumulative gain (NDCG)

Weighted rating of the entire ranking, not just the first hit

New cards

Exact matching (EM)/F1

Exact match and overlap of generated answers

New cards

Pitfalls of oversized chunks

Matches are vague and the context window is stuffed with extra fluff

New cards

Pitfalls of chunks that are too small

Passages lose meaning, the model gets fragmented without enough context

New cards

Purpose of post-training

Tailor outputs to domain needs (format, tone, policy, tool-use), cheaper than pre-training, works with RAG

New cards

Supervised Fine-Tuning (SFT)

Next-token cross-entropy on target responses (mask system/prompt as needed; length-normalize)

New cards

SFT data

instruction, response pairs, normalize templates, deduplication, filter unsafe/personally identifiable information (PII), tag metadata

New cards

Parameter-efficient fine tuning (PEFT)

Low-Rank Adaptation (LoRA) or Quantized LoRA (QLoRA) to train small, reusable modules on top of frozen base weights

New cards

RLHF (Reinforcement Learning from Human Feedback)

Reward model + PPO with KL regularization to a reference

New cards

Not studied (48)

You haven't studied these terms yet!

New cards

Select these 48

RLHF benefits

New cards

RLHF drawbacks

Higher operational complexity

New cards

LoRA

Inserts low-rank adapters into attention/MLP, trains only adapters causing massive parameter savings with small quality loss

New cards

QLoRA

4-bit quantized base + LoRA adapters to reduce memory further

New cards

Hyperparameters to reason about

Epochs, learning rate, warmup steps/ratio, weight decay, effective batch size (batch * grand accumulation)

New cards

Overfitting

Aggressive LR/epochs lead to repetition or echoing

New cards

SFT deliverables pattern

Saved adapters/tokenizer, prompt template, inference function for product integration

New cards

DPO (Direct Preference Optimization)

Logistic loss on log-probability gaps (chosen vs. rejected) with reference correction, beta controls preference strength

New cards

DPO data

Prompt, chosen, rejected , pairs; uses frozen reference policy

New cards

Reference policy

Compares gaps vs. frozen base/SFT model; beta sweeps are implementation-dependent

New cards

When to use DPO

complements SFT for subjective qualities (helpfulness, tone, refusals)

New cards

DPO evaluation

formatting adherence, pairwise win-rate, safety/jailbreak tests, business KPIs

New cards

DPO vs. RLHF

Avoids a separate reward model and PPO loop; turns alignment into a direct logistic objective

New cards

Typical workflow

SFT, collect preference pairs, DPO fine-tuning, evaluate HHH (helpful, honest, harmless) + task KPIs

New cards

Tooling

TRL's 'DPOTrainer' (with beta and other hyperparameters as in SFT) for pairwise preferences

New cards

AI agents

Goal-directed loops that plan, call tools, write/execute code, and refine using feedback

New cards

Explore top notes

Chapter 11: Human Genetic Variation (and Chapter 15.2 Chromosomal Abnormalities)

Updated 1043d ago

Note

PSYC 201 Unit 9

Updated 1104d ago

Note

The Elizabethan Age (1558 - 1603)

Updated 706d ago

Note

Graphing Linear Functions

Updated 1229d ago

Note

Chapter 13 - Mendelian Genetics and Probability

Updated 1239d ago

Note

10: Motivation and Emotion

Updated 1180d ago

Note

Conservation of Mass

Updated 1173d ago

Note

Chapter 4: Policing Functions and Units

Updated 1172d ago

Note

Chapter 11: Human Genetic Variation (and Chapter 15.2 Chromosomal Abnormalities)

Updated 1043d ago

Note

PSYC 201 Unit 9

Updated 1104d ago

Note

The Elizabethan Age (1558 - 1603)

Updated 706d ago

Note

Graphing Linear Functions

Updated 1229d ago

Note

Chapter 13 - Mendelian Genetics and Probability

Updated 1239d ago

Note

10: Motivation and Emotion

Updated 1180d ago

Note

Conservation of Mass

Updated 1173d ago

Note

Chapter 4: Policing Functions and Units

Updated 1172d ago

Note

Explore top flashcards

Chem H Ions to Memorize

Updated 882d ago

Flashcards (96)

Linear Algebra Final Review

Updated 422d ago

Flashcards (47)

Quiz Industrial Revolution

Updated 1002d ago

Flashcards (26)

Collocations

Updated 739d ago

Flashcards (105)

Senior All-State Terms

Flashcards (106)

Flashcards (100)

Flashcards (50)

Unit 1: Biological Bases of Behavior

Updated 35d ago

Flashcards (53)

Chem H Ions to Memorize

Updated 882d ago

Flashcards (96)

Linear Algebra Final Review

Updated 422d ago

Flashcards (47)

Quiz Industrial Revolution

Updated 1002d ago

Flashcards (26)

Collocations

Updated 739d ago

Flashcards (105)

Senior All-State Terms

Flashcards (106)

Flashcards (100)

Flashcards (50)

Unit 1: Biological Bases of Behavior

Updated 35d ago

Flashcards (53)