AWS Certified AI Practitioner

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/194

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 11:35 PM on 6/15/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

195 Terms

1
New cards

Artificial Intelligence

encompasses the development of intelligent systems capable of performing tasks that typically require human intelligence, such as perception, reasoning, learning, problem-solving, and decision making

2
New cards

Machine Learning

type of artificial intelligence for understanding and building methods that make it possible for machines to learn

3
New cards

Deep learning

subfield of machine learning that uses artificial neural networks to analyze data and learn complex patterns

4
New cards

Generative AI

subset of deep learning because it can adapt models built using deep learning, without retraining or fine tuning

capable of generating new data based on the patterns and structures learned from training data

5
New cards

Neural networks

lots of connected nodes that are organized into layers (input layer, hidden layer(s), and output layer)

when data is inputted, it figures out how to identify patterns by adjusting the connections between its nodes

6
New cards

Computer vision

a field of AI that makes it possible for computers to interpret and understand digital images and videos

ex: image classification, object detection, and image segmentation

7
New cards

Natural language processing (NLP)

branch of AI that deals with the interaction between computers and human languages

ex: text classification, sentiment analysis, machine translation, and language generation

8
New cards

Model

systems built using neural networks, system resources, data, and prompts, all working together to process input and produce outputs

9
New cards

Foundation model

models that are pretrained on internet-scale data

adapt a single FM to perform multiple tasks such as text generation, text summarization, information extraction, image generation, chatbot, and question answering

10
New cards

Algorithm

a set of instructions to be followed in calculations or other operations

11
New cards

Prompts

specific set of inputs to guide LLMs to generate an appropriate output or completion

12
New cards

Inference

process of using a trained model to generate results based on new input data (e.g., turning your prompt into an answer, image, or prediction)

13
New cards

Completion

the output the model produces in response to your prompt; often a continuation or expansion of the given input

14
New cards

Training

the process of feeding an AI model curated data sets to evolve the accuracy of its output

15
New cards

Labeled data

dataset where each instance/example is accompanied by a label or target variable that represents the desired output or classification

16
New cards

Unlabeled data

dataset where the instance/example do not have any associated labels or target variables (only input feature without corresponding output/classification)

17
New cards

Structured data

data that is organized and formatted in a predefined manner, typically in the form of tables or databases with rows and columns

18
New cards

Tabular data

structured data stored in spreadsheets, databases, or CSV files, with rows representing instances and columns representing features or attributes

19
New cards

Time-series data

structured data that consists of sequences of values measured at successive points in time, such as stock prices, sensor readings, or weather data

20
New cards

Semi-structured data

data that can have different attributes or missing attributes, such as a text file that contains JSON

21
New cards

Unstructured data

data that lacks a predefined structure or format, such as text, images, audio, and video

22
New cards

Text data

unstructured data that includes documents, articles, social media posts, and other textual data

23
New cards

Image data

unstructured data that includes digital images, photographs, and video frames

24
New cards

Inferencing

using the information that a model has learned to make predictions or decisions

25
New cards

Batch inferencing

analyzes a large amount of data all at once to provide a set of results

suitable for offline processing when data can be processed in batches. batch transform can support processing times of days therefore highest latency

26
New cards

Real-time inferencing

computer has to make decisions quickly, in response to new information as it comes in

real-time inference offers the lowest latency requirements because of the 60-second processing times

27
New cards

Asynchronous inferencing

can queue incoming requests for inference processing

asynchronous inference provides moderate latency requirements because of the processing times of up to 1 hour

28
New cards

Bias

model is missing important features of the datasets

measured by the difference between the expected predictions of the model and the true values we are trying to predict

29
New cards

Fairness

the equitable and impartial treatment of individuals, or data subjects, by AI systems

30
New cards

Fit

the process of adjusting the parameters of a model to best capture the patterns and relationships in the input data

31
New cards

Large language model (LLM)

powerful models that can understand and generate human-like text

use tokens, embeddings, and vectors to capture complex relationships in language

32
New cards

Tokens

basic units of text that the model processes (e.g. words, phrases, or characters) to provide standardization of input data

33
New cards

Vector

list of numbers representing data (such as text or an image) in a mathematical form the model can work with

34
New cards

Vector database

collection of data stored as mathematical representations

store structured and unstructured data, such as text or images with the vector embeddings

35
New cards

Embeddings

numerical representations of tokens, where each token is assigned a vector that captures its meaning and relationship with other tokens

a numerical vectorized representation of any entity that captures the meaning or semantic relationships of data

36
New cards

In-context learning

giving the model examples or extra information within the same prompt (inside the context window) so it can learn the task or style on the fly without retraining

37
New cards

Supervised learning

training on labeled data

learn a mapping function that can predict the output for new, unseen input data

38
New cards

Unsupervised learning

learn from unlabeled data

discover inherent patterns, structures, or relationships within the input data

39
New cards

Reinforcement learning

give only a performance score and portion of labeled training data

learns from feedback (rewards or penalties) to improve its decision-making over time

40
New cards

Classification model

a supervised learning technique used to assign labels or categories to new, unseen data instances based on a trained model

Use-cases: fraud detection, image classification, customer retention, and diagnostics

41
New cards

Regression model

a supervised learning technique used for predicting continuous or numerical values based on one or more input variable

Use-cases: advertising popularity prediction, weather forecasting, market forecasting, estimating life expectancy, population growth prediction

42
New cards

Clustering

an unsupervised learning technique that groups data into different clusters based on similar features or distances between the data point to better understand the attributes of a specific cluster

Use-cases: customer segmentation, targeted marketing, and recommended systems

43
New cards

Dimensionality reduction

an unsupervised learning technique used to reduce the number of features or dimensions in a dataset while preserving the most important information or patterns

Use-cases: big data visualization, meaningful compression, structure discovery, feature elicitation

44
New cards

Data collection

process of gathering and measuring information on variables of interest

should accurately reflect the diverse perspectives and experiences required for the use case of the AI system

45
New cards

Exploratory data analysis (EDA)

an analysis approach that identifies general patterns in the data

46
New cards

Data pre-processing

preprocess the data to ensure it is accurate, complete, and unbiased

techniques such as data cleaning, normalization, and feature selection can help to eliminate biases in the dataset

47
New cards

Feature engineering

the process of creating, transforming, extracting, and selecting variables from data

48
New cards

Model training

the process of training an ML algorithm with adequate training data to demonstrate correlation between the outcome and the influencing variables

49
New cards

Hyperparameter tuning

the problem of choosing a set of optimal hyperparameters for a learning algorithm

hyperparameter: a parameter whose value is used to control the learning process, which must be configured before the process starts

50
New cards

Model evaluation

the process of using different evaluation metrics to understand a machine learning model's performance, as well as its strengths and weaknesses

51
New cards

Model deployment

the process of putting machine learning models into production

52
New cards

Model monitoring

Ensures the model is maintaining a desired level of performance through early detection and mitigation

53
New cards

MLOps

refers to the practice of operationalizing and streamlining the end-to-end machine learning lifecycle from model development and deployment to monitoring and maintenance

54
New cards

Accuracy

model performance metric that measures the percentage of correct predictions or classifications

55
New cards

Area Under the ROC Curve

represents the probability that the model, if given a randomly chosen positive and negative example, will rank the positive higher than the negative

56
New cards

F1 score

the harmonic mean of precision and recall

57
New cards

Precision

proportion of positive predictions that are actually correct

58
New cards

Recall

proportion of correct sets that are identified as positive

59
New cards

Prompt engineering

focuses on developing, designing, and optimizing prompts to enhance the output of FMs for your needs

selects appropriate words, phrases, sentences, punctuation, and separator characters to effectively use LLMs for a wide variety of applications

60
New cards

Multi-modal models

can process and generate multiple modes of data simultaneously

learn how different modalities like images and text are connected and can influence each other

use-cases: automating video captioning, creating graphics from text instructions, answering questions more intelligently by combining text and visual info

61
New cards

Diffusion models

deep learning architecture system that starts with pure noise/random data, gradually adding more meaningful information to end with a clear and coherent output

62
New cards

Forward diffusion

system gradually introduces a small amount of noise to an input image until only the noise is left over

63
New cards

Reverse diffusion

noisy image is gradually introduced to denoising until a new image is generated

64
New cards

Stable diffusion

does not use the pixel space of the image, uses a reduced definition latent space

65
New cards

Generative adversarial networks (GANs)

a class of generative models that uses two neural networks — a generator that creates synthetic data, and a discriminator that tries to tell real data from fake data. The two compete in a feedback loop until the generator produces data that is indistinguishable from real examples

66
New cards

Variational autoencoders (VAEs)

A type of generative model that learns to encode input data into a compressed latent representation and then decode it back to reconstruct the data. By sampling from the latent space, VAEs can generate new, similar data

67
New cards

Intrinsic analysis

can be applied to models with low complexity or simple relationships between the input variables and the predictions

high model interpretability -> lower model performance

68
New cards

Post hoc analysis

can be applied to simple relationship models and more complex models that capture nonlinear interactions

69
New cards

Foundation model lifecycle

  1. Data selection (FMs required training on massive datasets from diverse sources)

  2. Pre-training (use self-supervised learning - make use of structure to autogenerate labels)

    • Can learn the meaning, context, and relationship of words in the dataset

  3. Optimization

    • Techniques like prompt engineering, retrieval-augmented generation (RAG), and fine-tuning on task-specific data

  4. Evaluation (measured using appropriate metrics and benchmarks)

  5. Deployment

  6. Feedback and continuous improvement (collected from users, domain experts, or other stakeholders to identify areas for improvement, detect potential bias, and inform future iterations of the model)

70
New cards

Advantages of generative AI

Adaptability, responsiveness, simplicity, creativity and exploration, data efficiency, personalization, and scalability

71
New cards

Toxicity

can generate content that is inflammatory, offensive, or inappropriate

72
New cards

Hallucinations

model generates inaccurate responses that are not consistent with the training data

73
New cards

Interpretability

the access into a system so that a human can interpret the model’s output based on the weights and features

74
New cards

Nondeterminism

model might generate different outputs for the same input, which can cause problems in applications where reliability is key

75
New cards

Agents

software components or entities designed to perform specific actions or tasks autonomously or semi-autonomously, based on predefined rules or algorithms

task coordination, reporting and logging, scalability and concurrency, integration and communication

76
New cards

Factors to select appropriate generative AI models

model types, performance requirements, capabilities, constraints, compliance

77
New cards

Business metrics for generative AI

User satisfaction, accuracy, customer lifetime value, average revenue per user, cross-domain performance, conversion rate, and efficiency

78
New cards

Context window

a model property that describes the number of tokens that the model can accept in the context

79
New cards

Temperature

controls the randomness or creativity of the model's output (set between 0 and 1)

Low temperature: Outputs are more conservative, repetitive, and focused on the most likely responses.

High temperature: Outputs are more diverse, creative, and unpredictable, but might be less coherent or relevant

80
New cards

Top K

limits the number of words to the top k most probable words, regardless of their percent probabilities

Low top k: With a low setting, like 10, the model will only consider the 10 most probable words for the next word in the sequence

High top k: With a high top k setting, like 500, the model will consider the 500 most probable words for the next word in the sequence, regardless of their individual probabilities.

81
New cards

Top P

a setting that controls the diversity of the text by limiting the number of words that the model can choose from based on their probabilities (set between 0 and 1)

Low top p (e.g 0.250): model will only consider words that make up the top 25 percent of the total probability distribution (more focused and coherent)

High top p (e.g 0.99): model will consider a broad range of possible words for the next word in the sequence, because it will include words that make up the top 99 percent of the total probability distribution (diverse and creative)

82
New cards

Stop sequences

special tokens or sequences of tokens that signal the model to stop generating further output

particularly useful in tasks where the desired output length is variable or difficult to predict in advance

83
New cards

Retrieval Augmented Generation (RAG)

a natural language processing (NLP) technique that combines the capabilities of retrieval systems and generative language models to produce high-quality and informative text outputs

reduces hallucinations

business applications: Building intelligent question-answering systems, Expanding and enriching existing knowledge bases, Generating high-quality content

84
New cards

Fine tuning

a supervised learning process that involves taking a pre-trained model and adding specific, smaller datasets

85
New cards

Elements of a prompt

Instructions: This is a task for the large language model to do. It provides a task description or instruction for how the model should perform.

Context: This is external information to guide the model.

Input data: This is the input for which you want a response.

Output indicator: This is the output type or format.

86
New cards

Negative prompting

involves providing the model with examples or instructions about what it should not generate or do

used to guide the model away from producing certain types of content or exhibiting specific behaviors

87
New cards

Zero-shot prompting

a technique where a user presents a task to a generative model without providing any examples or explicit training for that specific task

relies on the model's general knowledge and capabilities to understand and carry out the task without any prior exposure, or shots, of similar tasks

88
New cards

Few-shot prompting

technique that involves providing a language model with contextual examples to guide its understanding and expected output for a specific task

supplement the prompt with sample inputs and their corresponding desired outputs, effectively giving the model a few shots or demonstrations to condition it for the requested task.

89
New cards

Chain-of-thought prompting

a technique that divides intricate reasoning tasks into smaller, intermediary steps (use phrase “think step by step” to initiate)

recommended to use CoT prompting when the task requires multiple steps or a series of logical reasoning

90
New cards

Prompt tuning

the actual prompt text is replaced with a continuous embedding backer that is optimized during training

technique helps the prompt to be fine-tuned for a specific task, while keeping the rest of the model parameters frozen

91
New cards

Latent space

the encoded knowledge of language in LLMs or the stored patterns of data that capture relationships and reconstruct the language from the patterns when prompted

92
New cards

Amazon Bedrock Guardrails

provide safety and privacy controls to manage interactions in your generative AI applications

define threshold for content filters for hate, insults, sexual content, or violence

93
New cards

Poisoning

refers to the intentional introduction of malicious or biased data into the training dataset of a model

94
New cards

Hijacking and prompt injection

refer to the technique of influencing the outputs of generative models by embedding specific instructions within the prompts themselves

hijack the model's behavior and make it produce outputs that align with the attacker's intentions, such as generating misinformation or running malicious code

95
New cards

Exposure

refers to the risk of exposing sensitive or confidential information to a generative model during training or inference

96
New cards

Prompt leaking

refers to the unintentional disclosure or leakage of the prompts or inputs (regardless of whether these are protected data or not) used within a model

97
New cards

Jailbreaking

refers to the practice of modifying or circumventing the constraints and safety measures implemented in a generative model or AI assistant to gain unauthorized access or functionality

attempts involve crafting carefully constructed prompts or input sequences that aim to bypass or exploit vulnerabilities in the AI system's filtering mechanisms or constraints. The goal is to "break out" of the intended model limitations

98
New cards

Reinforcement learning from human feedback (RLHF)

an ML technique that incorporates human feedback to help models learn more efficiently

uses a supervised fine-tuning of a language model and a reward model

benefits: enhances AI performance, supplies complex training parameters, increases user satisfaction

99
New cards

Parameter-efficient fine-tuning (PEFT)

a process and set of techniques that freeze or preserve the parameters and weights of the original LLM and fine-tune a small number of task-specific adaptor layers and parameters

100
New cards

Low-Rank Adaptation (LoRA)

a popular PEFT technique that also preserves or freezes the original weights of the foundation model and creates new trainable low-rank matrices into each layer of a transformer architecture