AWS Certified AI Practitioner

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/194

There's no tags or description

Looks like no tags are added yet.

Last updated 11:35 PM on 6/15/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

195 Terms

New cards

Artificial Intelligence

encompasses the development of intelligent systems capable of performing tasks that typically require human intelligence, such as perception, reasoning, learning, problem-solving, and decision making

New cards

Machine Learning

type of artificial intelligence for understanding and building methods that make it possible for machines to learn

New cards

Deep learning

subfield of machine learning that uses artificial neural networks to analyze data and learn complex patterns

New cards

Generative AI

subset of deep learning because it can adapt models built using deep learning, without retraining or fine tuning

capable of generating new data based on the patterns and structures learned from training data

New cards

Neural networks

lots of connected nodes that are organized into layers (input layer, hidden layer(s), and output layer)

when data is inputted, it figures out how to identify patterns by adjusting the connections between its nodes

New cards

Computer vision

a field of AI that makes it possible for computers to interpret and understand digital images and videos

ex: image classification, object detection, and image segmentation

New cards

Natural language processing (NLP)

branch of AI that deals with the interaction between computers and human languages

ex: text classification, sentiment analysis, machine translation, and language generation

New cards

Model

systems built using neural networks, system resources, data, and prompts, all working together to process input and produce outputs

New cards

Foundation model

models that are pretrained on internet-scale data

adapt a single FM to perform multiple tasks such as text generation, text summarization, information extraction, image generation, chatbot, and question answering

New cards

Algorithm

a set of instructions to be followed in calculations or other operations

New cards

Prompts

specific set of inputs to guide LLMs to generate an appropriate output or completion

New cards

Inference

process of using a trained model to generate results based on new input data (e.g., turning your prompt into an answer, image, or prediction)

New cards

Completion

the output the model produces in response to your prompt; often a continuation or expansion of the given input

New cards

Training

the process of feeding an AI model curated data sets to evolve the accuracy of its output

New cards

Labeled data

dataset where each instance/example is accompanied by a label or target variable that represents the desired output or classification

New cards

Unlabeled data

dataset where the instance/example do not have any associated labels or target variables (only input feature without corresponding output/classification)

New cards

Structured data

data that is organized and formatted in a predefined manner, typically in the form of tables or databases with rows and columns

New cards

Tabular data

structured data stored in spreadsheets, databases, or CSV files, with rows representing instances and columns representing features or attributes

New cards

Time-series data

structured data that consists of sequences of values measured at successive points in time, such as stock prices, sensor readings, or weather data

New cards

Semi-structured data

data that can have different attributes or missing attributes, such as a text file that contains JSON

New cards

Unstructured data

data that lacks a predefined structure or format, such as text, images, audio, and video

New cards

Text data

unstructured data that includes documents, articles, social media posts, and other textual data

New cards

Image data

unstructured data that includes digital images, photographs, and video frames

New cards

Inferencing

using the information that a model has learned to make predictions or decisions

New cards

Batch inferencing

analyzes a large amount of data all at once to provide a set of results

suitable for offline processing when data can be processed in batches. batch transform can support processing times of days therefore highest latency

New cards

Real-time inferencing

computer has to make decisions quickly, in response to new information as it comes in

real-time inference offers the lowest latency requirements because of the 60-second processing times

New cards

Asynchronous inferencing

can queue incoming requests for inference processing

asynchronous inference provides moderate latency requirements because of the processing times of up to 1 hour

New cards

Bias

model is missing important features of the datasets

measured by the difference between the expected predictions of the model and the true values we are trying to predict

New cards

Fairness

the equitable and impartial treatment of individuals, or data subjects, by AI systems

New cards

Fit

the process of adjusting the parameters of a model to best capture the patterns and relationships in the input data

New cards

Large language model (LLM)

powerful models that can understand and generate human-like text

use tokens, embeddings, and vectors to capture complex relationships in language

New cards

Tokens

basic units of text that the model processes (e.g. words, phrases, or characters) to provide standardization of input data

New cards

Vector

list of numbers representing data (such as text or an image) in a mathematical form the model can work with

New cards

Vector database

collection of data stored as mathematical representations

store structured and unstructured data, such as text or images with the vector embeddings

New cards

Embeddings

numerical representations of tokens, where each token is assigned a vector that captures its meaning and relationship with other tokens

a numerical vectorized representation of any entity that captures the meaning or semantic relationships of data

New cards

In-context learning

giving the model examples or extra information within the same prompt (inside the context window) so it can learn the task or style on the fly without retraining

New cards

Supervised learning

training on labeled data

learn a mapping function that can predict the output for new, unseen input data

New cards

Unsupervised learning

learn from unlabeled data

discover inherent patterns, structures, or relationships within the input data

New cards

Reinforcement learning

give only a performance score and portion of labeled training data

learns from feedback (rewards or penalties) to improve its decision-making over time

New cards

Classification model

a supervised learning technique used to assign labels or categories to new, unseen data instances based on a trained model

Use-cases: fraud detection, image classification, customer retention, and diagnostics

New cards

Regression model

a supervised learning technique used for predicting continuous or numerical values based on one or more input variable

Use-cases: advertising popularity prediction, weather forecasting, market forecasting, estimating life expectancy, population growth prediction

New cards

Clustering

an unsupervised learning technique that groups data into different clusters based on similar features or distances between the data point to better understand the attributes of a specific cluster

Use-cases: customer segmentation, targeted marketing, and recommended systems

New cards

Dimensionality reduction

an unsupervised learning technique used to reduce the number of features or dimensions in a dataset while preserving the most important information or patterns

Use-cases: big data visualization, meaningful compression, structure discovery, feature elicitation

New cards

Data collection

process of gathering and measuring information on variables of interest

should accurately reflect the diverse perspectives and experiences required for the use case of the AI system

New cards

Exploratory data analysis (EDA)

an analysis approach that identifies general patterns in the data

New cards

Data pre-processing

preprocess the data to ensure it is accurate, complete, and unbiased

techniques such as data cleaning, normalization, and feature selection can help to eliminate biases in the dataset

New cards

Feature engineering

the process of creating, transforming, extracting, and selecting variables from data

New cards

Model training

the process of training an ML algorithm with adequate training data to demonstrate correlation between the outcome and the influencing variables

New cards

Hyperparameter tuning

the problem of choosing a set of optimal hyperparameters for a learning algorithm

hyperparameter: a parameter whose value is used to control the learning process, which must be configured before the process starts

New cards

Model evaluation

the process of using different evaluation metrics to understand a machine learning model's performance, as well as its strengths and weaknesses

New cards

Model deployment

the process of putting machine learning models into production

New cards

Model monitoring

Ensures the model is maintaining a desired level of performance through early detection and mitigation

New cards

MLOps

refers to the practice of operationalizing and streamlining the end-to-end machine learning lifecycle from model development and deployment to monitoring and maintenance

New cards

Accuracy

model performance metric that measures the percentage of correct predictions or classifications

New cards

Area Under the ROC Curve

represents the probability that the model, if given a randomly chosen positive and negative example, will rank the positive higher than the negative

New cards

F1 score

the harmonic mean of precision and recall

New cards

Precision

proportion of positive predictions that are actually correct

New cards

Recall

proportion of correct sets that are identified as positive

New cards

Prompt engineering

focuses on developing, designing, and optimizing prompts to enhance the output of FMs for your needs

selects appropriate words, phrases, sentences, punctuation, and separator characters to effectively use LLMs for a wide variety of applications

New cards

Multi-modal models

can process and generate multiple modes of data simultaneously

learn how different modalities like images and text are connected and can influence each other

use-cases: automating video captioning, creating graphics from text instructions, answering questions more intelligently by combining text and visual info

New cards

Diffusion models

deep learning architecture system that starts with pure noise/random data, gradually adding more meaningful information to end with a clear and coherent output

New cards

Forward diffusion

system gradually introduces a small amount of noise to an input image until only the noise is left over

New cards

Reverse diffusion

noisy image is gradually introduced to denoising until a new image is generated

New cards

Stable diffusion

does not use the pixel space of the image, uses a reduced definition latent space

New cards

Generative adversarial networks (GANs)

a class of generative models that uses two neural networks — a generator that creates synthetic data, and a discriminator that tries to tell real data from fake data. The two compete in a feedback loop until the generator produces data that is indistinguishable from real examples

New cards

Variational autoencoders (VAEs)

A type of generative model that learns to encode input data into a compressed latent representation and then decode it back to reconstruct the data. By sampling from the latent space, VAEs can generate new, similar data

New cards

Intrinsic analysis

can be applied to models with low complexity or simple relationships between the input variables and the predictions

high model interpretability -> lower model performance

New cards

Post hoc analysis

can be applied to simple relationship models and more complex models that capture nonlinear interactions

New cards

Foundation model lifecycle

Data selection (FMs required training on massive datasets from diverse sources)
Pre-training (use self-supervised learning - make use of structure to autogenerate labels)
- Can learn the meaning, context, and relationship of words in the dataset
Optimization
- Techniques like prompt engineering, retrieval-augmented generation (RAG), and fine-tuning on task-specific data
Evaluation (measured using appropriate metrics and benchmarks)
Deployment
Feedback and continuous improvement (collected from users, domain experts, or other stakeholders to identify areas for improvement, detect potential bias, and inform future iterations of the model)

New cards

Advantages of generative AI

Adaptability, responsiveness, simplicity, creativity and exploration, data efficiency, personalization, and scalability

New cards

Toxicity

can generate content that is inflammatory, offensive, or inappropriate

New cards

Hallucinations

model generates inaccurate responses that are not consistent with the training data

New cards

Interpretability

the access into a system so that a human can interpret the model’s output based on the weights and features

New cards

Nondeterminism

model might generate different outputs for the same input, which can cause problems in applications where reliability is key

New cards

Agents

software components or entities designed to perform specific actions or tasks autonomously or semi-autonomously, based on predefined rules or algorithms

task coordination, reporting and logging, scalability and concurrency, integration and communication

New cards

Factors to select appropriate generative AI models

model types, performance requirements, capabilities, constraints, compliance

New cards

Business metrics for generative AI

User satisfaction, accuracy, customer lifetime value, average revenue per user, cross-domain performance, conversion rate, and efficiency

New cards

Context window

a model property that describes the number of tokens that the model can accept in the context

New cards

Temperature

controls the randomness or creativity of the model's output (set between 0 and 1)

Low temperature: Outputs are more conservative, repetitive, and focused on the most likely responses.

High temperature: Outputs are more diverse, creative, and unpredictable, but might be less coherent or relevant

New cards

Top K

limits the number of words to the top k most probable words, regardless of their percent probabilities

Low top k: With a low setting, like 10, the model will only consider the 10 most probable words for the next word in the sequence

High top k: With a high top k setting, like 500, the model will consider the 500 most probable words for the next word in the sequence, regardless of their individual probabilities.

New cards

Top P

a setting that controls the diversity of the text by limiting the number of words that the model can choose from based on their probabilities (set between 0 and 1)

Low top p (e.g 0.250): model will only consider words that make up the top 25 percent of the total probability distribution (more focused and coherent)

High top p (e.g 0.99): model will consider a broad range of possible words for the next word in the sequence, because it will include words that make up the top 99 percent of the total probability distribution (diverse and creative)

New cards

Stop sequences

special tokens or sequences of tokens that signal the model to stop generating further output

particularly useful in tasks where the desired output length is variable or difficult to predict in advance

New cards

Retrieval Augmented Generation (RAG)

a natural language processing (NLP) technique that combines the capabilities of retrieval systems and generative language models to produce high-quality and informative text outputs

reduces hallucinations

business applications: Building intelligent question-answering systems, Expanding and enriching existing knowledge bases, Generating high-quality content

New cards

Fine tuning

a supervised learning process that involves taking a pre-trained model and adding specific, smaller datasets

New cards

Elements of a prompt

Instructions: This is a task for the large language model to do. It provides a task description or instruction for how the model should perform.

Context: This is external information to guide the model.

Input data: This is the input for which you want a response.

Output indicator: This is the output type or format.

New cards

Negative prompting

involves providing the model with examples or instructions about what it should not generate or do

used to guide the model away from producing certain types of content or exhibiting specific behaviors

New cards

Zero-shot prompting

a technique where a user presents a task to a generative model without providing any examples or explicit training for that specific task

relies on the model's general knowledge and capabilities to understand and carry out the task without any prior exposure, or shots, of similar tasks

New cards

Few-shot prompting

technique that involves providing a language model with contextual examples to guide its understanding and expected output for a specific task

supplement the prompt with sample inputs and their corresponding desired outputs, effectively giving the model a few shots or demonstrations to condition it for the requested task.

New cards

Chain-of-thought prompting

a technique that divides intricate reasoning tasks into smaller, intermediary steps (use phrase “think step by step” to initiate)

recommended to use CoT prompting when the task requires multiple steps or a series of logical reasoning

New cards

Prompt tuning

the actual prompt text is replaced with a continuous embedding backer that is optimized during training

technique helps the prompt to be fine-tuned for a specific task, while keeping the rest of the model parameters frozen

New cards

Latent space

the encoded knowledge of language in LLMs or the stored patterns of data that capture relationships and reconstruct the language from the patterns when prompted

New cards

Amazon Bedrock Guardrails

provide safety and privacy controls to manage interactions in your generative AI applications

define threshold for content filters for hate, insults, sexual content, or violence

New cards

Poisoning

refers to the intentional introduction of malicious or biased data into the training dataset of a model

New cards

Hijacking and prompt injection

refer to the technique of influencing the outputs of generative models by embedding specific instructions within the prompts themselves

hijack the model's behavior and make it produce outputs that align with the attacker's intentions, such as generating misinformation or running malicious code

New cards

Exposure

refers to the risk of exposing sensitive or confidential information to a generative model during training or inference

New cards

Prompt leaking

refers to the unintentional disclosure or leakage of the prompts or inputs (regardless of whether these are protected data or not) used within a model

New cards

Jailbreaking

refers to the practice of modifying or circumventing the constraints and safety measures implemented in a generative model or AI assistant to gain unauthorized access or functionality

attempts involve crafting carefully constructed prompts or input sequences that aim to bypass or exploit vulnerabilities in the AI system's filtering mechanisms or constraints. The goal is to "break out" of the intended model limitations

New cards

Reinforcement learning from human feedback (RLHF)

an ML technique that incorporates human feedback to help models learn more efficiently

uses a supervised fine-tuning of a language model and a reward model

benefits: enhances AI performance, supplies complex training parameters, increases user satisfaction

New cards

Parameter-efficient fine-tuning (PEFT)

a process and set of techniques that freeze or preserve the parameters and weights of the original LLM and fine-tune a small number of task-specific adaptor layers and parameters

100

New cards

Low-Rank Adaptation (LoRA)

a popular PEFT technique that also preserves or freezes the original weights of the foundation model and creates new trainable low-rank matrices into each layer of a transformer architecture