CS P3

0.0(0)
studied byStudied by 11 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/61

flashcard set

Earn XP

Description and Tags

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

62 Terms

1
New cards

Latency

The delay between a user's query and the chatbot's response. High latency can negatively impact user experience by making the chatbot seem slow

2
New cards

Linguistic nuances

the subtle differences or variations in meaning, tone, or expression within language

3
New cards

Architecture

underlying structure and components that enable it to understand and generate human

4
New cards

Natural Language Processing (NLP)

The field of AI focused on enabling machines to interpret understand and respond to human language by breaking down and analyzing text through methods like lexical analysis and semantic analysis.

5
New cards

Recurrent neural networks (RNNs)

Neural networks designed to handle sequential data with memory of previous inputs.

Consist of input, hidden, and output layers

Change using backpropagation through time(BPTT) (vanishing gradient problem making it harder to learn long-term dependencies)

6
New cards

Long short-term memory (LSTM)

Type of RNN that overcomes vanishing gradient problem (with specialised units for capturing long-term dependencies)

Uses a three gate mechanism (input gate, forget gate, and output gate) to selectively retain or forget information, allowing them to maintain a long-term dependencies

7
New cards

Transformer neural networks (transformer NNs).

Neural networks using a self-attention mechanism for parallel processing and better handling of long-term dependences

Uses a self attention mechanism to capture relationships between the words in a sequence, enabling better handling of long-term dependence in parallel processing of data

8
New cards

Self-attention mechanism

Mechanism used in transformer neural networks that allows the model to weigh the importance of different words in the input sequence when making predictions

9
New cards

Dataset

A collection of data used to train and evaluate machine learning models. A good dataset is diverse, high-quality, relevant and up-to-date

10
New cards

Synthetic data

Artificially generated data used to supplement real data, covering scenarios that may not be well-represented in the original dataset

11
New cards

Diversity

Inclusion of a wide range of topics, languages, and user intents in the dataset.

12
New cards

Data Cleaning

Removing irrelevant, duplicate, and noisy data to ensure the remaining data is accurate and well-labelled

13
New cards

Data Augmentation

Techniques used to increase the size and diversity of the dataset.

14
New cards

Bias Mitigation

Analyzing the dataset for potential biases and taking steps to address them, such as balancing the representation of different user groups and scenarios.

15
New cards

User Feedback

Incorporating feedback from users to identify and correct inaccuracies, continuously improving the dataset's quality.

16
New cards

Pre-processing

The initial step in data preparation, involving cleaning, transforming, and reducing data to improve its quality and make it suitable for training machine learning models

17
New cards

Ethical challenges

Data privacy and security

Bias and fairness

Accountability and responsibility

Transparency

Misinformation and manipulation

18
New cards

Data Privacy

Ensuring that user data is kept confidential and secure.

19
New cards

Bias

Systematic errors in data or algorithms that lead to unfair outcomes.

20
New cards

Accountability

Responsibility for the actions and decisions made by a system.

21
New cards

Transparency

Clarity and openness about how a system operates and makes decisions.

22
New cards

Misinformation

Incorrect or misleading information.

23
New cards

Improve Data Privacy and Security

Implement encryption and access controls to protect customer data handled by the chatbot.

24
New cards

Improve Bias and Fairness

Use diverse training data and regular bias audits to ensure the chatbot provides fair service to all customers.

25
New cards

Improve Accountability and Responsibility

Establish clear guidelines that define who is responsible for the chatbot's actions and ensuring adherence to ethical standards.

26
New cards

Improve Transparency

Provide explanations for the chatbot's responses and educating customers on how the chatbot works.

27
New cards

Improve Misinformation and Manipulation

Integrate fact-checking mechanisms to ensure the accuracy of the information provided by the chatbot

28
New cards

Critical Path

The shortest sequence of models required to process a query.

29
New cards

Critical path optimisation

A method used to improve response time by identifying and eliminating unnecessary processing steps, helping chatbots respond more quickly and efficiently.

30
New cards

Natural language understanding (NLU)

A component of NLP focused on understanding the user's input by analysing linguistic features and context.

31
New cards

Lexical analysis

1.The process of breaking down text into individual words and sentences, identifying parts of speech, and preparing it for further processing

32
New cards

Syntactic analysis (parsing)

2.Analysing the grammatical structure of a sentence, identifying the relationships between words and phrases

33
New cards

Semantic analysis

3.The process of understanding the meaning of words and sentences, going beyond the surface-level structure to interpret the underlying concepts

34
New cards

Discourse integration

4.A stage in NLP where the meaning of a sentence is integrated with the larger context of the conversation to generate coherent and contextually appropriate responses

35
New cards

Pragmatic analysis

5.Analysing the social, legal, and cultural context of a sentence to understand its intended meaning and implications.

36
New cards

User Satisfaction

A measure of how well a chatbot meets user needs and expectations, influenced by response speed, accuracy, and relevance to the user's emotional tone and query

37
New cards

Attention-retention

The ability of a chatbot to keep users engaged by maintaining relevance and providing timely, helpful responses, preventing users from losing interest

38
New cards

Pattern Recognition

The ability of chatbots to identify recurring language patterns, which helps improve their understanding of different linguistic structures and user intents

39
New cards

Hyperparameters

learning rate and the number of hidden layers

40
New cards

Hyperparameter Tuning

The process of optimizing the parameters that govern the training of a machine learning model (e.g., learning rate, number of layers) to improve its performance.

41
New cards

Backpropagation through time (BPTT)

A variant of the backpropagation algorithm used for training Recurrent Neural Networks (RNNs), where gradients are propagated backward through time to update the weights.

42
New cards

Vanishing Gradient

A problem in training deep neural networks where gradients become very small, making it difficult to update the weights effectively and learn long

43
New cards

Weights

Parameters in a neural network that are adjusted during training to minimize the loss function and improve the model's predictions.

44
New cards

Biases

Systematic errors in data or algorithms that can lead to unfair or discriminatory outcomes

45
New cards

Confirmation bias

A type of bias where data is skewed towards a particular viewpoint or expected outcome, often reinforcing pre

46
New cards

Historical bias

A bias that occurs when training data reflects outdated or historical patterns that may not be relevant to current scenarios, potentially leading to inaccurate predictions

47
New cards

Labelling bias

Occurs when the labels applied to training data are subjective, inaccurate, or incomplete, affecting the model's ability to learn correctly

48
New cards

Linguistic bias

Bias resulting from training data that favors certain dialects, vocabularies, or linguistic styles, potentially disadvantaging users who use different linguistic forms

49
New cards

Sampling bias

Occurs when the training dataset is not representative of the entire population, leading to a model that performs well for certain groups but poorly for others

50
New cards

Selection bias

Bias introduced when the training data is not randomly selected but chosen based on specific criteria, potentially missing important variations

51
New cards

Bag of words

A model used in natural language processing where text is represented as an unordered collection of words, disregarding grammar and word order but keeping track of word frequency

52
New cards

Processing power

the computational capacity of a system to perform tasks efficiently and quickly. crucial for handling the complex algorithms and large datasets required for natural language processing (NLP), machine learning, and generating real-time responses. ensures that a chatbot can function smoothly, provide quick responses, and handle a high volume of queries simultaneously

53
New cards

Central Processing Unit (CPU)

The primary component responsible for executing instructions. While capable, CPUs may struggle with the high parallel processing demands of advanced AI tasks.

54
New cards

Graphical processing units (GPUs)

Specialised hardware designed to accelerate the processing of large

55
New cards

Tensor processing units (TPUs)

Custom hardware developed by Google specifically designed to accelerate machine learning workloads, particularly for deep learning models.

56
New cards

Cloud Computing

Utilizing cloud services (e.g., AWS, Google Cloud, Azure) provides scalable resources that can be adjusted based on demand. This flexibility ensures that processing power can be scaled up or down as needed.

57
New cards

Distributed Computing

Distributing tasks across multiple machines to parallelize processing, reducing latency and improving efficiency.

58
New cards

Large language model (LLM)

Advanced neural networks trained on vast amounts of text data to understand and generate human

59
New cards

Memory cell state

In LSTM networks, the memory cell state represents the information that flows through the network, controlled by input, forget, and output gates to manage long

60
New cards

Loss function

A mathematical function that measures the difference between the predicted output of a model and the actual target output, guiding the optimization process during training.

61
New cards

Self-attention mechanism

Mechanism within neural networks that allows the network to weigh the importance of different input elements when making predictions or decisions

62
New cards

Deep Learning

A subset of machine learning involving neural networks with many layers (deep neural networks) that can learn complex patterns in large datasets.