FBLA - Data Science & AI

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/98

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

99 Terms

1
New cards

Data Science

The field that uses data, stats, and computation to extract insights and support decisions.

2
New cards

Data Conversion

Changing data from one numerical system or format to another (binary ↔ decimal ↔ hex)

3
New cards

Dependence Methods

Statistical techniques showing how variables rely/depend

on each other.

4
New cards

Interdependence Methods

Techniques examining relationships without a dependent variable. (Inter = Without)

5
New cards

Machine Learning Algorithm

A procedure computers use to learn patterns from data.

6
New cards

Learning Function

The rule a machine learning model uses to update itself based on errors.

7
New cards

Training Dataset

Data used to teach a model

8
New cards

Validation Dataset

Data used to fine-tune a model during training.

9
New cards

Test Dataset

Data used to measure final model performance.

10
New cards

Deep Learning

Machine learning using layered neural networks to learn complex patterns.

11
New cards

Structured Data

Data organized in rows and columns (like spreadsheets and SQL tables).

12
New cards

Unstructured Data

Data without a fixed format, such as text, images, audio, or video.

13
New cards

Numeric Data

Data measured or counted using numbers.

14
New cards

Categorical Data

Data grouped by labels or categories.

15
New cards

Binary System

Base-2 number system using 0 and 1.

16
New cards

Decimal System

Base-10 number system using digits 0-9.

17
New cards

Hexadecimal System

Base-16 system using 0-9 and A-F, often used in computing.

18
New cards

Data Sources

Places where data originates/comes from (the source)

(ex. sensors, surveys, transactions, social media).

19
New cards

Data Wrangling

Cleaning and organizing raw data so it can be analyzed. (Wrangling = cleaning/organizing)

20
New cards

Data Transformation

Converting/transforming data into a more useful or usable structure.

21
New cards

Data Science Process

Ask questions → collect → clean → analyze → model → interpret → communicate.

defining the problem, collecting and cleaning data, exploring and analyzing it to find patterns, modeling and evaluating solutions, and finally communicating the results

22
New cards

Probability

The chance an event happens; found by favorable outcomes ÷ total outcomes.

23
New cards

Sample Space

All possible outcomes in a scenario.

24
New cards

Random Variable

A variable whose value depends on random events.

25
New cards

Mean

Average of a dataset; add values and divide by number of values.

26
New cards

Median

Middle value when data is sorted; if even count, average the two middle numbers.

27
New cards

Mode

Value that appears most frequently.

28
New cards

Range

Largest value minus smallest value.

29
New cards

Variance

Measures spread of data from the mean; standard deviation squared.

30
New cards

Standard Deviation

Shows how spread out data is from the mean; square root of variance.

31
New cards

Expected Value

Long-term average outcome of a random variable.

32
New cards

Correlation

How strongly two variables move together (positive, negative, or none).

33
New cards

Regression

Predicting outcomes using relationships in data (line of best fit).

34
New cards

Outlier

Data point far from others that can distort results.

35
New cards

Gaussian (Normal) Distribution

Bell-shaped curve where values cluster around the mean.

36
New cards

Covariance

Shows whether two variables increase or decrease together.

37
New cards

Discrete Variable

A variable with countable values (like # of students).

38
New cards

Continuous Variable

A variable with infinite possible values within a range (height, time).

39
New cards

Data Visualization

Displaying data visually to reveal trends or patterns.

40
New cards

Bar Graph

Used to compare categories.

<p>Used to compare categories.</p>
41
New cards

Histogram

Shows distribution of numerical data.

<p>Shows distribution of numerical data.</p>
42
New cards

Line Graph

Tracks trends or changes over time.

<p>Tracks trends or changes over time.</p>
43
New cards

Scatter Plot

Shows relationships or correlations between two numeric variables.

<p>Shows relationships or correlations between two numeric variables.</p>
44
New cards

Box Plot

Shows spread, quartiles, and outliers in large datasets.

<p>Shows spread, quartiles, and outliers in large datasets.</p>
45
New cards

Pie Chart

Shows parts of a whole as percentages.

<p>Shows parts of a whole as percentages.</p>
46
New cards

Heatmap

Shows intensity or correlations using color patterns.

<p>Shows intensity or correlations using color patterns.</p>
47
New cards

Multivariate Data

Data involving more than two variables.

48
New cards

Linear Regression

Predicts a numeric value using a best-fit line.

49
New cards

Multiple Linear Regression

Predicts outcomes using several variables.

50
New cards

Logistic Regression

Predicts categories (yes/no, pass/fail).

(logical statement, like true/false)

51
New cards

Data Cleaning

Fixing errors, removing duplicates, and handling missing values.

52
New cards

Data Quality Issues

Problems like incomplete data, errors, duplicates, or noise.

53
New cards

K-Means

Clustering algorithm that groups data by similarity.

54
New cards

Decision Tree

Model that predicts outcomes by splitting data into branches.

<p>Model that predicts outcomes by splitting data into branches.</p>
55
New cards

Python

Main programming language for AI and data science due to simplicity and powerful libraries.

56
New cards

NumPy

Library for fast math and array operations.

57
New cards

Pandas

Library for cleaning, analyzing, and organizing data in dataframes.

58
New cards

Matplotlib

Library for visualizing data with graphs.

(mat = math = data) (plot = graph)

59
New cards

Seaborn

Python visualization library built on top of Matplotlib for cleaner graphs.

60
New cards

TensorFlow

Framework for building and training neural networks.

61
New cards

PyTorch

Deep learning framework popular for research and model development.

62
New cards

SQL

Language for storing, querying, and managing data in databases.

63
New cards

Jupyter Notebook

Environment to write code, visualize results, and explain analysis.

64
New cards

Relational Database

Database using tables linked by relationships (keys).

65
New cards

Google Cloud

Platform for running AI models, storing data, and deploying applications.

66
New cards

IBM Cloud

Cloud platform offering AI tools, storage, and data processing services.

67
New cards

R

Programming language used for statistics and data visualization.

68
New cards

Artificial Intelligence

Systems that mimic human intelligence to perform tasks like decision-making and pattern recognition.

69
New cards

Generative AI

AI that creates content such as text, images, and code.

70
New cards

Limitations of Generative AI

Can hallucinate, make false claims, and repeat bias.

71
New cards

Uses of Generative AI

Chatbots, creative tools, support in healthcare, summarization, coding help.

72
New cards

Computer Vision

AI that interprets visual data from images or videos.

73
New cards

Natural Language Processing (NLP)

AI that understands and generates human language.

74
New cards

Speech Recognition

Converts spoken words into text.

75
New cards

Robotics

AI used in machines that interact with the physical world.

<p>AI used in machines that interact with the physical world.</p>
76
New cards

Large Language Models (LLMs)

Neural networks trained on massive text datasets for reasoning and generating responses.

(Ex. Chatgpt)

77
New cards

LLM Capabilities

Summarization, question answering, reasoning, code generation.

78
New cards

Perception

AI converting raw data (images, audio, text) into useful information.

79
New cards

Knowledge Representation

How AI stores facts and relationships internally.

80
New cards

Reasoning

AI using stored knowledge to make decisions and predictions.

81
New cards

Inference

AI reaching conclusions without being explicitly told.

82
New cards

Knowledge Graph

Web of connected facts showing relationships between things.

83
New cards

Symbolic Reasoning

Solving problems using rules and logic.

84
New cards

Predicate Logic

Logical statements used to express facts and relationships.

85
New cards

Bayesian Network

Graph showing probabilistic relationships between variables.

86
New cards

Directed Acyclic Graph (DAG)

Graph used in Bayesian networks that has no loops.

87
New cards

Data Privacy

Protecting personal data from misuse.

<p>Protecting personal data from misuse.</p>
88
New cards

Data Security

Safeguarding data from unauthorized access or attacks.

89
New cards

Ethical AI

Designing AI that is fair, transparent, and does not harm people.

90
New cards

Algorithmic Bias

When AI produces unfair outcomes due to biased data.

91
New cards

Transparency

Explaining how an AI system makes decisions.

<p>Explaining how an AI system makes decisions.</p>
92
New cards

Accountability

Identifying who is responsible for AI outcomes.

93
New cards

Consent

Users must agree before their data is used.

94
New cards

AI Surveillance

AI systems that monitor people, raising privacy concerns.

95
New cards

Security Risks of LLMs

Data leaks, unauthorized training, privacy issues.

96
New cards

LLM Credibility Issues

Hallucinations, misinformation, false confidence.

97
New cards

AI Dilemmas

Ethical problems caused by AI decisions (e.g., self-driving choices).

98
New cards

Logic-Based Reasoning

Uses defined rules to reach conclusions.

99
New cards

Probability-Based Reasoning

Uses likelihood and uncertainty instead of strict rules.