Domain #1

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/73

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:29 AM on 6/16/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

74 Terms

1
New cards

What is a relational database?

A structured database that stores data in tables (rows and columns) with defined relationships using keys (primary/foreign keys). It supports SQL and enforces schema consistency.

2
New cards

What is a non-relational (NoSQL) database?

A flexible database that does not require fixed schemas; stores data in formats like document, key-value, graph, or columnar structures. Optimized for scalability and unstructured/semi-structured data.

3
New cards

What is a .csv file used for?

Stores tabular data in plain text using commas as delimiters; commonly used for data import/export.

4
New cards

What is a .json file used for?

Stores semi-structured data using key-value pairs and nested objects; commonly used in APIs and web applications.

5
New cards

What is a .xlsx file?

Microsoft Excel file used for spreadsheets, calculations, and structured tabular data with formulas.

6
New cards

What is a .txt file?

Plain text file with no formatting; used for logs, notes, or raw data.

7
New cards

What is a .dat file?

Generic data file that can store structured or unstructured data depending on the application.

8
New cards

What is a .jpg file in data contexts?

Binary image file; represents unstructured data used in machine learning or image processing.

9
New cards
10
New cards

What is structured data?

Highly organized data stored in rows/columns with a fixed schema (e.g., SQL tables).

11
New cards

What is semi-structured data?

Data that does not follow a rigid schema but has organizational markers like tags or keys (e.g., JSON, XML).

12
New cards

What is unstructured data?

Data with no predefined format (e.g., images, videos, emails, social media posts).

13
New cards
14
New cards

What is a fact table?

A table that stores measurable metrics (e.g., sales, revenue, quantity) used for analysis.

15
New cards

What is a dimensional table?

A table that stores descriptive attributes (e.g., customer name, product type, date).

16
New cards

What is a schema?

The blueprint of a database defining tables, fields, relationships, and constraints.

17
New cards

What is a slowly changing dimension (SCD)?

A dimension that tracks historical changes in data attributes over time (e.g., customer address changes).

18
New cards

What is a bridge table?

A table used to resolve many-to-many relationships between two entities.

19
New cards

What is nested JSON data?

JSON objects contained within other objects, allowing hierarchical data representation.

20
New cards
21
New cards

What is a string data type?

Text data (char, varchar, nvarchar) used for names, descriptions, and labels.

22
New cards

What is a numeric data type?

Data used for calculations (integers, decimals, floats).

23
New cards

What is a boolean data type?

Logical data type representing True/False values.

24
New cards

What is a datetime data type?

Stores date and time values, often used for timestamps.

25
New cards

What is a NULL value?

Represents missing, unknown, or undefined data.

26
New cards

What is a GUID/UUID?

A globally unique identifier used to uniquely label records across systems.

27
New cards

What is a BLOB?

Binary Large Object used to store large binary files like images or multimedia.

28
New cards

What is a CLOB?

Character Large Object used to store large text data (documents, logs).

29
New cards
30
New cards

What is a database as a data source?

A structured system storing organized data accessible via queries (SQL/NoSQL).

31
New cards

What is an API data source?

A system that allows applications to exchange data using endpoints (often JSON/XML).

32
New cards

What is website data?

Data scraped or collected from web pages (HTML content, tables, metadata).

33
New cards

What are logs as a data source?

System-generated records of events (errors, user activity, system performance).

34
New cards

What is a data warehouse?

Centralized repository for structured, historical data optimized for analytics.

35
New cards

What is a data lake?

Storage system for raw structured, semi-structured, and unstructured data.

36
New cards

What is a data mart?

A subset of a data warehouse focused on a specific business area.

37
New cards

What is a data silo?

Isolated data stored separately and not easily accessible across systems.

38
New cards
39
New cards

What is cloud computing?

Delivery of computing resources (storage, servers, databases) over the internet.

40
New cards

What are the major cloud providers?

AWS, Microsoft Azure, Google Cloud Platform (GCP).

41
New cards

What is a public cloud?

Cloud infrastructure shared across multiple organizations.

42
New cards

What is a private cloud?

Cloud infrastructure dedicated to a single organization.

43
New cards

What is a hybrid cloud?

Combination of public and private cloud environments.

44
New cards

What is object storage?

Storage method that stores data as objects with metadata (e.g., images, backups).

45
New cards

What is block storage?

Storage that splits data into fixed-size blocks for high-performance access.

46
New cards

What is file storage?

Traditional hierarchical storage using folders and files.

47
New cards

What is containerization?

Packaging applications with dependencies into isolated containers (e.g., Docker).

48
New cards
49
New cards

What is an IDE?

Software used to write and run code (e.g., VS Code, RStudio).

50
New cards

What are notebooks used for?

Interactive coding environments for data analysis (e.g., Jupyter Notebook).

51
New cards

What is Tableau / Power BI used for?

Business intelligence tools for dashboards and data visualization.

52
New cards

What is SQL used for?

Querying and managing relational databases.

53
New cards

What is Python used for in data analysis?

Data manipulation, automation, analysis, and machine learning.

54
New cards

What is pandas?

Python library for data manipulation and analysis (DataFrames).

55
New cards
56
New cards

What is generative AI?

AI that creates new content (text, images, code) based on learned patterns.

57
New cards

What is a large language model (LLM)?

AI model trained on large text datasets to understand and generate human language.

58
New cards

What is NLP?

Natural Language Processing—AI that interprets and processes human language.

59
New cards

What is deep learning?

Machine learning using neural networks with multiple layers.

60
New cards

What is RPA?

Robotic Process Automation—automates repetitive business tasks.

61
New cards
62
New cards

What is ETL?

Extract, Transform, Load—data is transformed before loading into a system.

63
New cards

What is ELT?

Extract, Load, Transform—data is loaded first, then transformed in the system.

64
New cards

What is data integration?

Combining data from multiple sources into a unified view.

65
New cards

What is data sampling?

Selecting a subset of data for analysis.

66
New cards

What is data aggregation?

Summarizing data (e.g., sum, average, count).

67
New cards

What is a join in SQL?

Combining tables based on related columns.

68
New cards

What is filtering in data queries?

Selecting rows that meet specific conditions.

69
New cards
70
New cards

What are missing values?

Data fields with no recorded value.

71
New cards

What is duplication in data?

Repeated records that can skew analysis.

72
New cards

What are outliers?

Data points significantly different from the rest.

73
New cards

What is data completeness?

Measure of whether all required data is present.

74
New cards

What is data validation?

Ensuring data meets defined rules and formats.