1/15
These flashcards cover key vocabulary and concepts related to data management and analysis, including relational databases, big data analytics, ETL processes, and more.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Relational Databases
Databases that use a structured format based on tables with rows and columns, allowing for data normalization.
Big Data Analytics
Analysis of large and complex data sets that traditional data processing software cannot deal with.
ETL Process
Extraction, Transformation, and Loading; the process of moving data from source systems into a data warehouse.
SQL
Structured Query Language, used for managing and manipulating relational databases.
Normalization
The process of organizing data to minimize redundancy in a relational database.
OLTP
Online Transaction Processing; systems for managing transaction-oriented applications.
OLAP
Online Analytical Processing; systems designed to perform complex queries and analysis on data.
Sharding
The process of splitting data across multiple databases to improve performance and scalability.
Data Warehouse
A centralized repository for storing historical data for analysis and reporting.
Replication
The process of copying and maintaining database objects in multiple databases for fault tolerance.
Fragmentation
The method of dividing a database into smaller parts that can be stored in different physical locations.
Distributed Database
A database spread across multiple locations, allowing for better availability and performance.
Joining Tables
The process of combining rows from two or more tables based on a related column between them.
Data Mart
A subset of a data warehouse focused on a specific subject or department.
CAP Theorem
A principle that states a distributed data store can only guarantee two of the following three properties: Consistency, Availability, and Partition Tolerance.
Cloud Computing
The delivery of computing services over the internet, allowing for scalable resources and flexibility.