Data Analytics Week 2

0.0(0)
studied byStudied by 0 people
0.0(0)
linked notesView linked note
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/15

flashcard set

Earn XP

Description and Tags

These flashcards cover key vocabulary and concepts related to data management and analysis, including relational databases, big data analytics, ETL processes, and more.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

16 Terms

1
New cards

Relational Databases

Databases that use a structured format based on tables with rows and columns, allowing for data normalization.

2
New cards

Big Data Analytics

Analysis of large and complex data sets that traditional data processing software cannot deal with.

3
New cards

ETL Process

Extraction, Transformation, and Loading; the process of moving data from source systems into a data warehouse.

4
New cards

SQL

Structured Query Language, used for managing and manipulating relational databases.

5
New cards

Normalization

The process of organizing data to minimize redundancy in a relational database.

6
New cards

OLTP

Online Transaction Processing; systems for managing transaction-oriented applications.

7
New cards

OLAP

Online Analytical Processing; systems designed to perform complex queries and analysis on data.

8
New cards

Sharding

The process of splitting data across multiple databases to improve performance and scalability.

9
New cards

Data Warehouse

A centralized repository for storing historical data for analysis and reporting.

10
New cards

Replication

The process of copying and maintaining database objects in multiple databases for fault tolerance.

11
New cards

Fragmentation

The method of dividing a database into smaller parts that can be stored in different physical locations.

12
New cards

Distributed Database

A database spread across multiple locations, allowing for better availability and performance.

13
New cards

Joining Tables

The process of combining rows from two or more tables based on a related column between them.

14
New cards

Data Mart

A subset of a data warehouse focused on a specific subject or department.

15
New cards

CAP Theorem

A principle that states a distributed data store can only guarantee two of the following three properties: Consistency, Availability, and Partition Tolerance.

16
New cards

Cloud Computing

The delivery of computing services over the internet, allowing for scalable resources and flexibility.