Data Analytics - Week 3

0.0(0)
studied byStudied by 0 people
0.0(0)
linked notesView linked note
full-widthCall with Kai
GameKnowt Play
New
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/14

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts related to distributed database management systems, focusing on relational and NoSQL databases, including essential definitions and characteristics.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

15 Terms

1
New cards

Relational Database Management System (RDBMS)

A database management system that is based on the relational model; it organizes data into tables with predefined schemas.

2
New cards

Analytics Hardware

Technology that scales out horizontally to support big data analysis, often not native to traditional relational databases.

3
New cards

ACID transactions

A set of properties (Atomicity, Consistency, Isolation, Durability) that guarantee reliable processing of database transactions.

4
New cards

Schema on Write

A database schema that requires a predefined structure before data can be written, typically associated with traditional RDBMS.

5
New cards

Schema on Read

Allows for data to be stored without predefined schemas, enabling quick writes and retrieval, common in NoSQL databases.

6
New cards

Distributed Database

A database that consists of data stored in multiple locations, ensuring system availability and redundancy.

7
New cards

Scaling

The capability of a database to expand its resources to accommodate increased load, typically classified as horizontal or vertical scaling.

8
New cards

CAP Theorem

A principle stating that in a distributed data store, it is impossible to simultaneously guarantee all three properties: Consistency, Availability, and Partition Tolerance.

9
New cards

Partition Tolerance

The ability of a system to continue operating despite network partitions that prevent some nodes from communicating.

10
New cards

Horizontal Scaling

Adding more machines to a system or database cluster to handle increased demand; preferable for distributed databases.

11
New cards

Replica Set

A group of MongoDB servers that maintain the same data set, ensuring redundancy and high availability.

12
New cards

MongoDB

A document-oriented NoSQL database that uses a flexible schema, allowing storage of semi-structured data in JSON-like documents.

13
New cards

Sharding

The process of distributing data across multiple machines to ensure efficiency and balance load in a database.

14
New cards

Data Normalization

A process in relational databases of organizing data to reduce redundancy and improve data integrity.

15
New cards

Key-Value Store

A type of NoSQL database that uses a simple key-value pair as the primary data model.