QUIZ Big Data and Data Analytics

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:39 AM on 5/18/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

22 Terms

1
New cards

Big Data

A massive volume of structured, semi-structured, and unstructured data characterized by high volume, high velocity, and high variety that traditional data processing methods struggle to handle efficiently

2
New cards

Big Data Analytics

The use of advanced analytic techniques against very large, diverse datasets from different sources ranging in size from terabytes to zettabytes

3
New cards

3 V's of Big Data

The three main characteristics of big data: Volume (the amount of data), Velocity (the speed at which data is generated), and Variety (the different types and sources of data)

4
New cards

Structured Data

Data that is organized in a predefined format, typically stored in relational databases and easily searchable

5
New cards

Unstructured Data

Data that does not have a predefined format or organization, such as social media posts, videos, and sensor data

6
New cards

Data Analytics

The process of examining data sets to uncover hidden patterns, correlations, trends, and other useful information to turn raw data into actionable insights

7
New cards

Descriptive Analytics

A type of big data analytics that involves easily readable and interpretable data used to create reports and visualize information such as company profits and sales, answering the question "what happened"

8
New cards

Diagnostics Analytics

A type of big data analytics that helps companies understand why a problem occurred by mining and recovering data to dissect issues and prevent future occurrences, answering the question "why it happened"

9
New cards

Predictive Analytics

A type of big data analytics that examines past and present data using AI, machine learning, and data mining to forecast future trends and outcomes, answering the question "what will happen"

10
New cards

Prescriptive Analytics

A type of big data analytics that provides solutions to problems by relying on AI and machine learning for data-driven risk management, answering the question "what should be done about it"

11
New cards

Batch Processing

A data processing method that examines large data blocks over time, useful when there is a longer turnaround time between collecting and analyzing data

12
New cards

Stream Processing

A data processing method that examines small batches of data at once to shorten the delay between data collection and analysis for quicker decision-making, though more complex and expensive

13
New cards

Data Mining

A big data analysis method that sorts through large datasets to identify patterns, relationships, and anomalies by creating data clusters

14
New cards

Data Lake

A storage system where raw or unstructured data that is too diverse or complex for a warehouse is assigned metadata and stored

15
New cards

Data Warehouse

A storage system for large amounts of data collected from many different sources, typically using predefined schemas

16
New cards

Hadoop

An open-source framework that stores and processes big data sets, capable of handling and analyzing both structured and unstructured data

17
New cards

Spark

An open-source cluster computing framework used for real-time processing and analyzing data

18
New cards

NoSQL Databases

Non-relational data management systems ideal for dealing with raw and unstructured data

19
New cards

Distributed Storage

Databases that can split data across multiple servers and have the capability to identify lost or corrupt data, such as Cassandra

20
New cards

Stream Analytics Tools

Systems that filter, aggregate, and analyze data that might be stored in different platforms and formats, such as Kafka

21
New cards

Data Integration Software

Programs that allow big data to be streamlined across different platforms such as MongoDB, Apache, Hadoop, and Amazon EMR

22
New cards

Dirty Data

Data that contains duplicates, errors, absences, conflicts, and inconsistencies that can obscure and mislead, creating flawed insights