Big Data Concepts and Analysis

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

flashcard set

Earn XP

Description and Tags

Flashcards covering key vocabulary and concepts related to Big Data, data usability, analysis methods, and data storage.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards

Big Data

Extremely large sets of data that cannot be easily managed or analyzed using traditional tools.

2
New cards

Usable Data

Data that is clean, well-organized, and accessible, easily understood and processed.

3
New cards

Useful Data

Data that is relevant, timely, and appropriate for solving a specific problem or answering a question.

4
New cards

Structured Data

Organized data formatted into rows and columns, such as spreadsheets and databases.

5
New cards

Unstructured Data

Data that has no predefined format, like emails, images, and social media posts.

6
New cards

Data Extraction

The process of pulling specific, meaningful information from raw or unstructured data.

7
New cards

Metadata

Data about data; information that helps in organizing, finding, and understanding stored data.

8
New cards

Data Persistence

The ability of data to be saved and retained over time, even after a program or device is shut off.

9
New cards

PII

Personally Identifiable Information; data that can identify a person, such as name or social security number.

10
New cards

Descriptive Analysis

Analysis that summarizes what happened, with high confidence but low future utility.

11
New cards

Predictive Analysis

Estimates what might happen, with moderate confidence and medium utility.

12
New cards

Prescriptive Analysis

Suggests actions based on data, with lower confidence but high decision-making utility.

13
New cards

Classification

A data mining strategy that assigns data to predefined categories.

14
New cards

Clustering

A data mining strategy that groups similar data points based on features.

15
New cards

Regression

A predictive analysis method that predicts continuous values based on trends.

16
New cards

Model

A simplified representation of a system used to understand complex systems and predict outcomes.

17
New cards

Simulation

Dynamic models that allow testing in virtual environments safely and cost-effectively.

18
New cards

Web Scraping

Method of automatically extracting data from the visual content of websites.

19
New cards

Screen Scraping

A process that captures data from the display output of a computer program.