Big Data

studied byStudied by 2 people
0.0(0)
Get a hint
Hint

Big Data

1 / 30

flashcard set

Earn XP

31 Terms

1

Big Data

Refers to a large amount of data that exceeds the capacity of a single computer, requiring specialized techniques for handling and analysis.

New cards
2

Data

Information in the form of characters, symbols, or numeric values necessary for computer operations, which can be transmitted as electric signals and stored in various devices.

New cards
3

Structured Data

Data organized in a relational database with unique identifiers, typically existing in rows and columns for easy analysis.

New cards
4

Unstructured Data

Data lacking a specific structure or order, making it challenging for analysis but potentially valuable for business intelligence.

New cards
5

Semi-Structured Data

Data with relational values and organization that can be analyzed, such as text marked up with descriptions like XML in a document.

New cards
6

Volume

The total quantity of data stored, which has rapidly increased, collected from various sources like business transactions and social media platforms.

New cards
7

Velocity

Refers to the speed at which data is created and collected, requiring processes and systems to cope with vast amounts of data.

New cards
8

Variety

The breadth of data sources analyzed, including different types of data related to customers, manufacturing processes, and industry.

New cards
9

Variability

Refers to inconsistencies in data that need to be identified for meaningful analytics, influenced by multiple data types and sources.

New cards
10

Veracity

Indicates the quality of data, emphasizing the importance of consistent and correct data for reliable analysis and decision-making.

New cards
11

Value

The most crucial characteristic of big data, highlighting the necessity of deriving value and achieving organizational goals through data analysis.

New cards
12

Data Fusion and Data Integration

Refers to the analysis of data from multiple sources to improve accuracy and results compared to single-source analysis.

New cards
13

Data Mining

Technique to extract useful information from large datasets, identifying trends and patterns for various applications like spam filtering and fraud detection.

New cards
14

Machine Learning

Subset of artificial intelligence using algorithms to make predictions based on large datasets, with models improving over time.

New cards
15

Natural Language Processing (NLP)

Technique using algorithms to analyze human languages, including translation, speech recognition, and question answering.

New cards
16

Statistics

Approach supporting data analysis, where statistical techniques can be applied to both small and large datasets.

New cards
17

Sampling

Process of taking a sample from a dataset to make estimates and predictions about the entire dataset.

New cards
18

Divide and Conquer

Method of dividing a dataset into smaller blocks for easier analysis, with results combined to analyze the whole dataset.

New cards
19

Big Data Visualization

Techniques to present data graphically for better understanding and communication, aiding decision-making processes.

New cards
20

Industry 4.0

Manufacturing concept using smart technologies and big data analysis to maximize production, reduce costs, and customize production based on demand.

New cards
21

Predictive Analytics

Utilizes big data to identify patterns that can predict future events, aiding in decision-making processes.

New cards
22

Big Data Implementation

Requires investment in solutions and hiring experts for data collection, storage, and processing.

New cards
23

Hyper-scale Computing Environments

Utilize dedicated servers, storage, and processing frameworks like Hadoop for big data storage and analysis.

New cards
24

Cloud Servers

Provide flexible storage options, though may impact latency, suitable for backup and scalable needs.

New cards
25

Descriptive Analytics

Involves data aggregation and mining to summarize findings and reveal underlying meanings in large datasets.

New cards
26

Predictive Analytics

Builds models on descriptive data to predict future outcomes based on current data trends.

New cards
27

Prescriptive Analytics

Goes beyond predictive analytics by suggesting multiple courses of action or possible outcomes for a specific goal.

New cards
28

Data Quality

Challenges in big data analysis related to the accuracy and relevance of data, influenced by veracity and data sources.

New cards
29

System Compatibility

Obstacles in data analysis due to the need to integrate data from various systems or processes.

New cards
30

Skills Gaps

Lack of skilled professionals in data analysis, emphasizing the importance of having the right competencies for effective implementation.

New cards
31

GDPR

The EU General Data Protection Regulation ensuring data protection and privacy compliance in data analysis.

New cards

Explore top notes

note Note
studied byStudied by 7 people
... ago
5.0(1)
note Note
studied byStudied by 12 people
... ago
5.0(1)
note Note
studied byStudied by 21 people
... ago
4.0(1)
note Note
studied byStudied by 32 people
... ago
5.0(1)
note Note
studied byStudied by 8 people
... ago
5.0(1)
note Note
studied byStudied by 9 people
... ago
5.0(1)
note Note
studied byStudied by 31 people
... ago
5.0(1)
note Note
studied byStudied by 357 people
... ago
5.0(5)

Explore top flashcards

flashcards Flashcard (24)
studied byStudied by 21 people
... ago
5.0(1)
flashcards Flashcard (51)
studied byStudied by 28 people
... ago
4.0(1)
flashcards Flashcard (198)
studied byStudied by 7 people
... ago
5.0(1)
flashcards Flashcard (34)
studied byStudied by 2 people
... ago
5.0(1)
flashcards Flashcard (39)
studied byStudied by 4 people
... ago
5.0(1)
flashcards Flashcard (61)
studied byStudied by 379 people
... ago
4.6(28)
flashcards Flashcard (116)
studied byStudied by 13 people
... ago
5.0(1)
flashcards Flashcard (65)
studied byStudied by 2352 people
... ago
4.6(14)
robot