chapter 8

studied byStudied by 0 people
0.0(0)
Get a hint
Hint

What is Analytics?

1 / 49

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

50 Terms

1

What is Analytics?

Processes, technologies, frameworks, and algorithms used to extract meaningful, actionable insights from data.

New cards
2

What are the Seven Giants in the context of big data analytics?

Basic Statistics, Generalized N-Body Problems, Linear Algebraic Computations, Graph-Theoretic Computation, Optimization, Integration, Alignment Problems.

New cards
3

Name the Basic Statistics used in analytics.

Mean, Median, Variance, Counts, Top-N, Distinct.

New cards
4

What is Descriptive Analytics?

Analyzes past data and presents it in a summarized form to answer "What happened?"

New cards
5

What types of analytics are Diagnostic, Predictive, and Prescriptive?

They represent different levels of data analysis: Diagnostic: Understand why something happened. Predictive: Forecast what is likely to happen. Prescriptive: Determine what actions to take to achieve desired outcomes.

New cards
6

What are Generalized N-Body Problems in analytics?

Computational tasks involving calculations of distances, kernels, similarity between pairs of points, nearest neighbors, clustering, and kernel SVM.

New cards
7

What are Linear Algebraic Computations used for in analytics?

Performing operations like Linear Algebra, Linear Regression, and Principal Component Analysis (PCA) for Descriptive, Diagnostic, and Predictive Analytics.

New cards
8

Define Graph-Theoretic Computations in analytics.

Computations involving graph search, betweenness, centrality, commute distance, shortest path, and minimum spanning tree for Diagnostic, Predictive, and Prescriptive Analytics.

New cards
9

What is Optimization in the context of analytics?

Techniques like Minimization, Maximization, Linear Programming, Quadratic Programming, and Gradient Descent used in Prescriptive Analytics.

New cards
10

What does Integration refer to in analytics?

Methods such as Bayesian Inference, Expectations, and Markov Chain Monte Carlo used for Predictive and Prescriptive Analytics.

New cards
11

What are Alignment Problems in analytics?

Tasks like matching between data sets (text, images, sequences) and Hidden Markov Models used for Predictive and Prescriptive Analytics.

New cards
12

What are the Types of Analytics?

Descriptive, Diagnostic, Predictive, and Prescriptive Analytics.

New cards
13

What does Descriptive Analytics aim to answer?

"What happened?" by analyzing past data and presenting it in a summarized form using reports and alerts.

New cards
14

How does Diagnostic Analytics differ from Descriptive Analytics?

It seeks to understand "Why did it happen?" by analyzing past data through queries and data mining.

New cards
15

What question does Predictive Analytics aim to answer?

"What is likely to happen?" by predicting future events based on patterns and trained models using forecasts and simulations.

New cards
16

What is the goal of Prescriptive Analytics?

To determine "What can we do to make it happen?" by using predictive analyses to figure out the best course of action through planning and optimization.

New cards
17

How is Big Data defined?

Collections of datasets so large that they are difficult to manage, process, and analyze using traditional means.

New cards
18

According to IBM, how much data is created every day?

2.5 quintillion bytes of data.

New cards
19

What is Big Data Analytics?

The collection, storage, processing, and analysis of massive-scale data, involving steps like data cleansing, munging, processing, and visualization.

New cards
20

Why are special tools needed for Big Data Analytics?

Because of the volume, velocity, and variety of data, making it difficult to store, process, and analyze on a single machine.

New cards
21

List some Big Data Examples.

Data from social networks (text, images, audio, video), Click-stream data from web applications, Machine sensor data from industrial and energy systems, Healthcare data from electronic health records (EHR), Logs from web applications, Stock market data, Transactional data from banking and financial applications.

New cards
22

What are the Five V's of Big Data?

Volume, Velocity, Variety, Veracity, Value.

New cards
23

Define Volume in Big Data characteristics.

The large size of data that requires specialized tools and frameworks for storage, processing, and analysis.

New cards
24

What does Velocity refer to in Big Data?

The speed at which data is generated and needs to be processed, often in real-time.

New cards
25

Explain Variety in Big Data.

The different forms of data, including structured, unstructured, and semi-structured data like text, images, audio, video, and sensor data.

New cards
26

What is Veracity in the context of Big Data?

The accuracy and trustworthiness of the data, which often requires cleaning to remove noise and ensure quality.

New cards
27

What does Value signify in Big Data characteristics?

The usefulness of data for its intended purpose, aiming to extract meaningful insights and benefits.

New cards
28

What are the main steps in the Analytic Flow for Big Data?

Data Collection, Data Preparation, Analysis Types, Analysis Modes, Visualizations.

New cards
29

What activities are involved in Data Preparation?

Cleaning data by fixing corrupt records, handling missing values, removing duplicates, standardizing abbreviations and units, correcting typos, spellings, and formatting.

New cards
30

What are the Analysis Modes in Big Data Analytics?

Batch, real-time, or interactive analysis.

New cards
31

What is the Big Data Stack?

A layered framework consisting of Raw Data Sources, Data Access Connectors, Data Storage, Batch Analytics, Real-Time Analytics, Interactive Querying, and Serving Database/Web & Visualization Frameworks.

New cards
32

What are Raw Data Sources in the Big Data Stack?

The origins from where data is captured.

New cards
33

What are Data Access Connectors?

Tools and frameworks used for collecting data from various sources.

New cards
34

Where is data stored in the Big Data Stack?

In distributed file systems and NoSQL databases.

New cards
35

What is Batch Analytics?

Analyzing data in large chunks or batches, typically not in real-time.

New cards
36

What are examples of Real-Time Analytics tools?

Apache Storm and Spark Streaming.

New cards
37

What does Interactive Querying involve?

Using SQL-like languages to perform queries on data interactively.

New cards
38

What are Serving Database, Web & Visualization Frameworks used for?

To present and visualize the analyzed data for end-users.

New cards
39

Describe the Alpha Pattern in Analytic Patterns.

Batch Analysis used to ingest large amounts of data.

New cards
40

What is the Beta Pattern in Analytic Patterns?

Real-Time Analysis focused on ingesting streaming data.

New cards
41

Explain the Gamma Pattern in Analytic Patterns.

Combines Batch and Real-Time Analysis by ingesting streaming data into the big data stack.

New cards
42

What is the Delta Pattern in Analytic Patterns?

Interactive Querying using source-sink connectors or SQL connectors, followed by using SQL-like languages.

New cards
43

What are Visualizations in the Analytic Flow?

Tools used to present data visually, which can be static, dynamic, or interactive.

New cards
44

Why is Data Cleansing important in Big Data Analytics?

To remove noise and ensure data quality, which is essential for extracting accurate insights.

New cards
45

What types of data does Big Data encompass?

Structured, unstructured, and semi-structured data.

New cards
46

How does Big Data Analytics handle the Variety of data?

By using specialized tools and frameworks that can process different data formats like text, images, audio, video, and sensor data.

New cards
47

What role do NoSQL databases play in the Big Data Stack?

They provide scalable and flexible storage solutions for large and diverse datasets.

New cards
48

What is the primary goal of any Big Data Analytics system?

To extract value from the data by uncovering meaningful insights and supporting decision-making.

New cards
49

How does Real-Time Analytics differ from Batch Analytics?

Real-Time Analytics processes data as it arrives, enabling immediate insights, whereas Batch Analytics processes data in large groups at scheduled intervals.

New cards
50

What are Reports and Alerts used for in Descriptive Analytics?

To present summarized data and notify users of important events or thresholds.

New cards

Explore top notes

note Note
studied byStudied by 746 people
... ago
4.7(16)
note Note
studied byStudied by 5 people
... ago
5.0(1)
note Note
studied byStudied by 19 people
... ago
5.0(2)
note Note
studied byStudied by 58 people
... ago
5.0(1)
note Note
studied byStudied by 20 people
... ago
5.0(2)
note Note
studied byStudied by 17 people
... ago
5.0(1)
note Note
studied byStudied by 27 people
... ago
5.0(2)
note Note
studied byStudied by 62 people
... ago
5.0(1)

Explore top flashcards

flashcards Flashcard (84)
studied byStudied by 9 people
... ago
5.0(1)
flashcards Flashcard (69)
studied byStudied by 36 people
... ago
5.0(1)
flashcards Flashcard (85)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (145)
studied byStudied by 5 people
... ago
5.0(1)
flashcards Flashcard (50)
studied byStudied by 68 people
... ago
5.0(1)
flashcards Flashcard (23)
studied byStudied by 27 people
... ago
5.0(1)
flashcards Flashcard (41)
studied byStudied by 8 people
... ago
5.0(1)
flashcards Flashcard (401)
studied byStudied by 34 people
... ago
5.0(1)
robot