Chapter 6 Big Data

studied byStudied by 63 people
5.0(3)
Get a hint
Hint

Big data

1 / 28

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

29 Terms

1

Big data

The term used to describe data collections that are so enormous (terabytes or more) and complex (from sensor data to social media data) that traditional data management software, hardware, and analysis processes are incapable of dealing with them.

New cards
2

data warehouse

A large database that holds business information from many sources in the enterprise, covering all aspects of the company’s processes, products, and customers.

New cards
3

Extract Transform Load (ETL) process

A data handling process that takes data from a variety of sources, edits and transforms it into the format used in the data warehouse, and then loads this data into the warehouse.

New cards
4

data mart

A subset of a data warehouse that is used by small- and medium-sized businesses and departments within large companies to support decision making.

New cards
5

data lake

A “store everything” approach to big data that saves all the data in its raw and unaltered form.

New cards
6

NoSQL database

A way to store and retrieve data that is modeled using some means other than the simple two-dimensional tabular relations used in relational databases.

New cards
7

Hadoop

An open-source software framework including several software modules that provide a means for storing and processing extremely large data sets.

New cards
8

Hadoop Distributed File System (HDFS)

A system used for data storage that divides the data into subsets and distributes the subsets onto different servers for processing.

New cards
9

MapReduce program

A composite program that consists of a Map procedure that performs filtering and sorting and a Reduce method that performs a summary operation.

New cards
10

in-memory database (IMDB)

A database management system that stores the entire database in random access memory (RAM).

New cards
11

Business intelligence (BI)

A wide range of applications, practices, and technologies for the extraction, transformation, integration, visualization, analysis, interpretation, and presentation of data to support improved decision making.

New cards
12

Analytics

The extensive use of data and quantitative analysis to support fact-based decision making within organizations.

New cards
13

data scientist

An individual who combines strong business acumen, a deep understanding of analytics, and a healthy appreciation of the limitations of data, tools, and techniques to deliver real improvements in decision making.

New cards
14

Descriptive analysis

A preliminary data processing stage used to identify patterns in the data and answer questions about who, what, where, when, and to what extent.

New cards
15

Visual analytics

The presentation of data in a pictorial or graphical format.

New cards
16

word cloud

A visual depiction of a set of words that have been grouped together because of the frequency of their occurrence.

New cards
17

conversion funnel

A graphical representation that summarizes the steps a consumer takes in making the decision to buy your product and become a customer.

New cards
18

Regression analysis

A method for determining the relationship between a dependent variable and one or more independent variables.

New cards
19

Predictive analytics

A set of techniques used to analyze current data to identify future probabilities and trends, as well make predictions about the future.

New cards
20

Time series analysis

The use of statistical methods to analyze time series data and determine useful statistics and characteristics about the data.

New cards
21

Data mining

A BI analytics tool used to explore large amounts of data for hidden patterns to predict future trends and behaviors for use in decision making.

New cards
22

Cross-Industry Process for Data Mining (CRISP-DM)

A six-phase structured approach for the planning and execution of a data mining project.

New cards
23

genetic algorithm

An approach to solving problems based on the theory of evolution; uses the concept of survival of the fittest to find approximate solutions to optimization and search problems.

New cards
24

Linear programming

A technique for finding the optimum value (largest or smallest, depending on the problem) of a linear expression (called the objective function) that is calculated based on the value of a set of decision variables that are subject to a set of constraints.

New cards
25

Scenario analysis

A process for predicting future values based on certain potential events.

New cards
26

Monte Carlo simulation

A simulation that enables you to see a spectrum of thousands of possible outcomes, considering not only the many variables involved, but also the range of potential values for each of those variables.

New cards
27

Text analysis

A process for extracting value from large quantities of unstructured text data.

New cards
28

Video analysis

The process of obtaining information or insights from video footage.

New cards
29

Self-service analytics

Training, techniques, and processes that empower end users to work independently to access data from approved sources to perform their own analyses using an endorsed set of tools.

New cards

Explore top notes

note Note
studied byStudied by 4 people
... ago
5.0(1)
note Note
studied byStudied by 7 people
... ago
5.0(1)
note Note
studied byStudied by 115 people
... ago
5.0(5)
note Note
studied byStudied by 25 people
... ago
5.0(1)
note Note
studied byStudied by 20 people
... ago
5.0(1)
note Note
studied byStudied by 16 people
... ago
5.0(1)
note Note
studied byStudied by 8 people
... ago
5.0(1)
note Note
studied byStudied by 2 people
... ago
5.0(1)

Explore top flashcards

flashcards Flashcard (244)
studied byStudied by 3 people
... ago
5.0(1)
flashcards Flashcard (34)
studied byStudied by 5 people
... ago
4.5(2)
flashcards Flashcard (80)
studied byStudied by 1 person
... ago
4.0(1)
flashcards Flashcard (20)
studied byStudied by 2 people
... ago
5.0(1)
flashcards Flashcard (250)
studied byStudied by 16 people
... ago
5.0(1)
flashcards Flashcard (80)
studied byStudied by 7 people
... ago
5.0(1)
flashcards Flashcard (30)
studied byStudied by 12 people
... ago
5.0(1)
flashcards Flashcard (178)
studied byStudied by 46 people
... ago
5.0(2)
robot