Business Intelligence Analytics + Big Data

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/39

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

40 Terms

1
New cards

big data

enormous (terabytes or more), complex (sensor data to social media data), traditional processes incapable of dealing with them

2
New cards

big data uses

improve day-to-day operations, planning, + decision making

3
New cards

key characteristics of big data

volume, velocity, value, variety, veracity

4
New cards

technologies used to manage + process big data

data warehouses, extract transform load process, data marts, data lakes, NoSQL databases, Hadoop, in-memory databases

5
New cards

online transaction processing (OLTP) systems

traditionally used to capture, do not support data analysis required today

6
New cards

data warehouses + data marts

allows organizations to access OLTP data, support decision making more effectively

7
New cards

data warehouse

large database, holds business information from many sources in the enterprise, covers all aspects of the company’s processes, products, + customers

8
New cards

extract transform load (ETL) process

extract data from a variety of sources, edits + transforms data into a data warehouse format, loads data into warehouse

9
New cards

data mart

subset of a data warehouse, used by small/medium sized businesses+ departments within large companies, supports decision making

10
New cards

data lake

takes a ‘store everything’ approach to big data, saves all data in its raw + unaltered form

11
New cards

NoSQL database

data modifed without two-dimensional tabular relations, uses horizontal scaling, does not require a predefined schema or conform to true ACID properties

12
New cards

structures used by NoSQL databases

more flexible than relational database tables, provide improved access speed + redundancy

13
New cards

categories of NoSQL databases

key value (two columns, key + value)

document (store, retrieve, + manage document-oriented information)

graph (well-suited for analyzing interconnection)

column (store data in columns)

14
New cards

Hadoop

open source software framework, includes several software models, stores + processes extremely large data sets

15
New cards

distributed file system (HDFS)

used for data storage, divides the data into subset, distributes teh subset onto different servers for processing

16
New cards

map reduce program

composite program, two components (map produced performs filtering + storing, reduce method performs a summary operation)

limitation → can only perform batch processesi

17
New cards

in-memory database

stores the entire database in RAM, foster access to data, enables the analysis of big data + other challenging data-processing applications

feasibility due to two factors → increase in RAM capacities, corresponding decrease in RAM costs

18
New cards

business intelligence (BI)

wide range of applications, practices, + technologies, extracts, transforms, integrates, visualizes, analyzes, interprets, + presents data, supports improved decision making

19
New cards

analytics

extensive use of data + quantitative analysis, supports fact-based desicion making with organizations

20
New cards

benefits of bi + analytics

detect fraud, improve forecasting, increase sales, optimize operations, reduce costs

21
New cards

data scientist

delivers real improvements in decision making, highly inquisitive person, strong business accumen, underatnds analytics

22
New cards

components for effective analytics + bi

exsistence of a solid data management program (includes governance), creative data scientists, strong commitment to data-driven decision making

23
New cards

descriptive analytics

preliminary data processing stage, identifies data patterns, answers questions

24
New cards

visual analytics

presentation of data pictorially or graphically

25
New cards

word cloud

visual depiction of a set of words (grouped together by frequency)

26
New cards

conversion funnel

graphical representaion (e.g. step summary)

27
New cards

regression analytics

determines the relationship between a dependent variable + one or more independent variables, produces a regression equation

28
New cards

predictive analytics

techniques to analyze current data, identifies future probabilities + trends, makes predictions

29
New cards

time series analysis

uses statistical methods, analyzes time series data, extracts meaningful statistics + characteristics

30
New cards

data mining

explores large amounts of data for hidden patterns, predicts future trends + behavior, used in decision making

techniques → association analysis, neural computing, case-based reasoning

31
New cards

optimination

allocate scare resources to minimize costs + maximize efforts

32
New cards

genetic algorithm

emplyes a natural-selection like process, finds approximate solutions to optimization + search problems

33
New cards

linear programming

finds the optimum value of a linear expression, calculated based on the value of a set of decision variables (variables subject to a set of constraints)

34
New cards

simulation

emulates the dynamic repsonse of a real-world system to various inputs

35
New cards

scenario analysis

predicts future values based on current potential events

36
New cards

monte carlo simulation

provides a spectrum of thousands of possible outcomes, considers variables, + range of potential values

37
New cards

text + video analysis

clean insights + data relevant to decision making

38
New cards

text analysis

process for extracting values from large quantities of unstructured text data

39
New cards

video analysis

process of obtaining information/insights from video footage

40
New cards

self-service analytics

training, techniques, + processes, empower end users to work independently (access data from approved sources, perform own analysis, use an endorsed set of tools)

advantages → get valuable data to end users, accelerates decision making, fact-based decision making, soltutioion to data scientist shortage