Topic 6 (BI) Business Intelligence

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/37

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

38 Terms

1
New cards

VOLUME

the amount of data generated

2
New cards

VERACITY

accuracy and trustworthiness of the data (if not correct or accurate data input garbage in garbage out)

3
New cards

VARIETY

the type of data files generated

structured→ excel files

semi structured→ log files

unstructured→ images

4
New cards

VELOCITY

the speed at which data is generated

5
New cards

VALUE

the community benefit of the information collected

6
New cards

What is a database?

collection of related files containing records on people, places, or things.

7
New cards

(T/F) database where all data is stored but it is not clean

True

8
New cards

data warehouse is

where the data stored is cleaned and usable

9
New cards

characteristics of data warehouses

-Large

-multiple sources

-historical

-cross organizational access and analysis

-supports various types of analyses and reporting

10
New cards

(T/F) data mart is a subset of data warehouse

True

11
New cards

data lake is a

storage of unstructured data

12
New cards

data mining

finds hidden patterns in large data sets

13
New cards

regression analysis

COMMON TECHNIQUE FOR DATA MINING

14
New cards

-text mining:

extract insights from textual data

15
New cards

video analysis is the process of

obtaining information or insights from video footage

16
New cards

data governance encompasses

policies and procedures through which data can be managed as an organizational resource

17
New cards

sources of data for big data

internal

  • documemts

  • emails

external

  • social media

  • public dataset

18
New cards

entity is

Generalized category representing person, place, thing

19
New cards

Attributes are

Specific characteristics of each entity

ex

supplier name, address

20
New cards

Entity-relationship diagram is

Used to clarify table relationships in a relational database

21
New cards

Relational database tables may have:

One-to-one relationship

One-to-many relationship

Many-to-many relationship

  • Requires “join table” or intersection relation that links the two tables to join information

22
New cards

operations of a relational DBMS

select

join

project

23
New cards

select

Creates a subset of all records meeting stated criteria

24
New cards

join

Combines relational tables to present the user with more information than is available from individual tables

25
New cards

project

  • Creates a subset consisting of columns in a table

  • Permits user to create new tables containing only desired information

26
New cards

Data definition: Specify structure of content of database

27
New cards

Data dictionary: Stores definitions of data elements and their characteristics

28
New cards

Querying and reporting

Data manipulation language

Structured query language (S Q L)

Microsoft Access query-building tools

29
New cards

– Report generation:

Examples: SQL Server Reporting Services

30
New cards

Hadoop

Breaks data task into sub-problems and distributes the processing to many inexpensive computer processing nodes

31
New cards

Key services of Hadoop

  • Hadoop Distributed File System (H D F S)

  • MapReduce

  • HBase: NoSQL database

32
New cards

alternative to Hadoop

Apache Spark

Faster than Hadoop for small workloads

33
New cards

Online Analytical Processing (O L A P) supports

multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions

34
New cards

data mining examples

Detect fraud

Improve forecasting

Increase sales

35
New cards

Regression analysis determines

the relationship between a dependent variable and one or more independent variables

36
New cards

text mining examples

Sentiment analysis

customer feedback analysis

social media monitoring

competitor analysis

market research and trend analysis

37
New cards

video analysis can be done using

Computer vision, machine learning, and deep learning

38
New cards

under web mining

Content mining mines content of websites

Structure mining mines website structural elements, such as links

Usage mining mines user interaction data gathered by web servers