L 2: Big Data and Machine Learning Stages and Techniques

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

to collect data

data collection and preparation

-stored in the company (surveys, interviews (qualitative))

-option button (check constraint, dropdown list)

2
New cards

data preparation

data collection and preparation

-resolve formatting issues

-smoothing inconsistent data

determine equivalent value in a dataset

-normalization

-binning

3
New cards

basic data manipulation and techniques

SQL statements to be used- for converting relational data to tabular data

4
New cards

pivot command

basic data manipulation and techniques

-command to tabular data/summarized

-values must be aggregated

5
New cards

data mining

-determine patterns

-extract hidden patterns from potential useful data in a dataset

6
New cards

supervised machine learning techniques

machine learning

-we know what is to be extracted

-we have target/class/dependent variable

7
New cards

unsupervised machine learning techniques

machine learning

-we don't know what will be the result

-market basket and clustering

8
New cards

composition

data visualization

pie chart

9
New cards

comparison

data visualization

bar chart

10
New cards

distribution

data visualization

scatter plot

11
New cards

relationship

data visualization

one variable connected to another variable

12
New cards

Statistics/AI

Provides methods to analyze data and find patterns (e.g., averages, probabilities).

13
New cards

Machine Learning/Pattern Recognition

Uses algorithms to automatically learn and make predictions or group data.

14
New cards

Database Systems

Stores and organizes large amounts of data for easy access.

15
New cards

Data Mining

the middle ground that combines these fields to discover useful insights from large datasets

16
New cards

statistics and AI

Data Mining is at the intersection of these fields:

It uses ___________ and ___________ for theoretical frameworks and data analysis.

17
New cards

machine learning

Data Mining is at the intersection of these fields:

t incorporates ___________________ for automating pattern discovery and predictions.

18
New cards

database systems

Data Mining is at the intersection of these fields:

It depends on ___________________ for efficient data management and retrieval.

19
New cards

Descriptive Analytics

When an aggregate level of understanding of what is going on in a business is required.

20
New cards

Predictive Analytics

When something about the future needs to be predicted or some missing information needs to be approximated

21
New cards

Diagnostic Analytics

When the reason behind a certain observed phenomenon or characteristic needs to be determined

22
New cards

Prescriptive Analytics

When advice is needed regarding what action to take for the best results