CAP3770 - Data Mining Lecture - Chapter 1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/27

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards for reviewing key data mining concepts and definitions

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

28 Terms

1
New cards

Data Generation Advances

Enormous data growth in both commercial and scientific databases due to advances in data generation and collection technologies.

2
New cards

Data Gathering

Gather whatever data you can whenever and wherever possible.

3
New cards

Commercial Data Mining Viewpoint

Data mining helps businesses provide better, customized services for an edge in Customer Relationship Management.

4
New cards

Scientific Data Mining Viewpoint

Data mining assists scientists in automated analysis of massive datasets and in hypothesis formation.

5
New cards

Data

The 'Facts'.

6
New cards

Information

Interpretation of Data.

7
New cards

Knowledge

Information That Has Been Given Meaning.

8
New cards

Explicit Knowledge

Identified and Codified Information, Documents, Records, and Files.

9
New cards

Tacit Knowledge

Lives in people and their practices; Experiences, Competence, Commitment, Deeds, and Thoughts.

10
New cards

Data Mining

Non-trivial extraction of implicit, previously unknown and potentially useful information from data; Exploration & analysis, by automatic or semi-automatic means, of large quantities of data to discover meaningful patterns.

11
New cards

Data Cleaning

Process that removes or transforms noise and inconsistent data.

12
New cards

Data Integration

Process where multiple data sources may be combined.

13
New cards

Data Selection

Data relevant to the analysis task are retrieved from the database.

14
New cards

Data Transformation

Data transformed/consolidated into appropriate forms for mining.

15
New cards

Data Mining (Process)

Essential process where intelligent and efficient methods are applied in order to extract patterns.

16
New cards

Pattern Evaluation

Process that identifies the truly interesting patterns representing knowledge based on some interestingness measures.

17
New cards

Knowledge Presentation

Visualization and knowledge representation techniques are used to present the mined knowledge to the user.

18
New cards

Prediction Methods

Use some variables to predict unknown or future values of other variables.

19
New cards

Description Methods

Find human-interpretable patterns that describe the data.

20
New cards

Classification

Find a model for class attribute as a function of the values of other attributes; Predictive Modeling

21
New cards

Data Mining Steps (abridged)

Understanding purpose, obtaining data, cleaning & preprocessing, reducing dimension, determining task, partitioning data, choosing technique, using algorithms, interpreting results, deploying model

22
New cards

Regression

Predict a value of a given continuous valued variable based on the values of other variables, assuming a linear or nonlinear model of dependency.

23
New cards

Clustering

Finding groups of objects such that the objects in a group will be similar (or related) to one another and different from (or unrelated to) the objects in other groups.

24
New cards

Association Rule Discovery

Produce dependency rules which will predict occurrence of an item based on occurrences of other items, given a set of records.

25
New cards

Deviation/Anomaly/Change Detection

Detect significant deviations from normal behavior

26
New cards

Scalability

Ability to increase or decrease in response to change.

27
New cards

High Dimensionality

The number of dimensions are so high that calculations become difficult.

28
New cards

Heterogeneous and Complex Data

Any data with high variability of data types and formats.