CAP 4770 - Lecture 1: Key Data Mining Tasks, Methods, Applications, and Foundational Concepts

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/20

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering the key data mining tasks, methods, applications, and foundational concepts from Lecture 1.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

21 Terms

1
New cards

Data Mining

The process of discovering patterns and knowledge from large data sets.

2
New cards

Prediction Methods

Use some variables to predict unknown or future values of other variables.

3
New cards

Description Methods

Find human-interpretable patterns that describe the data.

4
New cards

Classification

Predictive task: builds a model from labeled training data to assign class labels to new, unseen records based on their attributes.

5
New cards

Clustering

Descriptive task: group data points into clusters where members are similar within clusters and different across clusters.

6
New cards

Association Rule Discovery

Process of discovering rules that predict the occurrence of an item based on occurrences of other items (e.g., Milk → Soda).

7
New cards

Deviation/Anomaly Detection

Detect significant deviations from normal behavior (e.g., fraud detection, intrusions).

8
New cards

Regression

Predict a continuous-valued variable based on other variables using linear or nonlinear models.

9
New cards

Training Set

Subset of data used to build the model.

10
New cards

Test Set

Subset of data used to validate the model's accuracy.

11
New cards

Model

The learned classifier or predictive model produced from training.

12
New cards

Classifier

A model that assigns class labels to unseen records.

13
New cards

Direct Marketing (Classification Application)

Use classification to target customers likely to buy a product, reducing mailing costs.

14
New cards

Fraud Detection (Classification Application)

Predict fraudulent credit card transactions using transaction and account information.

15
New cards

Sky Survey Cataloging (Classification Application)

Predict whether a sky object is a star or galaxy from image features.

16
New cards

Market Segmentation (Clustering Application)

Subdivision of a market into distinct customer groups for targeted marketing.

17
New cards

Document Clustering (Clustering Application)

Group documents into clusters based on important terms to find similarity.

18
New cards

Milk → Soda

An example association rule showing that if Milk occurs, Soda is likely to occur.

19
New cards

Diaper, Milk → Beer

An association rule illustrating multiple antecedents predicting Beer.

20
New cards

Antecedent

In an association rule (e.g., A→B), the 'if-part' (A) represents the condition or set of items that are observed. For example, in the rule 'Milk → Soda', 'Milk' is the antecedent, indicating that if Milk is purchased, Soda is predicted to be bought as well.

21
New cards

Consequent

In an association rule (e.g., A→B), the 'then-part' (B) represents the item(s) that are predicted to occur if the antecedent is present. For example, in the rule 'Diaper, Milk → Beer', 'Beer' is predicted to be bought if 'Diaper' and 'Milk' are purchased.