Data Mining Unit 1 : Intro to DM

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/14

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

15 Terms

1
New cards

Data

Raw facts gathered through observation, experience, or experiments

2
New cards

Information

Meaningful data that has been processed or interpreted

3
New cards

Knowledge

Verified, structured information integrated into a process for decision-making

4
New cards

Data Mining (DM)

The process of extracting useful and previously unknown patterns from large datasets

5
New cards

Data Mining Purpose

Discover patterns to support decisions in fields such as business, healthcare, science, and government

6
New cards

Supervised Learning

A data mining approach using labeled data to train models (e.g., classification, regression)

7
New cards

Unsupervised Learning

A data mining approach that explores patterns in unlabeled data (e.g., clustering, association)

8
New cards

Classification

Predicts categorical outcomes like Yes/No or types of diseases

9
New cards

Regression

Predicts continuous outcomes such as temperature or price

10
New cards

Clustering

Groups data based on similarity; used in market segmentation and document analysis

11
New cards

Association Rule Mining

Discovers relationships between items in transactions, like {Milk} → {Bread}

12
New cards

Anomaly Detection

Identifies data that deviates significantly from expected patterns (e.g., fraud detection)

13
New cards

KDD Process (Knowledge Discovery in Databases)

  • Selection

  • Preprocessing

  • Transformation

  • Data Mining

  • Evaluation

14
New cards

Preprocessing Tasks

  • Data cleaning

  • Data integration

  • Data transformation

  • Data reduction

15
New cards

Normalization Methods

  • Min-max scaling

  • Z-score

  • Decimal scaling

  • Log transformation