Decision Tree

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/9

flashcard set

Earn XP

Description and Tags

Flashcards covering key concepts related to classification, decision trees, and their applications in business data mining.

Last updated 4:59 AM on 12/8/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

10 Terms

1
New cards

Classification

A supervised method where classes (categories) are pre-defined based on labels, requiring labeled data to train a model.

2
New cards

Decision Tree

A flowchart-like tree structure where each internal node represents an attribute, each branch represents a decision rule, and each leaf node represents an outcome.

3
New cards

Overfitting

A modeling error that occurs when a statistical model describes random error or noise instead of the underlying relationship, leading to poor performance on new data.

4
New cards

Pruning

The process of reducing the size of a decision tree by removing sections that provide little predictive power to improve model stability.

5
New cards

Training Set

A labeled data set used to train a classification model, allowing the algorithm to learn the patterns associated with each category.

6
New cards

True Positive Rate (Sensitivity)

The ratio of correctly predicted positive observations to all actual positives, indicating the model's ability to identify positive instances.

7
New cards

Confusion Matrix

A table used to evaluate the performance of a classification model by displaying true positive, false positive, true negative, and false negative values.

8
New cards

ROC Curve

A graphical representation of a classifier's performance by plotting the true positive rate against the false positive rate at various thresholds.

9
New cards

C4.5

An extension of the ID3 algorithm used for generating decision trees, capable of handling missing values and both categorical and continuous data.

10
New cards

Cross-Validation

A technique for assessing how the results of a statistical analysis will generalize to an independent data set, involving partitioning the data into k subsets.