Decision Tree

0.0(0)

Studied by 0 people

View linked note

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/9

Earn XP

Description and Tags

Flashcards covering key concepts related to classification, decision trees, and their applications in business data mining.

Last updated 4:59 AM on 12/8/25

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

10 Terms

1

New cards

Classification

A supervised method where classes (categories) are pre-defined based on labels, requiring labeled data to train a model.

2

New cards

Decision Tree

A flowchart-like tree structure where each internal node represents an attribute, each branch represents a decision rule, and each leaf node represents an outcome.

3

New cards

Overfitting

A modeling error that occurs when a statistical model describes random error or noise instead of the underlying relationship, leading to poor performance on new data.

4

New cards

Pruning

The process of reducing the size of a decision tree by removing sections that provide little predictive power to improve model stability.

5

New cards

Training Set

A labeled data set used to train a classification model, allowing the algorithm to learn the patterns associated with each category.

6

New cards

True Positive Rate (Sensitivity)

The ratio of correctly predicted positive observations to all actual positives, indicating the model's ability to identify positive instances.

7

New cards

Confusion Matrix

A table used to evaluate the performance of a classification model by displaying true positive, false positive, true negative, and false negative values.

8

New cards

ROC Curve

A graphical representation of a classifier's performance by plotting the true positive rate against the false positive rate at various thresholds.

9

New cards

C4.5

An extension of the ID3 algorithm used for generating decision trees, capable of handling missing values and both categorical and continuous data.

10

New cards

Cross-Validation

A technique for assessing how the results of a statistical analysis will generalize to an independent data set, involving partitioning the data into k subsets.