Decision Trees

0.0(0)

Studied by 4 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/14

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

15 Terms

New cards

DT learning

Method for learning discrete-valued target functions in which the function to be learned is represented by a decision tree

Can have continuous or discrete features

New cards

Continuous features

Check if the feature is greater than or less than some threshold
The decision boundary is made up of axis-aligned planes

New cards

Internal nodes

test a feature

New cards

Branching

Determined by the feature value

New cards

Leaf nodes

outputs, predictions

New cards

Classification Tree

Discrete output

New cards

Goal with decision trees

We are guaranteed goof generalization if we are able to find a small decision tree that explains the data well

New cards

Choosing a good split

We could find the entropy of our training samples to generate our tree
Entropy formula:

<p>We could find the entropy of our training samples to generate our tree<br>Entropy formula:<br><br></p>

New cards

Entropy Rule of Thumb

High Entropy

Uniform like distribution over many outcomes
Flat histogram
Values sampled from it as less predictable

Low Energy

Distribution is concentrated on only a few outcomes
Histogram is concentrated in a few areas
Values samples from it are more predictable

If all Negative, entropy = 0

If all positive, entropy = 0

If 50/50 positive and negative, entropy = 1

New cards

Information Gain

Difference of the Entropy of our attribute, subtracted by the entropy of the attribute given the classification, in proportion to that attribute
The higher the IG for a particular category, the higher the precedence is given towards it

<ul><li><p>Difference of the Entropy of our attribute, subtracted by the entropy of the attribute given the classification, in proportion to that attribute</p></li><li><p>The higher the IG for a particular category, the higher the precedence is given towards it</p><img src="https://knowt-user-attachments.s3.amazonaws.com/c7e701a3-ca37-45b8-9b38-a09a78c4fcd9.png" data-width="100%" data-align="center"><p></p></li></ul><p></p>

New cards

What is the Information Gain for this split? (Split A)

Steps:

H(Y) = -(5/7)log₂(5/7)-(2/7)log₂(2/7)
H(Orange|Left) = -(2/2)log₂(2/2)-(0/2)log₂(0/2)
H(Orange|Right) = -(3/5)log₂(3/5)-(2/5)log₂(2/5)
I.G = H(Y) - ((2/7)H(Orange|Left) + (5/7)H(Orange|Right))
Explanation: H(Y) is entropy of the split, H(Orange|Left) is the entropy of orange on the left, H(Orange|Right) is the entropy of orange on the right, I.G is the difference between the entropy of the split subtracted by the sum of the entropies of the regions proportional to how many elements in their domain compared to the whole.