Decision Trees with SAS Ch. 22

Studied by 0 people

0.0(0)

LearnA personalized and smart learning plan

Practice TestTake a test on your terms and definitions

Spaced RepetitionScientifically backed study method

Matching GameHow quick can you match all your cards?

FlashcardsStudy terms and definitions

1 / 31

There's no tags or description

Looks like no one added any tags here yet for you.

32 Terms

Start with:

Data setup

trees can handle missing values
note: trees dont have assumptions

New cards

With decision tree building we start with:

Proc hpsplit

New cards

Proc hpsplit

Seed to make sure it is not random
Class- specify classification variables
– Include DV this time!
• Because DT can handle interval DVs
Model statement like before (order matters but doesn’t)
– It will build differently based on order inputted, but we are looking for decisions not answers!!!
Grow- default method is entropy
Prune- default method is costcomplexity
– Balance of error rate and simplicity
Rules file- details of the tree

<ul><li><p><span>Seed to make sure it is not random</span></p></li><li><p><span>Class- specify classification variables</span></p><p><span>– Include DV this time!<br>• Because DT can handle interval DVs</span></p></li><li><p><span>Model statement like before (order matters but doesn’t)</span></p><p><span>– It will build differently based on order inputted, but we are looking for decisions not answers!!!</span></p></li><li><p><span>Grow- default method is entropy</span></p></li><li><p><span>Prune- default method is costcomplexity</span></p><p><span>– Balance of error rate and simplicity</span></p></li><li><p><span>Rules file- details of the tree</span></p></li></ul><p></p>

New cards

Looking at the tree….

NodeistheNodeID number
Nrepresentsthenumber of observations in that node
2representsthe classification and the % of the variable most common in that node
Belowthelineisthe breakdown of all classifications

<ul><li><p><span>NodeistheNodeID number</span></p></li><li><p><span>Nrepresentsthenumber of observations in that node</span></p></li><li><p><span>2representsthe classification and the % of the variable most common in that node</span></p></li><li><p><span>Belowthelineisthe breakdown of all classifications</span></p></li></ul><p></p>

New cards

Fit

Confusion Matrix to calculate fit
ROC Curve, just like before
Variable importance shows the importance of each variable in the tree

<ul><li><p><span>Confusion Matrix to calculate fit</span></p></li><li><p><span>ROC Curve, just like before</span></p></li><li><p><span>Variable importance shows the importance of each variable in the tree</span></p></li></ul><p></p>

New cards

Rules of the leaves

(Only shows the leaf nodes)
If the last gift amount is missing or less than $18, they are predicted to give
If the last gift amount is $18 or more, they are predicted to not give this time.

<ul><li><p><span>(Only shows the leaf nodes)</span></p></li><li><p><span>If the last gift amount is missing or less than $18, they are predicted to give</span></p></li><li><p><span>If the last gift amount is $18 or more, they are predicted to not give this time.</span></p></li></ul><p></p>

New cards

Subtrees (Pruning)

• Selecting an earlier iteration of a tree to avoid overfitting

• Removing branches that have few observations

– Leaf size, Maximum Depth, Method properties

• Test other trees
– Select a different maximum depth
– Change the maximum number of branches

New cards

More options → Which are endless!!!!

assignmissing=
Maxbranch=
Maxdepth=
Grow
Prune
Partition
Score

New cards

Assignmissing=

-Branch- create a separate branch for missing

– None- remove from analysis (default)
– Popular- assign to the largest child
– Similar- statistically determine the most similar

New cards

Maxbranch=

Maximum number of leaves per node

New cards

Maxdepth=

Maximum levels of the tree

New cards

grow

Chaid- uses a chi square estimate

New cards

Prune

Rep- reduced error pruning

New cards

Partition

– Build the data for you!

New cards

Score

Creates a score data set just like regression

New cards

More options include examples like:

New cards

How do you determine the best tree?

through using the Subtree assessment plot → where we look for divergence

New cards

What does the score data set do?

Allows us to evaluate the model

New cards

What are the consequences of a decision tree?

Look to the confusion matrix! → Look for similarity across partitions for fit

<p>Look to the confusion matrix! → <span>Look for similarity across partitions for fit</span></p>

New cards

we can look deeper into the

Confusion matrix and calculate:

misclassification rate
accuracy
Precision
specificity
sensitivity/ recall
harmonic mean

New cards

We also can look at the ROC chart to

assess the overall fit of the model

New cards

Potential improvements: Business understanding

Reevaluate the business question, evaluate the appropriateness of the DV

New cards

Potential improvements: Data understanding

Consider missingness, data inclusion criteria

New cards

Potential Improvements: Data Prep

Consider outliers, transformations, record selection, modeling assumptions

New cards

Potential Improvements:

Evaluate feature selection, significance, parsimony

New cards

Potential Improvements: Evaluation

Evaluate significance, generalizability (data partitioning), explainability, if it is actionable

New cards

Potential Improvements: Deployment

Make the model more actionable

New cards

How can we build a better tree?

change the data prep and change the model

New cards

How do we change the data prep?

Missing & Outliers

New cards

How do we change the model?

Feature Inclusion

– Branches
– Depth
– Grow method
– Pruning method

New cards

What should we consider with data mining?

Remember,thegoalofdataminingistofind unexpected patterns
Ifthetreedoesnottellyousomethingyoudon’t know, it is not very insightful
– Can be helpful if trying to prove your argument or gain confidence
Ifthetreeisoverfit,itmightnotbethathelpful
– Not generalizable
– Think about baseball pitches!

New cards

What about continuous dependent variables?

DTs are easier to interpret when used for classification
BUT they can be used with a continuous DV
Creates a predicted value instead of classification

<ul><li><p><span>DTs are easier to interpret when used for classification</span></p></li><li><p><span>BUT they can be used with a continuous DV</span></p></li><li><p><span>Creates a predicted value instead of classification</span></p></li></ul><p></p>

New cards