Reduced-Error Pruning
- Split data into training and validation
sets.
- Do until further pruning is harmful:
- Evaluate impact on validation set of pruning
each possible node (plus those below it)
- Greedily remove the one that most improves
validation set accuracy
- Produces smallest version of most accurate subtree.
- Requires that a lot of data be available.
José M. Vidal
.
20 of 25