A popular data mining method that is used to classify a dependent categorical variable based on measurements of one or more predictor variables is classification decision trees. Explain how does the decision tree algorithm pick the attributes for splitting. Why is tree pruning useful in decision tree induction? What is a drawback of using a separate set of data to evaluate pruning?
1 Page APA 2 references
I have attached the book as well