Slide 6.17: Data classification based on decision trees

◀
Previous

Slide 6.16: Data classification and data clustering
Slide 6.18: Web classification using modified decision trees
Home Print version

▶
Next

Data Classification Based on Decision Trees

This method chooses a subset of training examples to form a decision tree. If the tree does not give the correct answer for all the objects, a selection of the exceptions is added to the samples and the process continues until the correct decision set is found. The eventual outcome is a tree in which each leaf carries a class name, and each interior node specifies an attribute with a branch corresponding each possible value of that attribute.

Most decision-tree classifiers perform classification in two phases:

Tree Building

This process ends when all the examples in each partition belong to one class.

 MakeTree( Training Data T ) {
   Partition( T );
 }
 Partition( Data S ) {
   if (all points in S are in the same class)
     then return;
   Evaluate splits for each attribute A;
   Use best split found to partition S into S1 and S2;
   Partition( S1 );
   Partition( S2 );
 }

Tree Pruning

◀
Previous

Slide 6.16: Data classification and data clustering
Slide 6.18: Web classification using modified decision trees
Home Print version

▶
Next

“Wisely and slow; they stumble that run fast.”
― William Shakespeare, Romeo and Juliet