SQL/MX Data Mining Guide

Mining the Data
HP NonStop SQL/MX Data Mining Guide523737-001
4-9
Checking the Model
Figure 4-4 shows the final decision tree for the example business opportunity.
Changing the Process
Classification trees are used to predict or explain responses to categorical dependent
variables. If you had not been able to develop a classification tree with homogeneous
branches with respect to Cust_Left, you could now do any of the following:
Redefine the statement of the business opportunity.
The data analysis process might indicate new directions that offer more interesting
results.
Redefine the goal.
The goal is equal to Y if the customer had a zero balance for a period of 3 months.
This definition might need adjustment.
Add or remove columns in the mining view.
Some columns that do not contribute to the goal can be removed. Also, the initial
analysis might give new insight into columns that could be added.
Change the definition of derived columns.
For example, the average balance for the period of 3 months might be a better
choice than a zero balance for 3 months.
Change the mappings on the encoded columns.
Each iteration of the data mining process gives new insight into the changes you might
make for the next iteration.
Checking the Model
After you develop a model, you can check the model against the mining data.
Figure 4-4. Final Decision Tree
Marital Status
Single
No Yes
0 3
Married
No Yes
2 1
Widow
No Yes
1 1
Divorced
No Yes
1 5
Male
No Yes
0 5
Female
No Yes
1 0
Chldrn=0
No Yes
0 3
Chldrn>0
No Yes
0 0
Prune the tree here because
the remaining branches do not
yield a pattern.