Skip main navigation (Press Enter).

IBM TechXchange Community Conference Events IBM Developer

IBM TechXchange Community Conference Events IBM Developer

Global AI and Data Science

Global AI & Data Science

Train, tune and distribute models with generative AI and machine learning capabilities

View Only

Back to Blog List

Impurity Measures for Node Splitting

By Moloy De posted Thu July 30, 2020 08:34 PM

Like

The decision tree is a greedy algorithm that performs a recursive binary partitioning of the feature space. The tree provides a class distribution in each leaf partition. Each partition is chosen greedily by selecting the best split from a set of possible splits, in order to maximize the information gain at a tree node. In other words, the split chosen at each tree node maximizes IG(D,s) where IG(D,s) is the information gain when a split s is applied to a dataset D.

Following are the three popular Impurity Measures:

Where p(i|t) is the probability or relative frequency of class i in node t.

Following is a comparison chart for above three measures in case of Binary Classification.

QUESTION I: What are the three basic stop times for tree pruning?

QUESTION II: Why the decision tree is called a greedy algorithm?

#GlobalAIandDataScience
#GlobalDataScience

0 comments

5 views

Permalink

https://community.ibm.com/community/user/blogs/moloy-de1/2020/07/30/points-to-ponder

Additional
Resources

Discover these carefully selected resources to dive deeper into your journey and unlock fresh insights

Office

If you need immediate assistance please contact the Community Management team

support@communitysite.ibm.com

Monday - Friday: 8AM - 5 PM MT

Powered by Higher Logic

Global message icon