Welcome to the IBM Community, a place to collaborate, share knowledge, & support one another in everyday challenges. Connect with your fellow members through forums, blogs, files, & face-to-face networking.
Sign In
Search Options
Search Options
Data Science
Watson Studio
Skip to main content (Press Enter).
Sign in
Skip auxiliary navigation (Press Enter).
Data Science
Topic groups
Centers for Advanced Studies
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Gamification Program
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Resources
Community Front Porch
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Technology Zone
IBM Training
Data Science Elite
Marketplace
Marketplace
Watson Studio
Join the conversation.
Join / sign up
Explore Watson Studio
Skip main navigation (Press Enter).
Toggle navigation
Search Options
Feed
News
Group resources
Learn
Support
Data Science Community
Participate
Blogs
Blog Viewer
Watson Studio
Data Science
View Only
Group Home
Discussion
308
Library
62
Blogs
68
Events
0
Members
964
Back to Blog List
Run decision trees on Big Data
By
Armand Ruiz
posted
Wed October 14, 2015 09:25 PM
0
Like
To close these series of posts about the new algorithms of IBM SPSS Modeler 17.1, today is the turn of Tree-AS. The Tree-AS node can be used with data in a distributed environment to build CHAID decision trees using chi-square statistics to identify optimal splits.
The pre-existing tree algorithms (CHAID, QUEST and C&RT) can also be used in conjunction with Analytic Server but only through PSM (pass, stream, merge) to create multi-threaded split or averaged ensemble models. Tree-AS truly parallelizes the building of a single model
Tree-AS Supports:
CHAID or Exhaustive CHAID models
Binary, Categorical and Numeric targets
PMML and SQL Generation
and is scoreable via the database Scoring Adapters
It is similar to the existing CHAID node, but can scale better to large numbers of records, although doesn’t support all of the same features (e.g. there is no Interactive mode)
#Algorithms
#SPSSModeler
0 comments
4 views
Permalink
Data Science
Topic groups
Centers for Advanced Studies
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Gamification Program
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Resources
Community Front Porch
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Technology Zone
IBM Training
Data Science Elite
Marketplace
Marketplace
Copyright © 2019 IBM Data Science Community. All rights reserved.
Powered by Higher Logic