Community
IBM Community Home
AIOps & Management
Business Analytics
Business Automation
Cloud Pak for Data
Data Science
DataOps
Hybrid Data Management
IBM Japan
IBM Z & LinuxONE
Integration
Internet of Things
Power Systems
Public Cloud
Network Automation
Security
Storage
Supply Chain
Watson Apps
WebSphere & DevOps
Sign In
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
on this day
between these dates
Posted by
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
on this day
between these dates
Posted by
Data Science
Watson Studio
Skip to main content (Press Enter).
Sign in
Skip auxiliary navigation (Press Enter).
Data Science
Topic groups
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Post to Forum
Share a Resource
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Data Science Elite
Resources
IBM Support
IBM Cloud Support
IBM Champions
Demos
Marketplace
Marketplace
Watson Studio and Machine Learning
Join the conversation.
Join / sign up
Explore Watson Studio
Skip main navigation (Press Enter).
Toggle navigation
Content types
Announcements
Blogs
Groups
Discussions
Events
Glossary
Site Content
Libraries
Date range
on this day
between these dates
Posted by
Feed
News
Group resources
Learn
Support
Data Science Community
Participate
Blogs
Blog Viewer
Watson Studio
View Only
Group Home
Discussion
278
Library
58
Blogs
56
Events
1
Members
705
Back to Blog List
Run decision trees on Big Data
By
Armand Ruiz
posted
Wed October 14, 2015 09:25 PM
Options Dropdown
Mark as Inappropriate
0
Recommend
To close these series of posts about the new algorithms of IBM SPSS Modeler 17.1, today is the turn of Tree-AS. The Tree-AS node can be used with data in a distributed environment to build CHAID decision trees using chi-square statistics to identify optimal splits.
The pre-existing tree algorithms (CHAID, QUEST and C&RT) can also be used in conjunction with Analytic Server but only through PSM (pass, stream, merge) to create multi-threaded split or averaged ensemble models. Tree-AS truly parallelizes the building of a single model
Tree-AS Supports:
CHAID or Exhaustive CHAID models
Binary, Categorical and Numeric targets
PMML and SQL Generation
and is scoreable via the database Scoring Adapters
It is similar to the existing CHAID node, but can scale better to large numbers of records, although doesn’t support all of the same features (e.g. there is no Interactive mode)
#Algorithms
#SPSSModeler
0 comments
4 views
×
Reason for Moderation
Describe the reason this content should be moderated (required)
Permalink
Data Science
Topic groups
Global Data Science
Decision Optimization
SPSS Modeler
SPSS Statistics
Watson Studio
Data and AI Learning
User groups
Events
Upcoming Events
On Demand Webinars
IBM Expert TV
Virtual Community Events
All IBM Community Events
Participate
Post to Forum
Share a Resource
Blogging on the Community
Connect with Data Science Users
All IBM Community Users
Data Science Elite
Resources
IBM Support
IBM Cloud Support
IBM Champions
Demos
Marketplace
Marketplace
Copyright © 2019 IBM Data Science Community. All rights reserved.
Powered by Higher Logic