IBM Data Lifecycle - Integration and Governance Connect with experts and peers to elevate technical expertise, solve problems and share insights. Join / Log in
written by: Lukasz Cmielowski, PhD, Thomas Parnell
In Cloud Pak for Data 4.6, Watson Studio AutoAI is introducing support for large tabular data. Data sets up to 100 GB are consumed using the combination of ensembling and incremental learning. Adoption of BatchedTreeEnsembleClassifier and BatchedTreeEnsembleRegressor from Snap Machine Learning allows for adding “partial_fit()” capabilities (training on batches) to classical algorithms: