Come for answers. Stay for best practices. All we’re missing is you.
written by: Lukasz Cmielowski, PhD, Thomas Parnell
In Cloud Pak for Data 4.6, Watson Studio AutoAI is introducing support for large tabular data. Data sets up to 100 GB are consumed using the combination of ensembling and incremental learning. Adoption of BatchedTreeEnsembleClassifier and BatchedTreeEnsembleRegressor from Snap Machine Learning allows for adding “partial_fit()” capabilities (training on batches) to classical algorithms: