Originally posted by: Jenna Lau-Caruso
With IBM Spectrum Conductor with Spark v2.1.0.1, you can now install your cluster on a shared file system such as IBM Spectrum Scale. By installing your cluster on a shared file system, you can reduce installation and deployment time and contain storage costs and resources used by system services -- all while taking full advantage of the scalability, security, high availability, recovery, and management capabilities provided by the file system.
With IBM Spectrum Conductor with Spark installed on a shared file system, you gain the following efficiencies:
- Save installation time and reduce installation footprint by installing IBM Spectrum Conductor with Spark once for the entire cluster.
- Save deployment time when creating a Spark instance group by deploying Spark and notebook packages just once for the entire Spark instance group.
- Increase application performance by disabling the Spark shuffle service. By default, the Spark shuffle service is disabled on shared file system installations, as data written to a shared file system does not need to be shuffled between hosts. You can optionally enable the shuffle service if configuring your Spark instance group to use spark.local.dir on local disk.
- Reduce the number of SparkCleanup service instances (for periodic cleanup of Spark instance groups) to 1. The instance runs on a single management host, instead of every host in the cluster.
To install IBM Spectrum Conductor with Spark v2.1.0.1 on a shared file system, simply set the SHARED_FS_INSTALL=Y environment variable before running the installation. You only need to run the installer once and all hosts in the cluster will share the same installation. Note that shared file system installation applies only to standalone software packages.
Take the next step: Download v2.1.0.1 from IBM Fix Central and get started with your installation. For complete details, see Installing to a shared file system in the IBM Knowledge Center.
Comment or question? Talk to us in our forum.
#SpectrumComputingGroup