Community
Search Options
Search Options
Log in
Skip to main content (Press Enter).
Sign in
Skip auxiliary navigation (Press Enter).
Storage
Topic groups
Data Protection Software
DS8000 Transparent Cloud Tiering
File and Object Storage
Global Storage Forum
Mainframe Storage
Primary Storage
Storage Fusion
Storage Technical Client Council (TCC)
Tape Storage
User groups
Events
Upcoming Storage Events
IBM TechXchange Webinars
All IBM TechXchange Community Events
Participate
Gamification Program
Community Manager's Welcome
Getting Started
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Storage Users
All IBM TechXchange Community Users
Resources
IBM TechXchange Group
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Support 101
IBM Technology Zone
IBM Training
IBM TechXchange Conference
Marketplace
Marketplace
IBM Storage
The online community where IBM Storage users meet, share, discuss, and learn.
Getting Started
Nominate for the IBM TechXchange Awards by 12 September!
Skip main navigation (Press Enter).
Toggle navigation
Search Options
File and Object Storage
View Only
Group Home
Discussion
163
Library
16
Blogs
524
Events
0
Members
2.8K
Share
How to configure and performance tuning Spark workloads on IBM Spectrum Scale Sharing Nothing Cluster
By
Archive User
posted
Mon November 27, 2017 02:17 AM
0
Like
IBM Spectrum Scale Sharing Nothing Cluster performance tuning guide has been posted and please refer to
link
before you doing the below change.
Here is the tuning steps.
Step1: Configure spark.shuffle.file.buffer
By default, this must be configured on
$SPARK_HOME/conf/spark-defaults.conf
.
To optimize Spark workloads on an IBM Spectrum Scale filesystem, the key tuning value to set is the 'spark.shuffle.file.buffer' configuration option used by Spark (defined in a spark config file) which must be set to match the block size of the IBM Spectrum Scale filesystem being used.
The user can query the size of the blocksize for an IBM Spectrum Scale filesystem by running: 'mmlsfs
#cognitivecomputing
#Real-timeanalytics
#Softwaredefinedstorage
#Customerexperienceandengagement
#sparkworkloadtuning
#Data-centricdesign
#Workloadandresourceoptimization
#FPO
0 comments
0 views
Permalink
IBM Community Home
Browse
Discussions
Resources
Groups
Events
IBM TechXchange Conference 2023
IBM Community Webinars
All IBM Community Events
Participate
Gamification Program
Community Manager's Welcome
Post to Forum
Share a Resource
Blogging on the Community
All IBM Community Users
Resources
Community Front Porch
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Technology Zone
IBM Training
Marketplace
Marketplace
Storage
Topic groups
Data Protection Software
DS8000 Transparent Cloud Tiering
File and Object Storage
Global Storage Forum
Mainframe Storage
Primary Storage
Storage Fusion
Storage Technical Client Council (TCC)
Tape Storage
User groups
Events
Upcoming Storage Events
IBM TechXchange Webinars
All IBM TechXchange Community Events
Participate
Gamification Program
Community Manager's Welcome
Getting Started
Post to Forum
Share a Resource
Share Your Expertise
Blogging on the Community
Connect with Storage Users
All IBM TechXchange Community Users
Resources
IBM TechXchange Group
IBM Champions
IBM Cloud Support
IBM Documentation
IBM Support
IBM Support 101
IBM Technology Zone
IBM Training
IBM TechXchange Conference
Marketplace
Marketplace
Powered by Higher Logic