IT Automation

 View Only

Leon's WA Waypoints: Proactive System Monitoring in Workload Automation

By Leon Odenbrett posted 11 days ago

  

How many times have you been called in the middle of the night because your systems weren’t functioning correctly, only to find out the issue could have been prevented much earlier? In our fast-paced world, where the pressure to deliver results without mistakes is immense, it takes courage to implement preventive measures for fear they may introduce more issues. This blog post will break down simple yet effective ways to monitor systems without compromising their integrity.

1. Using "optman ls" for Quick System Validation

One of the simplest yet most effective techniques is creating a job that runs an "optman ls" command. Here’s why this is valuable:

  • "Optman ls" connects to your database, ensuring it is functioning properly.
  • It runs a job through Workload Automation (WA), verifying that WA itself is operational.
  • When executed frequently, it can detect system errors before they escalate into major problems.

By automating this check, you gain an early warning system for potential failures, reducing the chances of unexpected downtime.

2. Leveraging Workload Automation Events to Monitor Your System Monitors

A well-functioning monitoring system is crucial, but what happens when your monitoring tool itself fails? In one of my previous roles, we relied on HP OpenView for system performance monitoring. It worked well—until it didn’t. When HP OpenView went down, so did our visibility, leaving us blind to potential issues.

To solve this, I implemented a WA event-based approach to monitor HP OpenView itself. By using WA events to track the availability of the monitoring system, we ensured continuous oversight of our environment. This simple integration eliminated the need for round-the-clock human supervision and prevented critical lapses in system monitoring.

3. Automating Backup and Publishing Job Definitions

Most organizations already create daily backups of their WA environments, but taking it a step further by sharing these backups with stakeholders can significantly improve communication and efficiency. The WA text-based job definitions are easy to read, making it possible for customers to directly communicate their requirements.

For example, instead of lengthy emails or miscommunications, a customer could simply request:

SCHEDULE CPU1#SCHEDULE1

ON RUNCYCLE RC1 "FREQ=DAILY;INTERVAL=1"

   ( AT 1010 )

:

CPU1#JOB1

CPU1#JOB2

 FOLLOWS JOB1

 FOLLOWS CPU2#SCHEDULE2

END

This structured format enables clear, precise job scheduling updates without unnecessary back-and-forth discussions.

Maximizing Efficiency with Minimal Effort

By implementing these three techniques—proactive validation with "optman ls," monitoring your monitoring systems with WA events, and improving transparency through automated backups—you can enhance your WA operations with minimal effort while maximizing effectiveness.

Stay tuned for more insights on distributed systems and Workload Automation in future editions of Leon's WA Waypoints! Have questions or insights of your own? Drop them in the comments, and let’s continue the conversation.

0 comments
6 views

Permalink