IBM i Global

 View Only
Expand all | Collapse all

Monitoring and Alerting Tool for IBMi

  • 1.  Monitoring and Alerting Tool for IBMi

    Posted Tue September 05, 2023 02:45 AM
    Edited by Avinash Chandra Mandavilli Tue September 05, 2023 04:27 AM

    Hello IBM Champions,

    I just wanted to understand which is the better Monitoring and Alerting tool for iSeries in the current market. 

    My current monitoring tool is Nagios, but we feel the current monitoring tool is not intended at times, so planning for a new one for our existing systems. I'm also thinking about Halcyon Monitoring software and "ManageEngine site24x7", Nagios and site24x7 are both plugin based, so I'm still not sure if site24x7 gives accurate results.

    Please suggest.

    My Requirements:

    1.) Monitor all the necessary probes in iSeries, for ex: CPU, Check if backup job failed, Check if IFS folder file count reaches threshold, etc.

    2.) Performance Dashboards, like CPU, DASD, RAM etc.

    3.) Should be visible to see a system performance in a central view

    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 2.  RE: Monitoring and Alerting Tool for IBMi

    Posted Tue September 05, 2023 09:26 PM

    Dear Avinash

    For item 2 and 3, I see that you do not need to buy any tool for these purposes (Dashboard type info is not a good thing to have for IBM i - it is more suitable for Intel servers in general) because IBM i already provides a free great performance report tool named IBM i Performance Data Investigator (PDI).  Please take time to read my articles about using PDI tool to solve real-world performance problems here and you may see how useful PDI tool can be :  https://www.itjungle.com/author/satid-singkorapoom/ 



    ------------------------------
    Chance favors only the prepared mind.
    -- Louis Pasteur
    ------------------------------
    Satid S.
    ------------------------------



  • 3.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 02:50 AM

    Thanks, you Satid . Yes, I do even thought the same after doing the post. and you are right, we could utilize the inbuilt feature PDI. 



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 4.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 02:40 AM

    Hi Avinash ,

    We use CONTROL4I which is a very good tool to monitor your system using NAGIOS.

    They have are a lot of standard checks and you can create you own if needed.

    And, also important , the support is top

    You can have a look at https://www.m81.eu/

    regards

    Paul



    ------------------------------
    Paul Hemeleers
    ------------------------------



  • 5.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 02:56 AM

    Thank you, Paul, for the recommendation.

    We are currently using Nagios, because the alerts are not receiving intended at times. so, I would like to know on the following. 

    1.) Do you see any lags with this tool, I mean does it capture the monitoring at precise time and sending the email alerts?

    2.) Can you pls send me one sample screenshot of "CPU alert" to see how it looks? Incase if you are interested :)

    3.) Currently, we have Nagios and do we need to integrate this with Control4i 



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 6.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 06:45 AM

    Hello Avinash

    There are many products that allow to monitor the IT infrastructure (in fact, any system that has an IP address). Nagios is one of them, but also Icinga, Shinken, Zabbix, PRTG, …)

    Most of them have very few checks for the IBM i.

     

    The product Control for i is a complement to all those monitoring products. It brings about 170 checks specialized for IBM I, like for example:

    • Check CPU (overall or for a subsystem)
    • Check disk space (overall or an iASP)
    • Check subsystem and jobs
    • Check backups
    • Check HA (Mimix, Quick EDH, iTera, …)
    • Check IFS
    • Check web services
    • Check the WRKPRB
    • Etc

    And it's possible to create new checks by using IBM I programs (RPG or CLP for example)

    Control for I has not his own console, it works in the IBM i sending their data to Nagios for example. 

    To answer to your questions:

    1/ If Nagios (or any other tool) is well configured and there is no network issue, the response time for the alerts is very good, without any lags.

    And the alerts are send immediately

     

    3/ Control for I is based on 1 (only 1) plugin to integrate in Nagios, and an agent (library + subsystem) to install on each partition to monitor.

    It's VERY easy to install and configure (less than 1 hour)

     

    And to answer to "ace ace"

    Yes, there are many SQL requests that ca be used to find many information.

    But you still need a product or tool (like Nagios for example) to manage all those checks, and send the alerts.

    And you have to write the programs yourself on Linux side.

    If your job is to work daily on IBM I, you may be more familiar with a command like CTCHKCPU SBS(QINTER) than with a program like check_ibmi_cpu -l QSYS -s QINTER

     



    ------------------------------
    Andreu Rul
    ------------------------------



  • 7.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 09:56 AM

    Different monitoring tools aren't even linux based, so no linux scripting for basic SQL... they usually have a full gui when one can set a simple SQL query-sensor in the web GUI with a query to hit the IBMi partition. Then, based on the result of the query, you can associate alerts, thresholds etc, graph it...

    This enable a very flexible approach (any query one can think about) without using additional persistent agents or software.

    If one wishes or needs of course can use full blown agent based commercial solution.

    I have for example a query that checks that particular combination of subsystems and jobs are running. If not, the sensor faults leveraging the alerting subsystem of the monitoring tool (send SMS, pagers etc.). This uses a simple query and all is set in 5 minutes in a monitoring tool.

    Usually needs are very local and customer specific and leveraging sql ibmi services could provide very straight approch for some cases.



    ------------------------------
    --ft
    ------------------------------



  • 8.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 09:12 AM
    Edited by Avinash Chandra Mandavilli Sun September 10, 2023 10:29 AM

    Hello Ace,

     we don't have a strong SQL/PGM expert person at the moment. so, I was under the impression to go with a Halcyon Monitoring or Robot Monitoring tool.

    BTW, is it possible to achieve the following with CL or SQL or RPG?

    I would like to set up a daily summary email about our IBM I/AS400 system health every night after 3rd shift engineer duty is completed. 

    For ex: The daily summary email report should look like as below. Please help me out with any sample PGM google links or sample scripts. 

    • CPU: within threshold  
    • ASP: within threshold
    • QBATCH SBS: Jobs are below the threshold value; Condition: if QBATCH is equal to 10 above, it should print "Jobs are below the threshold value "
    • Daily Tape Check ( Test_Tape ) : Job completed successfully; Condition: check if Test_Tape job failed or successful.
    • INV_xxx job: Job is active in Subsystem; Condition: Check if Job is active in the subsystem.
    • WRKPRB: No problem records.
    • QSYSOPR: No MSGW's; Conditions:  if any msgw exists = MSGW needing for a reply, if no msgw exists = No MSGW's



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 9.  RE: Monitoring and Alerting Tool for IBMi

    Posted Wed September 06, 2023 04:23 AM

    Besides the native (and good) IBMi tools to monitor and introspect in detail, yes, it is common that one wants to monitor the IBMi partition(s) from a central monitoring tools, with simple indicators and timeseries, historical graphing, quick health checks, all in the context among other monitored system for overview of the infrastructure.

    I personally don't like Nagios and I don't use it (there are other commercial competitors, not so costly, much quicker to set and use, with good alerting and with support etc.), but any monitoring tool with the generic capability to query a SNMP value *and* additionally via a DB query via a standard SQL interface (like ODBC) could obtain any relevant information needed.

    If not in crude SNMP MIB, basically nowadays an IBMi can expose (via builtin SQL services for i or a custom routine) any kind of information.

    And IBMi has SNMP service (at least the system mib I think)... I use to track "simple numbers like nr of jobs, users, cpu over time quickly.

    i.e.

    uptime, CPU, ASP usage, resource usage etc. = you can just use the light SNMP 

    Check if backup failed = just issue an sql query to check the log for the relevant log entry mapping it to what is expected by the monitoring tool

    Check if IFS folder = just issue an sql query to count the file in folder etc.

    Any decent tool that call itself monitoring tool capable of using SQL to track data would do the job.

    Ibmi services on sql:

    https://www.ibm.com/support/pages/ibm-i-services-sql

    SNMP on IBMi

    https://www.ibm.com/support/pages/ibm-i-snmp-overview



    ------------------------------
    --ft
    ------------------------------



  • 10.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 09:35 AM

    Hello Ace,

    Thank you for the inputs. I may not sure if I could achieve all my existing Monitoring parameters through SNMP.

     Also, I've not done this before using SNMP, if you could shed some light on this, it could be helpful. Giving one sample SNMP monitoring example could give me a clear understanding. 



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 11.  RE: Monitoring and Alerting Tool for IBMi

    Posted Thu September 07, 2023 04:17 PM

    We use ICOM400.  It's quite a basic tool but has a small footprint and can send email alerts.  We've had it for years and its cheap.

    Lately I'm writing my own monitoring/alerting using SQL PL and leveraging SQL functions.  Things like "Check if IFS folder file count reaches threshold" are super easy to do with SQL.

    I guess tying it all together into a pretty dashboard is really what you're paying for.



    ------------------------------
    Steven Riedmueller
    Certified IBM i Admin
    Speaker, Mentor, and Advocate
    ------------------------------



  • 12.  RE: Monitoring and Alerting Tool for IBMi

    Posted Fri September 08, 2023 06:30 AM
    Hi ,

    I have included some printscreens from control4i.
    It's 5250 based , so easy to use and stable ��












    I can recommend this product to everyone who need a simple to use monitoring tool together with Nagios or Centreon  (and others).
    Above some examples of basic checks included in the tool but you can create your own checks.

    regards


    Paul HEMELEERS

    I

     






  • 13.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 08:33 AM

    Thank you so much Paul for sharing the sample screenshots. 

    For example:  can we also setup to receive an alert "when QBATCH jobs are greater than or equal to 10 "?



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 14.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 09:13 AM
    Hi Avinash,
    Yes you can develop your own monitoring items. With cl and sql from qsys2, you can get plenty of information.
    Regards

    Paul

    Verzonden vanaf Outlook voor Android





  • 15.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 10:30 AM

    Hello Ace,

    I'm sure the Nagios is well configured, but we feel the alerts are lagging a bit and sometimes we could understand the probes at Nagios side going to hung state, we don't have a strong expert person at the Nagios side at the moment. so, I was under the impression to go with a Halcyon Monitoring or Robot Monitoring tool.

    BTW, is it possible to achieve the following with CL or SQL or RPG ?

    I would like to set up a daily summary email about our IBM I/AS400 system health every night after 3rd shift engineer duty is completed. 

    For ex: The daily summary email report should look like as below. Please help me out with any sample PGM google links or sample scripts. 

    • CPU: within threshold  
    • ASP: within threshold
    • QBATCH SBS: Jobs are below the threshold value; Condition: if QBATCH is equal to 10 above, it should print "Jobs are below the threshold value "
    • Daily Tape Check ( Test_Tape ) : Job completed successfully; Condition: check if Test_Tape job failed or successful.
    • INV_xxx job: Job is active in Subsystem; Condition: Check if Job is active in the subsystem.
    • WRKPRB: No problem records.
    • QSYSOPR: No MSGW's; Conditions:  if any msgw exists = MSGW needing for a reply, if no msgw exists = No MSGW's


    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 16.  RE: Monitoring and Alerting Tool for IBMi

    Posted Mon September 11, 2023 10:07 AM

    If you login to the builtin integrated web application "IBM Navigator for i" to manage the system, you will see that each graph and table presented have a little "SQL" button; this shows the underlying SQL that IBM Navigator is using. This can easiliy copied and pasted or modified and launched externally.

    This just to give an idea of the capabilities of the SQL interface to get some system data (used by IBM itself).

    You can of course build reports using just native tools for some sort of super custom reporting (that go deep into the application and business logic of the application themselves).

    Not exclusively to that, ideally one would have a generic network monitoring tool set to fetch values from the IBMi partition using simple ODBC sensors, all without additional agents.

    A network tool usually monitors a whole infrastructure and have already a good alerting / thresholding system; additionally, one can set dependencies (for example if a switch connected to the ibmi goes down isolating it, it does not spam you with messages regarding the ibmi, because the switch is of course needed for inband monitoring to the ibmi).

    Basically nowadays on modern IBMi, if a monitoring tool supports easy ODBC SQL sensors, if you require high customization, basically you can do everything without agents in a very lean manner and without additional dependencies from what already there is available.

    It is also efficient, doesn't require bunch of xml / json deserializations, additional EXE, agents... etc.

    Additionally, you have the full SQL at your disposal, therefore one can conduct really sophisticated queries on metrics (running averages, OLAP etc.).

    Exposing the system information via SQL was a good choice by IBM... why invent another eventually limited thing / language like in other systems like i.e. WMI... 



    ------------------------------
    --ft
    ------------------------------



  • 17.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 09:22 AM

    Thanks Steven. I will check and read about ICOM400



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 18.  RE: Monitoring and Alerting Tool for IBMi

    Posted Fri September 08, 2023 07:10 AM

    Hi Avinash, 

    Perhaps you're interested in a novel and innovative approach based on Prometheus and Grafana. An exporter designed for IBM i (available on GitHub) enables Prometheus to retrieve any data that can be queried via SQL. This setup permits you to establish alerts within Prometheus or Grafana, while the dashboards you create with Grafana can prove highly beneficial.

    The subsequent phases involve integrating the entire solution with Event-Driven Ansible and the Red Hat Ansible Automation Platform to achieve a fully automated solution.



    ------------------------------
    Nicolae Chirea
    System architect
    Saytel Informatica SL
    Madrid
    +34607868409
    ------------------------------



  • 19.  RE: Monitoring and Alerting Tool for IBMi

    Posted Sun September 10, 2023 09:02 AM

    Thanks Nicolae. I will check and read about Prometheus or Grafana



    ------------------------------
    Avinash Chandra Mandavilli
    ------------------------------



  • 20.  RE: Monitoring and Alerting Tool for IBMi

    Posted Fri September 13, 2024 02:15 AM

    Hi Avinash,

    You may want to look into HelpSystems' products, as they offer several solutions for IBMi, such as ROBOT for monitoring and automating message responses, ROBOT Console to centralize your system on a single dashboard, and a Network Security Tool for establishing controls over IBMi ports. I believe HelpSystems has rebranded and is now called Fortra. Please check on Google and ask them to provide some demos of their products. I have used these solutions and found them to be excellent.



    ------------------------------
    Nadeem Ahmed Farooqui
    ------------------------------



  • 21.  RE: Monitoring and Alerting Tool for IBMi

    Posted Mon September 11, 2023 10:29 AM

    Hi Avinash,

    the way how Nagios works does not really fit to IBM i. This is the reason why we are using an IBM i event monitor which is running natively on IBM i. As a service provider we are monitoring more then 100 LPARs and delivering 24x7 services by that. Following you can find some basic information about that tool. https://www.it-ps.at/en/monitoring-software/ The software is very flexible, and in the case that a fucntion is missing it could be easly added. It is similar to what you might know from BRMS, where you can execute your own commands in control groups.

    For performance monitoring we are using Trinity, a software which is able to monitor the whole infrastructure stack of an IBM i installation without the need of an agent (e.g. Power Server, External Storage, SAN, VIO Server, IBM i, ....). Having all that information in one place it is easily possible to put all the needed metrics of any component just into one chart, and by that identify possible performance bottlenecks within minutes. It is also possible to compare "good" with "bad" days within one chart, or use predictive to identify issues automatically. Besides Performance Monitoring it could also produce automatically an infrastructure documentation on a daily basis, and much more. Following a link for more information: https://trinity.it-ps.at/en/benefits/



    ------------------------------
    Klaus Haderer
    ------------------------------



  • 22.  RE: Monitoring and Alerting Tool for IBMi

    Posted Mon September 11, 2023 10:35 AM

    If you're into Python, you might find my pymonfori site interesting. 

    https://github.com/richardschoen/pymonfori

    Definitely check out the pydircrawltodb.py script which can crawl your entire disk, including libraries in QSYS.LIB.

    If you come up with any issues or interesting script ideas, feel free to create a git issue.



    ------------------------------
    Richard Schoen
    ------------------------------



  • 23.  RE: Monitoring and Alerting Tool for IBMi

    Posted Tue November 07, 2023 09:34 AM

    Hi Avinash

    I have used Fortra's Halcyon Monitoring Solution for many years. I have worked on the Power platform for over 20 years and IBMi since 1995.  I started using Halcyon monitoring as a customer at a large insurance company for six years and then I went to work for Halcyon Software. I am currently the Halcyon Support Team Lead for EMEA and an SME in the product across all platforms (not just IBMi).

    Halcyon software is available in different levels with additional 'bolt-on' tools for MIMIX and IBM MQ monitoring and The software is modular and easy to learn.

    The benefit to using Halcyon over other methods (such as Nagios and writing your own code)

    • Halcyon software is a mature product, it has been around for over 35 years.
    • It is under continual development and we have a development and fix plan.
    • World-class support.
    • Ability to 'grow' the software by moving to a higher level or adding HA-MX and MQ monitoring modules.
    • Works 'out of the box' with minimal configuration - can be installed and operational within 30 minutes.
    • Menu driven, rule based interface making it easy to learn with no coding.
    • Includes example rules and 'templates' for popular IBM i software (QEDD, BRMS, maxava, Power HA, etc) supplied 'out of the box'
    • Resilience 'built in', which means there are controls to 'monitor the monitoring' and alert if it is not working.
    • Centralised console on IBM i Green-screen for all your IBM i systems, or.....
    • If desired, a windows based centralised 'Enterprise Console' allows you to create a 'single pane of glass' / Dashboard for all monitoring (including: IBM i, IBM AIX, IBM VIOS, Linux on power, Linux on x86, Windows and SNMP capable devices)
    • Designed to be scalable, 1 system or 500 systems (or more) Halcyon is used by many Managed Service Providers Worldwide.

    Halcyon Level 2 is a large, comprehensive suit of components that will do everything you are asking for and more.  If you would like more detailed information I can send that to you.  Also, If you like I can set up a free demonstration.

    We are also happy to provide free Supported trial/Proof of concepts so you can test out the software for yourself.

    You can contact me at michael.davison@fortra.com if you wish or connect with me on LinkedIn: https://www.linkedin.com/in/mike-davison-fortra/

    Kind regards
    Mike



    ------------------------------
    Michael Davison
    ------------------------------



  • 24.  RE: Monitoring and Alerting Tool for IBMi

    Posted Tue November 21, 2023 06:04 PM
    Edited by Pavel Hampl Tue November 21, 2023 06:04 PM

    LPAR2RRD, it gets primary data from the HMC (CPU mainly) with the agent option

    https://lpar2rrd.com/as400.php

    free demo: https://demo.lpar2rrd.com/?menu=e7e0176&tab=0

    It does not cover all your needs, it is focused on performance monitoring



    ------------------------------
    Pavel Hampl
    ------------------------------



  • 25.  RE: Monitoring and Alerting Tool for IBMi

    Posted Thu September 12, 2024 10:56 AM
    Dear Avinash,  Control4i is a very good product and solution but you need Nagios or somethings like that.
    In case you want a all in on IBMi system we are testing using the new Kisco Version 8.03 , monitor called iEventMonitor in our control room with IBMi support 24x7. Very useful and keep tabs on all the required KPIs, sent emails, you can reply to MSGW using the exit program provided with the software and have a nice HTTP portal to keep tabs on one or more LPARs within a single point of use.
    You can also integrate with other Kisco solution for security or SIEM integration. If you need more information: https://www.kisco.com/iem/index.html Good day.


    ------------------------------
    Luca Maurizio Verzicco
    IBMi Service and PreSales Manager
    S2E - Solution to Enterprise
    Milano - Italy
    +39 380 6570039
    ------------------------------



  • 26.  RE: Monitoring and Alerting Tool for IBMi

    Posted Thu September 12, 2024 05:06 PM
    You should test the lpar2rrd monitoring solution, I've been using the free version for about 6 years.
    After some digging you can use any metrics you wish.

    ----------------------------------------------------

    Алексей Марков

    Администратор БД

    АШАН ТЕХ

    al.markov@auchan.ru

    Тел.:  +7 (800) 700 58 00 доб. 79224016

    IP:79224016 FMTN: *79224016

    Моб.: +7 (903) 665 13 18




    чт, 12 сент. 2024 г. в 17:56, Luca Maurizio Verzicco via IBM TechXchange Community <Mail@connectedcommunity.org>:
    Dear Avinash,  Control4i is a very good product and solution but you need Nagios or somethings like that.
    In case you want a all in on IBMi system we are testing using the new Kisco Version 8.03 , monitor called iEventMonitor in our control room with IBMi support 24x7.
    Very useful and keep tabs on all the required KPIs, sent emails, you can reply to MSGW using the exit program provided with the software and have a nice HTTP portal to keep tabs on one or more LPARs within a single point of use.
    You can also integrate with other Kisco solution for security or SIEM integration.
    If you need more information: https://www.kisco.com/iem/index.html

    Good day.

    ------------------------------
    Luca Maurizio Verzicco
    IBMi Service and PreSales Manager
    S2E - Solution to Enterprise
    Milano - Italy
    +39 380 6570039
    ------------------------------




    Original Message:
    Sent: 9/12/2024 10:56:00 AM
    From: Luca Maurizio Verzicco
    Subject: RE: Monitoring and Alerting Tool for IBMi

    Dear Avinash,  Control4i is a very good product and solution but you need Nagios or somethings like that.
    In case you want a all in on IBMi system we are testing using the new Kisco Version 8.03 , monitor called iEventMonitor in our control room with IBMi support 24x7. Very useful and keep tabs on all the required KPIs, sent emails, you can reply to MSGW using the exit program provided with the software and have a nice HTTP portal to keep tabs on one or more LPARs within a single point of use.
    You can also integrate with other Kisco solution for security or SIEM integration. If you need more information: https://www.kisco.com/iem/index.html Good day.


    ------------------------------
    Luca Maurizio Verzicco
    IBMi Service and PreSales Manager
    S2E - Solution to Enterprise
    Milano - Italy
    +39 380 6570039
    ------------------------------


  • 27.  RE: Monitoring and Alerting Tool for IBMi

    Posted Mon September 16, 2024 10:06 AM

    Yes, Michael Davison (Fortra/Halcyon) and Luca Maurizio Verzicco (S2E/iEventMonitor ) and Alexey MArkov (Awah/lpar2rrd) are RIGHT ! All these tools are great tools.

    But the main difference (and strength) with "Control for i' (or "Control4i") is that our solution Integrates the IBM i INTO your EXISTING monitoring tool such as Nagios, Shinken, PRTG, SolarWinds, Pandora, Zabbix and many others !

    'Control for i' adds more than 180 plug-and-play controls (NO DEVELOPMENT) . With it, you can lmonitor ALL your IT platforms from ONE enterprise solution ...

    Hundreds of IBM i LPARS are monitored worldwide with 'Control for i'



    ------------------------------
    Bruno Taverne
    M81 Software House
    www.m81.eu
    ------------------------------



  • 28.  RE: Monitoring and Alerting Tool for IBMi

    Posted Fri September 13, 2024 07:09 AM

    Avinash,

    Another worthy mention would be AAG (At A Glance Monitoring) , runs even on a rasberry PI for ease of use and installation

    https://www.shieldadvanced.com/AAG.php

    A good set of monitoring endpoints there and I am sure Chris(the owner of Shield) will be happy to assist you with a custom report to your requirements with his software. 

    Marius



    ------------------------------
    Marius le Roux
    Owner
    MLR Consulting
    ------------------------------