AI: an SRE's best friend
In the past 6 months Instana has been supercharged by AI, leveraging the latest granite models from Watsonx to solve really tough challenges that have taxed SREs for too long. Probably the most used metric for IT incident is MTTR - which describes the mean time to repair an issue - and it has a 1-to-1 relationship with monetary costs. For example, you receive a certain amount of money back depending on how long it takes your cell phone provider to restore service, as defined by the SLA (Service Level Agreement).
It's not uncommon to see SREs tasked with SLOs (Service Level Objectives) that have multiple 9's - e.g. an SLO of 99.999% available means an error budget of only 5.26 minutes per year! Many times a human may not even respond in 5 minutes if an incident occurs in the middle of the night, so how can SREs keep this up? Only with AI & Automation's help.
Starting the journey: AI generated automation with Instana
The latest private and public previews from Instana have explored themes such as probable root cause and incident summarization, which are proving to be key accelerators to the diagnostic and collaborative activities surrounding an IT incident. We'll zoom into two parallel themes that leverage generative AI, essentially answering: what to do and how to do it.
In Instana we help SREs figure out the next steps - which can be deep diagnostics or remediative steps - via manual runbooks. We provide a set of built-in manual runbooks that were generated with Watsonx and then curated by SMEs - e.g. our generated k8s runbooks were carefully tested and edited by k8s experts. These are available as built-in actions within an issue or incident:
If there isn't a suitable built-in action you can switch to the live generation tab and edit the prompt that goes into Watsonx:
Orchestrating workflows with RNA
IBM Rapid Network Automation is a powerful API-driven tool that allows the authoring and execution of automation workflows. The RNA and Instana teams have collaborated to create a workflow that can call any Instana automation from within another workflow in RNA! This workflow has been published to the external automation repository and can be easily downloaded:
#community-stories2