Co-Author : @Arturo Cabre Watson AIOps version 3.2 delivered many key enhancements across the whole platform, in both the frontend and behind the scenes. A major focus was consolidating the backend, data is now easier to process and maintain, and is synced across all user...
"Monitoring tells you when something is wrong. Observability enables you to understand why" @JULIUS WAHIDIN is a member of the AIOps Elite Team, the top technical consultants/architects for anything related to IBM Cloud Pak for Watson AIOps and Instana, and the emerging technologies in the...
Glynn Lunney, NASA and Apollo engineering & operations legend, passed away last Friday. As a Flight Director, responsible for all operational aspects of the space flight, Lunney was a role model for SRE Leadership. Every Site Reliability Engineer, and Incident Commanders in particular, can...
I've just published the latest in my series of articles on SRE lessons from the Lunar Landing... in this article I discuss some of the work done by flight controllers in Mission Control, their difficulties and how we'd approach this problem today. Here's a hint - AIOps, specifically Cloud Pak...
Site reliability engineering (SRE) uses software engineering to automate IT operations tasks - e.g. production system management, change management, incident response, even emergency response - that would otherwise be performed manually by systems administrators (sysadmins). IBM Cloud Pak for...
Join us for the next IBM client webcast in the "Chat with Expert Labs" series: How an SRE or IT Ops Engineer can use Insights from Watson AIOps November 12, 10:00 am ET (60 minutes) Watson AIOps helps our clients address complex IT issues quickly to minimize service...