Blogs

ITSM meets ITOM

Erin McMahon

26 May 2021
4 min read

ITSM meets ITOM

TL; DR: MTTD is the new MTTR

Part two in a multi-part series

How downstream incident prediction and root cause analysis prevent upstream service outages

Eliminating downtime is no longer a secondary priority. It is every bit as essential as providing the right features. The distance between competing digital services is one outage away. Even slow load times or partial outages cost millions per minute for businesses that rely on digital-first relationships with customers.

As a result, everyone in IT is now expected to understand and manage operations. MTTD, mean time to detect, is the new MTTR, mean time to resolve. The best service experience is the one that is never interrupted which means traditional user-facing service management governed by the principles of ITSM is now ceding focus to infrastructure-facing service management governed by the principles of ITOM.

This shift is catalyzing interest within service management teams to embrace operations as a core discipline. The rise of AIOps as an area of expertise is the result of service delivery managers spending increasing portions of their time monitoring infrastructure performance and availability. The leading ITSM vendor, ServiceNow, is experiencing phenomenal growth in automation-related disciplines to help customers achieve their vision of near-zero downtime.

Just as incident and problem management define user-facing service delivery strategies, incident prediction and root cause analysis define infrastructure-facing service delivery strategies. Thankfully, the leader in incident prediction and automated root cause analysis, InsightFinder, has partnered with ServiceNow to ensure availability-related issues are detected upstream before users are impacted downstream.

The partnership between ServiceNow and InsightFinder combines the strength of ServiceNow in workflow automation, discovery, CMDB, and service mapping with the strengths of InsightFinder in anomaly detection, stream processing, and unsupervised machine learning to feed automated insights into incident and problem management workflows. The integration works as follows:

Unsupervised machine learning algorithms in InsightFinder detect anomalies across logs, metrics, traces, and change events.
Detected anomalies are used to predict future incidents with enough lead time for operators to investigate in InsightFinder before users are impacted.
To facilitate investigation, probable root cause analyses are used to determine the likely source of future incidents.
Probable root causes are used to isolate CIs and services likely to be impacted based on service maps maintained by ServiceNow network discovery which feeds relationship data to the CMDB.
Once infrastructure incidents are service-aware, service owners are notified by ITSM workflows in ServiceNow so status updates can be provided to stakeholders.
Incidents and problems are triaged and integrated with change management processes in ServiceNow to ensure approved actions are taken proactively and archived to build change risk profiles.
Change tasks in ServiceNow are used as inputs to InsightFinder machine learning models to improve the accuracy of anomaly detection and identify problems that are caused by change events before business gets impacted.

This cycle of InsightFinder AIOps feeding ServiceNow ITSM yields continuous process improvement. Each iteration through the cycle combines the best of machine learning with human intelligence. Subsequent predicted incidents include more detailed historical records about what happened and what resolved it. Months into integrating AIOps with ITSM, typical customers report previously common outage patterns have been completely eliminated. Those customers convert downtime reduction into business benefits including reduced customer churn, reduced SLA penalties, and increased spend per customer.

Service owners once relegated to front office user support are being empowered to perform like operators thanks to insights about future incidents and probable root causes that previously required large teams of NOC operators and days of rigorous analysis.

The future of IT operations is a combination of automated anomaly detection using unsupervised machine learning plus automated service mapping and problem management. That future is available today from InsightFinder and ServiceNow.

To learn more about the power of InsightFinder and ServiceNow, request a demo to speak to our team.

Contents

Erin McMahon

Published: 26 May 2021
4 min read

Blogs

Accelerate Your Organization’s Path to Zero Downtime with PagerDuty + InsightFinder’s Unified Intelligence Engine

InsightFinder is pleased to announce a two-way integration with PagerDuty. Starting today, PagerDuty users…

Blogs

ITSM meets ITOM

ITSM meets ITOM TL; DR: MTTD is the new MTTR Part two in a…

Predictive anomaly detection helps finance improve trading system performance

Blogs

Using InsightFinder to Anticipate Issues in Critical Finance Applications

In the fast-paced world of finance, IT Operations teams that use predictive anomaly detection…

See how InsightFinder helps your team deliver reliable services across every layer of the stack

Take InsightFinder AI for a no-obligation test drive. We’ll provide you with a detailed report on your outages to uncover what could have been prevented.

ARI

IT Reliability

AI Reliability

Unified Intelligence Engine - UIE

Integrations

Release Notes

ITSM meets ITOM

Related Resources

Accelerate Your Organization’s Path to Zero Downtime with PagerDuty + InsightFinder’s Unified Intelligence Engine

ITSM meets ITOM

Using InsightFinder to Anticipate Issues in Critical Finance Applications

See how InsightFinder helps your team deliver reliable services across every layer of the stack