Beyond The Hype - How Do You Really Put Ai To Work For Itops?
Beyond The Hype - How Do You Really Put Ai To Work For Itops?
Beyond The Hype - How Do You Really Put Ai To Work For Itops?
02
Rubber, meet road:
What's really possible?
AIOps puts machine learning and data analytics to work in a
number of different contexts to enable simpler, faster, and more
efficient IT operations management—without relying on static
thresholds. These include:
In the following pages we’ll explore each of these in a little more depth.
03
The basics: Getting smarter about APM
Any successful AIOps strategy requires a solid foundation of ITOps best practices—and that begins with application monitoring.
The way we deliver applications is changing. We need a new way to monitor their performance, too. Goodbye to simple,
centralized application delivery infrastructures and standardized endpoints—hello multi-source delivery, hybrid and multi-cloud
environments, and a dizzying diversity of user devices.
As app developers and owners embrace DevOps and agile to speed innovation, it’s no longer enough for you to monitor for
availability, errors, and job completion times. Now you’ve got to make sure data is being processed correctly, identify problem-
causing code, and diagnose slow application response times.
What does APM need to look like today? Focus on the perspectives that matter most:
Your infrastructure doesn’t drive your business—your Customers don’t care about your servers or routers;
applications and services do. By unifying monitoring data they care about the quality of service they’re getting.
across your environment into a service-aware, app-centric Critical metrics to monitor to ensure a great end user
view, you can better troubleshoot issues, find root experience include availability, responsiveness,
causes, and prevent incidents from recurring. and usability.
Learn more about service-aware, app-centric APM Learn more about end user experience monitoring
04
Behavioral learning:
Is this issue really an issue?
How can you tell the difference between a routine fluctuation in utilization and a
problem in the making? Behavioral learning puts utilization metrics in the context of
normal activity to separate actual problems from the ups and downs of a typical week.
When you’re relying solely on a static threshold—say, 80 percent CPU utilization—you’ll get alerts
even for spikes that are perfectly understandable, like high CPU usage first thing Monday morning.
With AIOps, machine learning algorithms establish a learned baseline that reflects typical utilization
patterns over the course of the week. For example:
You still set a static threshold for each resource, but now you’re only alerted if utilization is both over
the threshold and beyond the learned baseline. Here’s how it plays out.
Fewer false alarms, clearer issue prioritization—AIOPs is already making life better for ITOps.
05
Predictive event management:
Your crystal ball
The best time to solve a problem is before it becomes a problem.
06
Probable cause analysis:
Zeroing in
You can spend all day chasing down the effects of a problem—
or you can go right to its cause and eliminate all those effects
in one blow.
For example, let’s say a server in your environment is running slowly due to
excessive processing time. Probable cause analysis reveals related events
that suggest an issue with the server’s memory. In all likelihood, it’s this
memory issue that’s slowing the server’s response to data requests.
07
Log analytics:
Listening to your log data
The ability of machine learning to learn baselines and detect
anomalies can be applied to log files as well.
In this case, machine learning offers yet another way to discover issues that
need to be addressed. You haven’t started getting complaints from users; you
haven’t crossed any utilization thresholds; you haven’t seen any red lights—
but something’s not right. By investigating and addressing the situation
now, you can keep this anomaly from becoming a real problem that affects
your business.
08
CUSTOMER CASE STUDY
ɝɝ Fragmented monitoring made it hard to maintain availability, uptime, A lean, AI-powered process
and performance across over 650 applications makes it possible to detect and
ɝɝ A, reactive, email-based approach to ticket creation delayed correlate events, take corrective
responsiveness and MTTR action, automatically generate
ɝɝ Development teams had to monitor their own applications, diverting tickets, and alert the right people
their focus from innovation
Approximately one-third of critical
To increase speed, efficiency, and insight, the company created a new tickets are intercepted by the
centralized digital operations center powered by AI-powered TrueSight operations center and addressed
solutions from BMC. proactively—a share that
continues to rise
09
Ensuring operational
excellence with built-in intelligence TrueSight is an AIOps
platform that helps complex
and growing enterprises
As digital transformation pushes the speed and complexity of ITOps to new reinvent how IT operations
levels, artificial intelligence has become more than just a useful innovation—it’s delivers fast, secure, and
now crucial for survival and success. By building data analytics and machine cost-effective services.
learning into your ITOM toolset, you can:
Learn more here
Proactively predict and prevent issues before they impact your business
Gain deeper insight from activity and events across your complex environment
Most importantly, AIOps helps you deliver the performance and availability your business
needs, when it needs it, no matter how complex your environment becomes. That elevates
the strategic importance and visibility of ITOps so the business sees you as the heroes you
are—as it should be.
10
Continue your AIOps education
It’s a great time to be in IT operations. New AI capabilities have
the potential to deliver increased value to the business, with
machine learning and analytics applied to big data to deliver rich,
actionable insights that can transform IT operations.
Analyst Report
Read the report from Enterprise Management
Associates based on a survey of over 300 IT
decision makers: ‘AIOps and IT Analytics
at the Crossroads – What’s real today and
what’s most needed for tomorrow?’
AIOps Video
Watch the short video ‘Elevate IT Operations
with AIOps’ on how the TrueSight AIOps
platform is helping customers reduce event
remediation times, transform ITOps, and drive
digital transformation.
11
About BMC
BMC helps customers run and reinvent their businesses with open, scalable, and modular solutions to complex IT problems. Bringing
both unmatched experience in optimization and limitless passion for innovation to technologies from mainframe to mobile to cloud
and beyond, BMC helps more than 10,000 customers worldwide reinvent, grow, and build for the future success of their enterprises.