Observability and security have come to the forefront of IT service delivery, a convergence that was long overdue. This was the urgent theme of the 2022 Splunk conference in Las Vegas.
The latest Atlassian outage goes to show that every cloud provider is prone to unplanned downtime sooner or later. While every company strives to achieve that unicorn status of zero downtime, it is almost impossible to achieve that in the face of “Unknown Unknowns.” I analyze it and offer some solutions on how to mitigate that if disaster strikes you.
My latest research report "A CIO's guide to AIOps" is just published. I discuss why AIOps is a savior for digital heavy enterprises in this report and how to do it right including the top use cases that I have seen enterprises use it for.
I was fortunate enough to be invited to attend and speak at the Refresh 2021 conference in Las Vegas earlier this month. This blog is my review of the the conference Refresh 2021.
Most enterprises today are not set up to handle IT-related incidents, or crises, in real time. The classic legacy enterprises are set up to deal with IT incidents in old-fashioned ITIL ways, without considering the cloud, software-as-a-service (SaaS) nuances, or the social media venting by customers. Newer digital-native companies do not put much emphasis on digital incident management. Read this report to understand how digital leaders are changing the game with modern Incident Management systems.
With the COVID-19 pandemic still affecting many areas of the globe, work from home is more of a mainstay than a luxury and digitizing business is more of a survival strategy than an option. Most enterprises are struggling with both of these concepts while digital-native companies are thriving. This report outlines the primary SRE trends that Constellation has observed for 2022 and beyond, based on recent and ongoing conversations with many digital CxO-level executives, SRE practitioners, and incident management team members.
When it comes to crisis and incident management in the cloud/digital era, HOPE IS NOT A STRATEGY! A properly setup Incident Management process should identify the incidents, provide you with Root Cause Analysis (RCA), propose possible fixes, and escalate the issue to the right SRE, DevOps, SME in a matter of minutes.
AIOps is a discipline, set of tools, and set of use cases that can help eliminate such situations and get to the root cause of a problem quickly. At the core, AIOps is expected to identify issues that experienced human IT specialists are able to, but in a time frame that is multitudes shorter than what a human is capable of. Constellation Research identified the following offerings to be pure-play AIOps solutions that provide at least the bare-minimum functionality we define in the selection criteria below.
Unplanned downtime is a nightmare for every IT executive. Long-drawn war rooms drain valuable resources and businesses lose opportunities and risk brand damage. Particularly with many choices and alternatives for any service, reducing churn by providing reliable services is a top agenda for any digital business. Having siloed teams, siloed monitoring/observability tools, multi-cloud operations, hybrid locations, blend of legacy, shortage of skilled IT analysts, and new tools all add to the issue. Constellation evaluates more than 40 solutions categorized in this market. This Constellation ShortList is determined by client inquiries, partner conversations, customer references, vendor selection projects, market share and internal research.
In digital economy, you must move fast to survive. Not in six-month release cycles. But moving with fast release cycles, continuous releases, a mature CI/CD pipeline is only a portion of the solution. If you continue to break your systems at a faster rate but are unable to fix them faster as well, you are setting up for unplanned disasters that will hurt your business sooner than later. I discuss some of the fixes in this blog.