AIOps Has a Data(Ops) Problem
Modern complex systems are easy to develop and deploy but extremely difficult to observe. Their IT Ops data gets very messy. If you have ever worked with modern Ops teams, you will know this. There are multiple issues with data, from collection to processing to storage to getting proper insights at the right time.
Report: Observability deep dive report for Zebrium
Summary I did a deep dive vendor research report on Zebrium which specializes in automatic root cause analysis using machine leaning. Quick summary from the report: Zebrium is an Observability/AIOps platform that uses unsupervised machine learning to auto-detect software problems and automatically find root causes, reducing manual labor and speeding […]
Achieving Reliable Observability Part 1 – Making Cloud-Native Observability More Robust
I was having a conversation with a CxO level customer as part of an AIOps/Observability workshop, and from what I could tell, most are confused about how to properly operationalize cloud-native production environments – especially the monitoring/observability portion. Here is how the conversation went.
What is AIOps? – AI for IT operations explained
Every business now depends on IT. Efficient IT Operations is mandatory for all businesses, especially those operating in a hybrid mode – a mix of existing data centers and multi-cloud locations. As with any business process, IT operations can be augmented with machine learning-based solutions. IT is particularly fertile ground for AI as it is mostly digital, has seemingly endless processes requiring automation and there are gigantic amounts of data to process.
Comprehensive observability is core to future-proofing your IT infrastructure
Observability is an emerging set of practices, platforms, and tools that goes beyond monitoring to provide insight into the internal state of systems by analyzing external outputs. Monitoring has been a core function of IT for decades, but old approaches have become inadequate for a variety of reasons—cloud deployments, agile development methodology, continuous deployments, and new DevOps practices among them.
Report: GigaOm Radar for Cloud Observability
Summary Observability is an emerging set of practices, platforms, and tools that goes beyond monitoring to provide insight into the internal state of systems by analyzing external outputs. It’s a concept that has its roots in 19th century control theory concepts and is rapidly gaining traction today. Of course, monitoring has been […]
AIOps vs Observability vs Monitoring – What Is The Difference? Are You Using The Right One For Your Enterprise?
This article was originally published in Forbes on Feb 2, 2021 In the last few months, I have been analyzing and writing a research report for GigaOm in this space, which is due to be released soon. I looked at about 30+ vendors in this space as part of that […]
Report: Key Criteria for Evaluating Cloud Observability
An Evaluation Guide for Technology Decision Makers by Andy Thurai and David Linthicum Summary The concept of observability has evolved over the years, referring to the ability to monitor the internal states of systems using the externally exhibited characteristics of those systems. It provides the ability to predict the future […]
Getting to Zero Unplanned Downtime with AIOps
Originally published in APMDigest.com on Sep 30, 2020 Most business executives are worried about the competition taking them down. What they don’t realize is, their own IT can do an equal amount of damage. Imagine this: if your rideshare app is constantly down, would you rather wait for it to […]