In Digital Economy, You Should Fail Fast, But Must Also Recover Fast

August 12, 2021
Andy Thurai
AI, AIOps, Blogs, ML, Observability

In Digital Economy, You Should Fail Fast, But Must Also Recover Fast

In digital economy, you must move fast to survive. Not in six-month release cycles. But moving with fast release cycles, continuous releases, a mature CI/CD pipeline is only a portion of the solution. If you continue to break your systems at a faster rate but are unable to fix them faster as well, you are setting up for unplanned disasters that will hurt your business sooner than later. I discuss some of the fixes in this blog.

Report: Data Done Right for AIOps with RDA

Most of the AIOps companies are doing the process right, some use AI and ML properly, but most fail on how to automate data processing, or DataOps, on how to get the right data to AIOps tools at the right time. In this eBook "Data Done Right for AIOps," I discuss this in detail and offer some possible solutions including Robotic Data Automation (RDA).

June 18, 2021
Andy Thurai
AI, AIOps, Blogs, Juniper, ML

What are the criteria for selecting a good AIOps solution? I have the top 5 – do you have more?

What are the criteria for selecting a good AIOps solution? How do you compare and measure the solutions one against another? Especially when there are so many solutions out there all claiming to solve the problem better than the others! In this article, I outline the top 5 criteria that all buyers should keep in mind when considering an AIOps solution. Let me know if you have more.

May 26, 2021
Andy Thurai
AI, AIOps, Blogs, Juniper, ML

What does AIOps mean for the networking world?

The network is the foundation for all applications. With the increase in distributed applications and their hybrid nature, the network has become even more important. Delegating more and more tickets to AI will not only help reduce pressure on support team resources, but also fundamentally shift operations from being focused on reactive troubleshooting to proactive remediation.

May 21, 2021
Andy Thurai
Cloud, Videos, youtube

Edge visibility & architecture chat with Mark Thiele, CEO, Edgevana.

I am very honored to be part of the Edgevana podcast series talking to the legendary Mark Thiele on various edge, AI, AIOps, total observability at edge, and other related topics.

AIOps Has a Data(Ops) Problem

Modern complex systems are easy to develop and deploy but extremely difficult to observe. Their IT Ops data gets very messy. If you have ever worked with modern Ops teams, you will know this. There are multiple issues with data, from collection to processing to storage to getting proper insights at the right time.

Report: Observability deep dive report for Zebrium

Summary I did a deep dive vendor research report on Zebrium which specializes in automatic root cause analysis using machine leaning. Quick summary from the report: Zebrium is an Observability/AIOps platform that uses unsupervised machine learning to auto-detect software problems and automatically find root causes, reducing manual labor and speeding […]

Achieving Reliable Observability Part 1 – Making Cloud-Native Observability More Robust

I was having a conversation with a CxO level customer as part of an AIOps/Observability workshop, and from what I could tell, most are confused about how to properly operationalize cloud-native production environments – especially the monitoring/observability portion. Here is how the conversation went.

March 9, 2021
Andy Thurai
AI, AIOps, Blogs, Cloud

What is AIOps? – AI for IT operations explained

Every business now depends on IT. Efficient IT Operations is mandatory for all businesses, especially those operating in a hybrid mode – a mix of existing data centers and multi-cloud locations. As with any business process, IT operations can be augmented with machine learning-based solutions. IT is particularly fertile ground for AI as it is mostly digital, has seemingly endless processes requiring automation and there are gigantic amounts of data to process.

February 27, 2021
Andy Thurai
AI, Blogs, Observability

Comprehensive observability is core to future-proofing your IT infrastructure

Observability is an emerging set of practices, platforms, and tools that goes beyond monitoring to provide insight into the internal state of systems by analyzing external outputs. Monitoring has been a core function of IT for decades, but old approaches have become inadequate for a variety of reasons—cloud deployments, agile development methodology, continuous deployments, and new DevOps practices among them.