A lot is expected of automation in IT environments in the next few years. By 2024 Gartner predicts IT automation will drive a 20% reduction in unplanned downtime and lower operational costs by 30%. At the same time, the efficiencies generated by IT automation and analytics will allow organizations to refocus 30% of their IT… Continue reading Automating Root Cause Analysis with AIOps
Category: Zebrium
Automating Root Cause Analysis with AIOps
We are excited to announce that ScienceLogic has excelled in the Forrester Wave for AIOps again! We are proud to be named a Strong Performer this year—receiving the highest marks possible in the product vision, execution roadmap, performance, and automation and remediation criterion. Our Chief Product Officer, Michael Nappi, spoke about this great news, our… Continue reading ScienceLogic Celebrates the End of a Stellar Year with Strong Performance in Forrester Wave
ScienceLogic Celebrates the End of a Stellar Year with Strong Performance in Forrester Wave
Our Vice President of Technical Alliances, Erik Rudin, speaks about the importance of partners, automation, and data. Three key takeaways from re:Invent from our perspective: Number one, data. It’s all about data. There’s so much data coming into the system. Amazon has created a lot of new services and processes to help with that. This… Continue reading AWS re:Invent 2022 Recap: What Happened In Vegas …
AWS re:Invent 2022 Recap: What Happened In Vegas …
We believe the future of monitoring, especially for platforms like Kubernetes, is truly autonomous. Cloud-native applications are increasingly distributed, evolving faster, and failing in new ways, making it harder to monitor, troubleshoot and resolve incidents. Traditional approaches such as dashboards, carefully tuned alert rules, and searches through logs are reactive and time intensive, hurting productivity,… Continue reading Anomaly Detection as a Foundation of Autonomous Monitoring
Anomaly Detection as a Foundation of Autonomous Monitoring
Datadog is one of the most popular observability platforms today and offers a rich set of capabilities including monitoring, tracing, log management, as well as machine learning (ML) features that help detect outliers. One of its most interesting feature sets falls under the Watchdog umbrella. Watchdog Root Cause Analysis Watchdog automatically detects outliers in metrics… Continue reading Zebrium RCaaS: A Natural Evolution From Datadog Watchdog Insights Log Anomaly Detection
Zebrium RCaaS: A Natural Evolution From Datadog Watchdog Insights Log Anomaly Detection
There’s a good reason Datadog is one of the most popular monitoring solutions available. The power of the platform is summed up in the tagline, “See inside any stack, any app, at any scale, anywhere” and explained in this chart: “Datadog brings together end-to-end traces, metrics, and logs to make your applications, infrastructure, and third-party… Continue reading Using Datadog For Observability? Speed up Troubleshooting with Zebrium
Using Datadog For Observability? Speed up Troubleshooting with Zebrium
Application monitoring is experiencing a sea change. You can feel it as vendors rush to include the phrase “root cause” in their marketing boilerplate. Common solutions enhance telemetry collection and streamline workflows, but that’s not enough anymore. Autonomous troubleshooting is becoming a critical (but largely absent) capability for meeting SLOs, while at the same time,… Continue reading Observability: It’s Time to Automate the Observer
Observability: It’s Time to Automate the Observer
If you are a New Relic user, you’re likely using New Relic to monitor your environment, detect problems, and troubleshoot them when they occur. But let’s consider exactly what that entails and describe a way to make this entire process much quicker. Imagine that the dashboards used to monitor your application suddenly show a “blip”.… Continue reading Using New Relic For Observability? Speed up Troubleshooting with Zebrium
Using New Relic For Observability? Speed up Troubleshooting with Zebrium
The Elastic Stack (often called ELK) is one of the most popular observability platforms in use today. It lets you collect metrics, traces, and logs and visualize them in one Kibana dashboard. You can set alerts for outliers, drill down into your dashboards and search through your logs. But there are limitations. What happens when… Continue reading Using the Elastic Stack (ELK) For Observability? Here’s How to Speed Up Troubleshooting
Using the Elastic Stack (ELK) For Observability? Here’s How to Speed Up Troubleshooting
The Cisco Technical Assistance Center (TAC) has over 11,000 engineers handling 2.2 million Service Requests (analogous to incidents or support cases) a year. Although 44% of them are resolved in one day or less, many take longer because they involve log analysis to determine the root cause. This not only impacts the time a case remains… Continue reading How Cisco uses Zebrium ML to Analyze Logs for Root Cause