We believe the future of monitoring, especially for platforms like Kubernetes, is truly autonomous. Cloud-native applications are increasingly distributed, evolving faster, and failing in new ways, making it harder to monitor, troubleshoot and resolve incidents. Traditional approaches such as dashboards, carefully tuned alert rules, and searches through logs are reactive and time intensive, hurting productivity,… Continue reading Anomaly Detection as a Foundation of Autonomous Monitoring
Category: Zebrium
Anomaly Detection as a Foundation of Autonomous Monitoring
This project is a favorite of mine and so I wanted to share a glimpse of what we’ve been up to with OpenAI’s amazing GPT-3 language model. Today I’ll be sharing a couple of straightforward results. There are more advanced avenues we’re exploring for our use of GPT-3, such as fine-tuning (custom pre-training for specific… Continue reading Using GPT-3 for plain language incident root cause from logs
Using GPT-3 for plain language incident root cause from logs
The past three months has seen Zebrium reach several major milestones! We moved from beta to production and our platform is now in use by industry leading customers who rely on Zebrium to keep their production applications running. We were named in the Forbes AI50 list as one of “America’s Most Promising Artificial Intelligence Companies”.… Continue reading Zebrium Named a 2020 Gartner Cool Vendor
Zebrium Named a 2020 Gartner Cool Vendor
We often get asked how Zebrium ZELK Stack machine learning (ML) compares to native ML for Elasticsearch. The easiest way to answer this is to see the two technologies side by side. This short (3-minute) video demonstrates what each solution is able to uncover from the exact same log data. No manual training, rules, or… Continue reading ZELK vs ELK: Zebrium vs Elastic Machine Learning
ZELK vs ELK: Zebrium vs Elastic Machine Learning
Disclosure – I work for Zebrium. Part of our product does what most log managers do: aggregates logs, makes them searchable, allows filtering, provides easy navigation, and lets you build alert rules. So why write this blog? Because in today’s cloud native world (microservices, Kubernetes, distributed apps, rapid deployment, testing in production, etc.) while useful,… Continue reading Is Log Management Still the Best Approach?