Operations   Dogfooding  

Automating Collection of Troubleshooting Data with Triggers: a How-To Guide

Everyone wants to be more efficient -- to spend less time on the tedious things, and more time on the things that move the needle....

Product Updates   Monitoring  

Welcome (to) Home

Our latest product update features an intuitive home (landing) page that orients users with a quick, real-time view into what's happening right now in your...

Product Updates  

Honeycomb’s New APM Capabilities Give Engineering & DevOps Superior Production Insights and Faster Incident Resolution

PRESS RELEASE: Read on PRNewswire Modern Dev and SRE teams gain efficiencies for proactive and collaborative debugging as software updates deploy SAN FRANCISCO, May 29,...

Software Engineering   Sampling   Instrumentation  

Dynamic Sampling by Example

Last week, Rachel published a guide describing the advantages of dynamic sampling. In it, we discussed varying sample rates to achieve a target collection rate...

Dogfooding   Debugging   Databases  

Stop Your Database From Hating You With This One Weird Trick

Let's not bury the lede here: we use Observability-Driven Development at Honeycomb to identify and prevent DB load issues. Like every online service, we experience...

Sampling   Metrics  

The New Rules of Sampling

One of the most common questions we get at Honeycomb is about how to control costs while still achieving the level of observability needed to...

Software Engineering   Dogfooding  

Anatomy of a Cascading Failure

In Caches Are Good, Except When They Are Bad, we identified four separate problems that combined together to cause a cascading failure in our API...

Tracing   Software Engineering   Dogfooding  

When In Doubt, Add More Spans: A Tale of Tracing and Testing In Production

Recently, Toshok was telling a story about the kind of thing he talks about a lot—improving the performance of some endpoint or page or other....

Software Engineering   Dogfooding  

Incident Review: Caches are Good, Except When They Are Bad

Between Wednesday, April 17th and Friday, April 26th, Honeycomb had four separate periods of downtime affecting the Honeycomb API, resulting in approximately 38 minutes of...

Sampling   Metrics   Events  

Metrics vs Events: A Conversation About Controlling Volume

If I'm used to metrics, how should I think about events in Honeycomb? This question cuts to the heart of how Honeycomb is different from...

Software Engineering   Operations   Dogfooding   Debugging  

A New Bee's First Oncall

I'm Honeycomb's newest engineer, now on my eighth week at Honeycomb. Excitingly, I did my first week of oncall two weeks ago! Almost every engineer...

Tracing   Observability   Dogfooding  

Tracing and Observability for Background Jobs

Illuminating the under-loved with Honeycomb Most modern web apps end up sprouting some subset of tasks that happen in the “background”, i.e., when a user...

1 38 39 40 41 42 51