Blog

Category: Dogfooding

Dogfooding   Operations   Software Engineering  

Never Alone On Call

Does your organization have an on-call rotation? Several members of the Honeycomb engineering team recently hosted a live webcast about why they never feel alone...

Debugging   Dogfooding  

All Together Now: Better Debugging With Multiple Visualizations

"Nines don't matter when users aren't happy" is something you may have heard a time or two from folks here at Honeycomb. We often emphasize...

Dogfooding   Logging   Operations  

Understand Your AWS Cost & Usage with Honeycomb

AWS bills are notoriously complicated, and the Amazon Cost Explorer doesn’t always make it easy to understand exactly where your money is going. When we...

Debugging   Dogfooding   Operations  

Treading in Haunted Graveyards

Part 1: CI/CD for Infrastructure as Code At Honeycomb, we've often discussed the value of making software deployments early and often, and being able to...

Dogfooding   Operations  

Incident Review: You Can't Deploy Binaries That Don't Exist

Between 22:50 and 22:54 UTC on July 9, our capacity to accept traffic to api.honeycomb.io gradually diminished until all incoming requests started to fail. 8...

Dogfooding   Operations  

Automating Collection of Troubleshooting Data with Triggers: a How-To Guide

Everyone wants to be more efficient -- to spend less time on the tedious things, and more time on the things that move the needle....

Databases   Debugging   Dogfooding  

Stop Your Database From Hating You With This One Weird Trick

Let's not bury the lede here: we use Observability-Driven Development at Honeycomb to identify and prevent DB load issues. Like every online service, we experience...

Dogfooding   Software Engineering  

Anatomy of a Cascading Failure

In Caches Are Good, Except When They Are Bad, we identified four separate problems that combined together to cause a cascading failure in our API...

Dogfooding   Software Engineering   Tracing  

When In Doubt, Add More Spans: A Tale of Tracing and Testing In Production

Recently, Toshok was telling a story about the kind of thing he talks about a lot—improving the performance of some endpoint or page or other....

Dogfooding   Software Engineering  

Incident Review: Caches are Good, Except When They Are Bad

Between Wednesday, April 17th and Friday, April 26th, Honeycomb had four separate periods of downtime affecting the Honeycomb API, resulting in approximately 38 minutes of...

Debugging   Dogfooding   Operations   Software Engineering  

A New Bee's First Oncall

I'm Honeycomb's newest engineer, now on my eighth week at Honeycomb. Excitingly, I did my first week of oncall two weeks ago! Almost every engineer...

Dogfooding   Observability   Tracing  

Tracing and Observability for Background Jobs

Illuminating the under-loved with Honeycomb Most modern web apps end up sprouting some subset of tasks that happen in the “background”, i.e., when a user...

Dogfooding  

Support Your Customers More Effectively with Honeycomb

Customer success can be a serious differentiator and competitive advantage for companies today. Everyone wants to ship quality products to their customers faster, and the...

Dogfooding   Observability   Software Engineering  

Heatmaps Make Ops Better

In this blog miniseries, I'd like to talk about how to think about doing data analysis "the Honeycomb way."  Welcome to part 1, where I...

Dogfooding   Observability   Operations  

Postmortem: RDS Clogs & Cache-Refresh Crash Loops

On Thursday, October 4, we experienced a partial API outage from 21:02-21:56 UTC (14:02-14:56 PDT). Despite some remediation work, we saw a similar (though less...

1 2 3 4