The CoPE and Other Teams, Part 1: Introduction & Auto-Instrumentation
The CoPE is made to affect, meaning change, how things work. The disruption it produces is a feature, not a bug. That disruption pushes things...
Top Posts
-
Software Engineering
Dogfooding
Destroy on Friday: The Big Day 🧨 A Chaos Engineering Experiment - Part 2Â
-
Teams & Collaboration
Software Engineering
Culture
What Makes for a 'Good' Pair Programming Session?
-
Software Engineering
Dogfooding
Deploy on Friday? How About Destroy on Friday! A Chaos Engineering Experiment - Part 1
-
Tracing
Logging
Customer Stories
Confidently Shifting from Logs-Centric to a Unified Trace-First Approach: Ritchie Bros. Journey to Modern Observability
Staffing Up Your CoPE
Getting the right people working in the CoPE is crucial to success because these change agents must limber up the organization and promote the flexibility...
Why Every Engineering Team Should Embrace AWS Graviton4
Two years ago, we shared our experiences with adopting AWS Graviton3 and our enthusiasm for the future of AWS Graviton and Arm. Once again, we're...
Modern Observability in Action at the University of OxfordÂ
The Bennett Institute for Applied Data Science at the University of Oxford is pioneering the better use of data, evidence, and digital tools in healthcare,...
The Hater’s Guide to Dealing with Generative AI
Generative AI is having a bit of a moment—well, maybe more than just a bit. It’s an exciting time to be alive for a lot...
Unlocking Smiles: HappyCo's Observability SuccessÂ
With a diverse range of applications, HappyCo sought to advance their system investigations with a modern observability solution while embarking on an application refactor project....
Navigating Software Engineering Complexity With Observability
In the not-too-distant past, building software was relatively straightforward. The simplicity of LAMP stacks, Rails, and other well-defined web frameworks provided a stable foundation. Issues...
OpenTelemetry Best Practices #3: Data Prep and Cleansing
Having telemetry is all well and good—amazing, in fact. It’s easy to do: add some OpenTelemetry auto-instrumentation libraries to your stack and they’ll fill your...
Investigating Mysterious Kafka Broker I/O When Using Confluent Tiered Storage
Earlier this year, we upgraded from Confluent Platform 7.0.10 to 7.6.0. While the upgrade went smoothly, there was one thing that was different from previous...
Independent, Involved, Informed, and Informative: The Characteristics of a CoPE
In part one of our CoPE series, we analogized the CoPE with safety departments. David Woods says that those safety departments must be: independent, involved,...
Virtualizing Our Storage Engine
Our storage engine, affectionately known as Retriever, has served us faithfully since the earliest days of Honeycomb. It’s a tool that writes data to disk...
Announcing Honeycomb Support Business Hours in Europe
Earlier this year, Honeycomb announced the launch of data residency in Europe. To meet the growing needs of our customers in the region, we are...
Establishing and Enabling a Center of Production Excellence
Software is in a crisis. This is nothing new. Complex distributed systems are perpetually in a state far from equilibrium, operating in what Richard Cook...