The New Rules of Sampling

By: Rachel Perkins | May 9th, 2019

Metrics Sampling

2 Min. Read

One of the most common questions we get at Honeycomb is about how to control costs while still achieving the level of observability needed to debug, troubleshoot, and understand what is happening in production. Historically, the answer from most vendors has been to aggregate your data–to offer you calculated medians, means, and averages rather than the deep context you gain from having access to the actual events coming from your production environment.

This is exactly what it sounds like–a poor tradeoff for performance. With classic metrics and APM tools, you can never again get back to the raw event source of truth, which means you’ll regret that choice when debugging a complex, distributed system. When you’re working with metrics, the data must be numeric, and any other type of data must be stored as metadata either attached to the datapoints themselves or out-of-band in some way (“tags”, “dimensions”, etc), AKA: more limits on what you can store and retrieve.

Honeycomb’s answer is: Sample your data.

But, you say, sampling means I’m throwing away some (or a lot) of my data. How is that OK? I won’t know what I am not seeing, right?

What if you had more flexibility? What if sampling offered a greater breadth of options than just “send a percentage of my data”?

Find out what’s possible in The New Rules of Sampling.

Don’t forget to share!

Rachel Perkins

Charity Majors | Apr 23, 2025

How Much Should I Be Spending On Observability?

In last week’s piece, we talked about some of the factors that are driving costs up, both good and bad, and about whether your observability bill is (or should be) more of a cost center or an investment. In this piece, I’m going to talk more in depth about cost drivers and levers of control.

Observability Sampling

Irving Popovetsky | Apr 21, 2025

Data Strategy for SREs and Observability Teams

The idea that telemetry data needs to be managed, or needs a strategy, draws a lot of inspiration from the data world (as in, BI and Data Engineering). Your company most likely has a data team that manages the data warehouse(s), data pipelines, data sources, and reporting tools. These teams are also constantly balancing costs with their user and stakeholder needs, usability, data retention, granularity, etc. Sound familiar? That’s because if you’re working on observability data, these teams are at least several years ahead of you in addressing these tradeoffs and considerations—and can teach us quite a lot.

Observability Sampling Software Engineering

Tyler Helmuth | Jan 22, 2025

Tracing Refinery

We recently released Refinery 2.9, which came with great performance improvements. Reading through the release notes, I felt the need to write a piece on this improvement, as it's quite important but easy to overlook: collect loop taking too long. This is the story of how we used distributed tracing to find the slowdown in this loop.

Sampling Tracing

All-in-one Observability

Why Honeycomb

Looking for something?

Our mission

The New Rules of Sampling

Rachel Perkins

Related posts

How Much Should I Be Spending On Observability?

Data Strategy for SREs and Observability Teams

Tracing Refinery

Ready to get started?