Featured Conference Talk
A vendor-neutral day dedicated to the observability community and the unique challenges and benefits of the practice, regardless of specific tool choices. Through sessions led by the community’s most advanced practitioners, you’ll learn about the cutting-edge capabilities and unique outcomes that define observability.
The Business Value of Observability
This panel moves beyond the technical aspects of observabilty to examine the business case for adopting this new way of working. Observability can help teams lower their MTTR, lower change failure rates, and speed up time to delivery in production. How can you make a case to business stakeholders to prioritize observability adoption initiatives? Which outcomes should you expect and what practical hurdles will you hit along the way?
Cloud-Native Observability: Basics to Advanced Forensics
This panel addresses fundamental observability concepts, including cloud-native observability, common misconceptions, technical challenges, and industry standards like OpenTelemetry that make it possible to incorporate vendor neutral solutions.
Landing Observability in the Enterprise
John describes the key challenges teams need to overcome, and provide both specific and actionable advice to overcome those challenges. By listening to this talk. you'll learn techniques that empower you to assess whether the enterprise environment you're in is ready for observability and, if so, how to make the strongest possible case for observability to the right people.
Stepping Our Way Into Resilient Services
Is it possible to discover unknown unknowns proactively with Chaos Engineering? Where exactly is the intersection between intentionally breaking production services and discovering the multitude of ways they could be broken with observability? This is a short presentation that leaves plenty of time to have a real-time discussion with George Miranda, take audience questions, and explore practical steps you can take with your teams as you step your way toward improving service resilience.
The Future of Observability
In 2018, Peter Alvaro joined us at o11ycon to deliver a visionary and thought-provoking talk around the problems we had yet to solve. In retrospect, he ended up painting a picture of the problems that ultimately defined a lot of the work that was done in the observability space for the next three years. In this session, Charity and Christine join Peter for a chat that digs into where observability might be headed for the next three years.
Nora Jones and Charity Majors will share their experiences leading major movements shaping the future of shipping software. Nora Jones is CEO of Jeli, and former engineer at Netflix and Slack will share her research and experience with Chaos Engineering, human factors, and site reliability. Charity Majors is Honeycomb's CTO and co-founder, who pioneered Observability as a software practice for modern teams.
Conditional Distributed Tracing
Distributed tracing is generally a binary affair—it's off or on. Either a trace is sampled or, according to a flag, it's not. Span placement is also assumed to be an "always-on" system where spans are always added if the trace is active. For general availability and service-level objectives, this is usually good enough. But when we encounter problems, we need more. In this talk, I'll show you how to "turn up the dial" with detailed diagnostic spans and span events that are inserted using dynamic conditions.
Driving Observability Adoption Forward as an Internal Champion... During a Pandemic
Josh covers how they planned, strategized, and organized their observability tooling evaluation. Josh also shares how the COVID-19 pandemic impacted that evaluation. Josh covers the engineering outcomes they were aiming for, given Amperity's engineering structure, and the areas of focus that ultimately helped set us up for success.
Killing Your Staging Environment
“But it worked in staging!” is the new “But it works in my machine?” Docker, Kubernetes, and friends have made it easier to create similar environments, but it is still impossible to create environments that are the same as production. So a few years ago, we killed our staging environment and tested *everything* either locally on a developer’s machine or in production. Erwin goes over our lessons learned and what is needed to be able to successfully kill your staging environment. SPOILER WARNING: One of those things is proper observability.
Modernizing Availability Thinking at Babylon
Babylon has a vision to bring affordable and accessible healthcare to the world. A key part of that vision is having a highly-available, global platform for healthcare delivery that is a delight to use for both customers and clinicians. In this talk, Tom will cover their ongoing journey to adopt an alternative (or complementary) SLI & SLO-based approach, the advantages of that approach, what they've learned along the way, and what's left to do.
Testing is Not the Goal
In this talk, Rob introduces strategies to deal effectively with both the risks we can imagine and the realities we cannot. It explores deployments as the beginning of our journey, not the end. Rob shares the techniques and models they developed that enable teams to move quickly without compromising quality. Attend this talk to start unlearning old ideas about testing, development, and operations. A team-wide focus on testability and operability is fundamental to happy, high-performance teams, a successful business, and satisfied customers.
Survival Guide: What I Learned From Putting 200 Developers on Call
We say we want to live in a world where the development team who writes the code also owns that code’s success-or failure-in production. To that end, nothing incentivizes a team to ship better quality software than getting paged about production issues at 2am. But how can that actually be done? In this talk, you’ll learn tips and tricks for easing development teams into on-call rotations, how SRE or Ops facilitates the transition to production code ownership, and why observability was critical to our success when putting developers on call.
OpenTelemetry, Not Just for Production Troubleshooting
In this talk, Michael will show you how to leverage OpenTelemetry to gather and analyze data from production and how to use it in pre-production workflows. Using OTel in pre-production can help developers troubleshoot microservices, validate their changes, and prevent issues before they get to production. By attending this talk, you'll learn how to use OpenTelemetry data to become proactive rather than reactive.
Observability is More Fun With Friends: Stories From OpenTelemetry Collaboration
The modern open source landscape is hard enough, given the (sometimes) conflicting interests of commercial partners, end-users, and project maintainers. It takes a real, intentional effort to build collaborative relationships across these groups in order to make improvements to projects. In this panel, we'll share stories about what's worked from our involvement in OpenTelemetry as maintainers, community representatives, and end-users.
Eyes Wide Open: o11y From Theory to Practice
Peter covers his team's journey including how they did the early integration, included interactions with existing tooling that was available in their organization, and go through several ""aha"" moments they encountered when they've had instrumentation in place that has proven valuable in uncovering problems both in our deployed environment and in their test suite.