Finding the Golden Signals with Prometheus

I was honored to be selected to speak at All Things Open 2020. I wanted to tell the story of architecting Fitbit’s Prometheus and Thanos solution for metrics and alerting. Including the many things I learned and that I think are important to consider as a company scales out their observability platform. Oddly enough, some of this also applies to handling logs and events at scale too. The talk was just uploaded to YouTube.

I also gave this same presentation at the 2020 Devops Experience Virtual Summit. This version of the video cannot be embedded unfortunately but can be found here.

The slides I used for the ATO version are available here as well. I’m very open to comments and questions so please feel free to post in the discussion below!

PDF Presentation

