Finding the Golden Signals with Prometheus
I was honored to be selected to speak at All Things Open 2020. I wanted to tell the story of architecting Fitbit’s Prometheus and Thanos solution for metrics and alerting. Including the many things I learned and that I think are important to consider as a company scales out their observability platform. Oddly enough, some of this also applies to handling logs and events at scale too. The talk was just uploaded to YouTube.
The slides I used for the ATO version are available here as well. I’m very open to comments and questions so please feel free to post in the discussion below!