Tracing tools like [[Jaeger]] and [[AWS X-Ray]] lets you trace requests that travel through multiple systems. It is useful for [[Observability]] and can reduce [[Mean time to repair]] by quickly showing where slowdowns occur. Faster to pinpoint sources of latency.
https://logz.io/blog/distributed-tracing-dapper-jaeger/
> Using distributed tracing you can trace the path of a request as it travels across a complex system.
> While there were some distributed tracing solutions before, Google Dapper (2010) design paper is a, or the, cornerstone of distributed tracing. This paper explains how Google developed a production-grade tracing tool, with three key goals behind it.
Open Source Solutions
Zipkin (Twitter)
Jaeger (Uber)
AppDash
Skywalking
Enterprise Solutions
Amazon X-Ray
Google Cloud Trace
[[Datadog]]
Lightstep
New Relic
Most mental models for tracing descend from Google’s Dapper paper. [[OpenTracing]] uses similar nouns and verbs ([[OpenTelemetry]]).
> Distributed tracing, also called distributed request tracing, is a method used to profile and monitor applications, especially those built using a microservices architecture. Distributed tracing helps pinpoint where failures occur and what causes poor performance.