Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Elaborate, document and propose remedies for "the 10k span problem" #80

Open
codefromthecrypt opened this issue Jul 11, 2017 · 3 comments

Comments

@codefromthecrypt
Copy link
Member

Traces that have orders of thousands of spans can be problematic. They can choke the UI (not just ours) and increase the operating costs of a tracing system. There are a number of scenarios which can result in "the 10k span problem", such as broadcast messaging to boundless consumers or buggy traced loops. Some workarounds are easier than others. For example, dropping local spans reported is easier than trying to coordinate message consumers to have them drop.

This issue should clarify the major scenarios, known workarounds and remedies. Hopefully, it can result in at least documentation, and in ideal case in coding practice that defends against this

Here are some breadcrumbs:

@ImFlog
Copy link

ImFlog commented Jul 11, 2017

One of the solution discussed could be to drop spans. The question is where to do It ?

  • on the Zipkin server side (with a rule system for example), this could fix the UI issue
  • on the collector side, this reduces the load but also introduces complexity : How do we know this is a long trace ?

@codefromthecrypt
Copy link
Member Author

codefromthecrypt commented Jul 11, 2017 via email

@codefromthecrypt
Copy link
Member Author

codefromthecrypt commented Jul 11, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants