Add chain ID attribute #228

lameiraatt · 2023-03-22T11:14:17Z

This OTEP proposes the addition of a chain ID attribute to spans and logs that supports data processing/analysis and can be implemented as part of an extension to the default OTel SDK.

Feedback welcome and appreciated! 😊

linux-foundation-easycla · 2023-03-22T11:14:22Z

The committers listed above are authorized under a signed CLA.

✅ login: lameiraatt / name: Ana Lameira (a110a84, 6502ea6, 74364d9)

cartermp · 2023-04-10T17:00:38Z

text/0228-chain-id.md

+3. Sum.
+
+Without chain ID, we'd have to first create the tree, using span ID and parent span ID information.
+When processing big data sets, with millions of data points, generated by thousands of processes, this can have a considerable performance impact.


this can have a considerable performance impact.

I'd be curious about the impact based on some non-contrived examples. In my work we definitely have some big traces from some customers, but nothing to the point where this significantly taxed the tracing backend. Is this like a "google has this problem" kinda thing, or do others? It's clear that this can help performance for a tracing backend, but I'm just not clear if this is something that backends will benefit from broadly or only when working with google-sized data.

s/google/some-other-enormous-tech-system-that-requires-bespoke-tooling

"millions of data points" is not necessarily a Google-scale problem. I've seen smaller companies processing tracing volume comparable to hyper-scalers, the difference is that hyper-scalers just have to sample more.

And to me, the challenge with processing mentioned here is not necessarily with the data volume, but with expressiveness of the query languages. Aside from TempoDB QL I am not aware of any QL for traces that allows to express queries against the graph (you can do one-level parent/child in SQL, anything more becomes too cumbersome to express). The chain-id largely solves this issue by allowing to query into tabular data set without any aggregation of events into traces.

Thank you Yuri. @cartermp, see TraceQL Design Proposal for more details.

bputt-e · 2023-04-21T22:29:12Z

Does this proposal help with linked spans? Let's say I have 500 traces that get 'merged' into a new trace....What would that new trace look like, would it have any 'historical' knowledge to add any chain info or would it be considered a new trace and have nothing starting out.

joe-elliott · 2023-05-26T16:22:42Z

fwiw, Tempo achieves similar functionality by computing the nested set model for the tree upon storage. if this were adopted by OTEL it would make calculating the nested set quite a bit easier.

the UUID passed seems to be used to track the root process instance. imo it would be better to make this separate from the concept of the hierarchy. perhaps two different attributes that could be independently controlled:

Perhaps something like:
chain.hierarchy
chain.root

i'm also a bit concerned about traces with extreme depth creating enormous chain ids. perhaps a configuration option for max depth recorded would be worthwhile?

tedsuo · 2023-07-31T16:24:05Z

@lameiraatt we are cleaning up stale OTEP PRs. If there is no further action at this time, we will close this PR in one week. Feel free to open it again when it is time to pick it back up.

lameiraatt requested a review from a team March 22, 2023 11:14

lameiraatt changed the title ~~OTEP: Add chain ID attribute~~ Add chain ID attribute Mar 27, 2023

lameiraatt added 3 commits March 27, 2023 16:57

Add chain ID attribute

3a8aae9

Update OTEP file name

56f3914

Fix md errors

ad20750

lameiraatt force-pushed the chain-id branch from 74364d9 to ad20750 Compare March 27, 2023 15:57

carlosalberto added priority:p2 triaged labels Apr 10, 2023

cartermp reviewed Apr 10, 2023

View reviewed changes

tedsuo added the stale This issue or PR is stale and will be closed soon unless it is resurrected by the author. label Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add chain ID attribute #228

Add chain ID attribute #228

lameiraatt commented Mar 22, 2023

linux-foundation-easycla bot commented Mar 22, 2023 •

edited

Loading

cartermp Apr 10, 2023 •

edited

Loading

yurishkuro Apr 10, 2023

electron0zero Apr 13, 2023 •

edited

Loading

bputt-e commented Apr 21, 2023

joe-elliott commented May 26, 2023

tedsuo commented Jul 31, 2023

Add chain ID attribute #228

Are you sure you want to change the base?

Add chain ID attribute #228

Conversation

lameiraatt commented Mar 22, 2023

linux-foundation-easycla bot commented Mar 22, 2023 • edited Loading

cartermp Apr 10, 2023 • edited Loading

Choose a reason for hiding this comment

yurishkuro Apr 10, 2023

Choose a reason for hiding this comment

electron0zero Apr 13, 2023 • edited Loading

Choose a reason for hiding this comment

bputt-e commented Apr 21, 2023

joe-elliott commented May 26, 2023

tedsuo commented Jul 31, 2023

linux-foundation-easycla bot commented Mar 22, 2023 •

edited

Loading

cartermp Apr 10, 2023 •

edited

Loading

electron0zero Apr 13, 2023 •

edited

Loading