Add SDK span processors spec by pavolloffay · Pull Request #205 · open-telemetry/opentelemetry-specification

pavolloffay · 2019-08-07T14:19:24Z

Resolves #155
Resolves #54

Block when the queue is full processor will be added in a separate PR.

Signed-off-by: Pavol Loffay ploffay@redhat.com

reyang · 2019-08-07T15:36:11Z

specification/sdk-span-processors.md

@@ -0,0 +1,61 @@
+# Span processor


How to register a Span processor?
Do we allow multiple processors? (is there any guarantee on the ordering).

I think we could guarantee the order. I have added this:

Span processors can be registered via a method on SDK Tracer. The registered processors are invoked in the same order as they were registered.

Thanks @pavolloffay for answering the 2nd question.
Regarding the 1st question "How to register a Span processor", do you intend to cover it in a separate PR for the Tracer API change? Do we expect both register and unregister (or only allow to register during the Tracer initialization/ctor)?

Currently I implemented this with only Register and can be done at any moment.

I have already documented the registration step. Here is the copy but please comment on the code if something should be fixed.

Span processors can be registered directly on SDK Tracer. The processors are invoked in the same order as they were registered.

reyang · 2019-08-07T15:40:18Z

specification/sdk-span-processors.md

+
+* `exporter` - the exporter where the spans are pushed.
+* `maxQueueSize` - the maximum size of the queue. After the size is reached spans are dropped.
+* `scheduledDelayMilllis` - the delay interval between two consecutive exports.


Another option is exportInterval, actualDelayMilllis = max(0, exportInterval - timeTakenToExport).
Wish to discuss here and see which one is preferred.

One scenario is "I want to export traces/logs/metrics every 5 seconds" and "I don't expect to export data every 7 seconds due to the fact that export operation itself is using 2 seconds in environment XYZ".

One scenario is "I want to export traces/logs/metrics every 5 seconds" and "I don't expect to export data every 7 seconds due to the fact that export operation itself is using 2 seconds in environment XYZ".

The export should not be called concurrently, this is from exporter spec:

Export() will never be called concurrently for the same exporter instance. Export() can be called again only after the current call returns.

mayurkale22 · 2019-08-07T21:17:38Z

specification/sdk-span-processors.md

+
+**Configurable parameters:**
+
+* `exporter` - the exporter where the spans are pushed.


How to register multiple exporters?

At least in Java there is MultiSpanExporter exporter implementation which exports data to multiple exporters.

@bogdandrutu shall we add MultiSpanExporter to the spec? I think it's an implementation detail.

I agree, let's not be too restrictive about the implementation.

mayurkale22 · 2019-08-07T21:26:20Z

specification/sdk-span-processors.md

+
+## Interface definition
+
+### OnStart(SpanData)


Why SpanData?

The main point it to make clear that passed object exposes data of the Span.

tracing-api doc references it https://github.com/open-telemetry/opentelemetry-specification/blob/master/specification/api-tracing.md#spandata. It's a readable span object.

The span processors are invoked only on sampled spans.

Although this is what the exporters defined by the SDK will do, I don't feel is really a requirement - after all, sampling exists as a hint (as of now).

mayurkale22 · 2019-08-07T21:30:33Z

specification/sdk-span-processors.md

+**Configurable parameters:**
+
+* `exporter` - the exporter where the spans are pushed.
+* `maxQueueSize` - the maximum size of the queue. After the size is reached spans are dropped.


Do we have any default values for these params?

There are defaults in Java https://github.com/open-telemetry/opentelemetry-java/blob/master/sdk/src/main/java/io/opentelemetry/sdk/trace/export/BatchSampledSpansProcessor.java#L92. I think we could use the same across languages.

Does it make sense to add those here?

That seems like the wrong level of abstraction for the spec. Most other defaults and constants aren't documented here unless they're defined in other places, e.g. the W3C header length limits.

specification/sdk-span-processors.md

c24t · 2019-08-07T21:28:51Z

specification/sdk-span-processors.md

+
+### Simple processor
+
+The implementation of `SpanProcessor` that passes ended span directly to the configured `SpanExporter`.


What's the relationship between processors and exporters? It seems like we're using processors to (1) expose span start and end hooks to vendors and (2) batch spans to send to the exporter. Why use the same component for both?

If processors can't modify the spans that they pass to the exporters, why not call the processors and exporters in parallel?

If processors can't modify the spans that they pass to the exporters, why not call the processors and exporters in parallel?

From exporter spec:

Allow implementing helpers as composable components that use the same chainable Exporter interface. SDK authors are encouraged to implement common functionality such as queuing, batching, tagging, etc. as helpers. This functionality will be applicable regardless of what protocol exporter is used.

The processor can be considered as a helper for the exporter. Perhaps we should document this in the first section. The exporter should only export and do not batch.

In my mind the model was this:

SDK -> SpanProcessor (an implementation that batches all Spans) -> SpanExporter(List<SpanData/SpanProto>).

SpanExporter is one of the functionality that we can implement with the SpanProcessor, but for example some implementation may implement a SpanProcessor that adds extra attributes onStart.

@bogdandrutu Maybe it would be worth adding a note here, to mention that (custom) SpanProcessors can actually modify the received Span?

specification/sdk-span-processors.md

c24t · 2019-08-07T21:40:39Z

specification/sdk-span-processors.md

+The implementation of the `SpanProcessor` that batches ended spans and pushes them to the configured `SpanExporter`.
+
+First the spans are added to a synchronized queue, then exported to the exporter pipeline in batches.
+When the queue gets half full a preemptive notification is sent to the worker thread to wake up and start a new export cycle.


Does this mean we export faster than the scheduled interval when the queue is half full?

In any case this sounds to me like documentation for the java implementation. What about something like "The implementation is responsible for managing the span queue and sending batches of spans to the exporters."?

Yes, this text is originally from javadoc. We can make it more generic and give the implementors more freedom how the queue is managed and when it is flushed.

c24t · 2019-08-07T21:41:24Z

specification/sdk-span-processors.md

+
+**Parameters:**
+
+* `SpanData` - a readable span object.


I don't know if it's kosher to change the spec to suit pending RFCs, but open-telemetry/oteps#8 suggests this might have to be changed to "span" soon.

open-telemetry/oteps#8 It talks about removing it from the API, it might happen that it will be moved to SDK spec no?

But in that case it sounds like we shouldn't reference it from the spec.

something like SpanData can exists at the SDK level, there needs to be some sort of common concrete type that exporters can use. Having it at the API though added a more surface area and complexity though.

At this moment (in Java) we actually get a low-level interface that returns the (read-only) protobuf representation of a Span. Wondering if that is, well, low-level ;)

bg451 · 2019-08-08T18:54:00Z

We recently discussed batched span processors in the javascript sig and some concerns came up. Batching can have a lot of parameters (backoff, retry, dropping, etc.), so our worry is that trying to provide an out of the box batching solution won't work for some vendors, requiring the user to do more upfront, vendor specific configuration. We decided in our sig meeting to send spans to all exporters on end, and span processors are a place where users/vendors can hook into for extra sauce. I personally don't mind making vendors write more code if it means users will be able to write less.

I'd really like to learn more about the thought process and previous discussion since #155 doesn't have any information.

pavolloffay · 2019-08-09T11:55:41Z

The (backoff, retry, dropping, etc.) seems more related to the retry mechanism than the batching. I think the point here is to document something generic enough which will work across most languages.

I'd really like to learn more about the thought process and previous discussion since #155 doesn't have any information.

+1 perhaps @bogdandrutu or @SergeyKanzhelev could provide some details of what was discussed before.

carlosalberto · 2019-08-13T14:48:42Z

We decided in our sig meeting to send spans to all exporters on end, and span processors are a place where users/vendors can hook into for extra sauce.

That makes me wonder whether different languages might actually need to provide different strategies here, depending on the thread/execution model...

tedsuo · 2019-08-13T20:57:42Z

I know that we are trying to document the current SDK behavior. But I really think we need to do more work to define the requirements for the SDK. It's hard to judge a design proposal such as this, without understanding the intended use cases.

bogdandrutu

Overall it looks good, just some more thoughts that I have about this.

bogdandrutu · 2019-08-14T00:49:03Z

specification/sdk-span-processors.md

+
+## Interface definition
+
+### OnStart(SpanData)


In order to allow performance improvements and lazy initialized the SpanData I would suggest we go with a ReadableSpan which may expose some properties directly and a toSpanData or toSpanProto. The reason is that you don't want to construct that object all the time unless someone really needs it. Maybe a LazySpanData or something similar achieves the same thing.

Other fields that I have in mind may be information like isOutOfBand added in open-telemetry/oteps#8. So I think we need to expose an interface here that allows us to also expose other data.

Also I heard that some implementation of the SpanProcessor may want to add attributes to the Span itself or events. So it may be interesting to here if the Span itself should be exposed (maybe the SdkSpan that can have extra capabilities).

I didn't use ReadableSpan because it is not defined anywhere, whereas SpanData is defined in tracing spec.

I will rename SpanData to just Span. The comment in the parameters will says:

Span - a readable span object.

bogdandrutu · 2019-08-14T00:54:01Z

specification/sdk-span-processors.md

+
+### Simple processor
+
+The implementation of `SpanProcessor` that passes ended span directly to the configured `SpanExporter`.


In my mind the model was this:

SDK -> SpanProcessor (an implementation that batches all Spans) -> SpanExporter(List<SpanData/SpanProto>).

SpanExporter is one of the functionality that we can implement with the SpanProcessor, but for example some implementation may implement a SpanProcessor that adds extra attributes onStart.

reyang · 2019-08-14T17:51:22Z

specification/sdk-span-processors.md

+
+First the spans are added to a synchronized queue, then exported to the exporter pipeline in batches.
+The implementation is responsible for managing the span queue and sending batches of spans to the exporters.
+This processor can cause high contention in a very high traffic service.


Would like to understand more about high contention. Is this because of the queue insertion operation? There are lock-free queue implementations which doesn't have high contention even under heavy traffic.

Another question - do we expect the queue to guarantee FIFO?

even a lock-free queue can cause false sharing which is also very bad and hard to quantify.

batching does not require to guarantee fifo, and also can drop spans when high traffic (based on a config). For an example of this see https://github.com/open-telemetry/opentelemetry-java/blob/master/sdk/src/main/java/io/opentelemetry/sdk/trace/export/BatchSampledSpansProcessor.java

We also execute the export on a different thread.

specification/sdk-span-processors.md

reyang · 2019-08-15T14:47:15Z

specification/sdk-span-processors.md

+Span processor is an interface which allows hooks for span start and end method invocations.
+The span processors are invoked only on sampled spans. This interface can be used as a helper for span exporter to batch and convert spans see [sdk-exporter-spec](sdk-exporter.md).
+
+Span processors can be registered directly on SDK Tracer. The processors are invoked in the same order as they were registered.


Where is this interface defined?

I would want to see something like Tracer.RegisterSpanProcessor covered in this PR.

I think that should belong to the tracer SDK spec, which we don't have at the moment.

I have created #217

pavolloffay · 2019-08-15T15:36:38Z

@bogdandrutu @reyang @c24t I have updated the PR. The only change was to remove SpanData. I think we should also agree whether to define default values for batching span processor. I think it makes sense to have the same values across languages.

bogdandrutu · 2019-08-15T16:29:19Z

specification/sdk-span-processors.md

+
+**Configurable parameters:**
+
+* `exporter` - the exporter where the spans are pushed.


I agree, let's not be too restrictive about the implementation.

specification/sdk-span-processors.md

Oberon00 · 2019-09-05T07:48:13Z

specification/sdk-span-processors.md

@@ -0,0 +1,80 @@
+# Span processor
+
+Span processor is an interface which allows hooks for span start and end method invocations.


Maybe an answer:

(a) if I must make a copy of the data, it's a Processor
(b) if I can take a reference to the data, it's an Exporter

In my understanding, it's rather the other way round: A Span-processor gets a mutable reference to a single Span synchronously. A exporter on the other hand may get one or more read-only Spans (or some data transfer object to which the Span was converted) delayed, asynchronously or never (queue full).

Oberon00 · 2019-09-05T07:55:14Z

specification/sdk-span-processors.md

+
+**Parameters:**
+
+* `Span` - a readable span object.


I would rather like the Span processor to get a mutable reference to the same span object that is started/ended. I imagine a common use-case for span processors would be to attach context information like thread/coroutine ID, etc.
Although it is true that the Processor (at least in OnEnd) should also be able to read the data, which may or may not be possible with a sdk.Span object. Maybe we must specify an additional Span interface with getters?

Maybe we must specify an additional Span interface with getters?

The SpanData could be considered as a readable interface, however it is going to be removed #215

I believe we eliminated SpanData from the user-facing API. I could believe it's still meant as a part of the SDK processor API.

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

specification/sdk-span-processors.md

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay · 2019-09-18T14:24:42Z

Can anybody merge this API or pinpoint what should be improved or what is missing.

cc) @open-telemetry/specs-approvers @bogdandrutu @yurishkuro @SergeyKanzhelev

* Add SDK span processors spec Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Fixes Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Update review comments Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Small fixes Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Remove span data Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Remove sampled Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Add is recording events Signed-off-by: Pavol Loffay <ploffay@redhat.com> * change style of the links Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Add ascii diagram Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Smaller improvements based on review comments Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Fix typo Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Caught by: -Wpessimizing-move

…pen-telemetry#205)

* Add SDK span processors spec Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Fixes Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Update review comments Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Small fixes Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Remove span data Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Remove sampled Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Add is recording events Signed-off-by: Pavol Loffay <ploffay@redhat.com> * change style of the links Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Add ascii diagram Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Smaller improvements based on review comments Signed-off-by: Pavol Loffay <ploffay@redhat.com> * Fix typo Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Co-authored-by: Joao Grassi <joao@joaograssi.com> Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com> Co-authored-by: Armin Ruech <armin.ruech@dynatrace.com>

pavolloffay requested review from AloisReitbauer, SergeyKanzhelev, bogdandrutu, c24t, carlosalberto, iredelmeier, reyang, songy23, tedsuo and yurishkuro as code owners August 7, 2019 14:19

reyang reviewed Aug 7, 2019

View reviewed changes

mayurkale22 reviewed Aug 7, 2019

View reviewed changes

songy23 reviewed Aug 7, 2019

View reviewed changes

specification/sdk-span-processors.md Outdated Show resolved Hide resolved

c24t reviewed Aug 7, 2019

View reviewed changes

mayurkale22 mentioned this pull request Aug 7, 2019

feat(tracer): implement span processor open-telemetry/opentelemetry-js#149

Closed

reyang mentioned this pull request Aug 8, 2019

Django middleware: allow autogenerated span name override census-instrumentation/opencensus-python#757

Closed

bogdandrutu reviewed Aug 14, 2019

View reviewed changes

reyang reviewed Aug 14, 2019

View reviewed changes

specification/sdk-span-processors.md Show resolved Hide resolved

mayurkale22 mentioned this pull request Aug 15, 2019

Zipkin Exporter open-telemetry/opentelemetry-js#192

Merged

reyang reviewed Aug 15, 2019

View reviewed changes

pavolloffay mentioned this pull request Aug 15, 2019

Document Tracer SDK #217

Closed

pavolloffay force-pushed the sdk-span-processors branch from 6a4b4e2 to 47b4bbb Compare August 15, 2019 15:33

bogdandrutu approved these changes Aug 15, 2019

View reviewed changes

Oberon00 reviewed Sep 5, 2019

View reviewed changes

This was referenced Sep 5, 2019

SpanProcessor API for building export pipelines open-telemetry/opentelemetry-go#116

Closed

Propose consolidating into a single implementation open-telemetry/oteps#12

Closed

mayurkale22 mentioned this pull request Sep 6, 2019

feat: add BatchSpanProcessor open-telemetry/opentelemetry-js#238

Merged

pavolloffay added 10 commits September 11, 2019 12:37

Add SDK span processors spec

74704a6

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Fixes

77d777a

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Update review comments

f22ca63

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Small fixes

dc08491

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Remove span data

d2ce27f

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Remove sampled

f6dd498

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Add is recording events

69853ec

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

change style of the links

e4666fe

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Add ascii diagram

4af4aee

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Smaller improvements based on review comments

de3a388

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay force-pushed the sdk-span-processors branch from 9a0d656 to de3a388 Compare September 11, 2019 10:56

mauriciovasquezbernal reviewed Sep 18, 2019

View reviewed changes

specification/sdk-span-processors.md Outdated Show resolved Hide resolved

Fix typo

fe6c3db

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Merge branch 'master' into sdk-span-processors

84f95af

yurishkuro merged commit 7d7c3aa into open-telemetry:master Sep 20, 2019

Oberon00 mentioned this pull request Apr 16, 2020

SpanProcessor API - Storing a Span specific state open-telemetry/opentelemetry-java#1105

Open

TuckTuckFloof pushed a commit to TuckTuckFloof/opentelemetry-specification that referenced this pull request Oct 15, 2020

Moving a temporary object prevents copy elision. (open-telemetry#205)

e616636

Caught by: -Wpessimizing-move

trask mentioned this pull request Oct 4, 2023

Does anyone want http.status_code to report 0 instead of being empty? #3253

Closed

carlosalberto pushed a commit to carlosalberto/opentelemetry-specification that referenced this pull request Oct 21, 2024

Context propagation requirements for messaging semantic conventions (o…

aa7c777

…pen-telemetry#205)

carlosalberto pushed a commit to carlosalberto/opentelemetry-specification that referenced this pull request Oct 23, 2024

Context propagation requirements for messaging semantic conventions (o…

59f3a15

…pen-telemetry#205)

jack-berg mentioned this pull request Jan 28, 2025

Fix batch parameter requirements #4388

Closed


		Configurable parameters:

		* `exporter` - the exporter where the spans are pushed.


		### Simple processor

		The implementation of `SpanProcessor` that passes ended span directly to the configured `SpanExporter`.

		@@ -0,0 +1,80 @@
		# Span processor

		Span processor is an interface which allows hooks for span start and end method invocations.

Conversation

pavolloffay commented Aug 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pavolloffay Aug 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bg451 Aug 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bg451 commented Aug 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pavolloffay commented Aug 9, 2019

Uh oh!

carlosalberto commented Aug 13, 2019

Uh oh!

tedsuo commented Aug 13, 2019

Uh oh!

bogdandrutu left a comment

pavolloffay commented Aug 7, 2019 •

edited

Loading

pavolloffay Aug 8, 2019 •

edited

Loading

bg451 Aug 13, 2019 •

edited

Loading

bg451 commented Aug 8, 2019 •

edited

Loading