Handle instrumentation scope in the Prometheus conversion spec #2422

dashpole · 2022-03-17T17:06:48Z

Related: #1906

Background

Changes

This adds "short_name" to the instrumentation scope, which would be an optional, single-word name for the scope. If present, it would be used as the metric prefix in Prometheus exporters. In prometheus receivers, the opentelemetry_instrumentation_scope metric would identify prefixes that can be removed from metrics, and used to reconstruct the Instrumentation scope in OTLP.

Alternatives:

Turn the library name into the single-word metric prefix. E.g. goopentelemetryiocontribinstrumentationnethttpotelhttp_http_server_duration
- Downside: This is ugly
Add it as a label to all metrics within the scope: {"instrumentation_scope": "go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp"}
- Downside: This isn't how OpenMetrics is designed to namespace metrics. Removing this later would be breaking for users.

cc @jmacd @Aneurysm9 @jsuereth

MrAlias · 2022-03-22T19:55:46Z

specification/metrics/datamodel.md

+Instrumentation Scope MUST be left unset for metrics scraped from Prometheus
+endpoints.


If the opentelemetry_instrumentation_scope info metric is present that that not be used for the instrumentation scope?

Thats correct. It remains a separate metric stream to transmit that a given instrumentation library was used, but it doesn't allow you to figure out which metrics it applies to. I'm not sure it makes much sense, though, without the "short_name" in instrumentation scope. If we had that, then we could reconstruct the Instrumentation Scope here.

Maybe I should just skip this intermediary specification step, and just propose the "short_name" + round-tripping through prometheus...

Maybe I should just skip this intermediary specification step, and just propose the "short_name" + round-tripping through prometheus...

I'd be interested to hear what other think, but that sounds good to me.

jmacd · 2022-03-23T19:11:30Z

We should be careful to call the OpenMetrics "metric namespace" an equivalent of OTel's instrumentation library. I don't think this is completely true, see below.

Proposal for round-tripping Instrumentation Scope

I would like to see a treatment of instrumentation scope that could preserve the information for an OTel SDK exporting Prometheus data, which means a Prometheus scrape ideally would produce the same metrics as the SDK would have pushed. I think this is possible.

I've studied the OpenMetrics guidance on target info,

https://github.com/OpenObservability/OpenMetrics/blob/main/specification/OpenMetrics.md#supporting-target-metadata-in-both-push-based-and-pull-based-systems

and consider instrumentation scope information to be in a similar category. Suppose we use an OM info metric:

# TYPE otel_scope_info info
# HELP otel_scope_info Instrumentation scope metadata
otel_scope_info{name="some.dns.name/lib",version="0.1",schema_url="standard/url"} 1

To round-trip this correctly with an OTel scraper, an OTel SDK would specifically produce one library of metrics at a time. The OM/OTel translation would infer that metrics are organized by instrumentation scope such that the most recent otel_scope_info is implicitly associated with metrics in a scoped section. OTLP data scraped from other sources would leave instrumentation scope empty, as you propose.

Argument in favor of a first-class namespace concept

OpenMetrics writes about metric namespacing "the aim is to keep to a lightweight informal approach" and has three standard instrument constructor fields: "Namespace", "Subsystem", and "Name". With two optional underscore-separated fields and the potential for "Name" itself to contain underscores, there's no way to parse an OM metric name and infer what is namespace, what is subsystem, and what is name. Given no way to distinguish these, "metric namespace" appears to be a user-level option to ensure that when the same kind of instrumentation is produced by multiple distinct instances of the same instrumentation inside a single process, the user can force them not to aggregate or display in the same queries by adding an additional prefix. This would be done as-needed when the user is aware of conflicts and has the intention to solve them using namespaces, otherwise would not be done to keep metric names short.

To illustrate when metric namespace appears to solve for an OM user, imagine an OTel SDK configured with two clients speaking to two separate Redis instances with different load patterns, different kinds of data, different purposes. You've got two instances of the Redis client, and two instances of the OTel instrumentation.

Can the user have a way to distinguish the two scopes that works for both OM and OTel? IMO the user needs a first-class namespace concept here, which is not what OTel's instrumentation scope is at this time. Adding an additional property or property list to the Instrumentation Scope message type could help, and we could extend the OM<->OTel translation rules, for example, as follows. Suppose the MeterProvider has a new method called MeterProvider.Namespace(string) returning a MeterProvider where each Meter has the associated namespace. OTLP consumers would be advised to recognize this by prefixing metrics names in the section when displayed and queried.

This is what two instrumentation scopes for the same instrumentation library would look like when scraped, using the OM metric namespace:

# TYPE otel_scope_info info
# HELP otel_scope_info Instrumentation scope metadata
otel_scope_info{name="...",version="...",schema_url="...,namespace="firstprefix_"} 1

# TYPE firstprefix_http_requests
# HELP firstprefix_http_requests Number of HTTP requests
firstprefix_http_requests{} 1000

# TYPE otel_scope_info info
# HELP otel_scope_info Instrumentation scope metadata
otel_scope_info{name="...",version="...",schema_url="...,namespace="secondprefix_"} 1

# TYPE secondprefix_http_requests
# HELP secondprefix_http_requests Number of HTTP requests
secondprefix_http_requests{} 1000

dashpole · 2022-03-24T14:09:11Z

Unfortunately, I don't think the "Proposal for round-tripping Instrumentation Scope" complies with the OM spec: MetricFamilies MUST NOT be interleaved.

I haven't actually ever seen Namespace used to distinguish multiple instances of instrumentation--people just seem to use it as a second "subsystem" (e.g. in k8s), but maybe they were just using it wrong :). Since prom doesn't have a "View" concept, all metrics i've seen are fully named (including namespace, subsystem, etc.) by the library itself.

I think this is implied by the OM spec as well: the more public a library is the better namespaced its metric names should be implies that namespacing is performed by the library to keep it from colliding with other libraries.

So I think we are roughly on the same page, but need to figure out if the OM namespace is the equivalent of the instrumentation library, or if it is meant to be more than that.

@brian-brazil, we discussed this briefly at the prom WG yesterday. Can you help clarify the purpose of the OM namespace?

brian-brazil · 2022-03-24T14:19:39Z

Unfortunately, I don't think the "Proposal for round-tripping Instrumentation Scope" complies with the OM spec: MetricFamilies MUST NOT be interleaved.

I see no problems with the sample output. Though name and version are poor choices of label name that are highly likely to clash with something.

Can you help clarify the purpose of the OM namespace?

There's no formal concept of it in OM, as you say namespacing is performed by the library to keep it from colliding with other libraries.

jmacd · 2022-03-24T14:20:09Z

Unfortunately, I don't think the "Proposal for round-tripping Instrumentation Scope" complies with the OM spec: MetricFamilies MUST NOT be interleaved.

I see. A workaround would be to namespace the otel_scope_info section divider itself.

# TYPE firstprefix_otel_scope_info info
# HELP firstprefix_otel_scope_info Instrumentation scope metadata
firstprefix_otel_scope_info{name="...",version="...",schema_url="...,namespace="firstprefix_"} 1

# TYPE firstprefix_http_requests
# HELP firstprefix_http_requests Number of HTTP requests
firstprefix_http_requests{} 1000

# TYPE secondprefix_otel_scope_info info
# HELP secondprefix_otel_scope_info Instrumentation scope metadata
secondprefix_otel_scope_info{name="...",version="...",schema_url="...,namespace="secondprefix_"} 1

# TYPE secondprefix_http_requests
# HELP secondprefix_http_requests Number of HTTP requests
secondprefix_http_requests{} 1000

brian-brazil · 2022-03-24T14:22:17Z

OpenMetrics writes about metric namespacing "the aim is to keep to a lightweight informal approach" and has three standard instrument constructor fields: "Namespace", "Subsystem", and "Name".

OM has no such fields or notiions, that's what some client libraries did historically and I personally now consider it a bad idea. At the least it breaks grepability of metric names.

dashpole · 2022-03-24T15:44:58Z

I updated this to propose the short_name.

jmacd · 2022-03-24T15:48:42Z

This proposal from @bogdandrutu #2307 would give us a nice place to store the new "short_name" concept. I'm eager to see us add a list of arbitrary key:values to the scope, it would give us a way to distinguish two instances of the same instrumentation library which today we cannot do.

tigrannajaryan · 2022-03-24T15:53:34Z

I'm eager to see us add a list of arbitrary key:values to the scope, it would give us a way to distinguish two instances of the same instrumentation library which today we cannot do.

+1 to this. There are other use cases for this key:value list in the scope too, such as differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today.

tigrannajaryan · 2022-03-24T19:23:12Z

Given the comments about key-value lists in the Scope should the addition of ShortName be paused for now and instead we make it a semantic convention to be recorded in the key-value list?

dashpole · 2022-03-24T19:24:48Z

Yes. I'll close this for now, and will reopen after we've introduced the key-value list.

There are a few reasons why Scope attributes are a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry/opentelemetry-specification#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry/opentelemetry-specification#2450).

There are a few reasons why adding Scope attributes are a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry/opentelemetry-specification#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry/opentelemetry-specification#2450).

There are a few reasons why adding Scope attributes is a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry/opentelemetry-specification#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry/opentelemetry-specification#2450).

* Introduce Scope Attributes There are a few reasons why adding Scope attributes is a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry/opentelemetry-specification#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry/opentelemetry-specification#2450).

joaopgrassi · 2022-06-28T13:06:00Z

@tigrannajaryan @dashpole now that we have the scope attributes, should we bring this topic back, and define the semantic convention for scope attribute(s), where short_name would be the first?

tigrannajaryan · 2022-06-28T13:10:20Z

@tigrannajaryan @dashpole now that we have the scope attributes, should we bring this topic back, and define the semantic convention for scope attribute(s), where short_name would be the first?

Yes, feel free to make a proposal (or @dashpole feel free to do it).

We may need changes to semantic convention generator tooling to support scopes.

joaopgrassi · 2022-07-15T10:32:08Z

Unfortunately I don't have the cycles to work on this now, but I quickly tried it out and I think we don't need to change the convention generator.

Here's what I tried:
https://github.com/open-telemetry/opentelemetry-specification/compare/main...dynatrace-oss-contrib:opentelemetry-specification:feature/scope_attributes_semconv?expand=1

We could have a document inside specification/common/scope.md for the common scope attributes (short_name) for ex. Then, if there's scope attributes specific for each signal, those can be defined inside their appropriate folder like: semantic_conventions/metrics/scope-metrics.yaml -> specification/metrics/semantic_conventions/scope-metrics.md.

tigrannajaryan · 2022-07-15T14:12:57Z

@dashpole will you be able to work on this?

dashpole · 2022-07-15T14:17:23Z

Yes, I plan to work on this

joaopgrassi · 2022-07-19T10:56:20Z

Do we have an issue for it? Also, do you folks have an idea on how would we name this? We already have this page that states otel.scope.name and otel.scope.version. Would short_name also be mapped under the same? otel.scope.short_name? Or would it be under a new thing with just scope.short_name?

tigrannajaryan · 2022-07-20T18:11:17Z

Do we have an issue for it? Also, do you folks have an idea on how would we name this? We already have this page that states otel.scope.name and otel.scope.version. Would short_name also be mapped under the same? otel.scope.short_name? Or would it be under a new thing with just scope.short_name?

I don't think anything is decided yet. I would not assume otel.scope.* is the right approach. Should all scope attribute names start with otel.scope? Is there a reason the names cannot start from the root namespace?

joaopgrassi · 2022-07-20T20:12:48Z

Yeah I don't particularly have any answers, but was rather fishing for your ideas. I think we probably should have a new issue to discuss this topic further and lay out a good foundation for other attributes to come. I also don't think we need otel.scope.* as those are a bit "special".

joaopgrassi · 2022-07-21T11:21:22Z

I created an issue #2682 so we have a forum to discuss it. :)

Fixes: #2493 Related: #1906 This is a second attempt at #2422. ## Changes ### Background: Naming Collisions OpenTelemetry encourages the use of semantic conventions to make metric naming similar across instrumentation. For example, if I have two http client libraries in my application, they would each produce a metric named `http.client.duration`, but with different meters (e.g. [otelmux](https://github.com/open-telemetry/opentelemetry-go-contrib/tree/0dd27453a1ce8e433cb632e175a27f28ee83998d/instrumentation/github.com/gorilla/mux/otelmux) vs [otelhttp](https://github.com/open-telemetry/opentelemetry-go-contrib/tree/0dd27453a1ce8e433cb632e175a27f28ee83998d/instrumentation/net/http/otelhttp)). A prometheus exporter which receives both of these metrics would not be able to serve both of those histograms. This would occur anytime a user uses two libraries which produces the same category (e.g. http, database, rpc, etc) of metrics, or if the two libraries just happen to use the same name for a metric. Depending on the language, it may fail to create the Prometheus exporter, or may fail to send some, or all metrics if the same labels keys and values are present in both. ### Desired User Experience As a user, I can use a Prometheus exporter with OpenTelemetry without experiencing strange errors/behavior due to naming collisions, and without having to apply transformations to metric names to work around these, except in rare cases. As a user, I can easily add scope attributes to my metrics in Prometheus by joining with an info-style metric. This is a common pattern in Prometheus: https://grafana.com/blog/2021/08/04/how-to-use-promql-joins-for-more-effective-queries-of-prometheus-metrics-at-scale/. ### Design Add `opentelemetry_scope_name` and `opentelemetry_scope_version` as labels to all metrics. This ensures that if two libraries produce the same metric points, they don't collide because the scope name/version labels will differ. Those labels also serve as "join keys" to be able to add scope attributes to Prometheus metrics. This is accomplished by introducing an `opentelemetry_scope_info` metric containing the same `opentelemetry_scope_name` and `opentelemetry_scope_version` labels, but also including scope attributes. This also enables the collector's Prometheus receiver to reconstruct the original Instrumentation Scope when receiving the metrics.

Fixes: #2493 Related: open-telemetry/opentelemetry-specification#1906 This is a second attempt at open-telemetry/opentelemetry-specification#2422. ## Changes ### Background: Naming Collisions OpenTelemetry encourages the use of semantic conventions to make metric naming similar across instrumentation. For example, if I have two http client libraries in my application, they would each produce a metric named `http.client.duration`, but with different meters (e.g. [otelmux](https://github.com/open-telemetry/opentelemetry-go-contrib/tree/0dd27453a1ce8e433cb632e175a27f28ee83998d/instrumentation/github.com/gorilla/mux/otelmux) vs [otelhttp](https://github.com/open-telemetry/opentelemetry-go-contrib/tree/0dd27453a1ce8e433cb632e175a27f28ee83998d/instrumentation/net/http/otelhttp)). A prometheus exporter which receives both of these metrics would not be able to serve both of those histograms. This would occur anytime a user uses two libraries which produces the same category (e.g. http, database, rpc, etc) of metrics, or if the two libraries just happen to use the same name for a metric. Depending on the language, it may fail to create the Prometheus exporter, or may fail to send some, or all metrics if the same labels keys and values are present in both. ### Desired User Experience As a user, I can use a Prometheus exporter with OpenTelemetry without experiencing strange errors/behavior due to naming collisions, and without having to apply transformations to metric names to work around these, except in rare cases. As a user, I can easily add scope attributes to my metrics in Prometheus by joining with an info-style metric. This is a common pattern in Prometheus: https://grafana.com/blog/2021/08/04/how-to-use-promql-joins-for-more-effective-queries-of-prometheus-metrics-at-scale/. ### Design Add `opentelemetry_scope_name` and `opentelemetry_scope_version` as labels to all metrics. This ensures that if two libraries produce the same metric points, they don't collide because the scope name/version labels will differ. Those labels also serve as "join keys" to be able to add scope attributes to Prometheus metrics. This is accomplished by introducing an `opentelemetry_scope_info` metric containing the same `opentelemetry_scope_name` and `opentelemetry_scope_version` labels, but also including scope attributes. This also enables the collector's Prometheus receiver to reconstruct the original Instrumentation Scope when receiving the metrics.

* Introduce Scope Attributes There are a few reasons why adding Scope attributes is a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry#2450).

* Introduce Scope Attributes There are a few reasons why adding Scope attributes is a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](open-telemetry/opentelemetry-specification#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](open-telemetry/opentelemetry-specification#2450).

* Introduce Scope Attributes There are a few reasons why adding Scope attributes is a good idea: - There are 2 known use cases where Scope attributes can solve specific problems: - Add support for [Meter "short_name"](#2422), represented as an attribute of Meter's Scope. - Add support for differentiating the type of data emitted from the scopes that belong to different data domains, e.g. profiling data emitted as log records or client-side data emitted as log records needs to be differentiated so that it can be easily routed and processed differently in the backends. We don't have a good way to handle this today. The type of the data can be recorded as an attribute Logger's Scope. - It makes Scope consistent with the other primary data types: Resource, Span, Metric, LogRecord. See additional [discussion here](#2450).

handle instrumentation scope in prometheus conversions

0a5973f

dashpole force-pushed the prom_instrumentation_scope branch from abb65d2 to 0a5973f Compare March 17, 2022 17:08

dashpole marked this pull request as ready for review March 17, 2022 17:12

dashpole requested review from a team March 17, 2022 17:12

github-actions bot assigned bogdandrutu Mar 17, 2022

MrAlias reviewed Mar 22, 2022

View reviewed changes

switch to short_name proposal

a52dad3

dashpole requested review from a team March 24, 2022 15:44

fix link

0b2def8

dashpole closed this Mar 24, 2022

tigrannajaryan mentioned this pull request Mar 28, 2022

Add Attributes to Instrumentation Scope #2450

Closed

tigrannajaryan mentioned this pull request Apr 25, 2022

Introduce Scope Attributes open-telemetry/oteps#201

Merged

joaopgrassi mentioned this pull request Jul 21, 2022

Semantic conventions for Instrumentation Scope Attributes #2682

Open

This was referenced Jul 28, 2022

Add the short_name scope attribute #2702

Closed

Add Instrumentation Scope and Version as labels in Prometheus #2703

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle instrumentation scope in the Prometheus conversion spec #2422

Handle instrumentation scope in the Prometheus conversion spec #2422

dashpole commented Mar 17, 2022 •

edited

Loading

MrAlias Mar 22, 2022

dashpole Mar 22, 2022 •

edited

Loading

MrAlias Mar 22, 2022

jmacd commented Mar 23, 2022 •

edited

Loading

dashpole commented Mar 24, 2022

brian-brazil commented Mar 24, 2022 •

edited

Loading

jmacd commented Mar 24, 2022

brian-brazil commented Mar 24, 2022

dashpole commented Mar 24, 2022

jmacd commented Mar 24, 2022

tigrannajaryan commented Mar 24, 2022

tigrannajaryan commented Mar 24, 2022

dashpole commented Mar 24, 2022

joaopgrassi commented Jun 28, 2022 •

edited

Loading

tigrannajaryan commented Jun 28, 2022

joaopgrassi commented Jul 15, 2022

tigrannajaryan commented Jul 15, 2022

dashpole commented Jul 15, 2022

joaopgrassi commented Jul 19, 2022 •

edited

Loading

tigrannajaryan commented Jul 20, 2022

joaopgrassi commented Jul 20, 2022

joaopgrassi commented Jul 21, 2022

		Instrumentation Scope MUST be left unset for metrics scraped from Prometheus
		endpoints.

Handle instrumentation scope in the Prometheus conversion spec #2422

Handle instrumentation scope in the Prometheus conversion spec #2422

Conversation

dashpole commented Mar 17, 2022 • edited Loading

Background

Changes

MrAlias Mar 22, 2022

Choose a reason for hiding this comment

dashpole Mar 22, 2022 • edited Loading

Choose a reason for hiding this comment

MrAlias Mar 22, 2022

Choose a reason for hiding this comment

jmacd commented Mar 23, 2022 • edited Loading

Proposal for round-tripping Instrumentation Scope

Argument in favor of a first-class namespace concept

dashpole commented Mar 24, 2022

brian-brazil commented Mar 24, 2022 • edited Loading

jmacd commented Mar 24, 2022

brian-brazil commented Mar 24, 2022

dashpole commented Mar 24, 2022

jmacd commented Mar 24, 2022

tigrannajaryan commented Mar 24, 2022

tigrannajaryan commented Mar 24, 2022

dashpole commented Mar 24, 2022

joaopgrassi commented Jun 28, 2022 • edited Loading

tigrannajaryan commented Jun 28, 2022

joaopgrassi commented Jul 15, 2022

tigrannajaryan commented Jul 15, 2022

dashpole commented Jul 15, 2022

joaopgrassi commented Jul 19, 2022 • edited Loading

tigrannajaryan commented Jul 20, 2022

joaopgrassi commented Jul 20, 2022

joaopgrassi commented Jul 21, 2022

dashpole commented Mar 17, 2022 •

edited

Loading

dashpole Mar 22, 2022 •

edited

Loading

jmacd commented Mar 23, 2022 •

edited

Loading

brian-brazil commented Mar 24, 2022 •

edited

Loading

joaopgrassi commented Jun 28, 2022 •

edited

Loading

joaopgrassi commented Jul 19, 2022 •

edited

Loading