Metrics-generator should filter out spans based upon attributes #1482

yvrhdn · 2022-06-09T17:07:20Z

Is your feature request related to a problem? Please describe.
If the metrics-generator is enabled for a tenant, it will generate metrics for all traces that are ingested. In some cases it would be nice if we could only generate metrics for a subset of spans.

Example:

don't collect metrics for traces that don't have the attribute http.code
only generate metrics for spans from service XXX

Describe the solution you'd like
It should be possible to configure filters that include/exclude spans based upon their attributes.

Describe alternatives you've considered
There is no alternative AFAIK. You could drop the metrics when remote writing them, but that is definitely not ideal.

Additional context

The text was updated successfully, but these errors were encountered:

faustodavid · 2022-07-15T15:54:50Z

What do you think of having filters a bit similar to the OpenTelemetry Collector but just for the metrics processors?

https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/internal/coreinternal/processor/filterspan/filterspan.go

It will allow multiple includes or excludes filters that can be defined using regexp or strict match type.

metrics_generator:
  processor:
    span_metrics:
      dimensions:
        - http.method
        - http.host
      span_filters:
        include:
          - match_type: regexp
            attributes:
              - key: http.method
                value: "GET|POST|PUT|DELETE|HEAD|OPTIONS|TRACE"
        exclude:
         - match_type: strict
           attributes:
            - key: http.host
              value: "some-host-name"

yvrhdn · 2022-07-18T10:35:53Z

Yeah, that looks like a good reference! We won't be able to vendor the OTel processor directly, but we can mirror the structure of the configuration.

For a first implementation I'd like to keep it as simple as possible (just focus on the minimal features required to make this useful) so our initial implementation might be more limited.

09jvilla · 2022-11-04T18:45:17Z

Does this help decrease the count of series produced by the metrics generator and therefore does it have cost management applications? Or is it more about just getting metrics about the services or cases you actually care about without having those numbers skewed by the other ones?

ie-pham · 2022-11-04T19:25:25Z

If the filters are applied in a way that would reduce cardinality in the metrics produced by the Metrics Generator then potentially there could be a cost reduction.

09jvilla · 2022-11-04T19:27:29Z

Based on your response, though, it sounds like that is not the core use case or purpose of doing this? Its more like a nice secondary benefit?

github-actions · 2023-01-04T00:03:12Z

This issue has been automatically marked as stale because it has not had any activity in the past 60 days.
The next time this stale check runs, the stale label will be removed if there is new activity. The issue will be closed after 15 days if there is no new activity.
Please apply keepalive label to exempt this Issue.

NickAdolf · 2023-01-25T11:16:52Z

This filtering is both to declutter metrics that won't actually be used, but also a cost savings for us.

As we look to broaden our OTel usage, including Faro the metrics generator is required to really harness the value but it will put extreme pressure on active series counts. This is the biggest impact for us, as a Grafana Cloud customer. The previously proposed solution would more than suffice for my particular use case.

lmickh · 2023-02-09T18:47:55Z

I think there is a benefit to filtering out the traces before then get sent to the metrics generator, but that might be out of the scope of this issue.

I have a use-case where we receive ~5TB trace data per day, but only want to generate metrics on a specific list of services. It would be great if the distributors could just not send that to the metrics generator and reduce the daily network costs in addition to the cpu/memory consumption of the metrics generator.

This could probably be done in my use-case by splitting Tempo tenants, but then I've got to push that logic down to the otel collector and deal with different services using different tenants even though they are part of the same stack that telemetry data gets referenced across.

It would be nice if the distributor could use a limit set of dimensions to filter with even if it was only tenant and an exact match on the service.name attribute.

zalegrala · 2023-04-07T16:14:20Z

I've got a PR up to approach solving this issue. Review/testing welcome. I'd like to get this running somewhere and put it through the paces, but right now I have reasonable indication this is doing what we've want here based on the tests.

To the notion of filtering at the distributor, I think this is a great idea, but is more complicated due to the nature of the how the distributor works. The best approach to that might be to filter at the head using the agent or similar approach, so the distributor never receives the spans. But perhaps you are wanting to only filter out metrics generation and not the actual ingest of the spans. It might be worth a separate issue about how we could think about how we could approach that issue.

It came up during review with the team, that it might also be nice to filter based on TraceQL, given the amount of effort we're putting into the language. If we could tie into that area of the code, we may be able to re-use some filtering capabilities there, but I suspect this will be a follow-up issue.

zalegrala · 2023-04-07T16:16:47Z

Note, any testers will want to read the docs that are also included in #2274.

yvrhdn added the component/metrics-generator label Jun 9, 2022

yvrhdn changed the title ~~Exclude spans from metrics-generator~~ Metrics-generator should filter out spans based upon attributes Jun 27, 2022

faustodavid mentioned this issue Jun 28, 2022

[Metrics-generator] Add option to include dimensions from span events #1508

Open

joe-elliott added this to the Next milestone Jul 12, 2022

cristiangsp added this to Tempo squad Jul 13, 2022

yvrhdn assigned ie-pham Oct 11, 2022

cristiangsp moved this to Todo in Tempo squad Nov 2, 2022

github-actions bot added the stale Used for stale issues / PRs label Jan 4, 2023

joe-elliott added enhancement New feature or request keepalive Label to exempt Issues / PRs from stale workflow and removed stale Used for stale issues / PRs labels Jan 4, 2023

zalegrala moved this from Todo to Next in Tempo squad Mar 1, 2023

zalegrala self-assigned this Mar 1, 2023

zalegrala moved this from Next to In Progress in Tempo squad Mar 28, 2023

zalegrala mentioned this issue Apr 7, 2023

[metrics-generator] filter out spans based on policy #2274

Merged

3 tasks

zalegrala moved this from In Progress to In Review in Tempo squad Apr 27, 2023

zalegrala closed this as completed in #2274 May 2, 2023

github-project-automation bot moved this from In Review to Done in Tempo squad May 2, 2023

ie-pham moved this from Done to Archived in Tempo squad May 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics-generator should filter out spans based upon attributes #1482

Metrics-generator should filter out spans based upon attributes #1482

yvrhdn commented Jun 9, 2022

faustodavid commented Jul 15, 2022

yvrhdn commented Jul 18, 2022

09jvilla commented Nov 4, 2022 •

edited

Loading

ie-pham commented Nov 4, 2022

09jvilla commented Nov 4, 2022

github-actions bot commented Jan 4, 2023

NickAdolf commented Jan 25, 2023

lmickh commented Feb 9, 2023

zalegrala commented Apr 7, 2023

zalegrala commented Apr 7, 2023

Metrics-generator should filter out spans based upon attributes #1482

Metrics-generator should filter out spans based upon attributes #1482

Comments

yvrhdn commented Jun 9, 2022

faustodavid commented Jul 15, 2022

yvrhdn commented Jul 18, 2022

09jvilla commented Nov 4, 2022 • edited Loading

ie-pham commented Nov 4, 2022

09jvilla commented Nov 4, 2022

github-actions bot commented Jan 4, 2023

NickAdolf commented Jan 25, 2023

lmickh commented Feb 9, 2023

zalegrala commented Apr 7, 2023

zalegrala commented Apr 7, 2023

09jvilla commented Nov 4, 2022 •

edited

Loading