[WIP] OTEL / Prom metrics benchmark #5676

yurishkuro · 2024-06-24T22:28:02Z

Note that the benchmarks are designed to minimize the perf impact of counter bumps using "bounded instruments". Jaeger code always caches the instruments when some of the labels are dynamic (e.g. counts of received spans labeled with emitting service name)

$ go test -benchmem -benchtime=2s -bench=Benchmark ./internal/metrics/
goos: darwin
goarch: arm64
pkg: github.com/jaegertracing/jaeger/internal/metrics
BenchmarkPrometheusCounter-10       	342003924	         6.984 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounter-10             	33299455	        71.73 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounterWithLabel-10    	12442818	       190.6 ns/op	      16 B/op	       1 allocs/op
PASS
ok  	github.com/jaegertracing/jaeger/internal/metrics	8.415s

Signed-off-by: Wise-Wizard <[email protected]>

Co-authored-by: Yuri Shkuro <[email protected]> Signed-off-by: Saransh Shankar <[email protected]>

Signed-off-by: Wise-Wizard <[email protected]>

…into OTEL_Metrics

Signed-off-by: Wise-Wizard <[email protected]>

tested via ``` $ go test -benchmem -benchtime=5s -bench=Benchmark ./internal/metrics/ ``` before: ``` BenchmarkPrometheusCounter-10 856818336 6.875 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounter-10 146044255 40.92 ns/op 32 B/op 2 allocs/op ``` after: `` BenchmarkPrometheusCounter-10 855046669 6.924 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounter-10 293330721 21.05 ns/op 16 B/op 1 allocs/op ``` Signed-off-by: Yuri Shkuro <[email protected]>

Signed-off-by: Wise-Wizard <[email protected]>

Signed-off-by: Yuri Shkuro <[email protected]>

codecov · 2024-06-24T22:35:45Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.36%. Comparing base (afdd311) to head (c8735f6).
Report is 3 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #5676   +/-   ##
=======================================
  Coverage   96.36%   96.36%           
=======================================
  Files         329      329           
  Lines       16060    16060           
=======================================
  Hits        15477    15477           
  Misses        405      405           
  Partials      178      178

Flag	Coverage Δ
badger_v1	`8.04% <ø> (ø)`
badger_v2	`1.92% <ø> (ø)`
cassandra-3.x-v1	`16.60% <ø> (ø)`
cassandra-3.x-v2	`1.84% <ø> (ø)`
cassandra-4.x-v1	`16.60% <ø> (ø)`
cassandra-4.x-v2	`1.84% <ø> (ø)`
elasticsearch-7.x-v1	`18.88% <ø> (ø)`
elasticsearch-8.x-v1	`19.08% <ø> (ø)`
elasticsearch-8.x-v2	`19.08% <ø> (ø)`
grpc_v1	`9.47% <ø> (+0.01%)`	⬆️
grpc_v2	`7.49% <ø> (ø)`
kafka	`9.76% <ø> (ø)`
opensearch-1.x-v1	`18.93% <ø> (+0.01%)`	⬆️
opensearch-2.x-v1	`18.92% <ø> (-0.02%)`	⬇️
opensearch-2.x-v2	`18.92% <ø> (ø)`
unittests	`94.22% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

yurishkuro · 2024-06-26T16:20:55Z

TODO: add Opencensus as implementation, for comparison

https://github.com/open-telemetry/opentelemetry-collector/blob/227fb823dbf33e6a62b842aef5a70e10ed1db84f/service/internal/proctelemetry/config.go#L70

Signed-off-by: Yuri Shkuro <[email protected]>

jmacd · 2024-07-02T08:40:36Z

internal/metrics/benchmark_test.go

+	attrSet := attribute.NewSet(attribute.String("tag1", "value1"))
+	attrOpt := metric.WithAttributeSet(attrSet)
+


Curious how you feel about these two lines, and why you've excluded them from the benchmark? Ergonomically speaking, would you cache the attrOpt value after computing it? If so, you'd be better off with bound instruments. If not, you should measure it. I ask because the introduction of functional options adds a bunch of allocations, so unless you compute and re-use the []Option slice, you've either got an ergonomics problem or a performance problem. We stopped using the OTel-Go API as a result of this and installed a more-efficient functional-option-free bypass. lightstep/otel-launcher-go#446

Ergonomically speaking, would you cache the attrOpt value after computing it?

@jmacd yes, that's exactly what Jaeger is doing, we always used "bound" instruments.

jaeger/internal/metrics/otelmetrics/counter.go

Lines 12 to 20 in 8a33800

type otelCounter struct {

counter metric.Int64Counter

fixedCtx context.Context

option metric.AddOption

}

func (c *otelCounter) Inc(value int64) {

c.counter.Add(c.fixedCtx, value, c.option)

}

But this does not help to completely avoid allocations. It's not the passing of vararg options that's causing the allocations, it's something deeper in the implementation.

Signed-off-by: Yuri Shkuro <[email protected]>

Wise-Wizard and others added 21 commits June 19, 2024 22:44

Instantiated OTEL Metrics

f1f9b7a

Signed-off-by: Wise-Wizard <[email protected]>

Ran make fmt

12403b1

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'main' into OTEL_Metrics

175efa3

Merge branch 'main' into OTEL_Metrics

443858f

Create otelmetrics package

0a073b6

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'main' into OTEL_Metrics

aa92107

Update pkg/metrics/otelmetrics/factory.go

7efd9b9

Co-authored-by: Yuri Shkuro <[email protected]> Signed-off-by: Saransh Shankar <[email protected]>

Created benchmark to compare metric implementation

a70d64a

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'OTEL_Metrics' of https://github.com/Wise-Wizard/jaeger …

2ab543b

…into OTEL_Metrics

Resolved conflicts

dc2a9ed

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'main' into OTEL_Metrics

f508b8a

Implemented changes

099ba05

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'main' into OTEL_Metrics

21c14b4

Used metrics.NullCounter as return type

8b47b12

Signed-off-by: Wise-Wizard <[email protected]>

Created rough implementation of OTEL SDK

8032ce2

Signed-off-by: Wise-Wizard <[email protected]>

Fixed initialization errors

6ec5ec4

Signed-off-by: Wise-Wizard <[email protected]>

Merge branch 'main' into OTEL_Metrics

33d5855

Made suggested changes

4e7dcbb

Signed-off-by: Wise-Wizard <[email protected]>

Created Prometheus Exporter

f29cad5

Signed-off-by: Wise-Wizard <[email protected]>

OTEL / Prom metrics benchmark

723d74a

Signed-off-by: Yuri Shkuro <[email protected]>

yurishkuro mentioned this pull request Jun 25, 2024

Performance vs. Prometheus SDK open-telemetry/opentelemetry-go#5542

Open

add-census

c8735f6

Signed-off-by: Yuri Shkuro <[email protected]>

jmacd reviewed Jul 2, 2024

View reviewed changes

more

d04050c

Signed-off-by: Yuri Shkuro <[email protected]>

yurishkuro mentioned this pull request Jul 2, 2024

Add support for bound instruments to the metrics API open-telemetry/opentelemetry-specification#4126

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] OTEL / Prom metrics benchmark #5676

[WIP] OTEL / Prom metrics benchmark #5676

yurishkuro commented Jun 24, 2024 •

edited

Loading

codecov bot commented Jun 24, 2024 •

edited

Loading

yurishkuro commented Jun 26, 2024

jmacd Jul 2, 2024

yurishkuro Jul 2, 2024

		attrSet := attribute.NewSet(attribute.String("tag1", "value1"))
		attrOpt := metric.WithAttributeSet(attrSet)

	type otelCounter struct {
	counter metric.Int64Counter
	fixedCtx context.Context
	option metric.AddOption
	}

	func (c *otelCounter) Inc(value int64) {
	c.counter.Add(c.fixedCtx, value, c.option)
	}

[WIP] OTEL / Prom metrics benchmark #5676

Are you sure you want to change the base?

[WIP] OTEL / Prom metrics benchmark #5676

Conversation

yurishkuro commented Jun 24, 2024 • edited Loading

codecov bot commented Jun 24, 2024 • edited Loading

Codecov Report

yurishkuro commented Jun 26, 2024

jmacd Jul 2, 2024

Choose a reason for hiding this comment

yurishkuro Jul 2, 2024

Choose a reason for hiding this comment

yurishkuro commented Jun 24, 2024 •

edited

Loading

codecov bot commented Jun 24, 2024 •

edited

Loading