.With() function repeating keys/values #622

henriquechehad · 2018-08-15T19:50:30Z

Using .With() function it's repeating the same key/value for every With call. Should it overwrite the previous values or have a function to remove specific keys/values previously set?


import "go.uber.org/zap"

func main() {
	logger, _ := zap.NewProduction()
	defer logger.Sync()
	sugar := logger.Sugar()

	sugar = sugar.With("test", "value")
	sugar.Info("message 01")

	// prints:
	// {"level":"info","ts":1534362222.643982,"caller":"tmp/main.go:11","msg":"message 01","test":"value"}

	sugar = sugar.With("test", "value")
	sugar.Info("message 02")

	// prints duplicated "test" "value"
	// {"level":"info","ts":1534362222.644039,"caller":"tmp/main.go:14","msg":"message 02","test":"value","test":"value"}
}

The text was updated successfully, but these errors were encountered:

akshayjshah · 2018-08-28T21:39:35Z

This is as designed, and is documented in NewJSONEncoder. It's technically allowable by the JSON specification, and all deserialization code that I'm aware of (including Go's standard library) preserves only the last value.

Supporting a last-writer-wins policy as you suggest is remarkably difficult in zap (and similar projects) - zap is fast because it encodes fields as they're added, without maintaining some intermediate representation (commonly map[string]interface{}). It's technically possible, and could even be made reasonably fast in the case when there are no duplicates, but we haven't run into many situations where it's valuable - usually we run into this when two developers are mistakenly stomping on each others' log data, so keeping both values in the output makes debugging easier.

In short, this is functioning as designed. If you're interested in a fairly complex PR, I can guide you through how we might implement this feature and benchmark the performance impact.

geoah · 2021-03-03T10:58:18Z

Just a note:

This seems to be an issue when sending said logs to GCP's stackdriver.
It takes the duplicated fields and concats them.

{
	"trace": "foo",
	"trace": "foo"
}

will be presented in stackdriver as

{
	"trace": "foofoo"
}

akshayjshah · 2021-03-03T19:38:45Z

Huh! I'm a little surprised by this, but perhaps Google's also trying to avoid dropping the duplicate data. I'm no longer at Uber, so the current maintainers have the final say on whether to make any changes to the current duplicate-handling code.

If this is particularly inconvenient for you, you can wrap zap's JSON encoder and fix it yourself. To keep performance reasonably good, you could use a Bloom filter:

The JSON encoder can keep track of all keys seen so far in an uint64 Bloom filter.
If it sees a key that's possibly a duplicate, it can unmarshal the JSON accumulated so far into a map[string]json.RawMessage, overwrite the existing data (if any), and re-serialize it.

That would keep serialization reasonably fast and zero-allocation when there are no duplicates. I haven't thought through how you'd handle duplicates in nested objects (created by zap.Namespace).

pohly · 2024-11-21T09:31:38Z

This is as designed, and is documented in NewJSONEncoder. It's technically allowable by the JSON specification,

According to the RFC:

The names within an object SHOULD be unique. ... When the names within an object are not unique, the behavior of software that receives such an object is unpredictable.

zap is relying on readers implementing a "last one wins" approach, but that is not required by the spec.

You are right that SHOULD is not MUST, so it's not wrong to write JSON with duplicates - it's just not guaranteed to be interoperable.

henriquechehad changed the title ~~.With() function repeating key/values~~ .With() function repeating keys/values Aug 15, 2018

sojeri mentioned this issue Jun 24, 2020

MB-2977 add move locator to logs transcom/mymove#4315

Merged

3 tasks

JamesPeiris mentioned this issue May 25, 2021

chore: remove source of duplicate properties in JSON log argoproj/argo-events#1228

Merged

1 task

hligit mentioned this issue Jul 2, 2021

SEGV in flb_filter_do fluent/fluent-bit#3681

Closed

abhinav mentioned this issue Nov 10, 2021

Is it possible to check whether a field is already defined and either drop it or overwrite it? #1025

Closed

prashantv mentioned this issue Apr 19, 2023

WrapError: wrap an error with fields to be logged by zap.Error #1271

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.With() function repeating keys/values #622

.With() function repeating keys/values #622

henriquechehad commented Aug 15, 2018

akshayjshah commented Aug 28, 2018

geoah commented Mar 3, 2021

akshayjshah commented Mar 3, 2021

pohly commented Nov 21, 2024

.With() function repeating keys/values #622

.With() function repeating keys/values #622

Comments

henriquechehad commented Aug 15, 2018

akshayjshah commented Aug 28, 2018

geoah commented Mar 3, 2021

akshayjshah commented Mar 3, 2021

pohly commented Nov 21, 2024