Issues with cross-process concurrency #20

sheerun · 2014-07-23T12:29:35Z

It's connected with sindresorhus/insight#22

Configstore doesn't lock config storage before writing to it. It can result in invalid YAML when instantiating configstore in different processes concurrently.

If configstore encounters invalid YAML, it cleans the store, resulting in sindresorhus/insight#22

The solution is to use file lock around any filesystem call (mkdir, write, read).

sindresorhus · 2014-07-23T12:43:00Z

When I created this I assumed the file system did atomic writes, but clearly not. Seems weird that fs can't handle this for you.

dfreeman · 2015-07-22T02:37:56Z

I spent a little time looking at this today. Unfortunately, it looks like the only way to incorporate a file lock would be to make the entire configstore API asynchronous.

Given that this is already a fairly small library, a change like that would essentially amount to a complete rewrite. There's also something to be said for the simplicity of the synchronous interface for use cases that aren't concerned with concurrent access.

Do you have any thoughts on what the best path forward on this might be? I'm considering the possibility of just forking/building an asynchronous concurrent-access-safe version of configstore, but wanted to get your insights first.

sindresorhus · 2015-07-22T04:41:57Z

I don't see why it would have to be async. Can you elaborate?

dfreeman · 2015-07-22T11:47:02Z

If the lock is already held by another process when we go to read/write the config file, then the two options are either to give up entirely, or to wait some amount of time and check the lock again.

In the "just give up" option, clients of the library have to be prepared to handle a "lock already taken" exception potentially being raised by any configstore operation. They then have to implement their own logic to either manually retry after some period of time or otherwise handle the failure. This avoids data corruption, but pushes the problem of actually dealing with concurrent use on to the users of the library.

For the "automatically retry" option, we don't want to just hammer the filesystem in a loop checking the lock, so we need to wait some period of time between checks (proper-lockfile does this with nice exponential backoff). The only way I can think of to do this synchronously would be to roll our own file lock implementation that uses fibers or some other native extension to do a synchronous wait. Both the complexity of building a home-grown file lock and introducing the need for native bindings feel like points against that idea.

There could definitely be another alternative I'm not seeing, but the only other one I've thought of is an asynchronous interface.

sindresorhus · 2015-07-25T20:39:14Z

Another alternative could be to use atomic file writing and just let the last concurrent write win. One concurrent write will overwrite another, but that might not be such a big problem?

// @SBoudrias @sheerun

SBoudrias · 2015-07-25T21:16:18Z

I think having the last concurrent write win is good enough.

About the point about the automatically retry, the dummy implementation is to lock the process with a loop: while (now < t300msLater) {}. This is blocking the process - but that's kind of the point of a synchronous API. That would be way easier than starting to play with fibers.

sheerun · 2015-07-26T12:49:08Z

I have no idea how to implement "last concurrent write win" behavior :) With node-proper-lockfile it's probably feasible to implement "first concurrent write win", because subsequent calls to lockSync would throw an error..

sindresorhus · 2015-07-26T12:51:24Z

@sheerun We can use atomic file writing: https://github.com/iarna/write-file-atomic It works by writing to a temporary file first, then renaming and overwriting the actual file when done. We would then get "last concurrent write win" for free.

sheerun · 2015-07-26T18:35:02Z

That's certainly better than corrupted configuration files :)

dfreeman · 2015-07-27T14:26:15Z

@SBoudrias Blocking the process is the point of a synchronous API, but using a busy loop that pegs one CPU core seems less than ideal.

@sindresorhus Atomic writes definitely solve the corruption problem, and like you mentioned, it's easy to drop in a package to handle that. If you decide just avoiding total corruption is good enough for this project, then 👍.

In casual testing, though, the race condition I hit more often was caused by the read-mutate-write sequences in set and del – competing processes end up clobbering one another's changes when those steps interleave, so you get data loss. Not as dire as straight-up corrupting the file, but has a way bigger window it can happen in, so ¯_(ツ)_/¯

Fixes #20.

sheerun mentioned this issue Sep 12, 2014

Fix/skip prompt bower/bower#1507

Merged

forivall added a commit to forivall/configstore that referenced this issue Oct 16, 2014

Wip fix sindresorhus#20

1ff21ce

joostdevries mentioned this issue Feb 25, 2015

Config store race condition with parallel builds? ember-cli/ember-cli#3368

Closed

kevva added a commit that referenced this issue Aug 4, 2015

Use write-file-atomic to avoid corruption

9867a7a

Fixes #20.

kevva mentioned this issue Aug 4, 2015

Use write-file-atomic to avoid corruption #25

Merged

SBoudrias closed this as completed in #25 Aug 4, 2015

sindresorhus mentioned this issue Nov 13, 2015

Handle cross-process conflict floatdrop/cacha#1

Open

sindresorhus mentioned this issue Dec 19, 2015

Caching istanbuljs/nyc#101

Closed

jamestalmage mentioned this issue Dec 19, 2015

More detailed fs documentation nodejs/node#4352

Closed

stephenplusplus mentioned this issue Sep 14, 2020

Should uploading not use async internally instead of sync? googleapis/gcs-resumable-upload#297

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues with cross-process concurrency #20

Issues with cross-process concurrency #20

sheerun commented Jul 23, 2014

sindresorhus commented Jul 23, 2014

dfreeman commented Jul 22, 2015

sindresorhus commented Jul 22, 2015

dfreeman commented Jul 22, 2015

sindresorhus commented Jul 25, 2015

SBoudrias commented Jul 25, 2015

sheerun commented Jul 26, 2015

sindresorhus commented Jul 26, 2015

sheerun commented Jul 26, 2015

dfreeman commented Jul 27, 2015

Issues with cross-process concurrency #20

Issues with cross-process concurrency #20

Comments

sheerun commented Jul 23, 2014

sindresorhus commented Jul 23, 2014

dfreeman commented Jul 22, 2015

sindresorhus commented Jul 22, 2015

dfreeman commented Jul 22, 2015

sindresorhus commented Jul 25, 2015

SBoudrias commented Jul 25, 2015

sheerun commented Jul 26, 2015

sindresorhus commented Jul 26, 2015

sheerun commented Jul 26, 2015

dfreeman commented Jul 27, 2015