-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How CLS should work with promises #64
Comments
The important note here is that all of these stages of the promise lifecycle are important. To me, there are 5 points in the lifecycle of any call which produces an async task, including promises, which one might want to link context between in different ways:
Depending on the style of API (callbacks vs promises) points 2 and 3 may switch places. For repeatable actions (like Rather than picking one source of context, I'd rather see a way to allow the source to be configurable. That could be by allowing the user to define app-wide how a given context tree should be nested, or it could be done by producing a traversable structure that can be used to walk back through the tree to where one expects a particular bit of data to be. It may also be possible to combine the two somehow, with a hinting API for the call sites to individually describe how their child tasks should navigate the tree. It's a complicated problem which I feel CLS may be ill-equipped to solve in its current state. |
I would love to hear @ofrobots's thoughts. |
Followup comment from conversation in #66. Promises present an interesting pattern to async programming in JavaScript, but they are quite distinct from similarly named synchronization primitives in other languages. In all honesty, I didn't "get" the point of promises until I came across a few different articles by notable authors:
The first thing to note, as pointed out by @domenic is that promises, for better or worse, are functional in nature, not imperative. He explains it like this (emphasis his own):
The second important point covered by both authors is that a promise, once resolved or rejected, is immutable. The state of that promise may never change. It follows from this logic that whatever state exists in the CLS context at the time of a promise's resolution is a part of that promise's immutable state. Given this establishment of what a promise is and the example above under the section titled Difference between the three, I'm confident that "convention 1" is the only one of the three that fits with the goals of promises. The reasons these authors (and others like them) are so passionate about educating developers on promises, are the same reasons held by everyone in this thread; to establish a single interpretation of "safe" asynchronous programming in JavaScript. The safety I refer to is the guarantee of immutability, and ability to reason about asynchronous code. The latter two conventions would make it very difficult to reason about the state of a promise, given its CLS context is not captured at resolution time. Even if they are attempts to resolve the context-loss problem... But the closing statement on "convention 1" above describes the proper solution better than I could:
There are also a few example use cases provided above. I won't go into detail on any of these. I might propose, however, that promises could be the wrong tool for approaching those problems. You don't want to implement a non-terminating consumer for a message queue with promises, for example (something I foolishly tried the other day while sleep deprived). The functional nature of promises does not work correctly with the consumer pattern. You're better off with an event listener pattern! (But you can still use promises within your event handlers.) FWIW, my two cents. 👍 |
Agreed, convention 1 feels most natural to me too. |
Convention 3 seems way more correct. That is what we are doing for zones. |
This seems very unrelated to promises, by the way. It's more about the semantics of CLS and similar abstractions. At least for zones, the semantics are that no matter what mechanism you use to schedule your code later---setTimeout, promises, events---you want to have the context inside the scheduled code be the same as the context that performed the scheduling. This is essential to the nature of asynchronous context propagation in all cases I've seen so far. It also is the only thing that makes sense with async-await: // context is c1
await promise;
// context must still be c1 Under convention 1 or 2, context could in some cases no longer be c1, which is clearly wrong. Perhaps an analogy with thread-local storage would make it clearer. Let's say that you had the ability to resolve a promise from a background thread. (In fact, this is done all over the place in the web platform.) Should code in the foreground thread that reacts to that promise suddenly be using the thread-local storage of the background thread? Clearly not. My understanding is that convention 2 would make it always use the background thread context, whereas convention 1 would make it do so under certain weird circumstances. Again, in all cases context propagation should not depend on the scheduling mechanism, but should preserve the invariant that the context propagates from the scheduling code to the code that is scheduled. (My understanding of domains is that failing to enforce this invariant is one of their bigger failures; they do something weird for event emitters where event emitters carry a domain with them, which overrides the zone in which the event handler is called, instead of allowing the domain to propagate from the scheduling code to the event handler code.) |
@parasyte By the way, the link to @domenic's article is broken in your comment. Could you edit the link please? It appears to be this article you were referring to: https://blog.domenic.me/youre-missing-the-point-of-promises/ |
+1 to the comments from Domenic. I would like to add that convention 3 leads to better module interoperability. I like to conceptualize what CLS is trying to do by considering async stack traces. Imagine I am using module that returns me a promise: const cowsComeHome = require('cows').comeHome(); // a promise of cows coming home
function foo() {
cowsComeHome.then(() => {
// what's my context?
});
} From my code's point of view, the async stack trace ought to have |
@trevnorris Do you have any opinion on this? Looks like async-listener uses convention 1 in its patching of |
Patching Promise was introduced by @hayes. @trevnorris's core API preceded landing ES Promises in V8 (and the concomitant challenge of handling the new microtask queue). |
@overlookmotel Fixed. Thanks for the heads up. |
I'm on my phone in a car, so please excuse any typos. The most compelling reason we had for choosing 1 over 3 was for libraries doing some sort of monitoring or introspection (async stack traces, New Relic, etc). In those cases, the thing that holds up a promise from Option 1 provides causality at a fairly low level, option 3 seems to be much closer to capturing user intent. function login(id) {
dB.first({id}).from('users').then(setLastLogin)
} With option 1, setLastLogin was called as a result of getting the user from the database, with option 3 it was called because login was called. In the context of a library used for state management in applications (as opposed to monitoring tools) I do mostly agree with @domenic and async functions are a very compelling argument for using option 3. Basically with option 1 callbacks are continually nested inside eachother, while with option 3, the "stack" is reset for each then. option 3 is most inline with async/await, and when compared to how a sync equivalent would work, the call stack would be reset between each call the await/then. I still think option 1 makes sense in some contexts, especially monitoring, but I don't think that is the majority use case for CLS. |
I have to agree that option 3 is required for async-await. I argue that the coroutine-style async paradigm provided by await has vastly different use cases from promises; The former is good at hiding the complexity of callbacks entirely, whilst maintaining context in a traditional call stack model; The latter is useful for transforming the context via middleware. |
I disagree strongly that async/await and promises are meant to be used differently. My article linked previously ("You're Missing the Point of Promises") tries to make this clear. |
This module was originally created for the monitoring purpose, and it's even called continuation-local-storage, implying continued availability of context data into the future. Maybe there needs to be another module called context-local-storage, or something like that, which implements the third style? |
I disagree that anything but 3 correctly gives correct continual availabilityof context data into the future. |
Unless I'm misunderstanding your thinking, the third approach would mean that data added within the processing of a yielded promise would not be available in the context that did the yielding, after the yield completes. This is exactly what is needed for the monitoring use. It's less of a compartmentalized context tree and more of a sequence of transformations over time to one greater context. In most application performance monitoring cases, the "greater context" is an entire request. |
Not exactly, contexts are nested, contexts that are active before a yielded But that's kinda besides the point, option 3 is definitely wrong for This makes the definition of the 5 events you mentioned earlier pretty Personally I've been very frustrated with the behavior of both option 1 and On Sun, Jun 19, 2016, 7:22 PM Stephen Belanger [email protected]
|
Maybe it'd make sense to store references to the state snapshot at each of the transition points in the async task lifetime, allowing the relevant snapshot to be accessed manually, if you want to look at something different than what the default nesting resolution produces? |
It's an interesting idea, I'm not sure what that would look like. It seems On Sun, Jun 19, 2016, 9:22 PM Stephen Belanger [email protected]
|
That could be possible too--adding a strategy option to the namespace constructor. |
All my application is written in promises chain adn some callback style. |
async-listener is important module for APM tools and for long stack traces. Promises make the concept of a long-stack-trace ambiguous – as you can conceive the `then` callback as a continuation of either the resolution context or the context that queued the `then` callback ([more details][2]). async-listener defaults to the resolution context. This is the wrong default for how we are using promises here, resuling in APM tools like Stackdriver Trace losing context. We work-around the problem by not using the promise after it has resolved. [1]: https://www.npmjs.com/package/async-listener [2]: othiym23/node-continuation-local-storage#64
[async-listener][1] is important module for APM tools and for long stack traces. Promises make the concept of a long-stack-trace ambiguous – as you can conceive the `then` callback as a continuation of either the resolution context or the context that queued the `then` callback ([more details][2]). async-listener defaults to the resolution context. This is the wrong default for how we are using promises here, resuling in APM tools like Stackdriver Trace losing context. We work-around the problem by not using the promise after it has resolved. [1]: https://www.npmjs.com/package/async-listener [2]: othiym23/node-continuation-local-storage#64
[async-listener][1] is important module for APM tools and for long stack traces. Promises make the concept of a long-stack-trace ambiguous – as you can conceive the `then` callback as a continuation of either the resolution context or the context that queued the `then` callback ([more details][2]). async-listener defaults to the resolution context. This is the wrong default for how we are using promises here, resuling in APM tools like Stackdriver Trace losing context. We work-around the problem by not using the promise after it has resolved. [1]: https://www.npmjs.com/package/async-listener [2]: othiym23/node-continuation-local-storage#64
What's the status of this question? Is this still unresolved? |
@reggi Not resolved, but I suppose now pretty irrelevant. I went for convention 3 in my re-write of cls-bluebird. But from my understanding, async_hooks follows convention 1. This implementation of CLS is likely to be retired in favour of a new implementation of CLS based on async_hooks, so I guess convention 1 has won the day. |
Just finished watching this video by Thomas Watson on Instrumenting Node.js in Production, in the video Thomas goes deep into how the Given that |
I would love to see CLS brought up to date with modern versions of Node. However, somebody needs do the work necessary to add (and debug, which is the hard part) that functionality, and it's probably not going to be me. My interests are elsewhere at the moment, and I really don't have the resources necessary to deal with the tricky and technical work required. I'm happy to review PRs and hand out maintainer bits to motivated contributors, but that's about the limits of what I think I can do. Not all of the work is on the CLS side, either. If you're interested, you should take a look at the results in nodejs/benchmarking#181 – right now, the combination of |
With AsyncWrap taking shape, it feels like it could be a good time to determine how CLS (or a CLS-like module that uses AsyncWrap) should work with promises.
After delving into how promises and CLS interact, I've come to the conclusion that there are 3 different ways in which a CLS/Promise shim can work.
All three approaches have their own logic, and they're all incompatible with each other.
It's not clear which is the "correct" way. The behaviour of native promises with CLS (through the shim provided by async-listener) follows one convention, cls-q and cls-bluebird follow another.
The 3 conventions
Here are the 3 different approaches:
Convention 1: Callback style
This is the behavior of native JS Promises.
The context at the end of the last
.then()
callback is maintained for the next.then()
callback. Where and when the.then()
is added to the promise chain is irrelevant.For CLS purposes, the following are treated the same:
i.e. promises are essentially sugar for callbacks, rather than a distinct syntax with different behavior.
If the code inside a
.then()
callback loses CLS context (due to using a queue or similar), then the shim would NOT correct this.On the positive side, it allows a CLS context to be created within a
.then()
callback and the rest of the promise chain that follows runs within that context. This could be useful e.g. for middleware.Convention 2: Follow promise chain
CLS context is set at the time of the promise's creation. Any promises which chain on from another promise inherit the same context.
This is the same as (1) except:
.then()
callback loses context, context is restored for the next.then()
callback.then()
callback, it is NOT maintained for the next.then()
in the promise chainConvention 3: Listener attachment context
CLS context for execution of
.then()
callback is defined at time.then()
is called. This is not necessarily the same context as the previous promise in the chain.Similarly to (2), if a
.then()
callback loses context, this doesn't affect context for the next.then()
in the chain.This appears to be the convention followed by cls-q and cls-bluebird.
Difference between the three
The following code demonstrates the difference between the 3 conventions. It will log "This Promise implementation follows convention X", where X depends on which approach the promise shim takes.
NB With native JS promises you get "This Promise implementation follows convention 1". With cls-q or cls-bluebird you get "This Promise implementation follows convention 3".
Which way is best?
I think this is debatable. It depends on how you conceptualize promises and the control flow they represent.
Convention 1 is the simplest and isn't opinionated about what a promise control flow represents.
Native JS Promises follow this convention, so there's an argument other promise shims should follow the same convention to avoid confusion.
This doesn't cover the common use case of patching where a library like
redis
loses CLS context within it. However, there's a strong separation of concerns argument that a shim for a promise library should just shim the promise library. If another library loses CLS context, then that library should be shimmed. i.e. solve the problem thatredis
loses context withcls-redis
notcls-bluebird
!Convention 2 conceptualizes a promise chain as a set of connected actions.
Imagine multiple tasks running in parallel, each composed of multiple steps e.g. read a file, transform it, write it out again. Each task run is represented by a promise chain.
Now if you want to add an extra step to each of the tasks (e.g. notify a server when task is done), you'd add an extra
.then()
to the end of the promise chain for each task. You would expect each callback to run in the CLS context for that task.Convention 3 conceptualizes a promise chain as a queue.
Imagine a resource which can only be accessed by one process at a time. The queue for access is represented by a promise. When a process finishes accessing the resource, it resolves the promise and the next in the queue (next promise in the chain) then starts up. If you want access to the resource, you add a
.then()
to the promise chain.If a running task (e.g. serving an HTTP request), gets the resource and then continues on with other things, you would expect promises chained on after accessing the resource to execute in the CLS context of the task, NOT the context of the preceding item in the resource queue.
Conclusion
I'm not pushing for one convention over another. I just thought it'd be useful to lay out what I think are the 3 different choices and their implications.
What I do suggest is that if there's some consensus on which convention is best, this be set out in a set of tests, so everyone can be sure that the cls-promise implementation they're using is compliant.
It would also clear up what's a bit of an ambiguity - there's been some confusion for example here: TimBeyer/cls-bluebird#1 (comment).
I've made a start on a test suite here: https://github.com/overlookmotel/cls-bluebird-test
Anyone have any thoughts on this?
The text was updated successfully, but these errors were encountered: