improvement: Add cancellation support for long running queries #1271

rpallares · 2024-07-02T08:34:35Z

Hi,

I open this PR to support cancellation for long running queries.
It's inpired from #423.

The root problem is related to PageSize feature that imply synchornous queries.
This force us to not use it and expose directly IAsyncEnumerable.
The counterpart is clients are able to totally dump our database, and if not wanted, it's still nice to cancel thé request.
That issue force to not apply Security guidance for ASP.NET Core Web API OData suggestions to not sacrifice performance.

To reduce a bit risks a bit cancellation must be supported to at least reduce ressource consumption when the client timeout or cancel on long running queries.

The PageSize issue should be also fixed.

rpallares · 2024-07-02T08:35:41Z

I'm not sure how I could test that but I'm open to suggestions

rpallares · 2024-07-02T08:36:41Z

@microsoft-github-policy-service agree

habbes · 2024-07-04T12:32:31Z

@rpallares could you create a separate issue describing the PageSize issue in detail?

rpallares · 2024-07-04T19:45:09Z

@rpallares could you create a separate issue describing the PageSize issue in detail?

Hi, I'm note sure it's necessary to create a new one because all these existing issues already details that comportement:

There's probably more related as #423 seems to come from very old issue and even older pr #324

I also saw references inside https://github.com/OData/WebApi

Returning an IQueryable from EF core 6 generates synchronous database calls WebApi#2598
ODataQueryOptions forces materialization of the IQueryable when PageSize is set WebApi#886

If any brealing change is required, odata team should really consider add it in the next major version.

Thanks

habbes · 2024-08-01T14:45:38Z

@rpallares

I'm not sure how I could test that but I'm open to suggestions

Here's a suggestion of how you could test this. The general idea is to pass a cancellation token for which you have the CancellationTokenSource, then when you run the operation you want to abort, cancel the token and assert that an OperationCanceledException was thrown (assuming you're handling the cancellation by throwing that exception).

Look at the ODataResourceSetSerializerTests for examples of tests targeting the ODataResourceSetSerializer.

First, I think it will be easier to create unit tests if the ODataSerializerContext.CancellationToken property is settable. You can set it to the Request.HttpContext.RequestAborted token in the constructor, or if it's unset, but I think it would be easier not to fully couple it to the request.

Create a test async enumerable data source that will generate the sample data to serialize and will handle the cancellation token that will be passed by the serializer:

private static async IAsyncEnumerable<Address> GetAsyncAddressesDataSource(
    [EnumeratorCancellation] CancellationTolen cancellationToken = default)
{
    for (int i = 0; i < 1000; i++)
    {
        yield return new Address { City = "City" };
        await Task.Yield();
        cancellationToken.ThrowIfCancellationRequested();
    }
}

Ideally the enumeration will be cancelled before it runs through all those iterations. If we abort the request before we start serializing, then we're guaranteed the enumeration will be cancelled (if cancellation token is propagated correctly). If we cancel the token concurrently with serialization, it's possible it might run through the whole loop before cancellation is triggered even if the code is correct. So this could be a flaky test if we do it that way, but the Task.Yield() and the large number of iterations aim to avoid that. We could also use a while (true) but that would hang the test instead of failing if cancellation is not triggered.

Now we'll need a way to create trigger the cancellation of the token from the test. One way to do this would be to create an instance of CancellationTokenSource then pass its token to the ODataSerializerContext.Token property. However, in your PR, this property is read-only, and gets the token from the request. We can easily get around this limitation by making the ODataSerializerContext.Token property settable. So we can override the token in the test. That would work fine, but would leave a gap in the test, in that we won't be verifying that it's the cancellation of the HttpRequest the serialization to cancel. Maybe we could test the follow of the request's cancellation token to the serializer context separately to close this gap.

Alternatively, we could inject an override the cancellation token in the Request.HttpContext. This is a bit more involved, but I think it's worth it.

First, we use the RequestFactory.Create() helper to create a new request. We could simply pass this request to the serializer context in the test. And our test code would look like so:

var request = RequestFactory.Create();

// ...
var addresses = GetAsyncAddressesDataSource();
var type = typeof(IAsyncEnumerable<Address>);
ODataSerializationContext writeContext = new ODataSerializationContext
{
    Model = model,
    Type = type ,
    Request = request
};

// in this example we abort before we run serialization
writeContext.Request.HttpContext.Abort();
await Assert.ThrowAsync<OperationCanceledException>(() => serializer.WriteObjectAsync(addresses, type, writer, writeContext);

In the example above we abort the request before serialization starts, so we're guaranteed the enumeration will be cancelled if the code is implemented correctly. We could also abort after the serialization has started, which I think is more representative of real-world use cases. While technically flaky, I think that's unlikely to manifest in practice given the large number of iterations in the loop:

var serializationTask = serializer.WriteObjectAsync(addresses, type, writer, writeContext);
request.HttpContext.Abort();
await Assert.ThrowsAsync<OperationCanceledException>(() => serializationTask);

This test will fail because this test HttpContext doesn't actually abort anything. We have to override this.
Here's the current implementation of RequestFactory.Create:

public static HttpRequest Create()
{
    HttpContext context = new DefaultHttpContext();
    return context.Request;
}

I've looked at the implementation of DefaultHttpContext and found that you can customize its behaviour by injecting custom "features". To override the cancellation token, we can customize the IHttpRequestLifetimeFeature by implementing our own like so:

private class TestRequestLifeTimeFeature : IHttpRequestLifetimeFeature
{
    private CancellationTokenSource _cts = new CancellationTokenSource();

    public CancellationToken RequestAborted { get => _cts.Token; set => throw new NotImplementedException(); }

    public void Abort()
    {
        _cts.Cancel();
    }
}

This could be an inner helper class inside RequestFactory.

And finally, we can update the request factory to inject this feature:

public static HttpRequest Create()
{
    FeatureCollection features = new();
    features.Set<IHttpRequestLifetimeFeature>(new TestHttpRequestLifetimeFeature());
    HttpContext context = new DefaultHttpContext(features);
    return context.Request;
}

WanjohiSammy · 2024-08-08T13:32:39Z

@rpallares

Please add the tests like the ones suggested by @habbes above #1271 (comment)

rpallares · 2024-08-10T08:00:55Z

@rpallares

Please add the tests like the ones suggested by @habbes above #1271 (comment)

Hi, yes. @habbes gave great very detailled suggestions.
Sadly I don't have time to do it quickly.
Don't hesitate to propose pr either. I really believe the two issues pointes here are critical

rpallares · 2024-08-28T12:14:09Z

I changed a bit against @habbes suggested to be able to correctly dispose the CancellationTokenSource.
I believe that WE should refactor a bit some of RequestFactory Create method to share the base Create method that now take an optional cancellationToken. But it imply much more changes not used yet, only for testing. So I left as is.
I also target the main branch, a cherry pick will be welcome for the 8.x branch (no conflict expected).

rpallares · 2024-08-28T12:20:26Z

@rpallares could you create a separate issue describing the PageSize issue in detail?

Hi, I'm note sure it's necessary to create a new one because all these existing issues already details that comportement:
* [PageSize attribute causes significant drop in throughput #847](https://github.com/OData/AspNetCoreOData/issues/847)

* [TruncatedCollection does not implement IAsyncEnumerable #253](https://github.com/OData/AspNetCoreOData/issues/253)

* [Cannot avoid using synchronous calls in OData with entity framework #1194](https://github.com/OData/AspNetCoreOData/issues/1194)
There's probably more related as #423 seems to come from very old issue and even older pr #324

I also saw references inside https://github.com/OData/WebApi
* [Returning an IQueryable from EF core 6 generates synchronous database calls WebApi#2598](https://github.com/OData/WebApi/issues/2598)

* [ODataQueryOptions forces materialization of the IQueryable when PageSize is set WebApi#886](https://github.com/OData/WebApi/issues/886)
If any brealing change is required, odata team should really consider add it in the next major version.

Thanks

@habbes , any official news regarding the pagesize topic?
I know you were pretty busy with the major release, but this issue IS really killing scaling performances despite it's a great feature.

…ncellation-support

habbes · 2024-09-05T13:15:30Z

I changed a bit against @habbes suggested to be able to correctly dispose the CancellationTokenSource. I believe that WE should refactor a bit some of RequestFactory Create method to share the base Create method that now take an optional cancellationToken. But it imply much more changes not used yet, only for testing. So I left as is. I also target the main branch, a cherry pick will be welcome for the 8.x branch (no conflict expected).

The change looks good to me, simpler and requires less changes.

habbes · 2024-09-05T14:11:29Z

@rpallares I agree that the page size issue is longstanding issue that we should finally solve. We are looking to invest in more perf improvements to the library and this is one that should definitely be part of that. I can't give an ETA yet, but I'm bringing this to the attention of the team. We also want to get rid of places where synchronous I/O is forced even when the data source is async, and this is one of such bottlenecks.

gathogojr · 2024-09-26T19:14:08Z

Thank you @rpallares for your contribution

Co-authored-by: Rafael Pallares <[email protected]>

improvement: Add cancellation support for long running queries

867adff

ElizabethOkerio requested review from ElizabethOkerio, xuzhg, habbes, marabooy and WanjohiSammy and removed request for ElizabethOkerio August 1, 2024 12:44

Rafael Pallares added 3 commits August 26, 2024 14:42

Merge remote-tracking branch 'origin/main' into add-cancellation-support

784d3c9

add tests

db44bc0

reduce times, add more visible delay on generation

25f2a89

rpallares changed the base branch from release-8.x to main August 27, 2024 14:38

super minor missing configureawait

bc17ed1

Merge remote-tracking branch 'OData/AspNetCoreOData/main' into add-ca…

833e992

…ncellation-support

habbes approved these changes Sep 5, 2024

View reviewed changes

WanjohiSammy approved these changes Sep 12, 2024

View reviewed changes

gathogojr approved these changes Sep 26, 2024

View reviewed changes

gathogojr merged commit e9e9cc4 into OData:main Sep 26, 2024
4 checks passed

rpallares mentioned this pull request Oct 3, 2024

How to Configure a Default PageSize? #243

Open

anasik pushed a commit to anasik/AspNetCoreOData that referenced this pull request Oct 28, 2024

Add cancellation support for long running queries (OData#1271)

300b327

Co-authored-by: Rafael Pallares <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvement: Add cancellation support for long running queries #1271

improvement: Add cancellation support for long running queries #1271

rpallares commented Jul 2, 2024 •

edited

Loading

rpallares commented Jul 2, 2024

rpallares commented Jul 2, 2024

habbes commented Jul 4, 2024

rpallares commented Jul 4, 2024

habbes commented Aug 1, 2024 •

edited

Loading

WanjohiSammy commented Aug 8, 2024

rpallares commented Aug 10, 2024

rpallares commented Aug 28, 2024

rpallares commented Aug 28, 2024

habbes commented Sep 5, 2024

habbes commented Sep 5, 2024

gathogojr commented Sep 26, 2024

improvement: Add cancellation support for long running queries #1271

improvement: Add cancellation support for long running queries #1271

Conversation

rpallares commented Jul 2, 2024 • edited Loading

rpallares commented Jul 2, 2024

rpallares commented Jul 2, 2024

habbes commented Jul 4, 2024

rpallares commented Jul 4, 2024

habbes commented Aug 1, 2024 • edited Loading

WanjohiSammy commented Aug 8, 2024

rpallares commented Aug 10, 2024

rpallares commented Aug 28, 2024

rpallares commented Aug 28, 2024

habbes commented Sep 5, 2024

habbes commented Sep 5, 2024

gathogojr commented Sep 26, 2024

rpallares commented Jul 2, 2024 •

edited

Loading

habbes commented Aug 1, 2024 •

edited

Loading