feat: implement spec-compliant body mixins #1694

KhafraDev · 2022-10-10T17:27:29Z

Improves performance of .text(), .arrayBuffer(), .json(), and .blob() by 60%.

The next step is to introduce a synchronous FormData parser - similar to what every other runtime has. It doesn't make sense to asynchronously parse the FormData if all of the bytes are in memory.

If a library needs asynchronous parsing:

const response = new Response(fd, { headers: [['content-type', 'multipart/formdata']] })

for await (const chunk of response.body) {
  // write chunk to busboy, for example
}

As mentioned in the issue implementing .formData, it's very inefficient and should never be used on the server.

Firefox - https://github.com/mozilla/gecko-dev/blob/7f3ff3f4d34e7d234da4f5f5b345d2add7f30e95/dom/base/BodyUtil.cpp#L67
Chromium - https://source.chromium.org/chromium/chromium/src/+/main:third_party/blink/renderer/core/fetch/multipart_parser.cc;l=1;bpv=1;bpt=0
Deno - https://github.com/denoland/deno/blob/0cd05d737729b4cfab1d5e22077b3b9ad4ed5e30/ext/fetch/21_formdata.js#L490
Webkit - https://github.com/WebKit/WebKit/blob/f0350d6575c366d884d98f9937e77fe499b93398/Source/WebCore/Modules/fetch/FetchBodyConsumer.cpp#L129

codecov-commenter · 2022-10-10T17:31:30Z

Codecov Report

Base: 93.98% // Head: 94.05% // Increases project coverage by +0.06% 🎉

Coverage data is based on head (149a6ec) compared to base (eead2b8).
Patch coverage: 98.21% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1694      +/-   ##
==========================================
+ Coverage   93.98%   94.05%   +0.06%     
==========================================
  Files          53       53              
  Lines        4907     4912       +5     
==========================================
+ Hits         4612     4620       +8     
+ Misses        295      292       -3

Impacted Files	Coverage Δ
lib/fetch/body.js	`95.79% <98.18%> (-0.40%)`	⬇️
lib/fetch/dataURL.js	`84.90% <100.00%> (+2.62%)`	⬆️
lib/core/connect.js	`98.27% <0.00%> (-1.73%)`	⬇️
lib/fetch/index.js	`83.64% <0.00%> (+0.18%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

KhafraDev · 2022-10-11T17:23:23Z

I will make a follow up PR for the formData parsing, since it's much harder than everything else.

jimmywarting · 2022-10-12T21:14:27Z

this feels like a worse solution to first consume the hole body first.
an async parser feels like it would be more memory/GC friendlier. You do not always have all the data in the memory.

a .formData() dose not have to be inefficient if blob's could be created with a file system to back them up.

mcollina

lgtm

KhafraDev · 2022-10-13T00:25:23Z

@jimmywarting

this feels like a worse solution to first consume the hole body first.

You do not always have all the data in the memory.

According to the spec, you do have to consume the whole body. See: https://fetch.spec.whatwg.org/#ref-for-fully-reading-body-as-promise%E2%91%A0

With this assumption (that the whole body is in memory), the only logical outcome is that, similar to every other body mixin, the parsing happens synchronously. Every other platform has similarly came up with this solution.

repsac-by · 2022-10-13T07:12:56Z

According to the spec, you do have to consume the whole body. See: https://fetch.spec.whatwg.org/#ref-for-fully-reading-body-as-promise%E2%91%A0

As far as I understand the specification specifies how to give the result, but not how we process it under the hood.

A simple example showing memory consumption when reading the whole body before decoding instead of decoding by chunks.

const { randomFillSync } = require('crypto');
const { ReadableStream } = require('node:stream/web');
const { Request } = require('.');

function stream(length) {
	const buffer = Buffer.alloc(32 * 1024);
	const encoder = new TextEncoder();
	let size = 0;
	return new ReadableStream({
		pull(ctr) {
			const data = encoder.encode(randomFillSync(buffer).toString('base64'));
			ctr.enqueue(data);
			if ((size += data.length) > length) ctr.close();
		}
	})
}

void async function() {
	const request = new Request('http://localhost', {
		method: 'POST',
		duplex: 'half',
		body: stream(256 * 1024 * 1024),
	});

	await request.text();
	console.log(`RSS: ${(process.memoryUsage.rss() / 1024 / 1024).toFixed(2)} MB`);
}();

before

RSS: 365.86 MB

after

RSS: 601.42 MB

* feat: implement spec-compliant body mixins * fix: skip tests on v16.8

feat: implement spec-compliant body mixins

149a6ec

fix: skip tests on v16.8

7799653

ronag approved these changes Oct 10, 2022

View reviewed changes

KhafraDev requested a review from mcollina October 11, 2022 17:22

mcollina approved these changes Oct 12, 2022

View reviewed changes

KhafraDev merged commit 23fbc08 into nodejs:main Oct 13, 2022

KhafraDev deleted the fix-body-mixins branch October 13, 2022 01:12

metcoder95 pushed a commit to metcoder95/undici that referenced this pull request Dec 26, 2022

feat: implement spec-compliant body mixins (nodejs#1694)

4937381

* feat: implement spec-compliant body mixins * fix: skip tests on v16.8

KhafraDev mentioned this pull request Feb 25, 2023

feat: implement 0 dependency streaming multipart/form-data parser #1851

Closed

9 tasks

crysmags pushed a commit to crysmags/undici that referenced this pull request Feb 27, 2024

feat: implement spec-compliant body mixins (nodejs#1694)

fafbd49

* feat: implement spec-compliant body mixins * fix: skip tests on v16.8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement spec-compliant body mixins #1694

feat: implement spec-compliant body mixins #1694

KhafraDev commented Oct 10, 2022 •

edited

Loading

codecov-commenter commented Oct 10, 2022 •

edited

Loading

KhafraDev commented Oct 11, 2022

jimmywarting commented Oct 12, 2022

mcollina left a comment

KhafraDev commented Oct 13, 2022 •

edited

Loading

repsac-by commented Oct 13, 2022

feat: implement spec-compliant body mixins #1694

feat: implement spec-compliant body mixins #1694

Conversation

KhafraDev commented Oct 10, 2022 • edited Loading

codecov-commenter commented Oct 10, 2022 • edited Loading

Codecov Report

KhafraDev commented Oct 11, 2022

jimmywarting commented Oct 12, 2022

mcollina left a comment

Choose a reason for hiding this comment

KhafraDev commented Oct 13, 2022 • edited Loading

repsac-by commented Oct 13, 2022

before

after

KhafraDev commented Oct 10, 2022 •

edited

Loading

codecov-commenter commented Oct 10, 2022 •

edited

Loading

KhafraDev commented Oct 13, 2022 •

edited

Loading