feat(rt): add InputStream adapter for ByteStream #945

aajtodd · 2023-08-31T19:35:38Z

Issue #

Description of changes

Adds a conversion to go from ByteStream to java.io.InputStream

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

lauzadis · 2023-08-31T20:03:59Z

runtime/runtime-core/jvm/src/aws/smithy/kotlin/runtime/io/SdkByteReadChannelJVM.kt

+public fun SdkByteReadChannel.toInputStream(): InputStream = InputAdapter(this)
+
+private const val DEFAULT_READ_BYTES = 8192L
+private class InputAdapter(private val ch: SdkByteReadChannel) : InputStream() {


Should we support mark / reset? I think it's not supported by default (markSupported() defaults to false). It can probably be additive work if requested

I think I'd vote to make it additive based on customer feedback/requests. Anyone feel different?

Sounds good to me.

ianbotsf · 2023-08-31T19:56:09Z

runtime/runtime-core/jvm/src/aws/smithy/kotlin/runtime/io/SdkByteReadChannelJVM.kt

+    }
+    override fun read(b: ByteArray, off: Int, len: Int): Int {


Nit: Missing line break

ianbotsf · 2023-08-31T20:00:07Z

runtime/runtime-core/jvm/src/aws/smithy/kotlin/runtime/io/SdkByteReadChannelJVM.kt

+    private fun readBlocking(): Long =
+        runBlocking {
+            ch.read(buffer, DEFAULT_READ_BYTES)
+        }


Question: Looks like we only ever read DEFAULT_READ_BYTES (8K) bytes at a time from the channel, even when the read call may have had a len higher than that. Given that runBlocking may be expensive, is it worth passing the requested length through to the channel read? Or making the chunk size configurable?

Why do you feel that runBlocking is expensive?

The len argument is documented as

len - the maximum number of bytes to read.

We could change it but I chose 8K because thats the underlying segment size used by okio. I'm not sure this will matter all that much in practice but if you disagree I'm happy to change it.

ianbotsf · 2023-08-31T20:03:09Z

runtime/runtime-core/jvm/src/aws/smithy/kotlin/runtime/io/SdkByteReadChannelJVM.kt

+public fun SdkByteReadChannel.toInputStream(): InputStream = InputAdapter(this)
+
+private const val DEFAULT_READ_BYTES = 8192L
+private class InputAdapter(private val ch: SdkByteReadChannel) : InputStream() {


Question: This adapter works via runBlocking on every read. That will work obvi but seems like it may be slow. Would it be better to spin up a background coroutine to handle reads that lives for the lifetime of the adapter?

InputStream.read is a blocking call. runBlocking is designed to bridge blocking and non-blocking worlds (it does so by blocking the current thread). Spinning up a coroutine will result in tying up two threads and/or more context switches to accomplish the same thing.

As far as I understand (could be wrong, ofc) runBlocking starts a new coroutine and blocks the current thread. Doing that on every 8KB chunk sounds slower than starting a new coroutine once. Sure, it'll still block on every read call but the cost of setting up a new coroutine was already paid.

TL;DR I'm not at all worried about the overhead of runBlocking or starting a new coroutine.

runBlocking does start a new coroutine and blocks the current thread. This is fine though, coroutines are extremely lightweight. The only benchmarking I can find is here which would suggest 140 byte allocation and 100 nanoseconds.

Introducing a background coroutine will likely introduce additional context switches and a more complicated implementation to synchronize the coroutine and blocking read call. It also needs a CoroutineScope to launch into which would also mean taking an additional parameter and dealing with cancellation, error propagation, etc.

ianbotsf · 2023-08-31T20:05:33Z

runtime/runtime-core/jvm/test/aws/smithy/kotlin/runtime/content/ByteStreamInputStreamTest.kt

+class ByteStreamBufferInputStreamTest : ByteStreamInputStreamTest(ByteStreamFactory.BYTE_ARRAY)
+class ByteStreamSourceStreamInputStreamTest : ByteStreamInputStreamTest(ByteStreamFactory.SDK_SOURCE)
+class ByteStreamChannelSourceInputStreamTest : ByteStreamInputStreamTest(ByteStreamFactory.SDK_CHANNEL)


Comment: Nice abstraction.

sonarqubecloud · 2023-09-05T15:33:04Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

feat(rt): add InputStream adapter for ByteStream

66ecf79

aajtodd requested a review from a team as a code owner August 31, 2023 19:35

dump api

051f113

lauzadis approved these changes Aug 31, 2023

View reviewed changes

ianbotsf approved these changes Aug 31, 2023

View reviewed changes

aajtodd added 2 commits September 1, 2023 12:17

fix jdk9 only usage

bf14ade

stlyle

3c3033c

aajtodd merged commit 762d583 into main Sep 5, 2023

aajtodd deleted the feat-istream-adapter branch September 5, 2023 15:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rt): add InputStream adapter for ByteStream #945

feat(rt): add InputStream adapter for ByteStream #945

aajtodd commented Aug 31, 2023

lauzadis Aug 31, 2023

aajtodd Aug 31, 2023

ianbotsf Aug 31, 2023

ianbotsf Aug 31, 2023

ianbotsf Aug 31, 2023

aajtodd Aug 31, 2023

ianbotsf Aug 31, 2023

aajtodd Aug 31, 2023

ianbotsf Aug 31, 2023

aajtodd Sep 1, 2023

ianbotsf Aug 31, 2023

sonarqubecloud bot commented Sep 5, 2023

feat(rt): add InputStream adapter for ByteStream #945

feat(rt): add InputStream adapter for ByteStream #945

Conversation

aajtodd commented Aug 31, 2023

Issue #

Description of changes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonarqubecloud bot commented Sep 5, 2023