Extra disk space usage when scheduling multiple large background uploads to S3 #5022

tspop · 2023-10-30T21:00:05Z

Let's say I want to schedule background uploads for 5 large files, each being 1GB in size.

Will the S3 SDK divide these files into smaller chunks before initiating the upload, potentially utilizing an additional 5GB of storage space?

phantumcode · 2023-10-30T21:40:05Z

@tspop Thanks for submitting your question. When uploading files larger than 5MB, Amplify does rely on temporary local caching to chunk the file being uploaded into smaller chunks of 5MBs, thus requiring additional space to temporarily store and upload the chunked parts.

zamzamfp · 2024-09-17T11:55:33Z

Hi @phantumcode We are facing an issue with the AWS iOS SDK when using multipart upload to upload large files (e.g., 10GB). The SDK chunks the entire file into smaller parts at the beginning, before the upload even starts. This behaviour requires the device to have enough additional storage space (e.g., 10GB extra) to accommodate the chunked file. It also significantly delays the upload process, as it can take up to a minute for the chunking to complete, with no clear indication of this happening unless debug logging is enabled.

Questions:

Is there a technical reason the chunking process has to occur entirely before the upload starts, rather than chunking parts on-the-fly (i.e., chunk the first parts, upload them, and then proceed with the rest)?
Are there any plans to change this behavior in the future to improve efficiency?
Is there a way to track the progress of the chunking process so we can communicate it in the UI?

phantumcode added question General question pending-community-response Issue is pending response from the issue requestor labels Oct 30, 2023

atierian closed this as completed Nov 15, 2023

zamzamfp mentioned this issue Sep 19, 2024

Extra disk space usage and delayed uploads when uploading large files to S3 #5439

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra disk space usage when scheduling multiple large background uploads to S3 #5022

Extra disk space usage when scheduling multiple large background uploads to S3 #5022

tspop commented Oct 30, 2023

phantumcode commented Oct 30, 2023

zamzamfp commented Sep 17, 2024 •

edited

Loading

Extra disk space usage when scheduling multiple large background uploads to S3 #5022

Extra disk space usage when scheduling multiple large background uploads to S3 #5022

Comments

tspop commented Oct 30, 2023

phantumcode commented Oct 30, 2023

zamzamfp commented Sep 17, 2024 • edited Loading

zamzamfp commented Sep 17, 2024 •

edited

Loading