Memory leak on predictions #6

mishushakov · 2020-12-08T13:00:32Z

I have noticed that in order to properly finish training you'll need a lot of free memory in order to run the prediction
if you try to save the tensor as file the resulting file is going to take up gigabytes

in my case (#3) you're basically getting 65mb of data for 242kb of audio (26346% increase)

GuitarML · 2020-12-08T13:18:40Z

Yes, it's doing the same preprocessing that it does before training. Each predicted sample is determined by the previous input_size number of samples. So for a model with input size 100, and a wav file of 44100 samples (1 second) it creates an array of size (44100, 100, ) but 99% of all those samples are redundant, so there's definitely a better way of handling that. I started a custom dataLoader class which takes a small batch of data, preprocesses/trains it, then frees up the memory for the next batch. I'm having trouble with it training properly though, so I'll share that in case someone wants to try to fix it. The split_data param is basically a work around because I couldn't get that class working yet.

mishushakov · 2020-12-29T12:51:00Z

Hey there & happy holidays!
thanks for the explanation

maybe using the tensorflow audio preparation module would resolve the problem?
https://www.tensorflow.org/io/tutorials/audio

GuitarML · 2021-01-02T13:48:39Z

Happy Holidays! That does look helpful, I can probably use something in there. I don't see anything in particular that solves this data preparation problem though, I think the solution is still getting the custom data loader to work. In the meantime, the split data param will allow for training with limited RAM.

I've made some progress on a plugin for the LSTM models, I'm really excited for what can be done with that. Lots of good things coming for 2021!

mishushakov · 2021-01-02T15:04:24Z

thanks for sharing! i'd love to help out (where i can) after holidays

this particular part caught my attention:

The content of the audio clip will only be read as needed, either by converting AudioIOTensor to Tensor through to_tensor(), or though slicing. Slicing is especially useful when only a small portion of a large audio clip is needed

as far as i understand the data will be lazy loaded, however i wasn't entirely sure, if this is what we need

on the side note Google's tone transfer looks very promising: https://sites.research.google/tonetransfer/
i believe they use the technology to create voices for their Google Assistant

thanks and lets hope 2021 will be nothing like 2020!

GuitarML · 2021-02-05T12:17:26Z

Update: The colab notebook has been updated to fix the Out of Memory issue by using a Sequence class for loading the data one batch at a time. It also uses MSE for the loss calculation to alleviate issues with the error-to-signal with pre emphasis filter. Conducting more tests on choice of loss function before rolling out this change to the python scripts.

mishushakov mentioned this issue Jan 31, 2021

Fix GPU OOM issues #9

Merged

GuitarML closed this as completed Apr 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory leak on predictions #6

Memory leak on predictions #6

mishushakov commented Dec 8, 2020

GuitarML commented Dec 8, 2020

mishushakov commented Dec 29, 2020 •

edited

Loading

GuitarML commented Jan 2, 2021

mishushakov commented Jan 2, 2021

GuitarML commented Feb 5, 2021 •

edited

Loading

Memory leak on predictions #6

Memory leak on predictions #6

Comments

mishushakov commented Dec 8, 2020

GuitarML commented Dec 8, 2020

mishushakov commented Dec 29, 2020 • edited Loading

GuitarML commented Jan 2, 2021

mishushakov commented Jan 2, 2021

GuitarML commented Feb 5, 2021 • edited Loading

mishushakov commented Dec 29, 2020 •

edited

Loading

GuitarML commented Feb 5, 2021 •

edited

Loading