Training with custom data #23

wanshun123 · 2018-11-19T23:55:09Z

Curious if others have achieved reasonable results training on custom data. I've tried training the model on data from https://github.com/aomv/voiceloop-in-the-wild-experiments/tree/master/data/donald-trump/data (which has audio files and transcriptions of a few seconds in length, for somewhere around a couple hours in total) making a metadata.csv file in the same format as the LJSpeech dataset.

While I've trained for several hours with a steadily decreasing loss, the graph would indicate the model is not learning properly. I've also failed to generate intelligible audio at least without using a reference audio (trying several times).

syang1993 · 2018-11-26T08:29:13Z

@wanshun123 Hi, I cannot open the data link to check the quality of data. I tried different data sets before and found it works.

Besides, the attention used in this repo is a very basic one, which is not so good to generate long sentences.

iamanigeeit · 2021-06-15T16:17:23Z

@wanshun123 Did you train using use_gst=False? I have the same issue when use_gst=False but not when True.

@syang1993 In my case the audio seems intelligible, although not good quality. I am using the Emotional Speech Dataset from https://hltsingapore.github.io/ESD/download.html

The English data shows similar attention "collapse". The Chinese data is ok.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training with custom data #23

Training with custom data #23

wanshun123 commented Nov 19, 2018

syang1993 commented Nov 26, 2018

iamanigeeit commented Jun 15, 2021 •

edited

Loading

Training with custom data #23

Training with custom data #23

Comments

wanshun123 commented Nov 19, 2018

syang1993 commented Nov 26, 2018

iamanigeeit commented Jun 15, 2021 • edited Loading

iamanigeeit commented Jun 15, 2021 •

edited

Loading