How does deepspeech discriminate between speech music #46

JRMeyer · 2021-03-08T00:26:06Z

JRMeyer
Mar 8, 2021
Maintainer

>>> vivek.mangipudi13
[December 19, 2017, 6:42am]

Say I'm recording a radio DJ, and the final results in an audio file
contain: slash
music --- some music --- speech/voice -- music--- speech --- speech ---
speech --- hold music --- end of audio slash
(here speech-music is assumed to be non overlapping or marginally
overlapping.)

Q1. How do I ignore the non speech and extract only the speech portions
of the audio? slash
i.e I want my final audio to have only the speech portions.

Q2. How does deep speech currently handle music when doing
speech-recognition??

Q3. is there any pre trained model for detecting the onset of speech or
portions of speech in an audio?

[This is an archived TTS discussion thread from discourse.mozilla.org/t/how-does-deepspeech-discriminate-between-speech-music]

JRMeyer · 2021-03-08T00:26:09Z

JRMeyer
Mar 8, 2021
Maintainer Author

>>> reuben
[December 19, 2017, 9:14am]

1. Use a Voice Activity Detection (VAD) tool.

2. It doesn't. Transcription results for music will not make sense.

3. There are several available VAD tools. The WebRTC project has one,
for example. There's a topic here on Discourse that mentions other
tools, but I don't remember where it is.

[Archived Post]

0 replies

JRMeyer · 2021-03-08T00:26:12Z

JRMeyer
Mar 8, 2021
Maintainer Author

>>> yv001
[December 19, 2017, 9:41am]

some VAD tools are mentioned here Longer audio files with Deep
Speech

[Archived Post]

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does deepspeech discriminate between speech music #46

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

How does deepspeech discriminate between speech music #46

JRMeyer Mar 8, 2021 Maintainer

Replies: 2 comments

JRMeyer Mar 8, 2021 Maintainer Author

JRMeyer Mar 8, 2021 Maintainer Author

JRMeyer
Mar 8, 2021
Maintainer

JRMeyer
Mar 8, 2021
Maintainer Author

JRMeyer
Mar 8, 2021
Maintainer Author