Replies: 2 comments
-
>>> reuben |
Beta Was this translation helpful? Give feedback.
0 replies
-
>>> yv001 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
>>> vivek.mangipudi13
[December 19, 2017, 6:42am]
Say I'm recording a radio DJ, and the final results in an audio file
contain: slash
music --- some music --- speech/voice -- music--- speech --- speech ---
speech --- hold music --- end of audio slash
(here speech-music is assumed to be non overlapping or marginally
overlapping.)
Q1. How do I ignore the non speech and extract only the speech portions
of the audio? slash
i.e I want my final audio to have only the speech portions.
Q2. How does deep speech currently handle music when doing
speech-recognition??
Q3. is there any pre trained model for detecting the onset of speech or
portions of speech in an audio?
[This is an archived TTS discussion thread from discourse.mozilla.org/t/how-does-deepspeech-discriminate-between-speech-music]
Beta Was this translation helpful? Give feedback.
All reactions