Replies: 5 comments
-
>>> othiele |
Beta Was this translation helpful? Give feedback.
-
>>> othiele |
Beta Was this translation helpful? Give feedback.
-
>>> Clockworker |
Beta Was this translation helpful? Give feedback.
-
>>> Clockworker |
Beta Was this translation helpful? Give feedback.
-
>>> othiele |
Beta Was this translation helpful? Give feedback.
-
>>> Clockworker
[January 14, 2021, 8:00pm]
Hi,
We are students currently doing their Bachelor's Degree and during
Software Testing classes our task was to test an open-source project. A
few months ago I asked here what could have use some testing. I was glad
to receive a message and we've started working. Trying our best, we have
made an analysis of practical usage of hot-word feature and DeepSpeech
default model's accuracy, so you determine it's practical quality for
many scenarios.
Full report:
deepspeech_test_report.pdf
(426,0 KB)
Short summary for 250 different audio files with taggings:
scientific words was the most accurate. (95.6%)
there were more files for male speech in this combination or input
files may have been just a little bit harder for a model to
understand. (94.0%)
and 82.5% common speech) where lecture speech of non-accented
speaker is above 94.6%
in our data set for those that contained at least one of them), note
that if we used more proper nouns then this accuracy difference
could have been higher. It all depends on number of those words;
however, this serves as a proof that in fact the difference is real
and should be considered.
male common speech (87.3% female, 91.4% male).
males speaking in common voice and with scientific words accuracy
was: 86.4%, while the same tag combination but without scientific
words achieved 91.4% accuracy. That gives 5% drop.
As for hot-words feature:
behavior. Probably because it doesn't appear in word detection
mechanism and is not modified.
everything else that comes after that word, because of letter
splitting bug. Example: 'okay google'.
but be careful of this word to appear as a splitted one: 'another'
- slash > 'an other' or as a word of similar sound: 'gold' - slash > 'god'.
add a very small priority and it could work.
the given hot-word cause no change if the audio doesn't include the
sound of that word.
hot-words were detected.
Any opinions would be appreciated.
We are really glad that we've made this far and it was an honor for us
to anyhow support this great project
[How can I know what boost value to give for a particular hot
word
[This is an archived TTS discussion thread from discourse.mozilla.org/t/practical-tests-of-hot-word-feature-and-default-models-accuracy]
Beta Was this translation helpful? Give feedback.
All reactions