Replies: 3 comments
-
>>> hawa |
Beta Was this translation helpful? Give feedback.
-
>>> KieranGill |
Beta Was this translation helpful? Give feedback.
-
>>> hawa |
Beta Was this translation helpful? Give feedback.
-
>>> KieranGill
[December 11, 2020, 8:08pm]
I want to clarify I understand the DeepSpeech source correctly. My
objective is to modify the source in order to get word and character
level confidence.
The candidate transcript responses only seem to return with confidence
at the transcript level. This
decode
function seems to only aggregate scores instead of including
word/character level confidence. However, I am having a little trouble
understanding the scope of a
prefix
. Is aprefix
meant to be synonymous with a candidate transcript, or is its scope
supposed to be just a few words?
Also, the LM's scorer
function
scores
max_order
number words at a time, correct? So in order to getthe word-level confidence when using an LM,
max_order
would have to beset to 1, right?
[This is an archived TTS discussion thread from discourse.mozilla.org/t/extending-ctc-decoder-to-output-word-and-character-level-confidence]
Beta Was this translation helpful? Give feedback.
All reactions