Replies: 3 comments
-
>>> othiele |
Beta Was this translation helpful? Give feedback.
-
>>> othiele |
Beta Was this translation helpful? Give feedback.
-
>>> Clockworker |
Beta Was this translation helpful? Give feedback.
-
[
[[Chagari_Vamsi_Reddy
[January 15, 2021, 8:57am]
Hi,
How can I know what boost value to give for a particular hot word. slash
Example, let's say i want to give willson as a hotword, what would be
the boost value for that?. Any help would be greatly appreciated.
More info: slash
I used client.py (native_client/python/client.py) to test the Deepspeech
by providing an audio file along with the hot word plus a boost value as
well. Deep speech couldn't able to decode the audio correctly, Audio
file is 3 seconds which has 'Fiona walker' as the transcription. I tried
with the various boost values (Ex: 0, 1, 2, -0.5, -1, 0.2) and there is
no impact on the result/outcome, The output is same in both the cases (
with or without the hotword).
Question: slash
I am wondering how to know what boost value to give for a particular
word?, how do I calculate that at run time so that I can give the
correct boost value, which would help decoder to provide accurate
results?.
How I ran Deepspeech on the setup: slash
python3 client.py slash --model deepspeech_data/deepspeech-0.9.3-models.pbmm
slash --scorer deepspeech_data/deepspeech-0.9.3-models.scorer slash --audio
test.wav slash --hot_words 'fiona walker:1.5'
output: slash
DeepSpeech: v0.9.3-0-gf2e9c85 slash
2021-01-14 21:21:51.170266: I
tensorflow/core/platform/cpu_feature_guard.cc:142 slash ] This TensorFlow
binary is optimized with oneAPI Deep Neural Network Library (oneDNN)to
use the following CPU instructions in performance-critical operations:
AVX2 FMA slash
To enable them in other operations, rebuild TensorFlow with the
appropriate compiler flags. slash
2021-01-14 21:21:51.171547: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcuda.so.1 slash
2021-01-14 21:21:51.194164: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.194496: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1716 slash ] Found device 0
with properties: slash
pciBusID: 0000:01:00.0 name: GeForce RTX 2080 SUPER computeCapability:
7.5 slash
coreClock: 1.815GHz coreCount: 48 deviceMemorySize: 7.79GiB
deviceMemoryBandwidth: 462.00GiB/s slash
2021-01-14 21:21:51.194507: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcudart.so.10.1 slash
2021-01-14 21:21:51.195499: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcublas.so.10 slash
2021-01-14 21:21:51.196457: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcufft.so.10 slash
2021-01-14 21:21:51.196619: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcurand.so.10 slash
2021-01-14 21:21:51.197642: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcusolver.so.10 slash
2021-01-14 21:21:51.198195: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcusparse.so.10 slash
2021-01-14 21:21:51.200299: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcudnn.so.7 slash
2021-01-14 21:21:51.200360: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.200660: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.200908: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1858 slash ] Adding visible
gpu devices: 0 slash
2021-01-14 21:21:51.397930: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1257 slash ] Device
interconnect StreamExecutor with strength 1 edge matrix: slash
2021-01-14 21:21:51.397966: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1263 slash ] 0 slash
2021-01-14 21:21:51.397970: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1276 slash ] 0: N slash
2021-01-14 21:21:51.398103: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.398386: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.398640: I
tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:982 slash ] successful
NUMA node read from SysFS had negative value (-1), but there must be at
least one NUMA node, so returning NUMA node zero slash
2021-01-14 21:21:51.398889: I
tensorflow/core/common_runtime/gpu/gpu_device.cc:1402 slash ] Created
TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with
7033 MB memory) - slash > physical GPU (device: 0, name: GeForce RTX 2080
SUPER, pci bus id: 0000:01:00.0, compute capability: 7.5) slash
Loaded model in 0.291s. slash
Loading scorer from files deepspeech-0.9.3-models.scorer slash
Loaded scorer in 0.000138s. slash
Adding hot-words slash
Running inference. slash
2021-01-14 21:21:51.481202: I
tensorflow/stream_executor/platform/default/dso_loader.cc:48 slash ]
Successfully opened dynamic library libcublas.so.10 slash
frontier slash
Inference took 0.443s for 2.950s audio file.
Code related info: slash
I see that the boost value is been used in the 'void
DecoderState::next(const double slash *probs, int time_dim, int class_dim)'
function to improve the score for the corresponding word slash
piece of code: score = ( ext_scorer slash _- slash >get_log_cond_prob(ngram, bos) +
hot_boost ) slash DEEPSPEECH.cdx deepspeech.commands DEEPSPEECH.pages DEEPSPEECH.warc.gz discourse.mozilla.org html-to-markdown.sh shell-conver-html-to-split-posts.sh sorted-deepspeech-posts ext_scorer slash _- slash >alpha;
Thank you
[This is an archived TTS discussion thread from discourse.mozilla.org/t/how-can-i-know-what-boost-value-to-give-for-a-particular-hot-word]
Beta Was this translation helpful? Give feedback.
All reactions