Replies: 3 comments
-
>>> bem0302 |
Beta Was this translation helpful? Give feedback.
-
>>> eggonlea |
Beta Was this translation helpful? Give feedback.
-
>>> bem0302 |
Beta Was this translation helpful? Give feedback.
-
>>> bem0302
[April 30, 2019, 3:25pm]
Hi guys, when creating language model with KenLM and I have know that
KenLM use the N-grams model. So I have 2 questions for this:
1. When I build an slash *.arpa file from a text.txt file. Did all the
sentences in the text.txt need to have the length from 3 to 5 words
to get the best LM? Because my text is about 12000 sentences and
more than 80% of them have length about 8-15.
2. I'm using this command to build the slash *.arpa file:
./lmplz --text text.txt --arpa text2.arpa --o 5
. Did I need tochange the value of the last param (currently 5) to some other value
like 3 or 4 based on my data as above ?
[This is an archived TTS discussion thread from discourse.mozilla.org/t/create-language-model-with-kenlm]
Beta Was this translation helpful? Give feedback.
All reactions