merge_unicharsets - Simple tool to merge two or more unicharsets.
merge_unicharsets(1) is a simple tool to merge two or more unicharsets. It could be used to create a combined unicharset for a script-level engine, like the new Latin or Devanagari.
- unicharset-in-1
-
(Input) The name of the first unicharset file to be merged.
- unicharset-in-n
-
(Input) The name of the nth unicharset file to be merged.
- unicharset-out
-
(Output) The name of the merged unicharset file.
Main web site: https://github.com/tesseract-ocr
Information on training tesseract LSTM: https://tesseract-ocr.github.io/tessdoc/TrainingTesseract-4.00.html