This tool generates binary dictionaries with (.dict) file extension & can be covertable to (.jet) file extension. Using Python, Perl, Sqlite and Java.
- Open the project in Pycharm or any other IDE.
- Browse for "check_frequency_generate_db.py" Open it and in the last line main method is taking filepath as input: Put file path of your "raw_data.txt" file and run it.
- It Generates a sqlite database "EnglishDictionary.sqlite", Open it in Sqlite Browser and export the data as ".csv" file (while exporting put "|" as the separater instead of ",").
- When a csv file is generated, open it inside any text editor lets say in sublime text and replace “ | ” with “ , f=”.
- Save as file in different extension (rawdata.csv to rawdata.combined).
- After that, Open the new (.combined) file in any text editor, Now remove the first line ("word, freq") & Add a header e.g: dictionary=main:en,locale=en,description=English,date=1542212847,version=01.
- Run the following:
java -jar dicttool_aosp.jar makedict -s <filename.combined> -d <filename.dict>
This project is licensed under the MIT License license.
Adnan Chohan
Email: [email protected]