We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我看你代码里面,词长设定最大是4。我觉得这样有点问题。古汉语里面最大词长设置成2或者3更合适。可能有一些称呼是三音节,比如“秦穆公”。其他的词基本上都是单音节的,双音节的占一定的比例,但是不多。最大词长设置成4的话,情况就是分出来的四音节的都不是词。
The text was updated successfully, but these errors were encountered:
你好!非常理解你的考量。本项目之所以选择四字词为分词上限,是因为:
Sorry, something went wrong.
No branches or pull requests
我看你代码里面,词长设定最大是4。我觉得这样有点问题。古汉语里面最大词长设置成2或者3更合适。可能有一些称呼是三音节,比如“秦穆公”。其他的词基本上都是单音节的,双音节的占一定的比例,但是不多。最大词长设置成4的话,情况就是分出来的四音节的都不是词。
The text was updated successfully, but these errors were encountered: