Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lemmatizer creation and deployment with tagger #407

Closed
kirdin opened this issue Aug 28, 2018 · 2 comments
Closed

Lemmatizer creation and deployment with tagger #407

kirdin opened this issue Aug 28, 2018 · 2 comments
Assignees

Comments

@kirdin
Copy link

kirdin commented Aug 28, 2018

I think that the researchers who need morphological analysis in their tasks may be split up into two groups:

  1. those who need only lemmatizer (for dimension reduction etc.);
  2. those who need both lemmatizer and tagger.

To the best of my knowledge, there are really few applied tasks which require tagging only.

While those people who need both lemmatizer and tagger may not use your tagger due to the fact that except for your tagger they need a lemmatizer; in the mean time, all of the (good) lemmatizers for Russian (pymorphy2, mystem, rnnmorph) support tagging as well.

As a consequence, most researchers might choose another tagger due to lemmatization absence in yours.

Besides, as soon as your tagger seems to use pymorphy2 features, you could take pymorphy2 lemmas for the beginning (perhaps it is computationally easy), and then change pymorphy2 lemmatizer for your own.

@my-master
Copy link
Contributor

Hi! There is already lemmatizer for Russian language in the library, based on pymorphy2.
We will consider combining tags and lemmas in a single class soon.

@IgnatovFedor
Copy link
Collaborator

Closed due to the removal of the morphotagger from the library.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants