Skip to content

grammophone/Grammophone.EnnounInference

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Grammophone.EnnounInference

This .NET library is a part-of-speech tagging and lemmatization system. This is a two-layer approach: The first layer proposes possible features for each word form in a sentence, the second layer takes into account the context of the word forms in the sentence and elects the most probable, which then become the keys for estimating the lemmata corresponding to the forms.

The library is language-agnostic, as it depends on providers honoring the contract enforced by Grammophone.LanguageModel to define the available languages and corresponding training sets. Nevertheless, it is designed to deal with the computational challenges emerging from languages with rich morphology, and it is currently used in ancient Greek texts.

The central object type of the library is the InferenceResource. It is associated with a LanguageProvider defined via the Grammophone.LanguageModel system and provides methods to perform inference on a sentence using the SentenceClassifier which it offers via its homonymous property. The available InferenceResource instances are registered in the static property InferenceEnvironment.InferenceResources. They can be trained and saved using corresponding methods. Training references the data sets defined in the InferenceEnvironment.Setup.TrainingSets and InferenceEnvironment.Setup.ValidationSets static properties. Previously trained and saved instances can be loaded using the Load method of InferenceResourceProvider instances registered in the static property InferenceEnvironment.Setup.InferenceResourceProviders.

The previous are summarized in the following UML diagram.

Inference setup

This project relies on the following projects being in sibling directories:

About

.NET library of the part-of-speech and lemmatization engine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages