Grammophone.LanguageModel

This library abstracts a language in order to be consumed by Grammophone.EnnounInference, a part-of-speech tagging and lemmatization framework.

It defines the abstract class LanguageProvider, a contract for providing resources for processing text of a language. This is a root object for providing several aspects of the language. First, it should provide its grammar model, as shown in the UML below:

LanguageProvider derivations should also provide a SentenceBreaker implementation which instructs the system how to separate sentences and the words in them, plus a Syllabizer implementation which brings words to the syllabic representation required by the system in order to facilitate machine learning of grammatical features as well as providing distance metrics between syllables, as shown in the following UML diagram:

This library also defines the contract for training sources for a language. As shown in the following diagram, all kinds of sources derive from TrainingSource<T>, where T is the type of item in the stream of training data. In this way, the TaggedWordTrainingSource is a TrainingSource<TaggedWordForm> and SentenceTrainingSource is a TrainingSource<TaggedSentence>. Training sources can be combined via CompositeTrainingSource<T> and automatically sliced by NFoldTrainingSource<T> to support n-fold validation.

This project depends on the following projects residing in sibling directories:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Grammar		Grammar
Properties		Properties
Provision		Provision
TrainingSources		TrainingSources
.gitignore		.gitignore
Grammophone.LanguageModel.csproj		Grammophone.LanguageModel.csproj
NamedKeyedChild.cs		NamedKeyedChild.cs
README.md		README.md
SyllabicWord.cs		SyllabicWord.cs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grammophone.LanguageModel

About

Releases

Packages

Languages

grammophone/Grammophone.LanguageModel

Folders and files

Latest commit

History

Repository files navigation

Grammophone.LanguageModel

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages