Releases · uhh-lt/sensegram · GitHub

03 Aug 14:50

alexanderpanchenko

Phrases support and simpler FAISS installation Latest

Latest

Adding the possibility to specify a set of phrases from a dictionary which will be identified in the input corpus, e.g. target entities or locations or terms of domain of interest (using the -phrases key).
Now FAISS can be installed using conda in much-much simpler and platform independent way.

Assets 2

02 Mar 16:43

alexanderpanchenko

Adding new pre-computed models

Add new pre-computed models for English, German, and Russian languages
Moving the old pre-computed models to the new format
Adding documentation on how to convert pre-trained word embedding to sense embeddings
Updating the Quick Start examples and usage examples
Adding an easier user interface of the word sense disambiguation class

Assets 2

06 Feb 20:49

alexanderpanchenko

Rewriting all modules of the project in Python and porting it to Python 3

Remove dependency to the binary C word2vec implementation replacing it with the gensim word2vec implementation
Remove dependency to the Java implementation of ChineseWhispers algorithm and word sense induction: now using the chinese_whispers module from pipy and word sense induction inside the project
Migration to Python 3
Removing old slower implementation of the algorithm for computation of word graph with much faster one based on the Facebook FAISS library
Addressing consistency issues with the command line arguments
Reimplementing the word pooling mechanism
Adding make files for easy installation and training test models
Adding a iPython notebook with a QuickStart guide

Assets 2

15 Jan 22:20

alexanderpanchenko

New stable release

This release introduces a makefile for installing of the dependencies and several refactorings in the vector pooling stage.

Assets 2

28 Dec 13:06

alexanderpanchenko

The version published in the original paper

Pelevina et al., (2016). Making sense of word embeddings.

Assets 2