Seq2Seq

Encoder-decoder frameworks have been used for Statistical Machine Translation, Image Captioning, ASR, Conversation Modeling, Formula Generation and many other applications. Seq2seq is a type of encoder-decoder implementation, where the encoder is some sort of RNN, and the memories (sometimes known as the "thought vector") are transferred over to the decoder, also an RNN, essentially making this a conditional language model. The code as it is written here is with text in mind, and supports multiple types of RNNs, including GRUs and LSTMs, as well as stacked layer. Global attention is optional.

seq2seq: Phrase in, phrase-out, 2 lookup table implementation, input is a temporal vector, and so is output

This code implements seq2seq with mini-batching (as in other examples) using adagrad, adadelta, sgd or adam. It supports two vocabularies, and supports word2vec pre-trained models as input, filling in words that are attested in the dataset but not found in the pre-trained models. It uses dropout for regularization.

For any reasonable size data, this really needs to run on the GPU.

Losses and Reporting

The loss that is optimized is the total loss divided by the total number of non-masked tokens in the mini-batch (token level loss).

When reporting the loss every nsteps it is the total loss divided by the total number of non-masked tokens in the last nstep number of mini-batches. The perplexity is e to this loss.

The epoch loss is the total loss averaged over the total number of non-masked tokens in the whole epoch. The perplexity is e to this loss.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seq2seq.md

seq2seq.md

Seq2Seq

seq2seq: Phrase in, phrase-out, 2 lookup table implementation, input is a temporal vector, and so is output

Losses and Reporting

Files

seq2seq.md

Latest commit

History

seq2seq.md

File metadata and controls

Seq2Seq

seq2seq: Phrase in, phrase-out, 2 lookup table implementation, input is a temporal vector, and so is output

Losses and Reporting