Skip to content

Commit

Permalink
Added Relational Network Paper
Browse files Browse the repository at this point in the history
  • Loading branch information
shagunsodhani committed Sep 23, 2017
1 parent f9ff24d commit 79afbf1
Show file tree
Hide file tree
Showing 86 changed files with 69 additions and 98 deletions.
Empty file modified .gitignore
100644 → 100755
Empty file.
1 change: 1 addition & 0 deletions README.md
100644 → 100755
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ I am trying a new initiative - a-paper-a-week. This repository will hold all tho

## List of papers

* [Refining Source Representations with Relation Networks for Neural Machine Translation](https://shagunsodhani.in/papers-I-read/Refining-Source-Representations-with-Relation-Networks-for-Neural-Machine-Translation)
* [Pointer Networks](https://shagunsodhani.in/papers-I-read/Pointer-Networks)
* [Learning to Compute Word Embeddings On the Fly](https://shagunsodhani.in/papers-I-read/Learning-to-Compute-Word-Embeddings-On-the-Fly)
* [R-NET - Machine Reading Comprehension with Self-matching Networks](https://shagunsodhani.in/papers-I-read/R-NET-Machine-Reading-Comprehension-with-Self-matching-Networks)
Expand Down
Empty file modified assets/BatchNormalization/eq1.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/BatchNormalization/eq2.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/FewThingsAboutML/BiasVarianceDiagram.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/MVRNN.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/P1RNTN.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/P2RNTN.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/ParseTreeMVRNN.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/RNN.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified assets/RNTN/RNNModels.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified site/404.html
100644 → 100755
Empty file.
Empty file modified site/LICENSE.md
100644 → 100755
Empty file.
Empty file modified site/README.md
100644 → 100755
Empty file.
Empty file modified site/_config.yml
100644 → 100755
Empty file.
Empty file modified site/_includes/comments.html
100644 → 100755
Empty file.
Empty file modified site/_includes/google_analytics.html
100644 → 100755
Empty file.
Empty file modified site/_includes/head.html
100644 → 100755
Empty file.
Empty file modified site/_includes/sidebar.html
100644 → 100755
Empty file.
Empty file modified site/_layouts/default.html
100644 → 100755
Empty file.
Empty file modified site/_layouts/page.html
100644 → 100755
Empty file.
Empty file modified site/_layouts/post.html
100644 → 100755
Empty file.
Empty file modified site/_posts/2017-04-27-VQA Visual Question Answering.md
100644 → 100755
Empty file.
Empty file.
Empty file modified site/_posts/2017-05-07-Conditional Similarity Networks.md
100644 → 100755
Empty file.
Empty file.
Empty file modified site/_posts/2017-05-23-Neural Module Networks.md
100644 → 100755
Empty file.
Empty file.
Empty file.
Empty file.
Empty file modified site/_posts/2017-07-01-One Model To Learn Them All.md
100644 → 100755
Empty file.

This file was deleted.

Empty file.
Empty file.
Empty file.
Empty file.
Empty file modified site/_posts/2017-08-27-Pointer Networks.md
100644 → 100755
Empty file.
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
---
layout: post
title: Refining Source Representations with Relation Networks for Neural Machine Translation
comments: True
excerpt:
tags: ['2017', 'Relational Network', 'Representation Learning', AI, NLP, NMT]
---

## Introduction

* The paper introduces Relation Network (RN) that refines the encoding representation of the given source document (or sentence).
* This refined source representation can then be used in Neural Machine Translation (NMT) systems to counter the problem of RNNs forgetting old information.
* [Link to the paper](https://arxiv.org/abs/1709.03980)

## Limitations of existing NMT models

* The RNN encoder-decoder architecture is the standard choice for NMT systems. But the RNNs are prone to forgetting old information.
* In NMT models, the attention is modeled in the unit of words while the use of phrases (instead of words) would be a better choice.
* While NMT systems might be able to capture certain relationships between words, they are not explicitly designed to capture such information.

## Contributions of the paper

* Learn the relationship between the source words using the context (neighboring words).
* Relation Networks (RNs) build pairwise relations between source words using the representations generated by the RNNs. The RN would sit between the encoder and the attention layer of the encoder-decoder framework thereby keeping the main architecture unaffected.

## Relation Network

* Neural network which is desgined for relational reasoning.
* Given a set of inputs * O = o<sub>1</sub>, ..., o<sub>n</sub> *, RN is formed as a composition of inputs:
RN(O) = f(sum(g(o<sub>i</sub>, o<sub>j</sub>))), f and g are functions used to learn the relations (feed forward networks)
* *g* learns how the objects are related hence the name "relation".
* **Components**:
* CNN Layer
* Extract information from the words surrounding the given word (context).
* The final output of this layer is the sequence of vectors for different kernel width.

* Graph Propagation (GP) Layer
* Connect all the words with each other in the form of a graph.
* Each output vector from the CNN corresponds to a node in the graph and there is an edge between all possible pair of nodes.
* The information flows between the nodes of the graph in a message passing sort of fashion (graph propagation) to obtain a new set of vectors for each node.

* Multi-Layer Perceptron (MLP) Layer
* The representation from the GP Layer is fed to the MLP layer.
* The layer uses residual connections from previous layers in form of concatenation.

## Datasets

* IWSLT Data - 44K sentences from tourism and travel domain.
* NIST Data - 1M Chinese-English parallel sentence pairs.

## Models

* MOSES - Open source translation system - http://www.statmt.org/moses/
* NMT - Attention based NMT
* NMT+ - NMT with improved decoder
* TRANSFORMER - Google's new NMT
* RNMT+ - Relation Network integrated with NMT+

## Evaluation Metric

* case-insensitive 4-gram BLEU score

## Observations

* As sentences become larger (more than 50 words), RNMT clearly outperforms other baselines.
* Qualitative evaluation shows that RNMT+ model captures the word alignment better than the NMT+ models.
* Similarly, NMT+ system tends to miss some information from the source sentence (more so for longer sentences). While both CNNs and RNNs are weak at capturing long-term dependency, using the relation layer mitigates this issue to some extent.
2 changes: 1 addition & 1 deletion site/_site
Submodule _site updated from 694ba0 to ff5079
Empty file modified site/archieve.md
100644 → 100755
Empty file.
Empty file modified site/atom.xml
100644 → 100755
Empty file.
Empty file modified site/index.html
100644 → 100755
Empty file.
Empty file modified site/index.html.1
100644 → 100755
Empty file.
Empty file modified site/public/apple-touch-icon-precomposed.png
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified site/public/css/lanyon.css
100644 → 100755
Empty file.
Empty file modified site/public/css/poole.css
100644 → 100755
Empty file.
Empty file modified site/public/css/style.css
100644 → 100755
Empty file.
Empty file modified site/public/css/syntax.css
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/HELP-US-OUT.txt
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/css/font-awesome.css
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/css/font-awesome.min.css
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/fonts/FontAwesome.otf
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/fonts/fontawesome-webfont.eot
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/fonts/fontawesome-webfont.svg
100644 → 100755
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Empty file modified site/public/font-awesome-4.7.0/fonts/fontawesome-webfont.ttf
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/fonts/fontawesome-webfont.woff
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/fonts/fontawesome-webfont.woff2
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/animated.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/bordered-pulled.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/core.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/fixed-width.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/font-awesome.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/icons.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/larger.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/list.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/mixins.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/path.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/rotated-flipped.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/screen-reader.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/stacked.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/less/variables.less
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_animated.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_bordered-pulled.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_core.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_fixed-width.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_icons.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_larger.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_list.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_mixins.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_path.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_rotated-flipped.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_screen-reader.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_stacked.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/_variables.scss
100644 → 100755
Empty file.
Empty file modified site/public/font-awesome-4.7.0/scss/font-awesome.scss
100644 → 100755
Empty file.
Empty file modified site/tags.md
100644 → 100755
Empty file.

0 comments on commit 79afbf1

Please sign in to comment.