GitHub

Code Reference

Requirements

Python >= 3.6 and PyTorch >= 0.4.1
AllenNLP package (if you use ELMo)

If you use conda:

git clone https://github.com/allanj/pytorch_lstmcrf.git

conda create -n pt_lstmcrf python=3.7
conda activate pt_lstmcrf
# check https://pytorch.org for the suitable version of your machines
conda install pytorch=1.3.0 torchvision cudatoolkit=10.0 -c pytorch -n pt_lstmcrf
pip install tqdm
pip install termcolor
pip install overrides
pip install allennlp

Usage

Put the Glove embedding file (glove.6B.100d.txt) under data directory (You can also use ELMo/BERT/Flair, Check below.) Note that if your embedding file does not exist, we just randomly initalize the embeddings.
Simply run the following command and you can obtain results comparable to the benchmark above.
```
python trainer.py
```
If you want to use your 1st GPU device cuda:0 and train models for your own dataset with elmo embedding:
```
python trainer.py --device cuda:0 --dataset YourData --context_emb elmo --model_folder saved_models
```

Training with your own data.

Create a folder YourData under the data directory.
Put the train.txt, dev.txt and test.txt files (make sure the format is compatible, i.e. the first column is words and the last column are tags) under this directory. If you have a different format, simply modify the reader in config/reader.py.
Change the dataset argument to YourData when you run trainer.py.

Data Preparation

The preprocessed RR dataset is saved in ./data. For more details regarding the dataset, please refer to RR.

Data Processing

To process the data, we adopt bert-as-service as a tool to obtain the embeddings for all tokens [x0, x1, · · · , xT −1] in the sentence.

Install

pip install bert-serving-server  # server
pip install bert-serving-client  # client, independent of `bert-serving-server`

Download a pre-trained BERT model

e.g. Download a model, then uncompress the zip file into some folder, say /tmp/english_L-12_H-768_A-12/

Start the BERT service

bert-serving-start -model_dir /tmp/english_L-12_H-768_A-12/ -max_seq_len NONE -pooling_strategy NONE

Use Client to Get Sentence Encodes

Run ../data_processing/dataProcessing.py.

Now you will get vec_train.pkl, vec_dev.pkl, vec_test.pkl.

Citation

@inproceedings{cheng2020ape,
  title={APE: Argument Pair Extraction from Peer Review and Rebuttal via Multi-task Learning},
  author={Cheng, Liying and Bing, Lidong and Qian, Yu and Lu, Wei and Si, Luo},
  booktitle={Proceedings of EMNLP},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
common		common
config		config
data		data
data_processing		data_processing
docs		docs
model		model
model_files/english_model		model_files/english_model
modelrr		modelrr
preprocess		preprocess
.DS_Store		.DS_Store
README.md		README.md
ner_predictor.py		ner_predictor.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code Reference

Requirements

Usage

Training with your own data.

Data Preparation

Data Processing

Install

Download a pre-trained BERT model

Start the BERT service

Use Client to Get Sentence Encodes

Citation

About

Releases

Packages

Languages

LiyingCheng95/ArgumentPairExtraction

Folders and files

Latest commit

History

Repository files navigation

Code Reference

Requirements

Usage

Training with your own data.

Data Preparation

Data Processing

Install

Download a pre-trained BERT model

Start the BERT service

Use Client to Get Sentence Encodes

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages