fixes for validation engine and for using torchrun #22

jgmf-amazon · 2022-06-22T19:19:11Z

Issue #21

Description of changes:
There are two fixes given:

Code was added to the eval_preds function in training_utils.py per the suggestion of @bozheng-hit .With the current main branch, the validation engine is not functioning correctly, because subwords after the first subword are being handled by the convert_to_bio function. Instead, we want to merge/ignore the subsequent subwords and use only the prediction from the first subword. This change sets subsequent subwords to -100.
The train, test, and HPO scripts were modified to accept environmental variables for local_rank, allowing them to support torchrun.

I tested the changes with the xlmr_base example configs, both with train.py and test.py.

Once these changes are approved, I'll update the README. Thanks.

fixes for validation engine and for using torchrun

085111d

jgmf-amazon mentioned this pull request Jun 22, 2022

Can anyone reproduce the baseline results reported in the paper with the current version of codes? #21

Closed

cperiz approved these changes Jun 22, 2022

View reviewed changes

jgmf-amazon merged commit 3932705 into main Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixes for validation engine and for using torchrun #22

fixes for validation engine and for using torchrun #22

jgmf-amazon commented Jun 22, 2022

fixes for validation engine and for using torchrun #22

fixes for validation engine and for using torchrun #22

Conversation

jgmf-amazon commented Jun 22, 2022