Modified evaluation to use seqeval package #14

jgmf-amazon · 2022-06-14T18:37:18Z

This PR is to fix a bug found in issue 13:

in the case, eg. label: what is the weather [datetime: today] prediction: what [datetime: is the weather today] treated as a correct prediction.

The problem is that only tokens in the predicted utterance aligned with slotted tokens in the ground truth were used when performing evaluation. Additionally, the calculation of F1 was wrong. Exact match accuracy was also affected.

To fix the bug, the seqeval package is now used, which follows conlleval conventions by default. To use seqeval, BIO tagging was implemented. Exact match accuracy was also updated to use the BIO-tagged sequences. Some new test cases were added, as well. All tests pass.

The paper preprint and the eval.ai leaderboards will be fixed by 6/17.

…atch entire utterance

massive-dev-amz

LGTM

jgmf-amazon added 3 commits June 14, 2022 12:15

modified slot f1 to use seqeval package and exact match accuracy to m…

e9af0ff

…atch entire utterance

updated gitignore

345cae1

removed swaps and temp files

174c04b

jgmf-amazon mentioned this pull request Jun 14, 2022

About calculating the slot f1 metric #13

Closed

massive-dev-amz approved these changes Jun 14, 2022

View reviewed changes

jgmf-amazon merged commit 0552cdd into main Jun 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modified evaluation to use seqeval package #14

Modified evaluation to use seqeval package #14

jgmf-amazon commented Jun 14, 2022

massive-dev-amz left a comment

Modified evaluation to use seqeval package #14

Modified evaluation to use seqeval package #14

Conversation

jgmf-amazon commented Jun 14, 2022

massive-dev-amz left a comment

Choose a reason for hiding this comment