You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In some cases the sentence a preprocessor is presented with is, after preprocessing, empty or obviously incorrect. For example, in the English data set there are validated sentences of the form
"<html lang=""en"">"
which obviously is the result of some bug or bugs in the Common Voice code.
Upon preprocessing such a sentence would be empty or obviously incorrect and should not be included in any validated data set.
The result of this issue should be some means of allowing preprocessors to reject sentences and move them from the valid data set to the invalid data set.
The text was updated successfully, but these errors were encountered:
kdavis-mozilla
changed the title
Allow preprocessors to reject sentences
Allow language specific preprocessors to reject sentences
Dec 13, 2018
In some cases the sentence a preprocessor is presented with is, after preprocessing, empty or obviously incorrect. For example, in the English data set there are validated sentences of the form
which obviously is the result of some bug or bugs in the Common Voice code.
Upon preprocessing such a sentence would be empty or obviously incorrect and should not be included in any validated data set.
The result of this issue should be some means of allowing preprocessors to reject sentences and move them from the valid data set to the invalid data set.
The text was updated successfully, but these errors were encountered: