Consider changing name "valid.csv" and "invalid.csv" #83

JRMeyer · 2019-02-04T20:21:20Z

Given that "dev" and "valid" are used interchangeably in the machine learning world to refer to the held-out dataset which is used for early stopping, I think having a valid.csv as well as a dev.csv will lead to confusion.

Given that "invalid.csv" also exists, and the filenames should make obvious that "valid.csv" and "invalid.csv" are complementary sets, I suggest the following names:

valid.csv	invalid.csv
validated.csv	invalidated.csv
accurate.csv	inaccurate.csv
correct.csv	incorrect.csv
confirmed.csv	uncomfirmed.csv
verified.csv	unverified.csv
validated_transcripts.csv	invalidated_transcripts.csv
accurate_transcripts.csv	inaccurate_transcripts.csv
correct_transcripts.csv	incorrect_transcripts.csv
confirmed_transcripts.csv	unconfirmed_transcripts.csv
verified_transcripts.csv	unverified_transcripts.csv

The text was updated successfully, but these errors were encountered:

kdavis-mozilla · 2019-02-05T09:14:01Z

What about keeping it simple with validated.csv and invalidated.csv?

Fixed #83 renamed output tsv's

JRMeyer assigned kdavis-mozilla Feb 4, 2019

JRMeyer closed this as completed in d73df91 Feb 8, 2019

JRMeyer added a commit that referenced this issue Feb 8, 2019

Merge pull request #84 from mozilla/issue83

1d9be5e

Fixed #83 renamed output tsv's

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider changing name "valid.csv" and "invalid.csv" #83

Consider changing name "valid.csv" and "invalid.csv" #83

JRMeyer commented Feb 4, 2019

kdavis-mozilla commented Feb 5, 2019

Consider changing name "valid.csv" and "invalid.csv" #83

Consider changing name "valid.csv" and "invalid.csv" #83

Comments

JRMeyer commented Feb 4, 2019

kdavis-mozilla commented Feb 5, 2019