Skip to content

Commit

Permalink
feat: openai results
Browse files Browse the repository at this point in the history
  • Loading branch information
lukasellinger committed Aug 19, 2024
1 parent 479cd1b commit 1873942
Show file tree
Hide file tree
Showing 135 changed files with 36,189 additions and 4,338 deletions.
4 changes: 3 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -245,4 +245,6 @@ fabric.properties

/wiki-pages

*.safetensors
*.safetensors

dataset/jan
1,685 changes: 0 additions & 1,685 deletions data/evaluation/fever_base.jsonl

This file was deleted.

1,685 changes: 0 additions & 1,685 deletions data/evaluation/fever_finetuned.jsonl

This file was deleted.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

168 changes: 0 additions & 168 deletions data/evaluation/german_dpr-claim_verification_base.jsonl

This file was deleted.

336 changes: 0 additions & 336 deletions data/evaluation/german_dpr-claim_verification_finetuned.jsonl

This file was deleted.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion general_utils/reader.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ def process(self, file):

def _write(self, file, lines):
for line in lines:
json.dump(line, file)
json.dump(line, file, ensure_ascii=False)
file.write('\n')


Expand Down
532 changes: 243 additions & 289 deletions notebooks/factscore.ipynb

Large diffs are not rendered by default.

1,064 changes: 896 additions & 168 deletions notebooks/openai_evaluation.ipynb

Large diffs are not rendered by default.

11 changes: 6 additions & 5 deletions notebooks/pipeline_evaluation.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -152,10 +152,6 @@
" 'dataset': load_dataset('lukasellinger/german_wiktionary-claim_verification-mini', split='test'),\n",
" 'lang': 'de'\n",
" },\n",
" #'german-claim_verification': {\n",
" # 'dataset': load_dataset('lukasellinger/german-claim_verification', split='test'),\n",
" # 'lang': 'de'\n",
" #},\n",
" 'squad-claim_verification': {\n",
" 'dataset': load_dataset('lukasellinger/squad-claim_verification', split='test'),\n",
" 'lang': 'en'\n",
Expand All @@ -164,7 +160,12 @@
" #'german_wiktionary-claim_verification-large': {\n",
" # 'dataset': load_dataset('lukasellinger/german_wiktionary-claim_verification-large', split='test'),\n",
" # 'lang': 'de'\n",
" #}\n",
" #},\n",
" # outdated\n",
" #'german-claim_verification': {\n",
" # 'dataset': load_dataset('lukasellinger/german-claim_verification', split='test'),\n",
" # 'lang': 'de'\n",
" #},\n",
"}"
],
"id": "26dff9eae25d4da9",
Expand Down

0 comments on commit 1873942

Please sign in to comment.