Add global mmlu lite sensitivity cards #1568

eliyahabba · 2025-02-02T17:55:49Z

feat: add Global-MMLU-Lite CS/CA task cards

Add two task cards for Global-MMLU-Lite dataset:

CS card for culturally sensitive questions
CA card for culturally agnostic questions

Both cards include:

Support for 14 languages
Multiple choice QA format
Topic mapping and preprocessing steps

Add two task cards for Global-MMLU-Lite dataset: - CS card for culturally sensitive questions - CA card for culturally agnostic questions Both cards include: - Support for 14 languages - Multiple choice QA format - Topic mapping and preprocessing steps

elronbandel

Is all the difference between the 3 files of global mmlu is the filtering lambda?
can you just have them in one python file with loop over the lambdas:
for func in [None, "lambda x: x['cultural_sensitivity_label'] == 'CA'", ...
Also can you run make pre-commit before committing to fix the style of the code (once you run it once it will persist to affect your code before new commits)

eliyahabba · 2025-02-02T18:49:27Z

Not exactly. I combined the two files of cultural_sensitivity_label, but there are important differences between these files and the global_mmlu file:

different datasets: Global-MMLU-Lite vs Global-MMLU
The processing approach is different: the Global-MMLU-Lite files create one card per language covering all subjects, while the Global-MMLU file creates a separate card for each language-subject combination.

eliyahabba added 6 commits February 2, 2025 19:52

added cards

2c5cd69

reformat files

52d0022

added cards

fa191df

reformat files

2bc697a

elronbandel requested changes Feb 2, 2025

View reviewed changes

eliyahabba added 2 commits February 2, 2025 20:47

merged files

74f5378

merged files

88d4b5f

merged files

ae51897

eliyahabba requested a review from elronbandel February 2, 2025 19:11

elronbandel approved these changes Feb 2, 2025

View reviewed changes

Merge branch 'main' into add-global-mmlu-lite-sensitivity-cards

dd6d8a7

elronbandel merged commit f9f9c5d into main Feb 2, 2025
16 of 18 checks passed

elronbandel deleted the add-global-mmlu-lite-sensitivity-cards branch February 2, 2025 20:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add global mmlu lite sensitivity cards #1568

Add global mmlu lite sensitivity cards #1568

eliyahabba commented Feb 2, 2025

elronbandel left a comment

eliyahabba commented Feb 2, 2025

Add global mmlu lite sensitivity cards #1568

Add global mmlu lite sensitivity cards #1568

Conversation

eliyahabba commented Feb 2, 2025

elronbandel left a comment

Choose a reason for hiding this comment

eliyahabba commented Feb 2, 2025