Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add P-MMEval #1714

Merged
merged 7 commits into from
Nov 27, 2024
Merged

[Feature] Add P-MMEval #1714

merged 7 commits into from
Nov 27, 2024

Conversation

wanyu2018umac
Copy link
Contributor

@wanyu2018umac wanyu2018umac commented Nov 25, 2024

Motivation

This PR introduces the implementation of P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs (see paper link). The P-MMEval benchmark delivers support for evaluating LLMs on multilingual capabilities with examples in 10 languages.

Modification

  • Configs:
    • Add files in configs/datasets/PMMEval for evaluation support. For each subset in P-MMEval (i.e., flores, humaneval-xl, mgsm, mhellaswag, mifeval, mlogiqa, mmmlu, and xnli), each dataset python file is created.
    • Add files in configs/summarizers and configs/summarizers/groups for summarizing the evaluation results on P-MMEval.
  • Datasets
    • Add files in datasets supporting the loading and evaluation for each subset.

Checklist

Before PR:

  • Pre-commit or other linting tools are used to fix the potential lint issues.
  • Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests.
  • The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
  • The documentation has been modified accordingly, like docstring or example tutorials.

After PR:

  • If the modification has potential influence on downstream or other related projects, this PR should be tested with those projects.
  • CLA has been signed and all committers have signed the CLA in this PR.

@wanyu2018umac wanyu2018umac changed the title [Update] Add P-MMEval [Feature] Add P-MMEval Nov 25, 2024
@liushz liushz merged commit 90efcf2 into open-compass:main Nov 27, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants