Skip to content

Releases: corentin-ryr/MultiMedEval

MultiMedEval 0.1.1

18 Mar 16:31
0f7cffe
Compare
Choose a tag to compare

New features:

  • Dynamic datasets: use any dataset formatted correctly and apply the metrics of a task "family" (QA, VQA, Report Comparison, Image Classification, and NLI). This feature adds more flexibility to MultiMedEval.
  • Added Diff-VQA [Paper] to the list of supported tasks.
  • Updated RadCliQ to reflect more closely the results in the [Paper]

In addition to the new features, we added a suite of unit tests and corrected some bugs.

MultiMedEval 0.1

15 Feb 15:41
Compare
Choose a tag to compare
v0.1.0

Removed BnB from benchmarking code