Releases: corentin-ryr/MultiMedEval
Releases · corentin-ryr/MultiMedEval
MultiMedEval 0.1.1
New features:
- Dynamic datasets: use any dataset formatted correctly and apply the metrics of a task "family" (QA, VQA, Report Comparison, Image Classification, and NLI). This feature adds more flexibility to MultiMedEval.
- Added Diff-VQA [Paper] to the list of supported tasks.
- Updated RadCliQ to reflect more closely the results in the [Paper]
In addition to the new features, we added a suite of unit tests and corrected some bugs.
MultiMedEval 0.1
v0.1.0 Removed BnB from benchmarking code