Skip to content
@WildEval

WildEval Team

Popular repositories Loading

  1. ZeroEval ZeroEval Public

    Forked from allenai/WildBench

    A simple unified framework for evaluating LLMs

    HTML 149 20

  2. wildeval.github.io wildeval.github.io Public

    Ruby

  3. WildBench WildBench Public

    Forked from allenai/WildBench

    Benchmarking LLMs with Challenging Tasks from Real Users

    Python

Repositories

Showing 3 of 3 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…