feat: support custom task runner #2407

zhyncs · 2024-12-08T18:27:57Z

Motivation

Usually, when we run experiments (benchmark or evaluation), we need to do it multiple times, manually starting the server cmd and client cmd each time and collecting results. This work is very procedural and tedious, so it can be semi-automated using scripts. cc @shanyu-sys @yichuan520030910320

Here is a simple example. ref https://github.com/sgl-project/sglang/actions/runs/12224157302

Modifications

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

This reverts commit 87bf2bb.

zhyncs · 2024-12-08T18:31:09Z

ref https://github.com/sgl-project/sglang/actions/runs/12224251746

zhyncs · 2024-12-08T18:44:13Z

TODO(zhyncs):
Make different config.yml files as CI input choices, so we can just write yml and manually trigger the execution of a specific config.
In theory, any group and any server cmd and client cmd can be used. The only things that need to be handled are the dependency installation for the server and the output formatting for the client.
For example, lm_eval, evalplus, etc., and we may want to run different configurations each time. This configurability gives us great flexibility without the need to set up a development environment ourselves.

zhyncs · 2024-12-08T18:46:14Z

In some cases, we only need to display, in other cases, we need to set a threshold for comparison.

zhyncs added 4 commits December 9, 2024 02:14

upd

e45583a

upd

34ce9d3

test

87bf2bb

Revert "test"

12a5efe

This reverts commit 87bf2bb.

zhyncs requested review from merrymercy and Ying1123 as code owners December 8, 2024 18:27

zhyncs merged commit 0f8eb15 into main Dec 8, 2024
3 of 15 checks passed

zhyncs deleted the zhyncs/test branch December 8, 2024 18:29

zhyncs mentioned this pull request Dec 9, 2024

Add InfiniteBench for long context benchmarking #2421

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support custom task runner #2407

feat: support custom task runner #2407

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024

feat: support custom task runner #2407

feat: support custom task runner #2407

Conversation

zhyncs commented Dec 8, 2024

Motivation

Modifications

Checklist

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024