Improve overall user experience of model service #1748

kyujin-cho · 2023-11-29T04:02:19Z

Main idea

Since the birth of Backend.AI Model Service, main concern of the feature is that it is too hard to utilize for majority of users those who want to serve their own model. To overcome this problem we decided to add several new features on both Core and WebUI, which will potentially enhance overall experience of the Model Service feature.

Core: New "Dry Run" API
This new API should validate actual whole lifecycle of the inference session. Its request schema will be identical with the model service creation API. The implementation should first read model-definition.yml, create a new inference session accordingly but without the bound routing and endpoint, wait until model server loads, perform a health check (if defined at model definition), and finally terminate the created session. The API should report the whole progress to callee with the help of SSE.

Alternative ideas

No response

Anything else?

No response

Tasks

Give feedback

feat: try-start model service API #1754

comp:client comp:manager size:XL
Expose model-definition validator as a static validation schema (perhaps as a format of JSONSchema)
(on WebUI) Implement live model-definition.yml editor
(on WebUI) Introduce on-the-fly model-definition.yml schema validator for the editor
Options

The text was updated successfully, but these errors were encountered:

kyujin-cho added the type:feature Add new features label Nov 29, 2023

kyujin-cho assigned kyujin-cho and lizable Nov 29, 2023

achimnol removed the type:feature Add new features label Oct 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve overall user experience of model service #1748

Improve overall user experience of model service #1748

kyujin-cho commented Nov 29, 2023 •

edited

Loading

Tasks

Improve overall user experience of model service #1748

Improve overall user experience of model service #1748

Comments

kyujin-cho commented Nov 29, 2023 • edited Loading

Main idea

Alternative ideas

Anything else?

Tasks

kyujin-cho commented Nov 29, 2023 •

edited

Loading