Add more support for intel Gaudi accelerators #2357

YangQun1 · 2024-12-05T03:28:49Z

Motivation

We already have initial support for Intel Gaudi accelerators in sglang.
This PR aims to add more supports (like hpu memory capacity getter and some minor changes to use generic torch device module apis) to make the offline_batch_inference.py example run e2e successfully on x1 or x2 Gaudi2 cards.

# x1
python examples/runtime/engine/offline_batch_inference.py --device hpu --model-path meta-llama/Meta-Llama-3.1-8B-Instruct

# x2
python examples/runtime/engine/offline_batch_inference.py --device hpu --tp-size=2 --model-path meta-llama/Meta-Llama-3.1-8B-Instruct

Modifications

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

YangQun1 · 2024-12-05T10:03:23Z

Hi @merrymercy , could you help to take a review?

merrymercy

This looks good!

YangQun1 marked this pull request as ready for review December 5, 2024 06:57

YangQun1 requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners December 5, 2024 06:57

YangQun1 force-pushed the dev/enable-hpu branch from 976cb1e to 0ee1f26 Compare December 5, 2024 09:22

YangQun1 added 4 commits December 5, 2024 17:22

Support HPU device

140b949

fix

22fbef8

fix format

ef521ce

always cast probs_idx to int32 in pytorch sampler

32c3ed8

YangQun1 force-pushed the dev/enable-hpu branch from 0ee1f26 to 32c3ed8 Compare December 5, 2024 09:22

merrymercy approved these changes Dec 6, 2024

View reviewed changes

merrymercy enabled auto-merge (squash) December 6, 2024 09:15

merrymercy disabled auto-merge December 6, 2024 09:15

Merge branch 'main' into dev/enable-hpu

c21b13c

merrymercy merged commit 37ee906 into sgl-project:main Dec 6, 2024
0 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more support for intel Gaudi accelerators #2357

Add more support for intel Gaudi accelerators #2357

YangQun1 commented Dec 5, 2024 •

edited

Loading

YangQun1 commented Dec 5, 2024

merrymercy left a comment

Add more support for intel Gaudi accelerators #2357

Add more support for intel Gaudi accelerators #2357

Conversation

YangQun1 commented Dec 5, 2024 • edited Loading

Motivation

Modifications

Checklist

YangQun1 commented Dec 5, 2024

merrymercy left a comment

Choose a reason for hiding this comment

YangQun1 commented Dec 5, 2024 •

edited

Loading