_is_bf16_available judgment supports npu #5193

Ricardo-L-C · 2024-08-16T03:16:17Z

What does this PR do?

transformers.utils.is_torch_bf16_gpu_available currently only supports checking for GPU/CUDA availability, and transformers do not provide a similar interface for NPU. This results in llama-factory incorrectly determining that the device does not support NPU (such as Ascend 910B) when using NPU for inference. That is, when inferencing without specifying infer_dtype, it will automatically fall back to fp16 instead of bf16.

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

hiyouga

LGTM

_is_bf16_available judgment supports npu

384ab8d

hiyouga self-requested a review August 19, 2024 15:32

hiyouga approved these changes Aug 19, 2024

View reviewed changes

Ricardo-L-C temporarily deployed to tests August 19, 2024 15:33 — with GitHub Actions Inactive

hiyouga merged commit 5d5bfc8 into hiyouga:main Aug 19, 2024
1 check passed

hiyouga added the solved This problem has been already solved label Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

_is_bf16_available judgment supports npu #5193

_is_bf16_available judgment supports npu #5193

Ricardo-L-C commented Aug 16, 2024

hiyouga left a comment

_is_bf16_available judgment supports npu #5193

_is_bf16_available judgment supports npu #5193

Conversation

Ricardo-L-C commented Aug 16, 2024

What does this PR do?

Before submitting

hiyouga left a comment

Choose a reason for hiding this comment