set autoround format as default to unify CPU/HPU/CUDA #205

wenhuach21 · 2024-08-05T06:15:27Z

No description provided.

WeiweiZhang1 · 2024-08-06T01:56:29Z

auto_round/auto_quantizer.py

+            if "hpu" in backend and model.dtype != torch.bfloat16:
+                logger.info("We suggest you to set `torch_dtype=torch.bfloat16` for better efficiency with AutoRound.")
+                model = model.to(torch.bfloat16)
+            elif model.dtype != torch.float16:


only for gpu, better add a judge "cpu" not in backend

wenhuach21 added 7 commits August 5, 2024 14:14

set autoround format as default to unify CPU/HPU/CUDA

cdce492

fix issue

c1c25cb

fix some issues

c0e22c9

fix some issues

45ad308

fix some issues

ef5fbd1

tiny change

4b70375

tiny change

b7b48a1

wenhuach21 requested review from WeiweiZhang1 and n1ck-guo and removed request for n1ck-guo August 5, 2024 07:19

wenhuach21 added 3 commits August 5, 2024 15:58

fix some issues

75d1f67

fix issues in mx_fp

c631bd3

Merge branch 'main' into default_autoround

d8173f0

WeiweiZhang1 approved these changes Aug 6, 2024

View reviewed changes

wenhuach21 merged commit 1e75afd into main Aug 6, 2024
10 checks passed

wenhuach21 deleted the default_autoround branch August 6, 2024 02:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

set autoround format as default to unify CPU/HPU/CUDA #205

set autoround format as default to unify CPU/HPU/CUDA #205

wenhuach21 commented Aug 5, 2024

WeiweiZhang1 Aug 6, 2024

set autoround format as default to unify CPU/HPU/CUDA #205

set autoround format as default to unify CPU/HPU/CUDA #205

Conversation

wenhuach21 commented Aug 5, 2024

WeiweiZhang1 Aug 6, 2024

Choose a reason for hiding this comment