fix torch-npu dependency #4561

hashstone · 2024-06-26T12:13:55Z

What does this PR do?

Fix torch-npu dependency problem like this

#0 11.87 Collecting triton==2.1.0 (from torch==2.1.0->llamafactory==0.8.3.dev0)
#0 11.90   Downloading triton-2.1.0-0-cp310-cp310-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.3 kB)
#0 11.90 INFO: pip is looking at multiple versions of torch-npu to determine which version is compatible with other requirements. This could take a while.
#0 11.90 ERROR: Cannot install llamafactory and llamafactory[metrics,torch-npu]==0.8.3.dev0 because these package versions have conflicting dependencies.
#0 11.90 
#0 11.90 The conflict is caused by:
#0 11.90     llamafactory[metrics,torch-npu] 0.8.3.dev0 depends on torch==2.1.0; extra == "torch-npu"
#0 11.90     torch-npu 2.1.0.post3 depends on torch==2.1.0+cpu
#0 11.90 
#0 11.90 To fix this you could try to:
#0 11.90 1. loosen the range of package versions you've specified
#0 11.90 2. remove package versions to allow pip to attempt to solve the dependency conflict
#0 11.90 
#0 11.90 ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

hiyouga · 2024-06-26T12:46:37Z

cc @MengqingCao

MengqingCao · 2024-06-27T02:21:04Z

@hashstone Thanks for the fix! But the installation of torch-npu differs by architecture. Maybe adding a dependency "torch-npu-x86" in setup.py and choosing to install "torch-npu" or "torch-npu-x86" depending on the architecture in the Dockerfile is a solution. @hiyouga What do you think?

https://pypi.org/project/torch-npu/2.1.0.post3/

hashstone · 2024-06-27T07:28:28Z

@MengqingCao have modified as your proposal, please review.

MengqingCao · 2024-06-27T10:07:17Z

docker/docker-npu/Dockerfile

@@ -1,28 +1,35 @@
 # Use the Ubuntu 22.04 image with CANN 8.0.rc1
 # More versions can be found at https://hub.docker.com/r/cosdt/cann/tags
-FROM cosdt/cann:8.0.rc1-910b-ubuntu22.04
+FROM --platform=$TARGETPLATFORM cosdt/cann:8.0.rc1-910b-ubuntu22.04


This can be done automatically by docker. let's use uname -i to get the architecture info.

MengqingCao · 2024-06-27T10:07:25Z

docker/docker-npu/Dockerfile


 # Copy the rest of the application into the image
 COPY . /app

 # Install the LLaMA Factory
-RUN EXTRA_PACKAGES="torch-npu,metrics"; \
+RUN EXTRA_PACKAGES="metrics"; \
+    if [ "$TARGETPLATFORM" == "linux/arm64" ]; then \


if [ "$(uname -i)" == "aarch64" ]; then \

hiyouga

We did some modifications and this pr can be merged

hiyouga · 2024-06-27T12:25:20Z

Hi @hashstone , if possible, could you please help us verify whether the problem has been fixed or not, using the current version: https://github.com/hiyouga/LLaMA-Factory/blob/main/docker/docker-npu/Dockerfile ? thanks!

hashstone · 2024-06-28T02:29:37Z

Hi @hashstone , if possible, could you please help us verify whether the problem has been fixed or not, using the current version: https://github.com/hiyouga/LLaMA-Factory/blob/main/docker/docker-npu/Dockerfile ? thanks!

In my local build env, cross building through docker build --platform linux/arm64 or docker build --platform linux/amd64 is pass.

hiyouga · 2024-06-28T17:20:49Z

Hi @hashstone , if possible, could you please help us verify whether the problem has been fixed or not, using the current version: https://github.com/hiyouga/LLaMA-Factory/blob/main/docker/docker-npu/Dockerfile ? thanks!

In my local build env, cross building through docker build --platform linux/arm64 or docker build --platform linux/amd64 is pass.

Great news, thanks for the verification

fix torch-npu dependency

8096f94

hiyouga added the pending This problem is yet to be addressed label Jun 26, 2024

support docker-npu-[amd64|arm64] build

bdda082

hashstone force-pushed the fix-docker-npu branch from f1ed2bc to bdda082 Compare June 27, 2024 07:25

hiyouga added 3 commits June 27, 2024 19:16

Update setup.py

0338920

Update README.md

06536c4

Update README_zh.md

fc7a08e

MengqingCao reviewed Jun 27, 2024

View reviewed changes

hiyouga added 3 commits June 27, 2024 19:38

Update setup.py

569f03c

Update Dockerfile

9624af1

Update Dockerfile

5fcd33f

hiyouga approved these changes Jun 27, 2024

View reviewed changes

hiyouga merged commit a6bf74c into hiyouga:main Jun 27, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix torch-npu dependency #4561

fix torch-npu dependency #4561

hashstone commented Jun 26, 2024

hiyouga commented Jun 26, 2024

MengqingCao commented Jun 27, 2024

hashstone commented Jun 27, 2024

MengqingCao Jun 27, 2024

MengqingCao Jun 27, 2024

hiyouga left a comment

hiyouga commented Jun 27, 2024 •

edited

Loading

hashstone commented Jun 28, 2024

hiyouga commented Jun 28, 2024 •

edited

Loading

fix torch-npu dependency #4561

fix torch-npu dependency #4561

Conversation

hashstone commented Jun 26, 2024

What does this PR do?

Before submitting

hiyouga commented Jun 26, 2024

MengqingCao commented Jun 27, 2024

hashstone commented Jun 27, 2024

MengqingCao Jun 27, 2024

Choose a reason for hiding this comment

MengqingCao Jun 27, 2024

Choose a reason for hiding this comment

hiyouga left a comment

Choose a reason for hiding this comment

hiyouga commented Jun 27, 2024 • edited Loading

hashstone commented Jun 28, 2024

hiyouga commented Jun 28, 2024 • edited Loading

hiyouga commented Jun 27, 2024 •

edited

Loading

hiyouga commented Jun 28, 2024 •

edited

Loading