Installing NVIDIA GPU Operator - Does Bottlerocket come with container toolkit? #3967
Replies: 2 comments 4 replies
-
We do include the NVIDIA container toolkit in Bottlerocket. The GPU operator will try to do more things than you expect it to do, that's why we don't recommend using it with Bottlerocket. I don't recall if there is a flag to pass to the GPU operator to prevent it from messing up with containerd's configurations. What's your use case for the GPU operator? |
Beta Was this translation helpful? Give feedback.
-
I'm also interested in using the GPU Operator with the use case being that I want to time share GPUs and have control over the number of GPU resources that the physical GPU gets shared out as. I was able to disable the driver and toolkit installation following nvidia's guide, but the GPU Operator will not become healthy. This is from the nvidia-operator-validator/driver-validation pod/container:
Is there a way to control the number of GPU resources on a per-node basis without using the GPU Operator I'd be interested. |
Beta Was this translation helpful? Give feedback.
-
I am trying to install the nvidia gpu operator. It is not recommended to use according to BottleRocket and I wonder if that is because of the configuration settings we have to pass? I know we have to add flag to disable nvidia driver but what about container toolkit?
More info:
toolkit.enabled | By default, the Operator deploys the NVIDIA Container Toolkit (nvidia-docker2 stack) as a container on the system. Set this value to false when using the Operator on systems with pre-installed NVIDIA runtimes.
I am using EKS bottlerocket nvidia ami
Beta Was this translation helpful? Give feedback.
All reactions