Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable gpuCI (Jenkins/legacy) webhooks #404

Closed
fjetter opened this issue Nov 13, 2024 · 3 comments
Closed

Disable gpuCI (Jenkins/legacy) webhooks #404

fjetter opened this issue Nov 13, 2024 · 3 comments

Comments

@fjetter
Copy link
Member

fjetter commented Nov 13, 2024

gpuCI is currently not very reliable and it's been a source of friction. Some of this is being discussed here and here. The summary is that the current infrastructure is not reliable and the conversation about new infrastructure is only moving along slowly.

Therefore, I am proposing to disable the existing gpuCI webhooks this Friday, November 15th.

This does not mean that gpuCI is gone forever. There's a proposal to introduce a new gpuCI GH actions runner here #348 There is still some work to be done to align on the final version of this but I'm open to continue this conversation.

cc @dask/gpu

@rjzamora
Copy link
Member

rjzamora commented Nov 13, 2024

I am proposing to disable the existing gpuCI webhooks this Friday, November 15th.

Thanks for the heads up Florian.

Yes - I think this is fine. I'm sorry for not suggesting it earlier in the week. The legacy/Jenkins infrastructure is having serious issues, and we are in the process of standing up something within RAPIDS to run the GPU tests in dask/dask and dask/distributed until a suitable GHA gpuCI solution is available.

Related Note: The @pytest.mark.gpu marker will still be very important to RAPIDS when gpuCI isn't running :)

This does not mean that gpuCI is gone forever. There's a proposal to introduce a new gpuCI GH actions runner here #348 There is still some work to be done to align on the final version of this but I'm open to continue this conversation.

Great - Thanks for being open to this. Having (reliable) gpu-CI is very helpful.

@fjetter
Copy link
Member Author

fjetter commented Nov 15, 2024

this is done

@charlesbluca
Copy link
Member

Had I not been on PTO in the short period I could've objected to this, I would've asked if we could've waited until after some kind of makeshift solution was put in place to do this.

But now that it's done, and I'm assuming there is no going back, I would suggest we go a step further than what @jrbourbeau is doing in dask/dask#11545 dask/distributed#8945 and just outright remove all the gpuCI infra files from the respective repos, as it is unlikely these will be used for whatever replacement we put together and we likely want to decouple that replacement as much as possible from the Dask/Distributed repos to avoid the frictions we saw with Jenkins

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants