Skip to content

Struggling to use cuspatial with a large dataset and small GPU memory #1116

Answered by voycey
voycey asked this question in Q&A
Discussion options

You must be logged in to vote

Ok just to update this with something that is working: (I appreciate the help everyone 🙂 - hopefully my pain helps others trying to get a PoC working):
With a bit of help from ChatGPT I understood some of the limitations, primarily dask and building out the cudf Dataframe needs to be done in each worker for both the polygon and the points.

This meant I also had to wrap the function in a separate function so that the serialization using pickle would work internally too (It didn't like doing this with cudf in it directly I think).
Also creating the interleaved column INSIDE the function that was being mapped was obvious in hindsight as it would only work on partitioned data (which I think …

Replies: 5 comments 16 replies

Comment options

You must be logged in to vote
8 replies
@thomcom
Comment options

thomcom May 3, 2023
Collaborator

@isVoid
Comment options

isVoid May 3, 2023
Collaborator

@harrism
Comment options

harrism May 3, 2023
Collaborator

@harrism
Comment options

harrism May 3, 2023
Collaborator

@voycey
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
6 replies
@voycey
Comment options

@harrism
Comment options

harrism May 10, 2023
Collaborator

@voycey
Comment options

@harrism
Comment options

harrism May 10, 2023
Collaborator

@voycey
Comment options

Comment options

You must be logged in to vote
1 reply
@harrism
Comment options

harrism May 10, 2023
Collaborator

Answer selected by jarmak-nv
Comment options

You must be logged in to vote
1 reply
@voycey
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants