fix stream handling in multi-op device tasks #421

evaleev · 2023-09-25T15:25:53Z

resolves #420 and #422

- initial support for multiple devices - introduced device::Stream - sticky handling of streams should allow device tasks with multiple ops (e.g., scale+permute) to work correctly

This reverts commit b4b5997.

…ed with quiet=false

evaleev · 2023-09-27T17:27:16Z

currently um_expressions_suite/*cont* and librett_suite/librett_gpu_mem, e.g.

and

https://gitlab.com/ValeevGroup/tiledarray/-/jobs/5177464289#L1687

…me, not CUDA runtime

…, but in task body so that streams are per-task, not per thread in case a task recursively executes other tasks by doing Future::get(dowork=true)

…upport + MADNESS tag to pull in m-a-d-n-e-s-s/madness#501

…ic decisions and need for locking

This was linked to issues Sep 25, 2023

stream assignment to device tasks should be sticky #420

Closed

support for multiple compute devices / MPI rank #422

Closed

evaleev force-pushed the 420-stream-assignment-to-device-tasks-should-be-sticky branch from 1393bc3 to b4b5997 Compare September 26, 2023 19:58

evaleev added 8 commits September 27, 2023 11:00

relax deviceEnv::current_device_id to support multiple devices per rank

f7b7b42

jumbo "multidevice" bundle

0b2254b

- initial support for multiple devices - introduced device::Stream - sticky handling of streams should allow device tasks with multiple ops (e.g., scale+permute) to work correctly

UM tensor/expression unit tests build for HIP also

23ba34e

decudaify header guard in device_task_fn.h

1d59a36

run unit tests with multiple device streams

75c39be

try reverting "run unit tests with multiple device streams"

7bf0463

This reverts commit b4b5997.

can probe whether TA was initialized to be quiet

957e5cc

device initialization informational messages only logged if initializ…

5ea446b

…ed with quiet=false

evaleev force-pushed the 420-stream-assignment-to-device-tasks-should-be-sticky branch from ca59376 to 5ea446b Compare September 27, 2023 15:35

evaleev added 2 commits September 27, 2023 18:42

convert librett unit tests to use default stream provided by TA runti…

fdaf8bc

…me, not CUDA runtime

stream to use for syncing by madness tasks is no longer stored in TLS…

37ee448

…, but in task body so that streams are per-task, not per thread in case a task recursively executes other tasks by doing Future::get(dowork=true)

evaleev force-pushed the 420-stream-assignment-to-device-tasks-should-be-sticky branch from de1c368 to 2134056 Compare September 28, 2023 18:14

bump Umpire tag to bump up to its latest commit that provides C++20 s…

dfa3f76

…upport + MADNESS tag to pull in m-a-d-n-e-s-s/madness#501

evaleev force-pushed the 420-stream-assignment-to-device-tasks-should-be-sticky branch from 2134056 to dfa3f76 Compare September 28, 2023 18:25

evaleev added 2 commits September 28, 2023 17:31

ReduceTask choose device stream in round-robin fashion to avoid dynam…

089dcc6

…ic decisions and need for locking

clone(DistArray) supports device-based arrays

4b79b5a

evaleev force-pushed the 420-stream-assignment-to-device-tasks-should-be-sticky branch from c612b71 to 4b79b5a Compare September 28, 2023 23:47

evaleev merged commit 3ce7fdc into master Sep 29, 2023
8 checks passed

This was referenced Sep 29, 2023

Replace suffix cuda by suffix gpu in directory and file names #387

Closed

cuTT issue with callback #271

Closed

evaleev deleted the 420-stream-assignment-to-device-tasks-should-be-sticky branch September 22, 2024 12:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix stream handling in multi-op device tasks #421

fix stream handling in multi-op device tasks #421

evaleev commented Sep 25, 2023 •

edited

Loading

evaleev commented Sep 27, 2023 •

edited

Loading

fix stream handling in multi-op device tasks #421

fix stream handling in multi-op device tasks #421

Conversation

evaleev commented Sep 25, 2023 • edited Loading

evaleev commented Sep 27, 2023 • edited Loading

evaleev commented Sep 25, 2023 •

edited

Loading

evaleev commented Sep 27, 2023 •

edited

Loading