Use deepspeed.comm
instead of torch.distributed
#6799
Job | Run time |
---|---|
5m 3s | |
5m 3s |
deepspeed.comm
instead of torch.distributed
#6799
Job | Run time |
---|---|
5m 3s | |
5m 3s |