-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-gpu support #5
Comments
Thanks for this question! I think multi-GPU works just like single-GPU. Since our search on CIFAR takes a few hours, we did not consider multi-GPU training. However, during our recent work that generalizes P-DARTS in searching on ImageNet directly, we did use 8 GPUs for acceleration. @chenxin061 more experiences to share? |
To search with multiple GPUs, you need to change a few lines in train_search.py.
|
I added code Traceback (most recent call last): |
@zihaozhang9 to fix that |
I did several thing .. model = nn.DataParallel(model) then but still I can not train_search.py on multi gpu, it will still try to overwhelm single gpu then out of memory .... What is wrong here ... ? I am using pytorch1.0.0 python3.6 and get 4 by print(torch.cuda.device_count()) If I use model = nn.DataParallel(model) |
The new version of our code now supports multi-GPU search! |
I see your modification, I did the same to support multi gpu ,what is more , class MixedOp(nn.Module): shall change to class MixedOp(nn.Module): other wise the error that I encountered will still happen... |
@chenxin061 Thanks for sharing your code! Can you confirm whether you used 8 V100 GPUs with 16 GB of memory per card or 8 V100 GPUs with 32 GB memory per card? Thanks! |
@davidrpugh The search code is tested on two P100 GPUs and the evaluating code is tested on 8 V100 with 16GB memory. |
@chenxin061 Thanks! I suspected as much for the V100s. Didn't realize that you used 2 P100s. I was able to complete the search process using CIFAR-10 or CIFAR-100 using a single P100 with 16 GB in between 7-8 hours (as advertised in the paper and README). |
How to use Multi-gpu to search?
The text was updated successfully, but these errors were encountered: