-
Notifications
You must be signed in to change notification settings - Fork 93
Issues: microsoft/Tutel
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How expert parameters are distributed in the cluster when using the Tutel framework?
#251
opened Oct 30, 2024 by
luuck
How to load 32-experts Swin-transformer-moe on a 2-GPU machine.
#248
opened Oct 27, 2024 by
ywxsuperstar
How to convert checkpoint files that adapt to different distributed world sizes
#246
opened Aug 27, 2024 by
swjtulinxi
[Question] Why use datatype ncclInt8 in nccl_all_to_all_scatter_async.
#220
opened Dec 18, 2023 by
cicirori
How to implement Fairseq-MoE training checkpoint like Swin-MoE?
#219
opened Nov 10, 2023 by
withinmiaov
Non-surface function utilities only work for contiguous input data
#218
opened Nov 6, 2023 by
lyd126
ImportError: cannot import name 'tutel_custom_kernel' from 'tutel.impls.jit_compiler'
environmental issue
#198
opened Mar 30, 2023 by
zhaojiancheng007
tutel/jit_kernels/sparse.py torch.float16 There is a bug in the calculation: the cuda calculation result is inconsistent with the CPU calculation result and the array is out of bounds
invalid
This doesn't seem right
#196
opened Mar 8, 2023 by
WsqRichards1
How the experts' gradients are handled under data parallelism?
#192
opened Dec 26, 2022 by
yzs981130
[installation errors] fatal error: nccl.h: No such file or directory
#189
opened Oct 19, 2022 by
qianyuzqy
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.