Make FloatToFloat16 conversion 75x faster using SVE2 instructions (#3… #7613
fbgemm_gpu_ci_cuda.yml
on: push
Matrix: build_artifact
Matrix: test_and_publish_artifact
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
fbgemm_gpu_nightly_cuda_x86_clang_py3.10_cu11.8.0.whl
|
237 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.10_cu12.4.1.whl
|
454 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.10_cu12.6.3.whl
|
451 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.11_cu11.8.0.whl
|
237 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.11_cu12.4.1.whl
|
454 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.11_cu12.6.3.whl
|
451 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.12_cu11.8.0.whl
|
237 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.12_cu12.4.1.whl
|
454 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.12_cu12.6.3.whl
|
451 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.13_cu11.8.0.whl
|
237 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.13_cu12.4.1.whl
|
454 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.13_cu12.6.3.whl
|
451 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.9_cu11.8.0.whl
|
237 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.9_cu12.4.1.whl
|
454 MB |
|
fbgemm_gpu_nightly_cuda_x86_clang_py3.9_cu12.6.3.whl
|
451 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.10_cu11.8.0.whl
|
234 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.10_cu12.4.1.whl
|
452 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.10_cu12.6.3.whl
|
449 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.11_cu11.8.0.whl
|
234 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.11_cu12.4.1.whl
|
452 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.11_cu12.6.3.whl
|
449 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.12_cu11.8.0.whl
|
234 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.12_cu12.4.1.whl
|
452 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.12_cu12.6.3.whl
|
449 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.13_cu11.8.0.whl
|
234 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.13_cu12.4.1.whl
|
452 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.13_cu12.6.3.whl
|
449 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.9_cu11.8.0.whl
|
234 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.9_cu12.4.1.whl
|
452 MB |
|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.9_cu12.6.3.whl
|
449 MB |
|