Skip to content

Make FloatToFloat16 conversion 75x faster using SVE2 instructions #7596

Make FloatToFloat16 conversion 75x faster using SVE2 instructions

Make FloatToFloat16 conversion 75x faster using SVE2 instructions #7596

Triggered via pull request January 29, 2025 02:26
Status Success
Total duration 1h 42m 38s
Artifacts 20

fbgemm_gpu_ci_rocm.yml

on: pull_request
Matrix: build_artifact
Matrix: test_and_publish_artifact
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size
fbgemm_gpu_nightly_rocm_x86_clang_py3.10_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.10_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.11_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.11_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.12_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.12_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.13_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.13_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.9_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_clang_py3.9_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.10_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.10_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.11_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.11_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.12_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.12_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.13_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.13_rocm6.3.whl
122 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.9_rocm6.2.4.whl
127 MB
fbgemm_gpu_nightly_rocm_x86_gcc_py3.9_rocm6.3.whl
122 MB