Add optional zero_start_index_M argument to triton fp8 rowwise quantization #8852
Triggered via pull request
January 28, 2025 23:01
Status
Success
Total duration
1h 47m 47s
Artifacts
2
build_wheels_linux_aarch64.yml
on: pull_request
generate-matrix
/
generate
4s
Matrix: build
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
pytorch_FBGEMM__3.9_cpu_aarch64
|
2.69 MB |
|
pytorch_FBGEMM__3.9_cu126_aarch64
|
446 MB |
|