Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arm SVE CGEMM / ZGEMM Natural Kernels #542

Merged
merged 14 commits into from
Oct 9, 2021

Conversation

xrq-phys
Copy link
Collaborator

These new kernels (2vx10) yield a peak efficiency of around 95% on GW4 Isambard.
zgemm_ukr
As GW4 Isambard is observed to be working at a frequency a bit lower than 1.8GHz, these numbers could be a little higher on SC Fugaku (which is very crowded at the moment due to end-of-term job accounting).

Here's also a comparison between nat and 1m:
cgemm_nat
zgemm_nat

The new kernels should compile also with Clang.

@xrq-phys
Copy link
Collaborator Author

In fact I haven't considered writing ZGEMM kernels until I realized there's no gemmt1m. 🤣

But anyway, the kernel's out and it's better than 1m now. :D

@xrq-phys
Copy link
Collaborator Author

Data from Fugaku is out.
Here's a preview:
preview_zgemm_st
preview_zgemm_1s
preview_zgemm_2s

@devinamatthews
Copy link
Member

Excellent!

@devinamatthews
Copy link
Member

@xrq-phys have the cgemm/zgemm kernels gotten the beta == 0 treatment?

@xrq-phys
Copy link
Collaborator Author

xrq-phys commented Oct 5, 2021

@devinamatthews Not yet. Working on *beta == 0 on #533 at the moment.

@devinamatthews devinamatthews marked this pull request as draft October 5, 2021 04:47
@devinamatthews
Copy link
Member

Converted to draft while working on beta == 0. @xrq-phys please convert back to a regular PR when it's ready for merging.

@xrq-phys
Copy link
Collaborator Author

xrq-phys commented Oct 8, 2021

Fixed *beta == 0 check.

Pushing rebased for convenience.

@xrq-phys xrq-phys marked this pull request as ready for review October 8, 2021 03:36
@devinamatthews devinamatthews merged commit 408906f into flame:master Oct 9, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants