Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add --pooling in TBE nbit_cpu benchmark #2200

Closed
wants to merge 2 commits into from

Conversation

sryap
Copy link
Contributor

@sryap sryap commented Dec 8, 2023

Summary: As title

Differential Revision: D51963691

Sarunya Pumma added 2 commits December 7, 2023 16:09
Summary:

Instead of using the ref implementation for sequence embedding on CPU,
this diff directs TBE to invoke the AVX implementation of pooled TBE
by forcing pooling factors of 1 (i.e., passing `at::arange(index_size +
1)`
as offfsets).  The performance gained from using the AVX
implementation offsets the overhead incurs in creating the new
offsets.

Reviewed By: jspark1105, YazhiGao

Differential Revision: D51918878
Summary: As title

Differential Revision: D51963691
Copy link

netlify bot commented Dec 8, 2023

Deploy Preview for pytorch-fbgemm-docs canceled.

Name Link
🔨 Latest commit 091d42a
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/65725eb2ec8cef00087f0c51

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D51963691

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 8724d89.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants