Skip to content

Commit

Permalink
Optimize block-sparse forward by up to 15% with pipelining (facebookr…
Browse files Browse the repository at this point in the history
  • Loading branch information
fmassa authored Mar 29, 2023
1 parent 5b7c501 commit 01cffe2
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -197,7 +197,7 @@ def block_sparse_attention(
BLOCK_DMODEL=Lk,
IS_CAUSAL=causal,
num_warps=num_warps,
num_stages=1, # TODO: needs to have num_stages=1 otherwise IMA
num_stages=2,
)

return o, lse
Expand Down

0 comments on commit 01cffe2

Please sign in to comment.