Keras wrapper for blocksparse layers #18

ThomasHagebols · 2018-11-30T16:28:00Z

As requested in #14 and following up on an email with @scott-gray

Added a keras blocksparse layer with an example in a Jupyter Notebook. This example Jupyter Notebook shows a simple network training on Cifar10 with an test accuracy of 54%. (60% since latest commit)

As of now it does not yet support saving the model. Maybe someone else knows how to fix that? (Fixed in last commit)

It also doesn't support eager execution. I'll follow up on that and create an issue later, since it seems to be a problem in the code outside of this commit.

I put the code in the examples directory. If you think it's a good addition I could also move it to the blocksparse / module directory.

ThomasHagebols · 2018-12-19T21:46:38Z

I made some new commits. In these commits I added the functionality to save a model.

I also updated the jupyter notebook and trained a (really) simple mlp to 60% accuracy on cifar 10.

scott-gray · 2018-12-19T22:16:26Z

Sorry for not replying earlier. I've been doing a lot of development on blocksparse related things (mostly related to learned sparsity). Although, I don't think I've made any interface breaking changes for you. For image data I was planning on providing some seperable conv kernels to go with the bs_matmul ops. That way you can just sparisfy the feature conjunctions (1d_conv ops) where the spatial conjunctions are already sparse. But you should still be able to simulate local spatial sparsity with a carefully constructed pure bs_matmul layout.

Anyway, I should be able to get to this request soon.

ThomasHagebols · 2018-12-20T18:11:29Z

Cool, I noticed some commits recently. Looking forward to play around with new features. Depthwise Separable Convs are an interesting approach for reducing parameters. I saw that in Mobilenet(V1) 75% of the parameters are in the 1x1 convolution, so sparsifying those might bring some significant improvements. :D

Haha, if it breaks I'll find out eventually. Maybe I should write tests to check for issues at a later stage.

I saw that you added a prune method to BlocksparseMatMul. That was meant for removing blocks right? I couldn't really figure out how to use that. I think it would be interesting to add the possibility of learned sparsity to the wrapper. You said before that adding blocks is hard from a memory management point of view. Would this also be the case when you remove blocks and add some other blocks? (with the constraint that the number of blocks removed >= the number of blocks added) Depending on the complexity I think it would be cool if I could add this functionality to the Keras wrapper.

ThomasHagebols added 4 commits November 30, 2018 17:08

Keras wrapper for blocksparse layers

1a245df

Added get_config for save model+sparsity_mask input param in BlockSparse

bb87564

Bug fix for saving and restoring models

c9258e2

Updated the notebook and added a saved model

1a617fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keras wrapper for blocksparse layers #18

Keras wrapper for blocksparse layers #18

ThomasHagebols commented Nov 30, 2018 •

edited

Loading

ThomasHagebols commented Dec 19, 2018

scott-gray commented Dec 19, 2018

ThomasHagebols commented Dec 20, 2018

Keras wrapper for blocksparse layers #18

Are you sure you want to change the base?

Keras wrapper for blocksparse layers #18

Conversation

ThomasHagebols commented Nov 30, 2018 • edited Loading

ThomasHagebols commented Dec 19, 2018

scott-gray commented Dec 19, 2018

ThomasHagebols commented Dec 20, 2018

ThomasHagebols commented Nov 30, 2018 •

edited

Loading