-
Notifications
You must be signed in to change notification settings - Fork 246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ELECTRA
backbone implementation in keras
#1291
Conversation
/gcbrun |
One minor point, make sure to format your code with the format script. |
Thanks for the pr! Not sure what is going on with serialization on tests either, but will try to find some time to debug soon. |
Ah figured this out. You are seeing failures in the serialization tests, which basically check that You need to make sure all the parameters you return from Once that is fixed up, you should be able to save and test saving. (though may be other issues on top) |
thanks I made the changes accordingly and tests in |
I tried to work on saving of model in h5 format but it still fails with the error |
I don't think that would be. I'm not exactly sure why we are getting the error. But we do want this is h5 format and not the tf format. I'll try to run the colab here soon and help debug! |
Thanks! Very helpful and this is indeed a puzzling error. Not sure what exactly broke here. @nkovela1 will take on that issue. |
any updates on this, can we get this merged so I can add tokenizer in the next PR |
@pranavvp16 thanks! And good to know. Let's pull this in. Very sorry about the delay here. It's been a hectic time for the team with the Keras 3 release last week. But it is out! @nkovela1 would still be good to look at that |
/gcbrun |
Thank you!! |
I almost have successfully implemented the
electra_backbone
in keras, but there are 2 problems remaining to solve1 - When
test_backbone_basics
is run it throws errorReversibleEmbbeding
is not json serialisable but at the same time tests from BERT pass even though it also implements the ReversibleEmbedding class.2- The second problem is although outputs of
hf_model
andkeras_model
match, I'm unable to save the keras model successfully.notebookplease can someone guide me to fix these above problems , #1281