Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Albert #203

Closed
wants to merge 58 commits into from
Closed

Conversation

tejasvaidhyadev
Copy link
Member

@tejasvaidhyadev tejasvaidhyadev commented Mar 31, 2020

Hi everyone
I am adding ALBERT [WIP]
Currently only raw code is given in PR.
Dependencies - Transformers.jl , WordTokenizer.jl

I am not exporting any function.I am still in middle of deciding what is the best way to use it.
But i am adding some important codes which is used for conversion of pretrained checkpoints and in Demo file below

Roadmap

  • SentencePiece - containing wordpiece as well as unigram model(python Wrapper (for now) as well as julia implementation (under development))
  • tfckpt2bsonforalbert.jl - for conversion of Tensorflow checkpoint to BSON weights
  • albert transformer - It is not completed but is based on transformers.jl transformer
  • model file - for now is kept inside ALBERT folder but it just the general wrapping structure to load ALBERT pretrain weight
  • APIs - alberttokenizer , albertmasklm , albertforsequenceclassification etc.
  • our own hosted Pretrain model manage by datadeps.jl
  • Documentation, test and Tutorial
  • code and APIs for fine tuning and Data loading
    apart from above refactoring and cleaning of code is remaining

Important links

Pretrained weights link .

  • The pretrained weigths are converted from tensorflow check point released by google-research.
  • The code for conversion is given in tfckpt2bsonforalbert.jl
  • Currently Pretrained weight for Version-1 is given soon I will release it for version-2

For detail refer - link

Demo - link

PS
All the suggestions are welcome

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Apr 2, 2020

Sorry for closing PR before
Commit history of git is now updated

News

Updated Demo

  • Contatins demo of embedding from wordpiece and sentencepiece

  • Demo of conversion of Tensorflow checkpoint to bson file(as desire by Julia flux) - link

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Apr 18, 2020

Pretrained weights

Version 2 of ALBERT converted Bson is released
It doesn't contain 30k-clean.model file (by sentencepiece)

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Apr 23, 2020

@aviks
any suggestion on the roadmap mentioned above.
i am also thinking of adding Tutorial folder (containing ipynb of tutorials)

@tejasvaidhyadev
Copy link
Member Author

added Sentencepiece unigram support

@tejasvaidhyadev
Copy link
Member Author

completed trainable Albert structure.

@tejasvaidhyadev
Copy link
Member Author

fine-tuning Training Tutorial (it's not supported GPU so far)-
here

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Jul 18, 2020

The above code is pretty messy and not yet refractor (for the experiment)
we can drop Sentencepiece as soon as PR of ALBERT is merged
Apart from that pretrain.jl is ready and can drop tfck2bsonforalbert.jl in next push
I will refractor code within next 1 week

@aviks
Copy link
Member

aviks commented Nov 1, 2020

Hi @tejasvaidhyadev can you move this PR to TextModels now please?

@tejasvaidhyadev
Copy link
Member Author

tejasvaidhyadev commented Nov 2, 2020

Hi @tejasvaidhyadev can you move this PR to TextModels now please?
Hi @aviks,

Is it okay, if I will do it the coming weekend?
I have exams this week

@aviks
Copy link
Member

aviks commented Nov 2, 2020

I will do it the coming weekend?

Yes, of course, only whenever you have time.

@rssdev10 rssdev10 closed this Oct 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants