-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update greedy_causal_lm.cpp to read EOS Token #315
Conversation
Made changes to accommodate the dynamic EOS Token
could you please also update beam search sample? EOS token is specified in parameters for beam search |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@anzr299 thx for the PR. Left some comments. Please also check why compilation fails
getting the rt_info from the tokenizer instead of the LLM.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am having some trouble building it locally and hence I continued pushing to check the build. I would require some assistance with building it. |
Yes, sure. Could you please share some details, what kind of errors do you face? When you follow this steps https://github.com/ilya-lavrenov/openvino.genai/tree/ct-beam-search/text_generation/causal_lm/cpp#linuxmacos when the problem appears? During configure or compile stage?
|
@pavel-esir I have made the changes and tested it locally, can you please review it now? |
Made changes according to the review. This is the latest commit.
Fixed the error Related to the tokenizer read_model
@ilya-lavrenov @pavel-esir I have pushed the latest changes, I have fixed the error which appeared earlier regarding reading the model which was due to it being called before the path was set. Also, I have updated beam_causal_lm to include reading the EOS token from the model runtime info. |
Added comments and organised the code better
Changed Beam Search header to accommodate eos token in parameter and removed the comment regarding eos token not implemented
Added EOS token into the parameters
@ilya-lavrenov @pavel-esir I have pushed the latest changes and fixed the problem, could you please review it now? |
@ilya-lavrenov @apaniukov @pavel-esir The specific check that fails is because the model Qwen-7B-Chat's runtime information does not contain an EOS token. Hence the program responds with an error as expected saying that the EOS token is not found in the runtime information. |
Please, update OpenVINO tokenizers submodule https://github.com/openvinotoolkit/openvino.genai/tree/master/thirdparty to catch up latest commit openvinotoolkit/openvino_tokenizers#109 which fixes Qwen-7B-Chat |
I am still updating the tokenizers submodule. |
@ilya-lavrenov I have updated the tokenizer submodule to the latest commit. |
@ilya-lavrenov I have just merged changes from the main repo. I think the error regarding building openvino tokenizer only for speculative decoding and stable diffusions could be fixed with the latest commit. |
@ilya-lavrenov I am looking deeper into this issue. The tokenizer submodule wasn't updated and hence the error regarding the specific header file added in this commit: "openvinotoolkit/openvino_tokenizers@35d7dcb" appears which was added after the old submodule update |
@ilya-lavrenov Is there anything I can assist with in regards to the tokenizer issue? |
Hi @anzr299 |
Done |
thank you for the contribution! |
Details: Made*changes to accommodate the dynamic EOS Token
Tickets: #277 132861