-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does the repo suport qwen-vl? #728
Comments
Hi, we do have an internal implementation supporting qwen-vl with TensorRT-LLM, it has not been published onto the github yet. Let me sync with the engineers working on that and go back to you later. June |
@juney-nvidia I am also working on qwen-vl-chat. To my best understanding on TensorRT-LLM, the decoder session currently does not support embedding as an input, so I cannot modify the current qwen model to adapt to qwen-vl-chat model. I will be glad to use an official release of qwen-vl-chat. |
I am looking forward to deploy Qwen-vl-chat with trt-llm. |
same strong demands here . |
the same to me |
There might be what you're looking for here |
Thanks! |
How to build qwen-vl using TensorRT-LLM?
The text was updated successfully, but these errors were encountered: