[Question] Running custom Encoder Decoder model #2491

AvivSham · 2024-11-24T14:03:06Z

Hi All,
Thank you for your amazing work.
We have an encoder decoder model we want to run using TensorRT-LLM. We made an architectural modification by pooling the encoder's output dim using stacked MLP layers.
What is the recommended way of modifying the code to support the new architecture? We assume that we need to change the code to convert the model (to a static computation graph) and run it.

Please advice,

hello-11 · 2024-11-25T07:59:02Z

@AvivSham you can follow this guide.

hello-11 added question Further information is requested triaged Issue has been triaged by maintainers labels Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Running custom Encoder Decoder model #2491

[Question] Running custom Encoder Decoder model #2491

AvivSham commented Nov 24, 2024

hello-11 commented Nov 25, 2024

[Question] Running custom Encoder Decoder model #2491

[Question] Running custom Encoder Decoder model #2491

Comments

AvivSham commented Nov 24, 2024

hello-11 commented Nov 25, 2024