`ModelRunnerCpp` does not transfer `SamplingConfig` Tensor fields correctly #1183

Marks101 · 2024-02-28T12:09:58Z

Hello team,
the tensorrt_llm.runtime.SamplingConfig defines multiple fields as either scalar or torch.Tensor, for example the random_seed, top_k or top_p. Inside the ModelRunnerCpp these fields are all assumed to be scalar and wrapped into lists with a single item, see here. Thus, when using the ModelRunnerCpp it is not possible to set independent values for each batch entry. If you try to do so it fails with the following error:

E           TypeError: (): incompatible function arguments. The following argument types are supported:
E               1. (self: tensorrt_llm.bindings.SamplingConfig, arg0: Optional[List[int]]) -> None
E           
E           Invoked with: <tensorrt_llm.bindings.SamplingConfig object at 0x7f5699304eb0>, [tensor([7692698082559361259,...

Looking at the the SamplingConfig Cpp code it should to be supported to provide a list with batch size entries. Accordingly it would be necessary to cast torch.Tensors to lists.

Additionally, the parameters top_p_decay, top_p_min and top_p_reset_ids are scalar in the tensorrt_llm.runtime.SamplingConfig, but are supposed to be a vector of length batch size in the Cpp implementation of the config.

It would be great if you could have a look at this and possibly fix this. Thank you!

The text was updated successfully, but these errors were encountered:

MartinMarciniszyn · 2024-03-01T17:55:41Z

@Funatiq, could you please take a look at this? I believe that everything should be supported for this on the C++ side. Only ModelRunnerCpp needs to be revised.

BaiMoHan · 2024-03-14T02:34:29Z

same issue

BaiMoHan · 2024-03-14T02:36:17Z

The random_seed should be allowed to be set to values such as [1, 2, 3, 4] when batch_size is greater than 1.

BaiMoHan · 2024-03-14T02:37:46Z

same issue

I need to fallback to a Python session to make it work, but I prefer to use a C++ session.

Funatiq · 2024-03-14T07:27:51Z

The fix will be included in the next push to main (ETA Mar 19).

Marks101 · 2024-03-21T07:36:14Z

@Funatiq @kaiyux Thank you for processing this so quickly. I took a look at it today but I still cannot pass a torch.Tensor as SamplingConfig.random_seed. The issue seems to be, that

TensorRT-LLM/tensorrt_llm/runtime/model_runner_cpp.py

Line 435 in 66ca337

gpt_sampling_config.random_seed = sampling_config.random_seed

lacks a tolist(). For the other fields of SamplingConfig (temperature, frequency_penalty, ...) the tolist() is there. Could you please take a look? Thank you

Funatiq · 2024-03-21T07:52:23Z

You're right, we somehow missed that.

byshiue assigned MartinMarciniszyn Feb 29, 2024

MartinMarciniszyn assigned Funatiq Mar 1, 2024

Funatiq closed this as completed Mar 14, 2024

Funatiq added the bug Something isn't working label Mar 14, 2024

kaiyux mentioned this issue Mar 19, 2024

Update TensorRT-LLM #1315

Merged

Shixiaowei02 mentioned this issue Mar 26, 2024

Update TensorRT-LLM #1358

Merged

kaiyux mentioned this issue Apr 12, 2024

Update TensorRT-LLM Release branch #1445

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ModelRunnerCpp` does not transfer `SamplingConfig` Tensor fields correctly #1183

`ModelRunnerCpp` does not transfer `SamplingConfig` Tensor fields correctly #1183

Marks101 commented Feb 28, 2024

MartinMarciniszyn commented Mar 1, 2024

BaiMoHan commented Mar 14, 2024

BaiMoHan commented Mar 14, 2024

BaiMoHan commented Mar 14, 2024

Funatiq commented Mar 14, 2024

Marks101 commented Mar 21, 2024

Funatiq commented Mar 21, 2024

ModelRunnerCpp does not transfer SamplingConfig Tensor fields correctly #1183

ModelRunnerCpp does not transfer SamplingConfig Tensor fields correctly #1183

Comments

Marks101 commented Feb 28, 2024

MartinMarciniszyn commented Mar 1, 2024

BaiMoHan commented Mar 14, 2024

BaiMoHan commented Mar 14, 2024

BaiMoHan commented Mar 14, 2024

Funatiq commented Mar 14, 2024

Marks101 commented Mar 21, 2024

Funatiq commented Mar 21, 2024

`ModelRunnerCpp` does not transfer `SamplingConfig` Tensor fields correctly #1183

`ModelRunnerCpp` does not transfer `SamplingConfig` Tensor fields correctly #1183