Add topk arg to return topk items and scores at inference step #678

rnyak · 2023-04-18T17:29:56Z

This PR adds functionality for returning topk most relevant (with the highest scores) item ids from Triton IS, for NextItemPrediction task.

Current blocker:

~~The code designed to return top_k item ids (int 64 dtype), but model.output_schema returns next_item as float32 dtype, which creates an error from Triton.~~

Shall we change the code base in a way that model.output_schema matches with the expected output and output dtype from Triton? Or shall we return top_k item id scores, instead of item_ids?

Status update:

After modifying the model.output_schema, we can now return two outputs (item_scores, item_ids) from Triton.

Remaining tasks:

be sure the dtype of categorical item-id in the model.output schema matches with the model.input_schema
add a unit test
add an example notebook to showcase topK layer or modify one of the existing example: will be taken care of by this PR update end-to-end example to use systems api #680

github-actions · 2023-04-18T17:46:44Z

Documentation preview

https://nvidia-merlin.github.io/Transformers4Rec/review/pr-678

oliverholworthy · 2023-04-19T07:06:47Z

transformers4rec/torch/model/base.py

-                outputs[name] = task(
-                    body_outputs, targets=targets, training=training, testing=testing, **kwargs
-                )
+                if isinstance(task, NextItemPredictionTask):


If the NextItemPredictionTask is always something that is created by user code. An alternative to passing to the foward method could be accepting top_k in the constructor __init__ method. That way we wouldn't need to have this condition here

Good point. adding it to the NextItemPredictionTask constructor might be easier (as I show below) but, the reason we did not add this to the constructor is bcs there is no turning back changing the topK value at the inf step, once the model is trained with top_k arg. So to make it flexible, we decided to add it in the forward method of NextItemPredictionTask and constructor of the Model class.

head = tr.Head( body, tr.NextItemPredictionTask(weight_tying=True, metrics=metrics, top_k =20), inputs=inputs, )

@sararb wanna add something else?

oliverholworthy · 2023-04-19T07:09:59Z

transformers4rec/torch/model/prediction_task.py

@@ -342,7 +344,11 @@ def forward(self, inputs: torch.Tensor, targets=None, training=False, testing=Fa
            # Compute predictions probs
            x, _ = self.pre(x)  # type: ignore

-            return x
+            if top_k == -1:


Does top_k need to be a numeric value? Would it work with with None as the default value, for example?

I think -1 is a used convention but we can set it to None either. @sararb will setting it to None create any issue?

You're right, it is just a convention. Setting it to None won't create any issue.

sararb · 2023-04-20T11:24:06Z

transformers4rec/torch/model/prediction_task.py

@@ -342,7 +344,11 @@ def forward(self, inputs: torch.Tensor, targets=None, training=False, testing=Fa
            # Compute predictions probs
            x, _ = self.pre(x)  # type: ignore

-            return x
+            if top_k == -1:


You're right, it is just a convention. Setting it to None won't create any issue.

rnyak · 2023-04-26T20:19:06Z

rerun tests

rnyak · 2023-04-27T17:03:22Z

rerun tests

add topk_layer

66e7432

rnyak added the area/inference label Apr 18, 2023

rnyak added this to the Merlin 23.05 milestone Apr 18, 2023

rnyak requested review from sararb and oliverholworthy April 18, 2023 17:30

rnyak added the chore Maintenance for the repository label Apr 18, 2023

rnyak added 2 commits April 18, 2023 17:56

reformat

244d3ba

fix formatting

5ad7e11

oliverholworthy reviewed Apr 19, 2023

View reviewed changes

rnyak added 2 commits April 19, 2023 10:38

return two outputs, fix output_schema

c327e61

add unit test, modify output_schema dtype

5b72ff0

rnyak changed the title ~~[DRAFT] add topk_layer~~ Add topk arg to return topk items and scores at inference step Apr 19, 2023

sararb approved these changes Apr 20, 2023

View reviewed changes

oliverholworthy approved these changes Apr 26, 2023

View reviewed changes

rnyak added 2 commits April 26, 2023 12:23

change topk default to None

4f86f05

fix if cond

2a4476a

oliverholworthy assigned oliverholworthy and rnyak and unassigned oliverholworthy Apr 26, 2023

oliverholworthy approved these changes Apr 26, 2023

View reviewed changes

Merge branch 'main' into add_topk_layer

684517e

rnyak added 2 commits April 27, 2023 13:03

Merge branch 'main' into add_topk_layer

d24a7d3

set timeout longer

4bd80dc

radekosmulski merged commit 1828657 into main Apr 28, 2023

rnyak mentioned this pull request May 9, 2023

[Task] Add topK layer in model to return topK items at inference step #668

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add topk arg to return topk items and scores at inference step #678

Add topk arg to return topk items and scores at inference step #678

rnyak commented Apr 18, 2023 •

edited

Loading

github-actions bot commented Apr 18, 2023

oliverholworthy Apr 19, 2023

rnyak Apr 19, 2023 •

edited

Loading

oliverholworthy Apr 19, 2023

rnyak Apr 19, 2023

sararb Apr 20, 2023

sararb Apr 20, 2023

rnyak commented Apr 26, 2023

rnyak commented Apr 27, 2023

Add topk arg to return topk items and scores at inference step #678

Add topk arg to return topk items and scores at inference step #678

Conversation

rnyak commented Apr 18, 2023 • edited Loading

github-actions bot commented Apr 18, 2023

Documentation preview

oliverholworthy Apr 19, 2023

Choose a reason for hiding this comment

rnyak Apr 19, 2023 • edited Loading

Choose a reason for hiding this comment

oliverholworthy Apr 19, 2023

Choose a reason for hiding this comment

rnyak Apr 19, 2023

Choose a reason for hiding this comment

sararb Apr 20, 2023

Choose a reason for hiding this comment

sararb Apr 20, 2023

Choose a reason for hiding this comment

rnyak commented Apr 26, 2023

rnyak commented Apr 27, 2023

rnyak commented Apr 18, 2023 •

edited

Loading

rnyak Apr 19, 2023 •

edited

Loading