JIT scripting is broken #246

jramapuram · 2022-03-22T16:38:44Z

🐛 Bug

JIT scripting xformers (running commit 357545a) breaks with the following error:

xformers/components/attention/attention_mask.py", line 128
    def __add__(self, other):
        assert isinstance(other, type(self))
                                 ~~~~~~~~~ <--- HERE
        return AttentionMask(self.values + other.values, is_causal=False)
'AttentionMask.__add__' is being compiled since it was called from '__torch__.xformers.components.attention.attention_mask.AttentionMask'

To Reproduce

Blocks config:

reversible: false
block_type: "encoder"
num_layers: 12
dim_model: 768
layer_norm_style: "pre"

multi_head_config:
  num_heads: 12
  residual_dropout: 0.0
  use_rotary_embeddings: false

  attention:
    name: "scaled_dot_product"
    dropout: 0.0
    causal: false

feedforward_config:
  name: "MLP"
  dropout: 0.0
  activation: "gelu"
  hidden_layer_multiplier: 4

Python code to repro:

import yaml
import torch

from xformers.factory import xFormer, xFormerConfig

with open(xformer_config_file, "rb") as fileptr:  # above config
    model_config = yaml.load(fileptr, Loader=yaml.FullLoader)

torch.jit.script(xFormer.from_config(xFormerConfig([model_config])))

The text was updated successfully, but these errors were encountered:

blefaudeux · 2022-03-24T17:23:29Z

ah, torch script is annoyingly fragile.. This part is being completely rewritten by @fmassa, but in the meantime probably that we can remove the assert, more of a failsafe really

blefaudeux · 2022-03-24T17:28:25Z

Thanks for the issue and repro steps @jramapuram, super helpful !

jramapuram · 2022-03-24T18:13:42Z

Indeed; got to love when you have to be verbose about stuff like this:

w, h = tensor.shape[-2:]  # not jit scriptable
w, h = tensor.shape[-2], tensor.shape[-1]  # jit scriptable.

I mean I get why, but doesn't hurt any less :)

jramapuram · 2022-03-24T22:45:32Z

Thanks @blefaudeux !

jramapuram · 2022-03-25T11:57:12Z

Not sure this is resolved, sorry! 😬

RuntimeError: Error inferring type for mask: None: 
builtin cannot be used as a value:
  File "xformers/components/attention/attention_mask.py", line 128
    def __add__(self, other):
        return AttentionMask(self.values + other.values, is_causal=False)
                                           ~~~~~~~~~~~~ <--- HERE
'AttentionMask.__add__' is being compiled since it was called from '__torch__.xformers.components.attention.attention_mask.AttentionMask'

blefaudeux · 2022-03-25T16:02:04Z

ok sorry about that, I'll write this down as a unit test, I should have done that from the beginning..

erip · 2022-03-25T16:45:55Z

Note that this was discussed previously in #168.

blefaudeux · 2022-03-25T16:49:34Z

https://github.com/facebookresearch/xformers/tree/jit_with_test is doing the attention mask part (suggested by @erip), but torchscript dies on a lot of the flexible constructs which are not easy to get rid of when keeping things intercompatible (even **kwargs is a no go for instance, there are a lot of these in the wrappers). I had forgotten about that initially but I think that's right, each of the xformers components can be mostly made torchscriptable (except for the newer dispatch bits being worked on by @fmassa), but the programmatic construct cannot easily do that, and torchscript is on the way out anyway. Thoughts ?

fmassa · 2022-04-01T11:37:22Z

I would stick with my initial comment from #168 (comment) that it might be preferable to stay away from torchscript support. Happy to reconsider this decision though, but I think the days of torchscript might be counted in favor of other approaches that work directly on Python.

erip · 2022-04-01T12:10:14Z

Only slightly related, but is there something we (non-meta'ers) can read about the sunsetting of torchscript?

jramapuram · 2022-04-06T10:40:02Z

@fmassa : correct me if I'm wrong here but JIT scripting != module packaging / deployment? I get that most folks use jit-scripting for this use case, but aren't there model optimizations (eg: inlining for loops, etc) that also take place with jit-scripting that aren't touched with torch.package?

blefaudeux mentioned this issue Mar 24, 2022

[fix] Torchscriptability FTW #252

Merged

3 tasks

blefaudeux closed this as completed in #252 Mar 24, 2022

blefaudeux self-assigned this Mar 25, 2022

blefaudeux reopened this Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT scripting is broken #246

JIT scripting is broken #246

jramapuram commented Mar 22, 2022 •

edited

Loading

blefaudeux commented Mar 24, 2022

blefaudeux commented Mar 24, 2022

jramapuram commented Mar 24, 2022

jramapuram commented Mar 24, 2022

jramapuram commented Mar 25, 2022

blefaudeux commented Mar 25, 2022 •

edited

Loading

erip commented Mar 25, 2022 •

edited

Loading

blefaudeux commented Mar 25, 2022 •

edited

Loading

fmassa commented Apr 1, 2022

erip commented Apr 1, 2022

jramapuram commented Apr 6, 2022

JIT scripting is broken #246

JIT scripting is broken #246

Comments

jramapuram commented Mar 22, 2022 • edited Loading

🐛 Bug

To Reproduce

blefaudeux commented Mar 24, 2022

blefaudeux commented Mar 24, 2022

jramapuram commented Mar 24, 2022

jramapuram commented Mar 24, 2022

jramapuram commented Mar 25, 2022

blefaudeux commented Mar 25, 2022 • edited Loading

erip commented Mar 25, 2022 • edited Loading

blefaudeux commented Mar 25, 2022 • edited Loading

fmassa commented Apr 1, 2022

erip commented Apr 1, 2022

jramapuram commented Apr 6, 2022

jramapuram commented Mar 22, 2022 •

edited

Loading

blefaudeux commented Mar 25, 2022 •

edited

Loading

erip commented Mar 25, 2022 •

edited

Loading

blefaudeux commented Mar 25, 2022 •

edited

Loading