Skip layerNorm layer weight quantitation while conversion #85

siddhawan · 2023-08-29T11:42:07Z

I am trying to convert my torchscript module to tensorrt using torch_tensorrt.compile is there any argument to skip the layers which shows warnings while converting.
This is the warning its giving.
WARNING: [Torch-TensorRT TorchScript Conversion Context] - Running layernorm after self-attention in FP16 may cause overflow. Exporting the model to the latest available ONNX opset (later than opset 17) to use the INormalizationLayer, or forcing layernorm layers to run in FP32 precision can help with preserving accuracy. Although I am not converting to onnx at my end as I am using torch_tensorrt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip layerNorm layer weight quantitation while conversion #85

Skip layerNorm layer weight quantitation while conversion #85

siddhawan commented Aug 29, 2023

Skip layerNorm layer weight quantitation while conversion #85

Skip layerNorm layer weight quantitation while conversion #85

Comments

siddhawan commented Aug 29, 2023