Torchscript Example for BERT

hikushalhere · January 27, 2022, 6:24am

I am looking at the example for torchscripting BERT-like models here: Exporting 🤗 Transformers Models. I have a basic question about the dummy inputs being passed for tracing which don’t make obvious sense to me.

The input passed is a list containing token_ids and segment_ids (or token_type_ids) which torchscript will unpack. Now, BertModel.forward() expects input_ids and attention_mask as the first and second arguments respectively. So, how why is segment_ids being passed as the second argument for both tracing and later on for inference with the loaded torchscripted model? Does it somehow work because of the flag torchscript=True that’s passed when instantiating the model? If so, how does it work?

nielsr · January 27, 2022, 8:34am

cc’ing @lewtun here

hikushalhere · February 1, 2022, 12:35am

@lewtun any insights here?

lewtun · February 4, 2022, 10:29am

Hey @hikushalhere thanks for raising this issue! This looks like an error in the guide and the only reason the code runs is because the tensor used for segment_ids is similar to what attention_mask should be.

The torchscript=True flag is used to ensure the model outputs are tuples instead of ModelOutput (which causes JIT errors).

Would you like to open a PR to fix the guide?

Topic		Replies	Views
Get "RuntimeError: forward() Expected a value of type 'Tensor' for argument 'input_ids' but instead found type 'list'" while loading Torchscript models Converting from Transformers Beginners	1	2637	February 3, 2022
Converting TF-Bert to Torch using conversion script works, but Beginners	4	768	July 23, 2021
Torchscript vector Input 🤗Transformers	0	281	November 8, 2020
Language generation with torchscript model? 🤗Transformers	6	2551	November 20, 2021
List object has no attribute 'size' error Beginners	0	1331	October 27, 2021

Torchscript Example for BERT

Related topics