Suppress HuggingFace logging warning: "Setting `pad_token_id` to `eos_token_id`:{eos_token_id} for open-end generation."

Question

In HuggingFace, every time I call a pipeline() object, I get a warning:

`"Setting `pad_token_id` to `eos_token_id`:{eos_token_id} for open-end generation."

How do I suppress this warning without suppressing all logging warnings? I want other warnings, but I don't want this one.

score 35 · Accepted Answer · edited Mar 08 '22 at 15:48

35

The warning comes for any text generation task done by HuggingFace. This is explained here, and you can see the code here. You can avoid that warning by manually setting the pad_token_id to the eos_token_id.

That is when you call

model.generate(**encoded_input)

just change it to

model.generate(**encoded_input, pad_token_id=tokenizer.eos_token_id)

and that will get rid of the error. However, I haven't found a way to set this directly from the pipeline interface. I'm guessing you could pass in some arguments to the ArgumentHandler. But I haven't tried it.

edited Mar 08 '22 at 15:48

Dharman

30,962
25
85
135

answered Mar 08 '22 at 15:42

Jacobo Azcona

366
4
5

What other effects would changing the PAD token to the EOS token have? – Rylan Schaeffer Mar 08 '22 at 16:07
@RylanSchaeffer no other effects. It's already done internally to enable batch inference for models like GPT-2, which don't actually have a pad token. – chicxulub Mar 08 '23 at 21:26

chicxulub · Answer 2 · 2023-06-25T21:49:44.320

7

For a text-generation pipeline, you need to set the pad_token_id in the generator call to suppress the warning:

from transformers import pipeline

generator = pipeline('text-generation', model='gpt2')
sample = generator('test test', pad_token_id=generator.tokenizer.eos_token_id)

edited Jun 25 '23 at 21:49

answered Mar 08 '23 at 21:03

chicxulub

210
2
5

What do you mean by "to suppress the output". @chicxulub – Yassin Sameh Jun 25 '23 at 11:51
I meant not having the generator print out the warning mentioned in the question: "Setting `pad_token_id` to `eos_token_id`:{eos_token_id} for open-end generation." I'll edit the answer to clarify that – chicxulub Jun 25 '23 at 21:49

Suppress HuggingFace logging warning: "Setting `pad_token_id` to `eos_token_id`:{eos_token_id} for open-end generation."

2 Answers2

Linked