-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Description
Hi, thanks for you great work!
My question is about
if tokenizer.eos_token_id not in tokenizer.encode(response, add_special_tokens=False): |
and
if tokenizer.eos_token_id not in tokenizer.encode(response, add_special_tokens=False): |
In first line, "response" is from "all_responses".
But in the second line, "response" is from "query".
If the response exceeds the maximum generation length, it will be truncated.
In this way, all "response" in "all_responses" does not contain the "<|end▁of▁sentence|>".
But second "response" will have "<|end▁of▁sentence|>".
The results of the second judgment will be inconsistent.
Metadata
Metadata
Assignees
Labels
No labels