Replies: 1 comment
-
@qixuema hi Xueqi yes, i do believe that is a bug 🐛 , thank you for catching it! the latest commit should be more sound |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello Phil,
I have been exploring the code in the repository and came across a line that piqued my curiosity regarding the calculation of
code_lens
. Specifically, the line is:As I understand it, this line appears to calculate the number of non-padding elements in
codes
. However, I noticed that right after this,code_lens
is used to set theeos_token_id
.Based on the above calculation method, it seems to add the
eos_token_id
at the very beginning of thecodes
sequence, rather than at the end. This seems a bit counterintuitive to me, as typically,eos_token_id
is added at the end of a sequence.Since I'm not very familiar with this part of the functionality, I'm curious about the rationale and logic behind this particular calculation method. Is there a specific reason or scenario for this approach, or am I misunderstanding the code?
Looking forward to your clarification, and thank you for your contributions to this project!
Best regards,
Xueqi
Beta Was this translation helpful? Give feedback.
All reactions