-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
Thanks for the great work!
What is the generate_mode
in forward()
function in your modified Qwen2_5_VLForConditionalGeneration
? To my understanding, shouldn't that be set to True
in the prepare_inputs_for_generation()
under the if cache_position[0] != 0:
part? Otherwise it seems to try to recompute the latent embeddings at every step during the generation process. Or, am I missing something here?
Thanks a lot!
Metadata
Metadata
Assignees
Labels
No labels