gemma 27b slow on v2

I am noticing that gemma 27b is significantly slower on the v2 backend. I don't observe such a large slowdown with other models. 

Here are three sets of runs:

https://wandb.ai/nvidia/nemo-rl?nw=drf8mhln88

* green: v2
* blue, red: v1

<img width="399" height="259" alt="Image" src="https://github.com/user-attachments/assets/bd35afaf-6a25-4939-874f-c2968282cc7a" />

There is some in-run variance in perf (~8%), but the diff from v1 is much larger 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gemma 27b slow on v2 #432

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

gemma 27b slow on v2 #432

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions