You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since version 1.94, when the context window overflows with SWA and FastForwarding enabled, instead of prompt processing, as in older versions(v1.93.2), I get an error as in the screenshot.
Model gemma-3-12b-it-q4_0_s, all context memory and layers in video memory, all ffn_up and ffn_gate tensors override in the CPU RAM. Other settings are standard