You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Now,sparse decode mla kernel can achieve 350T flops yet on Blackwell,do we have some plan to opt it?Now we are opt it and achieve 500T flops yet, and still is working to 1000Tflops in near future. If we both work for this, can we have some possibility to work together to opt it to 1000T flops.