Skip to content

fix(parallel.py): fix norm and moe gate gradient reduce check #1416

fix(parallel.py): fix norm and moe gate gradient reduce check

fix(parallel.py): fix norm and moe gate gradient reduce check #1416