rfcs: extend layer norm to support root mean square normalization #3147

mzhukova · 2025-04-23T19:54:52Z

Proposal to extend current Layer Normalization primitive to support RMS normalization via new flag.

Read here: https://github.com/uxlfoundation/oneDNN/blob/rfcs/rfcs/20251004-rms-norm/README.md
Implementation: #3068
JIRA: https://jira.devtools.intel.com/browse/MFDNN-13287

// Note: Current proposal is aligned with keras RMS normalization, but not Layer Normalization with rms_scaling due to keras-team/keras#21234.

mzhukova · 2025-04-24T00:01:24Z

also inviting @pengzhao-intel and @gaurides to take a look

rfcs/20251004-rms-norm/README.md

mzhukova · 2025-04-26T00:19:56Z

confirmed with @pengzhao-intel and @gaurides, that this options is preferred --

(Preferred) Omit mean in the output:
Pros: Cleaner design and avoids unnecessary memory requirement.
Cons: Requires users to adjust their code to handle the absence of the mean for RMSNorm comparing to LayerNorm.

mgouicem

Regarding open questions:

I would vote for dnnl_use_zero_mean or dnnl_no_mean, as this is closer to how we split dnnl_use_scale and dnnl_use_shift.
I would vote for omitting mean, but it might be worth checking with PyTorch team what the expectation is.

rfcs/20251004-rms-norm/README.md

mzhukova · 2025-05-02T21:32:24Z

hi @dzarukin, @mgouicem and @gaurides, thank you for your feedback!
I believe I've addressed all the current comments, so could you please re-review/close opened threads and let me know if anything else is needed here or approve.

rfcs/20251004-rms-norm/README.md

gaurides

LGTM

first draft of RFC

cc65a8a

mzhukova self-assigned this Apr 23, 2025

github-actions bot added the RFC A design document label Apr 23, 2025

mzhukova mentioned this pull request Apr 23, 2025

add Root Mean Square (RMS) normalization support to Layer Normalization primitive #3068

Merged

mzhukova requested a review from a team April 23, 2025 19:58

dzarukin reviewed Apr 25, 2025

View reviewed changes

rfcs/20251004-rms-norm/README.md Show resolved Hide resolved

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

mzhukova added 2 commits April 25, 2025 16:10

rfcs: update benchdnn flag name proposal (and minor cleanup)

e5bfa6c

rfcs: minor rewording to avoid confusion

6b403a0

mzhukova requested a review from dzarukin April 26, 2025 00:20

mgouicem reviewed Apr 28, 2025

View reviewed changes

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

gaurides reviewed Apr 28, 2025

View reviewed changes

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

rfcs: fixed link in the intro section

d779fae

mzhukova requested a review from mgouicem May 2, 2025 00:36

rfcs: various updates due to discussions

4fb9ee2

mzhukova force-pushed the mzhukova/rfcs/20251004-rms-norm branch from 791544a to 4fb9ee2 Compare May 2, 2025 21:20

mzhukova requested a review from gaurides May 2, 2025 21:30

mgouicem reviewed May 6, 2025

View reviewed changes

rfcs/20251004-rms-norm/README.md Outdated Show resolved Hide resolved

mgouicem approved these changes May 6, 2025

View reviewed changes

rfcs: adding minor clarification

92e9f5d

gaurides approved these changes May 6, 2025

View reviewed changes

rfcs: minor clarification wrt keras lnorm with rms_scaling

b739951

mzhukova requested review from dzarukin and removed request for dzarukin May 7, 2025 00:20

dzarukin approved these changes May 7, 2025

View reviewed changes

rfcs: closing on OQ

aa51a27

mzhukova merged commit e1d1e4b into rfcs May 7, 2025
1 check passed

mzhukova deleted the mzhukova/rfcs/20251004-rms-norm branch May 7, 2025 00:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rfcs: extend layer norm to support root mean square normalization #3147

rfcs: extend layer norm to support root mean square normalization #3147

Uh oh!

mzhukova commented Apr 23, 2025 •

edited

Loading

Uh oh!

mzhukova commented Apr 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzhukova commented Apr 26, 2025

Uh oh!

mgouicem left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzhukova commented May 2, 2025

Uh oh!

Uh oh!

gaurides left a comment

Uh oh!

Uh oh!

Uh oh!

rfcs: extend layer norm to support root mean square normalization #3147

rfcs: extend layer norm to support root mean square normalization #3147

Uh oh!

Conversation

mzhukova commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mzhukova commented Apr 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzhukova commented Apr 26, 2025

Uh oh!

mgouicem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzhukova commented May 2, 2025

Uh oh!

Uh oh!

gaurides left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mzhukova commented Apr 23, 2025 •

edited

Loading