Fix Gemma RMSNorm +1 offset missing on --checkpoint path by psiddh · Pull Request #19901 · pytorch/executorch

psiddh · 2026-05-30T16:02:52Z

The --checkpoint code path skipped the Gemma-specific RMSNorm weight adjustment (weight + 1). Gemma stores norm weights as deviations from 1 and computes (1 + w) * x, but ExecuTorch's RMSNorm computes w * x. The HF download path applied the +1 offset correctly, but passing a converted checkpoint via --checkpoint silently produced garbage output from all 36+ norm layers, regardless of quantization recipe.

##Test Plan

PASS: Gemma 1 2B and Gemma 3 1B run on S23 HTP at 23.5 and 48.6 tok/s after fixing the RMSNorm +1 offset
FAIL: Gemma 2 2B crashes because its attention soft-capping tanh op is unsupported on V73.(Need to test on S25)

pytorch-bot · 2026-05-30T16:02:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19901

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit dce6eca with merge base ec31735 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-05-30T16:03:01Z

The committers listed above are authorized under a signed CLA.

✅ login: psiddh / name: Siddartha Pothapragada (bae4e37)

github-actions · 2026-05-30T16:03:41Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

Fixes incorrect Gemma model behavior when supplying a pre-converted checkpoint via --checkpoint by ensuring Gemma RMSNorm weights are offset by +1 (to match Gemma’s (1 + w) * x convention) on that code path as well.

Changes:

Apply Gemma RMSNorm +1 weight offset when loading weights from --checkpoint.
Keep Gemma model handling consistent between HF-download and --checkpoint loading paths.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

The `--checkpoint` code path skipped the Gemma-specific RMSNorm weight adjustment (`weight + 1`). Gemma stores norm weights as deviations from 1 and computes `(1 + w) * x`, but ExecuTorch's RMSNorm computes `w * x`. The HF download path applied the +1 offset correctly, but passing a converted checkpoint via `--checkpoint` silently produced garbage output from all 36+ norm layers, regardless of quantization recipe.

Copilot AI review requested due to automatic review settings May 30, 2026 16:02

psiddh requested a review from abhinaykukkadapu as a code owner May 30, 2026 16:02

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 30, 2026

Copilot started reviewing on behalf of psiddh May 30, 2026 16:03 View session

psiddh requested review from chenweng-quic, haowhsu-quic, shewu-quic and winskuo-quic May 30, 2026 16:03

Copilot AI reviewed May 30, 2026

View reviewed changes

Comment thread examples/qualcomm/oss_scripts/llama/wrappers/llm_wrappers.py

psiddh force-pushed the main branch from c322788 to bae4e37 Compare May 31, 2026 00:24

Merge branch 'main' into main

dce6eca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Gemma RMSNorm +1 offset missing on --checkpoint path#19901

Fix Gemma RMSNorm +1 offset missing on --checkpoint path#19901
psiddh wants to merge 2 commits into
pytorch:mainfrom
psiddh:main

psiddh commented May 30, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented May 30, 2026 •

edited

Loading

Uh oh!

linux-foundation-easycla Bot commented May 30, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

psiddh commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19901

✅ No Failures

Uh oh!

linux-foundation-easycla Bot commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 30, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

psiddh commented May 30, 2026 •

edited

Loading

pytorch-bot Bot commented May 30, 2026 •

edited

Loading

linux-foundation-easycla Bot commented May 30, 2026 •

edited

Loading

This PR needs a `release notes:` label