Added evaluation script for qualcomm LlamaModel #11663

rohansjoshi · 2025-06-14T00:49:33Z

Summary:
Script for evaluating models which follow qualcomm's LlamaModel definition, on lm eval harness tasks such as WikiText

Results for WikiText evaluation task:

Model Name	max_seq_len	word_perplexity
Llama 1B Instruct	128	34.82890030691187
Llama 1B Instruct	512	22.919538703371582

Differential Revision: D76634688

pytorch-bot · 2025-06-14T00:49:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11663

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 74603d2 with merge base 8cfa858 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
RuntimeError: Command docker exec -t d1bb9c330570b9dad7e199ffefc6b067aa64cbe9b38b8569657f9369f81a7a23 /exec failed with exit code 127
pull / unittest-editable / macos / macos-job (gh)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-06-14T00:50:01Z

This pull request was exported from Phabricator. Differential Revision: D76634688

github-actions · 2025-06-14T00:50:34Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Pull Request resolved: pytorch#11663 Script for evaluating models which follow qualcomm's LlamaModel definition, on lm eval harness tasks such as WikiText Results for WikiText evaluation task: | Model Name | max_seq_len | word_perplexity |----------|----------|----------| | Llama 1B Instruct | 128 | 34.82890030691187 | | Llama 1B Instruct | 512 | 22.919538703371582 | Differential Revision: D76634688

facebook-github-bot · 2025-06-14T00:56:17Z

This pull request was exported from Phabricator. Differential Revision: D76634688

cccclai

Thank you for adding the eval scripts. @shewu-quic @haowhsu-quic fyi we're adding eval scripts here and trying to improve the accuracy for ptq

rohansjoshi requested a review from cccclai as a code owner June 14, 2025 00:49

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 14, 2025

facebook-github-bot added the fb-exported label Jun 14, 2025

rohansjoshi force-pushed the export-D76634688 branch from 246afd1 to 74603d2 Compare June 14, 2025 00:56

cccclai approved these changes Jun 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added evaluation script for qualcomm LlamaModel #11663

Added evaluation script for qualcomm LlamaModel #11663

Uh oh!

rohansjoshi commented Jun 14, 2025

Uh oh!

pytorch-bot bot commented Jun 14, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

github-actions bot commented Jun 14, 2025

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

cccclai left a comment •

edited

Loading

Uh oh!

Uh oh!

Added evaluation script for qualcomm LlamaModel #11663

Are you sure you want to change the base?

Added evaluation script for qualcomm LlamaModel #11663

Uh oh!

Conversation

rohansjoshi commented Jun 14, 2025

Uh oh!

pytorch-bot bot commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11663

❌ 2 New Failures

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

github-actions bot commented Jun 14, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Jun 14, 2025

Uh oh!

cccclai left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 14, 2025 •

edited

Loading

This PR needs a `release notes:` label

cccclai left a comment •

edited

Loading