Bump torchao + add unit tests for torchao kernels #9396

metascroy · 2025-03-19T15:04:51Z

Summary

This PR bumps the torchao pin and adds unit tests and documentation for the lowbit torchao kernels.

Test plan

New CI test

pytorch-bot · 2025-03-19T15:04:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9396

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job, 2 Pending

As of commit ad95321 with merge base 7a2a300 ():

NEW FAILURE - The following job has failed:

pull / unittest-buck / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest / macos / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jackzhxng

Thanks 🙏🏻

jackzhxng · 2025-03-20T17:22:49Z

.ci/scripts/test_llama_torchao_lowbit.sh

+    -d fp32
+
+# Test run
+./cmake-out/examples/models/llama/llama_main --model_path=$MODEL_OUT --tokenizer_path=$TOKENIZER --prompt="Once upon a time,"


Should we do a simple non-brittle sanity check? e.g. output length > 0 or something

I think the main concern is whether it runs. If it runs, it will produce output.

jackzhxng · 2025-03-20T17:25:23Z

examples/models/llama/README.md

@@ -380,6 +380,79 @@ Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-de
 ### Android
 Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-demo-android.html) to for full instructions on building the Android LLAMA Demo App.

+## Running with low-bit kernels
+
+We now give instructions for quantizating and running your model with low-bit kernels.  These are still experimental, and require you do development on an Arm-based Mac.  Also note that low-bit quantization often requires QAT (quantization-aware training) to give good quality results.


Might be worth saying that these don't work with dynamic shapes yet

### Summary This PR bumps the torchao pin and adds unit tests and documentation for the lowbit torchao kernels. ### Test plan New CI test

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 19, 2025

metascroy force-pushed the torchao-up branch 2 times, most recently from 3f4b66d to 531d8ca Compare March 20, 2025 00:33

metascroy added the ciflow/trunk label Mar 20, 2025

metascroy changed the title ~~Torchao up~~ Bump torchao + add unit tests for torchao kernels Mar 20, 2025

metascroy requested a review from Jack-Khuu March 20, 2025 00:35

metascroy marked this pull request as ready for review March 20, 2025 00:36

metascroy requested review from GregoryComer, lucylq and jackzhxng as code owners March 20, 2025 00:36

metascroy mentioned this pull request Mar 20, 2025

torchao: quantization schemes can't find ao lib #8937

Open

jackzhxng approved these changes Mar 20, 2025

View reviewed changes

jackzhxng added the release notes: build Changes related to build, including dependency upgrades, build flags, optimizations, etc. label Mar 20, 2025

metascroy added 5 commits March 20, 2025 11:56

up

9ab5bf9

up

4030e77

up

716944f

up

d5e46c8

up

ad95321

metascroy force-pushed the torchao-up branch from 812a0f9 to ad95321 Compare March 20, 2025 18:56

metascroy merged commit 1a9a59b into main Mar 20, 2025
251 of 254 checks passed

metascroy deleted the torchao-up branch March 20, 2025 20:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump torchao + add unit tests for torchao kernels #9396

Bump torchao + add unit tests for torchao kernels #9396

metascroy commented Mar 19, 2025 •

edited

Loading

pytorch-bot bot commented Mar 19, 2025 •

edited

Loading

jackzhxng left a comment

jackzhxng Mar 20, 2025

metascroy Mar 20, 2025

jackzhxng Mar 20, 2025

metascroy Mar 20, 2025

Bump torchao + add unit tests for torchao kernels #9396

Bump torchao + add unit tests for torchao kernels #9396

Conversation

metascroy commented Mar 19, 2025 • edited Loading

Summary

Test plan

pytorch-bot bot commented Mar 19, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9396

❌ 1 New Failure, 1 Cancelled Job, 2 Pending

jackzhxng left a comment

Choose a reason for hiding this comment

jackzhxng Mar 20, 2025

Choose a reason for hiding this comment

metascroy Mar 20, 2025

Choose a reason for hiding this comment

jackzhxng Mar 20, 2025

Choose a reason for hiding this comment

metascroy Mar 20, 2025

Choose a reason for hiding this comment

metascroy commented Mar 19, 2025 •

edited

Loading

pytorch-bot bot commented Mar 19, 2025 •

edited

Loading