[ExecuTorch][WebGPU] test: document bicol decode-GEMV coverage in q4gsw configs#20598
[ExecuTorch][WebGPU] test: document bicol decode-GEMV coverage in q4gsw configs#20598JulianCloudNTH wants to merge 1 commit into
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20598
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit e2806c4 with merge base b331ebd ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
Stack from ghstack (oldest at bottom):
The existing M==1 q4gsw configs (q/kv/gate/down_proj) exercise the new bicol 2-column decode GEMV via the handler's M==1 routing, each reading its own per-column scale across many K-groups. Adds comments to the config tables documenting this and the N % 8 == 0 op constraint -- odd-N is un-exportable through torchao's scale layout, so there is no odd-N config and bicol's
has1odd-N guard is defensive (mirrors coop4's general-N robustness). No new configs.Differential Revision: D110017691