test: sync qwen3 samples to pypto main#799
Conversation
There was a problem hiding this comment.
Code Review
This pull request updates the Qwen3 decode PTO kernels for both A3 and A5 targets. The generic @qwen3_decode_incore_* function names have been remapped to descriptive kernel names (such as @q_proj, @gate_proj, and @softmax) to align with the current pypto-lib/main raw kernel names. Additionally, minor adjustments were made to tile buffer allocation addresses and constants in qwen3_decode_incore_5.pto and rope_kv_cache.pto, and the README files were updated accordingly. There are no review comments provided, and I have no additional feedback on these changes.
Important
The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.
Codex Review该评论由 review 机器人自动更新。
SummaryReview failed at stage Findings未生成结构化 findings,因为 review 过程提前失败。 Log Tail |
Summary
test/samples/Qwen3DecodeA3andtest/samples/Qwen3DecodeA5to the currentpypto-lib/mainmodels/qwen3/32b/qwen3_32b_decode.pyoutput atccbdc4fa5cafd1eda7784c9585f9dc876791778b.ptopayloads from the latest upstream raw kernel namesValidation
a2a3anda5frompypto-lib/main.ptofiles with localptoasusing--pto-level=level3--pto-arch=a3--pto-arch=a5Scope note
DeepseekV4DecodeA3/A5pypto-lib/mainDeepSeek V4 decode flow now exports a much finer-grained kernel set than the 7 vendored PTOAS sample cases, so syncing it cleanly needs a separate sample/golden/board-validation migration instead of a straight file refresh