Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update DSV4 GB300 Dynamo vLLM Recipes
#2010 opened Jul 3, 2026 by hjjq Collaborator Loading…
[WIP] Update Minimax M3 FP4 B200 Eagle full-sweep-enabled
#2007 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
Update Minimax M3 FP4 B300 Eagle full-sweep-enabled
#2006 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
[AMD] MiniMax-M3 FP4/FP8 MI355X ATOMESH (disagg): refactor config & add MTP recipes / 重构配置并新增 MTP 配方 / 설정 리팩토링 및 MTP 레시피 추가 AMD evals-only Suppress throughput and run only eval jobs; combine with all-evals to expand selection full-sweep-enabled
#2000 opened Jul 3, 2026 by seungrokj Collaborator Loading…
8 tasks
[WIP] Test Kimi 2.5 B300 Agg full-sweep-enabled
#1998 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
chore(deps): bump the github-actions group across 1 directory with 3 updates dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#1995 opened Jul 3, 2026 by dependabot Bot Loading…
Update Minimax M3 B300 FP4 vllm full-sweep-enabled
#1994 opened Jul 2, 2026 by wzhao18 Collaborator Loading…
[WIP] [do not merge] Add MiniMax-M3 FP4 B200 Dynamo-vLLM disagg config full-sweep-fail-fast-no-canary Full sweep, no canary gate; first failure in a matrix cancels that matrix
#1982 opened Jul 2, 2026 by jasonlizhengjian Collaborator Loading…
test the GB300 cluster after the node patch full-sweep-enabled
#1961 opened Jun 30, 2026 by richardhuo-nv Collaborator Loading…
ProTip! Updated in the last three days: updated:>2026-07-01.