Skip to content

Pull requests: ModelTC/LightLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support GLM-5.2 (glm_moe_dsa, DeepSeek-V3.2-style DSA MoE)
#1370 opened Jun 25, 2026 by sufubao Collaborator Loading…
Make moe align fused deterministic
#1369 opened Jun 25, 2026 by hiworldwzj Collaborator Loading…
feat: fuse add and rmsnorm
#1368 opened Jun 24, 2026 by blueswhen Collaborator Loading…
feat: opt fa3 and flashinfer
#1367 opened Jun 23, 2026 by blueswhen Collaborator Loading…
feat(quant): support tensorwise fp8 w8a8 (--quant_type fp8w8a8-pt)
#1366 opened Jun 22, 2026 by sufubao Collaborator Loading…
fix stream fc for qwen3_coder
#1364 opened Jun 18, 2026 by shihaobai Collaborator Loading…
Support default chat template kwargs
#1363 opened Jun 18, 2026 by sufubao Collaborator Loading…
support ds4
#1355 opened Jun 15, 2026 by WANDY666 Contributor Loading…
feat: add gguf support
#1354 opened Jun 15, 2026 by zhangts20 Loading…
feat(qwen3_5_mtp): Qwen3.5 / Qwen3.5-MoE MTP speculative decoding
#1338 opened Jun 9, 2026 by sufubao Collaborator Loading…
feat: update disk cache params and benchmark_multiturn.py
#1333 opened Jun 8, 2026 by blueswhen Collaborator Loading…
Fa4 support
#1327 opened Jun 2, 2026 by blueswhen Collaborator Loading…
add in-process URL pool caching
#1325 opened Jun 1, 2026 by Owleye4 Contributor Loading…
update cpu cache load use async way.
#1318 opened May 25, 2026 by hiworldwzj Collaborator Loading…
support mtp for gemma4
#1316 opened May 22, 2026 by WANDY666 Contributor Loading…
feat(RL): add RL support for verl
#1298 opened May 8, 2026 by shihaobai Collaborator Loading…
import flashqla to speedup gdn prefill
#1295 opened May 8, 2026 by WANDY666 Contributor Loading…
import flashqla and support cudagraph for gdn
#1292 opened May 6, 2026 by WANDY666 Contributor Loading…
Logging colorization + access middleware cleanup + windowed cache stats
#1289 opened May 6, 2026 by sufubao Collaborator Loading…
6 tasks done
[Feature] Add support for Neo++
#1274 opened Apr 17, 2026 by XHPlus Contributor Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.