-
Notifications
You must be signed in to change notification settings - Fork 197
Pull requests: alibaba/rtp-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(stream): avoid self-deadlock in waitLoadCacheDone via reportError…
#1045
opened May 27, 2026 by
ZhihanYan
Collaborator
Loading…
docs: fix typos ('comming soon', 'not suport')
#1044
opened May 27, 2026 by
daqiege
Loading…
1 task done
fix(openai): streaming spec compliance — SSE chunk split, content preservation, min_new_tokens enforcement
#1040
opened May 26, 2026 by
aslanxie
Loading…
fix(stream): wake nextOutput() waiter when stream is flipped to Error
#1036
opened May 24, 2026 by
ZhihanYan
Collaborator
Loading…
fix(rocm): preserve column-major layout in GDN qkvz+ba fusion to fix swizzle core dump
#1030
opened May 22, 2026 by
chengshu-lcc
Collaborator
Loading…
feat(kimi-linear): enable tool-call protocol and add e2e smoke
#1029
opened May 22, 2026 by
theNiemand
Collaborator
Loading…
fix(dash_sc): thinking parameter propagation fixes (max_new_think_tokens, FINISH_REASON_LENGTH, 0-budget)
#1028
opened May 21, 2026 by
jianglan89
Collaborator
Loading…
feat: migrate flashinfer renorm kernel
#1027
opened May 21, 2026 by
Vinkle-hzt
Collaborator
Loading…
feat(ci_gate): delegate fork PR review reruns to workflow_run helper and add pre gate
#1025
opened May 21, 2026 by
guoj14
Collaborator
Loading…
feat: add ROCm aiter custom and quick allreduce support
#1010
opened May 18, 2026 by
chengshu-lcc
Collaborator
Loading…
feat: update rtp-kernel for w4a8-opt and sm103a
#999
opened May 13, 2026 by
Bruce-Lee-LY
Collaborator
Loading…
feat: Complete P2PConnector implementation for high-performance PD Disaggregation
#997
opened May 13, 2026 by
ZhihanYan
Collaborator
Loading…
test: add server_args for server_test
#994
opened May 12, 2026 by
zhangjianning-zjn
Collaborator
Loading…
feat(flexlb): add configurable group routing policy
#988
opened May 10, 2026 by
jianglan89
Collaborator
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.