Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

support disaggregated weight update
#4638 opened May 29, 2026 by irexyc Collaborator Loading…
Add raw chat completion logprob output improvement
#4637 opened May 28, 2026 by lvhan028 Collaborator Loading…
update gated delta rule state layout improvement
#4636 opened May 28, 2026 by grimoire Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
[WIP] Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Draft
1 task
[ci] refactor testcoverage config
#4630 opened May 28, 2026 by zhulinJulia24 Collaborator Loading…
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
Validate final chat response structure planned feature
#4621 opened May 26, 2026 by lvhan028 Collaborator Loading…
Refactor prefix caching
#4618 opened May 24, 2026 by grimoire Collaborator Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
Support dp for qwen35 mtp
#4611 opened May 21, 2026 by RunningLeon Collaborator Loading…
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
support qwen3.5(vit) inference in turbomind backend enhancement New feature or request
#4602 opened May 20, 2026 by irexyc Collaborator Loading…
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
Refactor proxy server improvement
#4596 opened May 18, 2026 by lvhan028 Collaborator Draft
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
Add OpenAI Responses-compatible endpoint enhancement New feature or request
#4582 opened May 13, 2026 by CUHKSZzxy Collaborator Loading…
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
feat: configure cudagraph capture batch sizes
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.