Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix cp inference Bug:P0
#4619 opened May 25, 2026 by irexyc Collaborator Loading…
[WIP] Refactor prefix caching
#4618 opened May 24, 2026 by grimoire Collaborator Draft
fix(turbomind): map Intern-S1 HF checkpoint keys Bug:P1
#4617 opened May 23, 2026 by lvhan028 Collaborator Loading…
Improve health endpoint improvement
#4615 opened May 23, 2026 by lvhan028 Collaborator Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
[WIP]: Support mtp + dp
#4611 opened May 21, 2026 by RunningLeon Collaborator Loading…
fix memleak when input contain large image data Bug:P1
#4610 opened May 21, 2026 by grimoire Collaborator Loading…
TEST: update video test
#4606 opened May 21, 2026 by littlegy Contributor Loading…
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Remove state init improvement
#4604 opened May 20, 2026 by grimoire Collaborator Loading…
fix(vl): reduce multimodal feature memory use Bug:P1
#4603 opened May 20, 2026 by CUHKSZzxy Collaborator Loading…
support qwen3.5(vit) inference in turbomind backend enhancement New feature or request
#4602 opened May 20, 2026 by irexyc Collaborator Loading…
[ci] add k4v2 testcase and fix some fail cases
#4601 opened May 20, 2026 by zhulinJulia24 Collaborator Loading…
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Collaborator Draft
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
Refactor proxy server improvement
#4596 opened May 18, 2026 by lvhan028 Collaborator Draft
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
Add OpenAI Responses-compatible endpoint enhancement New feature or request
#4582 opened May 13, 2026 by CUHKSZzxy Collaborator Loading…
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
feat: configure cudagraph capture batch sizes
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Draft
ProTip! What’s not been updated in a month: updated:<2026-04-25.