-
Notifications
You must be signed in to change notification settings - Fork 701
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(turbomind): memory allocator, object cache, and scheduler integration
enhancement
New feature or request
#4717
opened Jun 29, 2026 by
lzhangzz
Collaborator
Loading…
feat(serve): add --generation-config CLI for server sampling defaults
improvement
#4708
opened Jun 25, 2026 by
lvhan028
Collaborator
Loading…
feat: share multimodal hash helpers
enhancement
New feature or request
#4704
opened Jun 24, 2026 by
CUHKSZzxy
Collaborator
Loading…
Optimize BaseResponseParser streaming and add parser benchmark
improvement
#4697
opened Jun 22, 2026 by
lvhan028
Collaborator
Loading…
fix: gate multimodal preprocessing concurrency
#4687
opened Jun 17, 2026 by
CUHKSZzxy
Collaborator
Loading…
fix: parse multimodal tool messages
Bug:P1
#4680
opened Jun 16, 2026 by
CUHKSZzxy
Collaborator
Loading…
refactor(proxy): split monolithic proxy into modular serve/proxy package
improvement
#4647
opened Jun 4, 2026 by
lvhan028
Collaborator
Loading…
feat: add multimodal and preemption metrics
#4640
opened Jun 1, 2026 by
CUHKSZzxy
Collaborator
Loading…
modify save model in lite module
improvement
#4624
opened May 26, 2026 by
43758726
Contributor
Loading…
feat(turbomind): support priority schedule policy
#4614
opened May 22, 2026 by
4mengy
Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
improvement
#4605
opened May 21, 2026 by
windreamer
Collaborator
Loading…
1 of 4 tasks
[WIP]: Support reuse routed experts on eviction
#4599
opened May 19, 2026 by
RunningLeon
Collaborator
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-28.