Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Docs] Fix a bullet list in usage/security.md documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#19358 opened Jun 9, 2025 by windsonsea Loading…
[Bugfix][Core] Prevent token lengths exceeding max_model_len in V0 ready ONLY add when PR is ready to merge/full CI is needed
#19348 opened Jun 9, 2025 by 22quinn Loading…
4 tasks done
[Frontend]Opt beam search
#19347 opened Jun 9, 2025 by zhanggzh Loading…
qwen optimze
#19345 opened Jun 9, 2025 by momo609 Loading…
4 tasks
[CI] Add mteb testing for rerank models ci/build
#19344 opened Jun 9, 2025 by noooop Loading…
[Kernel] Raise verbose error and consolidate num_heads/num_kv_heads divisibility check tpu Related to Google TPUs v1
#19339 opened Jun 9, 2025 by 22quinn Loading…
4 tasks done
Add GLM4.1V model (Draft) documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194)
#19331 opened Jun 8, 2025 by zRzRzRzRzRzRzR Loading…
[V1] [P/D] Add Support for KV Load Failure Recovery documentation Improvements or additions to documentation v1
#19330 opened Jun 8, 2025 by sdavidbd Loading…
[v1] Support mamba2 v1
#19327 opened Jun 8, 2025 by heheda12345 Loading…
3 tasks done
[full_graph] Fix query_start_loc padding ready ONLY add when PR is ready to merge/full CI is needed v1
#19321 opened Jun 7, 2025 by yinghai Loading…
3 tasks done
v0.9.1
[v1] Add fp32 support to v1 engine through flex attn ready ONLY add when PR is ready to merge/full CI is needed v1
#19319 opened Jun 7, 2025 by Isotr0py Loading…
3 tasks done
[Bugfix] Fix auto dtype casting for BatchFeature ready ONLY add when PR is ready to merge/full CI is needed
#19316 opened Jun 7, 2025 by Isotr0py Loading…
2 of 3 tasks
[Fix] Remove unused opentelemetry-semantic-conventions-ai dependency ci/build documentation Improvements or additions to documentation
#19313 opened Jun 7, 2025 by conroy-cheers Loading…
[V1] Reuse V0's memory_profiling util for gpu worker memory profiling ready ONLY add when PR is ready to merge/full CI is needed v1
#19312 opened Jun 7, 2025 by yeqcharlotte Loading…
3 tasks
[Misc]: refactor: ParallelConfig init func
#19310 opened Jun 7, 2025 by googs1025 Loading…
3 tasks
Use xla flag to improve the quantized model performance ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#19303 opened Jun 6, 2025 by vanbasten23 Loading…
3 tasks done
ProTip! Adding no:label will show everything without a label.