-
-
Notifications
You must be signed in to change notification settings - Fork 7.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Docs] Fix a bullet list in usage/security.md
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#19358
opened Jun 9, 2025 by
windsonsea
Loading…
[Frontend] Add tqdm_leave_pbar to control progress bar visibility
frontend
#19357
opened Jun 9, 2025 by
reidliu41
Loading…
4 tasks
[CI][Structured Output] Refactor
test_struct_output_generate.py
to make the code cleaner
v1
#19354
opened Jun 9, 2025 by
shen-shanshan
Loading…
1 of 4 tasks
[Bugfix][Core] Prevent token lengths exceeding ONLY add when PR is ready to merge/full CI is needed
max_model_len
in V0
ready
#19348
opened Jun 9, 2025 by
22quinn
Loading…
4 tasks done
[P/D][Bugfix]: Fix the metadata corruption issue in Nixl when TP > 1.
#19341
opened Jun 9, 2025 by
chaunceyjiang
Loading…
[Kernel] Raise verbose error and consolidate Related to Google TPUs
v1
num_heads/num_kv_heads
divisibility check
tpu
#19339
opened Jun 9, 2025 by
22quinn
Loading…
4 tasks done
[WIP][Wheel Size] Only build FA2 8.0+PTX
ci/build
#19336
opened Jun 9, 2025 by
LucasWilkinson
Loading…
Add GLM4.1V model (Draft)
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
#19331
opened Jun 8, 2025 by
zRzRzRzRzRzRzR
Loading…
[V1] [P/D] Add Support for KV Load Failure Recovery
documentation
Improvements or additions to documentation
v1
#19330
opened Jun 8, 2025 by
sdavidbd
Loading…
[v1] Add fp32 support to v1 engine through flex attn
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19319
opened Jun 7, 2025 by
Isotr0py
Loading…
3 tasks done
[Bugfix] Fix auto dtype casting for BatchFeature
ready
ONLY add when PR is ready to merge/full CI is needed
#19316
opened Jun 7, 2025 by
Isotr0py
Loading…
2 of 3 tasks
[Fix] Remove unused opentelemetry-semantic-conventions-ai dependency
ci/build
documentation
Improvements or additions to documentation
#19313
opened Jun 7, 2025 by
conroy-cheers
Loading…
[V1] Reuse V0's memory_profiling util for gpu worker memory profiling
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19312
opened Jun 7, 2025 by
yeqcharlotte
Loading…
3 tasks
Use xla flag to improve the quantized model performance
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#19303
opened Jun 6, 2025 by
vanbasten23
Loading…
3 tasks done
Add optional token-level progress bar to
LLM.beam_search
using tqdm
frontend
#19301
opened Jun 6, 2025 by
NekoMimiUnagi
Loading…
3 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.