Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] Refactor Ovis2 to support original tokenizer documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#17537 opened May 1, 2025 by Isotr0py Review required
[CI/Build] Remove awscli dependency ci/build ready ONLY add when PR is ready to merge/full CI is needed
#17532 opened May 1, 2025 by DarkLight1337 Review required
Move the last arguments in arg_utils.py to be in their final groups ready ONLY add when PR is ready to merge/full CI is needed
#17531 opened May 1, 2025 by hmellor Loading…
[FEAT][ROCm]: Support AITER MLA on V1 Engine ci/build rocm Related to AMD ROCm v1
#17523 opened May 1, 2025 by vllmellm Loading…
[prototype] prioritized block soft pinning/evictions documentation Improvements or additions to documentation frontend v1
#17520 opened May 1, 2025 by simon-mo Draft
[V1] Add num_cached_tokens stats for request output ready ONLY add when PR is ready to merge/full CI is needed v1
#17519 opened May 1, 2025 by simon-mo Loading…
[BugFix] Qwen3 tool calling failed using qwen3 reasoning parser. documentation Improvements or additions to documentation frontend tool-calling
#17506 opened Apr 30, 2025 by Xu-Wenqing Loading…
[V1][Spec Decode] Apply torch.compile & cudagraph to EAGLE3 ready ONLY add when PR is ready to merge/full CI is needed v1
#17504 opened Apr 30, 2025 by zixi-qi Loading…
[Chore] Ignore Ruff warning on E501
#17502 opened Apr 30, 2025 by aarnphm Loading…
[Model] Add GraniteMoeHybrid 4.0 model
#17497 opened Apr 30, 2025 by s3woz Loading…
[TPU] Add kernel test for moe_pallas ci/build ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs
#17496 opened Apr 30, 2025 by mgoin Loading…
Fix arg checking for GGUF/Quark/GPTQMarlin quantized MoE methods bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed
#17491 opened Apr 30, 2025 by mgoin Loading…
Add full API docs and improve the UX of navigating them ci/build documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) v1
#17485 opened Apr 30, 2025 by hmellor Loading…
[v1] Pass BlockTable and KVCacheSpec to AttentionMetadataBuilders tpu Related to Google TPUs v1
#17483 opened Apr 30, 2025 by heheda12345 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.