Skip to content

Actions: EmbeddedLLM/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
143 workflow runs
143 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Misc] small update (#20462)
pre-commit #143: Commit a7bab0c pushed by tjtanaa
July 4, 2025 05:48 7m 39s main
July 4, 2025 05:48 7m 39s
MLA
pre-commit #142: Pull request #70 synchronize by vllmellm
July 3, 2025 14:51 4m 55s deepseek00-mla
July 3, 2025 14:51 4m 55s
MLA
pre-commit #141: Pull request #70 opened by tjtanaa
July 3, 2025 12:32 5m 1s deepseek00-mla
July 3, 2025 12:32 5m 1s
[V1] Only print cudagraph tqdm on rank 0 with is_global_first_rank
pre-commit #140: Commit be250bb pushed by kliuae
July 1, 2025 06:11 6m 21s main
July 1, 2025 06:11 6m 21s
[Docs] Fix 1-2-3 list in v1/prefix_caching.md (#20243)
pre-commit #139: Commit 3ee56e2 pushed by tjtanaa
June 30, 2025 12:10 6m 14s main
June 30, 2025 12:10 6m 14s
[Bugfix] Skip loading extra parameters for modelopt Qwen3 MoE model (…
pre-commit #138: Commit f5dfa07 pushed by vllmellm
June 30, 2025 09:27 6m 30s main
June 30, 2025 09:27 6m 30s
[CI Fix] Try fixing eagle e2e test OOM by reducing block allocation (…
pre-commit #137: Commit 7b1895e pushed by tjtanaa
June 29, 2025 13:42 6m 11s main
June 29, 2025 13:42 6m 11s
[CI/Build] Allow hermetic builds (#18064)
pre-commit #136: Commit 3c545c0 pushed by tjtanaa
June 28, 2025 05:09 6m 12s main
June 28, 2025 05:09 6m 12s
[Bugfix] Fix flaky failure when getting DP ports (#20151)
pre-commit #135: Commit 4ab3ac2 pushed by vllmellm
June 27, 2025 09:00 7m 38s main
June 27, 2025 09:00 7m 38s
[CI Failure] Fix OOM with test_oot_registration_embedding (#20144)
pre-commit #134: Commit 71799fd pushed by vllmellm
June 27, 2025 03:41 6m 14s main
June 27, 2025 03:41 6m 14s
[Bugfix] Fix Mistral tool-parser regex for nested JSON (#20093)
pre-commit #133: Commit 754b00e pushed by tjtanaa
June 26, 2025 02:41 7m 18s main
June 26, 2025 02:41 7m 18s
Fix: Check the type of params to be a Sequence not list. (#19910)
pre-commit #132: Commit 8ca81bb pushed by tjtanaa
June 21, 2025 02:43 6m 7s main
June 21, 2025 02:43 6m 7s
[Bugfix][Ray] Set the cuda context eagerly in the ray worker (#19583)
pre-commit #131: Commit 5e666f7 pushed by vllmellm
June 20, 2025 07:05 6m 26s main
June 20, 2025 07:05 6m 26s
[Doc] Update V1 user guide for embedding models (#19842)
pre-commit #130: Commit 6f68c49 pushed by tjtanaa
June 19, 2025 15:41 6m 20s main
June 19, 2025 15:41 6m 20s
[Frontend] Add optional token-level progress bar to LLM.beam_search
pre-commit #129: Commit 466166d pushed by tjtanaa
June 19, 2025 08:03 7m 8s main
June 19, 2025 08:03 7m 8s
Use xla flag to improve the quantized model performance (#19303)
pre-commit #128: Commit 9af6d22 pushed by vllmellm
June 10, 2025 03:40 6m 11s main
June 10, 2025 03:40 6m 11s
[Frontend] Remove unreachable code from llm.py (#19288)
pre-commit #127: Commit 8335667 pushed by vllmellm
June 9, 2025 04:19 6m 11s main
June 9, 2025 04:19 6m 11s
[Bugfix] Fix EAGLE vocab embedding construction for Llama 70B (#19033)
pre-commit #126: Commit 3465b87 pushed by vllmellm
June 6, 2025 02:20 6m 56s main
June 6, 2025 02:20 6m 56s
[TPU] Skip hanging tests (#19115)
pre-commit #125: Commit 8e972d9 pushed by vllmellm
June 4, 2025 09:04 6m 12s main
June 4, 2025 09:04 6m 12s
[Bugfix] Fix FA3 full cuda graph correctness (#19106)
pre-commit #124: Commit b124e10 pushed by vllmellm
June 4, 2025 06:30 6m 2s main
June 4, 2025 06:30 6m 2s
[Bugfix][Nixl] Fix DP Metadata Handshake (#19008)
pre-commit #123: Commit b9f61e1 pushed by vllmellm
June 2, 2025 05:29 6m 2s main
June 2, 2025 05:29 6m 2s
[Bugfix] Fix for issue 17396 (#18773)
pre-commit #122: Commit f2c3f66 pushed by tjtanaa
May 31, 2025 13:53 6m 3s main
May 31, 2025 13:53 6m 3s
improve the robustness of parsing vlms config in AutoRound (#18894)
pre-commit #121: Commit 3de3ead pushed by vllmellm
May 30, 2025 03:02 5m 54s main
May 30, 2025 03:02 5m 54s
Fixes a dead link in nightly benchmark readme (#18856)
pre-commit #120: Commit fd7bb88 pushed by tjtanaa
May 29, 2025 07:33 7m 9s main
May 29, 2025 07:33 7m 9s
[V1] fix torch profiling for V1 offline scenarios (#18445)
pre-commit #119: Commit 774c5fd pushed by vllmellm
May 28, 2025 04:53 6m 41s main
May 28, 2025 04:53 6m 41s