-
-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BugFix] Fix non detected failing tests
ready
ONLY add when PR is ready to merge/full CI is needed
#30277
opened Dec 8, 2025 by
ilmarkov
Loading…
5 tasks
[ROCM][CI] Fix AMD Examples Test Group
ci/build
documentation
Improvements or additions to documentation
rocm
Related to AMD ROCm
#30276
opened Dec 8, 2025 by
Concurrensee
Loading…
[NIXL] refine decoder side post process for heterogeneous BlockSize and kv_layout
kv-connector
v1
#30275
opened Dec 8, 2025 by
xuechendi
Loading…
5 tasks
[AMD] Amd/deepseek aiter fusions
deepseek
Related to DeepSeek models
needs-rebase
rocm
Related to AMD ROCm
v1
[Bugfix] Temporarily disable group quant rms norm fusion
#30273
opened Dec 8, 2025 by
ElizaWszola
Loading…
[CI/Build] Use spawn subprocess for ROCm
documentation
Improvements or additions to documentation
rocm
Related to AMD ROCm
#30272
opened Dec 8, 2025 by
rjrock
Loading…
3 of 5 tasks
[ROCm][CI][Bugfix] Multi-Modal Model Support Fixes and Attention Backend Improvements
ci/build
multi-modality
Related to multi-modality (#4194)
qwen
Related to Qwen models
rocm
Related to AMD ROCm
#30270
opened Dec 8, 2025 by
AndreasKaratzas
Loading…
[Bugfix] Fix DeepGEMM after #29546
ready
ONLY add when PR is ready to merge/full CI is needed
#30267
opened Dec 8, 2025 by
zhewenl
Loading…
[Frontend] Fixes anthropic streaming message_start usage nesting
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#30266
opened Dec 8, 2025 by
bbartels
Loading…
5 tasks
Multiple Hybrid KV Cache Coordinator
v1
#30263
opened Dec 8, 2025 by
roikoren755
Loading…
3 of 5 tasks
Support TP which is not divded for NVFP4 kernels (flashinfer-cutlass) by adding dynamic padding
nvidia
#30260
opened Dec 8, 2025 by
danielafrimi
Loading…
[Feature]: OpenTelemetry Metrics Support
v1
#30258
opened Dec 8, 2025 by
mladjan-gadzic
•
Draft
3 of 5 tasks
[bugfix][quantization] Fix fp8 per_tensor scale shape
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
v1
#30257
opened Dec 8, 2025 by
haoyangli-amd
Loading…
[ROCm] Use aiter.topk_sigmoid in llama4
llama
Related to Llama models
rocm
Related to AMD ROCm
#30255
opened Dec 8, 2025 by
tpopp
Loading…
gptq marlin quantization support for fused moe with lora
#30254
opened Dec 8, 2025 by
Bhanu068
Loading…
3 of 5 tasks
fix: DeepSeek-V3.2 DeepGEMM RuntimeError
deepseek
Related to DeepSeek models
#30251
opened Dec 8, 2025 by
KeeProMise
Loading…
5 tasks
[gpt-oss] Add model_identity to system message retrieval for harmony chat template
frontend
gpt-oss
Related to GPT-OSS models
#30247
opened Dec 8, 2025 by
lyuwen
Loading…
5 tasks
[Feature] skip language model in Encoder
qwen
Related to Qwen models
#30242
opened Dec 8, 2025 by
Bounty-hunter
Loading…
5 tasks
[Bugfix] fix streaming final output for non harmony
frontend
gpt-oss
Related to GPT-OSS models
#30237
opened Dec 8, 2025 by
penfree
Loading…
Bump actions/stale from 10.1.0 to 10.1.1
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#30234
opened Dec 8, 2025 by
dependabot
bot
Loading…
Bump actions/checkout from 6.0.0 to 6.0.1
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#30233
opened Dec 8, 2025 by
dependabot
bot
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-08.