Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] Add Gemma2 GGUF support
#12186 opened Jan 18, 2025 by Isotr0py Draft
[Kernel] add triton fused moe kernel for gptq/awq
#12185 opened Jan 18, 2025 by jinzhen-lin Loading…
[Misc] Add BNB support to GLM4-V model ready ONLY add when PR is ready to merge/full CI is needed
#12184 opened Jan 18, 2025 by Isotr0py Loading…
[torch.compile] store inductor compiled Python file ready ONLY add when PR is ready to merge/full CI is needed
#12182 opened Jan 18, 2025 by youkaichao Loading…
[WIP][Hardware][CPU] testing branch for mlperf ci/build documentation Improvements or additions to documentation needs-rebase
#12141 opened Jan 17, 2025 by bigPYJ1151 Draft
[V1] Add V1 support of Qwen2-VL documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#12128 opened Jan 16, 2025 by ywang96 Loading…
[Misc] Update to Transformers 4.48 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#12120 opened Jan 16, 2025 by tlrmchlsmth Loading…
benchmark_serving support --served-model-name param
#12109 opened Jan 16, 2025 by gujingit Loading…
Use CUDA 12.4 as default for release and nightly wheels ci/build documentation Improvements or additions to documentation
#12098 opened Jan 15, 2025 by mgoin Loading…
[V1] Add notes on test_async_engine.py::test_abort
#12081 opened Jan 15, 2025 by heheda12345 Loading…
[Bugfix] Fix num_heads value for simple connector when tp enabled ready ONLY add when PR is ready to merge/full CI is needed
#12074 opened Jan 15, 2025 by ShangmingCai Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.