Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Refactor ci/build needs-rebase tpuRelated to Google TPUsv1
#17950 openedMay 10, 2025by yarongmu-google Draft
[Bugfix] Add revision to transformers.Auto*.from_pretrained processors readyONLY add when PR is ready to merge/full CI is needed
#17948 openedMay 10, 2025by xinli-centml Loading…
[v1] Support multiple KV cache groups in GPU model runner tpuRelated to Google TPUsv1
#17945 openedMay 10, 2025by heheda12345 Loading…
[doc] list the hf downloaded models documentationImprovements or additions to documentation
#17940 openedMay 10, 2025by reidliu41 Loading…
[WIP] Fix Misleading Error Messages
#17938 openedMay 10, 2025by mengbingrock Loading…
[doc] update lora doc documentationImprovements or additions to documentationreadyONLY add when PR is ready to merge/full CI is needed
#17936 openedMay 10, 2025by reidliu41 Loading…
[Bugfix] Avoid repeatedly creating dummy data during engine startup multi-modalityRelated to multi-modality (#4194)v1
#17935 openedMay 10, 2025by DarkLight1337 Loading…
[BugFix] Set default random seed to 0 for V1
#17929 openedMay 10, 2025by WoosukKwon Loading…
[Frontend] [Core] Add Tensorizer support for LoRA adapter serialization and deserialization documentationImprovements or additions to documentation
#17926 openedMay 9, 2025by sangstar Loading…
[WIP][Misc] Add Ray Prometheus logger to V1 v1
#17925 openedMay 9, 2025by eicherseiji Loading…
WIP: fix_llama4_tool_call documentationImprovements or additions to documentationfrontend tool-calling
#17917 openedMay 9, 2025by wukaixingxp Draft
[Bugfix][V1] Only get input embeddings w/ multi-modal models if first PP readyONLY add when PR is ready to merge/full CI is neededv1
#17916 openedMay 9, 2025by jinhuang12 Loading…
[Misc] Add compressed-tensors NVFP4A16 emulation support quantization readyONLY add when PR is ready to merge/full CI is needed
#17914 openedMay 9, 2025by dsikka Loading…
[BugFix][AMD] Compatible for AITER lib after 04/20
#17912 openedMay 9, 2025by qli88 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.