-
-
Notifications
You must be signed in to change notification settings - Fork 7.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Fix mistral model tests
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#17181
opened Apr 25, 2025 by
DarkLight1337
Loading…
[Bugfix] gemma[2,3] interleaved attention when sliding window is disabled
#17180
opened Apr 25, 2025 by
heheda12345
Loading…
[Bugfix] support local dataset path in benchmark_serving
#17179
opened Apr 25, 2025 by
wubai
Loading…
Add option "--expand-tools-even-if-tool-choice-none"
frontend
#17177
opened Apr 25, 2025 by
okdshin
Loading…
[CI] Add mteb testing to test the accuracy of the embedding model
ci/build
#17175
opened Apr 25, 2025 by
noooop
Loading…
[Bugfix] Modifications to error handling of multiple vllm api endpoints
frontend
#17165
opened Apr 25, 2025 by
tunglinwood
Loading…
[Hardware][Power] Enable compressed tensor W8A8 INT8 quantization for POWER
ci/build
#17153
opened Apr 25, 2025 by
Akashcodes732
Loading…
[Misc] Add gemma3 chat template with pythonic-style function calling
documentation
Improvements or additions to documentation
tool-calling
#17149
opened Apr 25, 2025 by
philipchung
Loading…
Add xLAM tool parser support
documentation
Improvements or additions to documentation
frontend
tool-calling
#17148
opened Apr 25, 2025 by
zuxin666
Loading…
[Frontend] Enforce user input key args to reduce chance of large performance degradation
documentation
Improvements or additions to documentation
frontend
#17145
opened Apr 24, 2025 by
Chenyaaang
Loading…
[Kernel] FP8 quantization fused into V1 Flash Attention
#17143
opened Apr 24, 2025 by
gshtras
Loading…
[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env
#17142
opened Apr 24, 2025 by
jamesjwu
Loading…
[ROCm][FP8][Kernel] FP8 quantization fused into Custom Paged Attention
#17139
opened Apr 24, 2025 by
gshtras
Loading…
[V1][Spec Decode] Make eagle compatible with prefix caching.
v1
#17137
opened Apr 24, 2025 by
LiuXiaoxuanPKU
Loading…
[Misc] Only import amdsmi and _rocm_C on rocm platform
#17136
opened Apr 24, 2025 by
angusYuhao
Loading…
[Docs] Update structured output doc for V1
documentation
Improvements or additions to documentation
structured-output
#17135
opened Apr 24, 2025 by
russellb
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.