[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

varun-sundar-rabindranath · 2025-05-05T21:26:39Z

LoRA triton kernels fail to compile on non-CUDA gpus because the maxnreg argument is recognized only on CUDA platforms.

Fix: The maxnreg argument isn't used and it is safe to retire completely.

FIX #16676

github-actions · 2025-05-05T21:26:49Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-05-05T21:27:13Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @varun-sundar-rabindranath.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>

robertgshaw2-redhat · 2025-05-05T21:36:54Z

Thanks Varun!

…ect#17677)

…ect#17677) Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

…ect#17677)

Syncing midstream NM fork to Upstream tag of [v0.8.5.post1](https://github.com/vllm-project/vllm/tree/v0.8.5.post1) + cherry pick of vllm-project@be633fb needed for benchmarks + [CP](neuralmagic/nm-vllm-ent@1fe447d) for compressed tensor bump + [CP](vllm-project#17677) for lora on AMD + [CP](vllm-project#17315) for llama4 w/ pure dense layers ``` commit 31c73ba (HEAD -> upstream-v0.8.5, nm-fork/upstream-v0.8.5) Author: Chauncey <chaunceyjiang@gmail.com> Date: Wed Apr 30 15:11:04 2025 +0800 [Bugfix] Fix AttributeError: 'State' object has no attribute 'engine_client' (vllm-project#17434) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> commit f8db0bd Author: Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Date: Fri May 2 14:01:38 2025 -0400 [BugFix][Attention] Fix sliding window attention in V1 giving incorrect results (vllm-project#17574) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> commit e335c34 Author: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Date: Fri May 2 04:07:03 2025 -0400 [BugFix] Fix Memory Leak (vllm-project#17567) Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> commit cc463fe Merge: 1e358ff ba41cc9 Author: Selbi Nuryyeva <selbi@redhat.com> Date: Tue Apr 29 12:34:57 2025 -0400 Merge branch 'tag-upstream-v0.8.5' into upstream-v0.8.5 commit ba41cc9 (tag: v0.8.5, tag-upstream-v0.8.5) Author: Michael Goin <mgoin64@gmail.com> Date: Mon Apr 28 16:20:24 2025 -0600 [Model] Add tuned triton fused_moe configs for Qwen3Moe (vllm-project#17328) Signed-off-by: mgoin <mgoin64@gmail.com> commit dcbac4c Author: Simon Mo <simon.mo@hey.com> Date: Mon Apr 28 14:12:01 2025 -0700 [Model] Qwen3 Dense FP8 Compat Fixes (vllm-project#17318) Signed-off-by: simon-mo <xmo@berkeley.edu> [...] ``` Commands ``` git fetch upstream git checkout -b upstream-v0.8.5 git merge upstream/v0.8.5 git cherry-pick be633fb ``` TEST PLAN accept sync: https://github.com/neuralmagic/nm-cicd/actions/runs/14841223552 related PR in cicd: neuralmagic/nm-cicd#99 release workflow: https://github.com/neuralmagic/nm-cicd/actions/runs/14845693864

…ect#17677)

mergify bot added the needs-rebase label May 5, 2025

retire unused maxnreg lora arg

8c5ecb5

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>

varun-sundar-rabindranath force-pushed the varun/retire-maxnreg branch from a68945d to 8c5ecb5 Compare May 5, 2025 21:30

mergify bot removed the needs-rebase label May 5, 2025

varun-sundar-rabindranath mentioned this pull request May 5, 2025

[Bugfix] Adding maxnreg to lora expand/shrink kernel definition #17671

Closed

robertgshaw2-redhat approved these changes May 5, 2025

View reviewed changes

simon-mo merged commit 90bd2ae into vllm-project:main May 6, 2025
27 of 29 checks passed

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 6, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

e335865

…ect#17677)

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 6, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

28d6b3c

…ect#17677)

NickLucche mentioned this pull request May 6, 2025

[Misc] Add Next Edit Prediction (NEP) datasets support in benchmark_serving.py #16839

Merged

1 task

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

a589edc

…ect#17677) Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

dtrifiro pushed a commit to red-hat-data-services/vllm that referenced this pull request May 13, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

278cc0f

…ect#17677)

mawong-amd pushed a commit to ROCm/vllm that referenced this pull request May 14, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

a5bae91

…ect#17677)

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

varun-sundar-rabindranath commented May 5, 2025 •

edited by github-actions bot

Loading

github-actions bot commented May 5, 2025

mergify bot commented May 5, 2025

robertgshaw2-redhat commented May 5, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

Conversation

varun-sundar-rabindranath commented May 5, 2025 • edited by github-actions bot Loading

github-actions bot commented May 5, 2025

mergify bot commented May 5, 2025

robertgshaw2-redhat commented May 5, 2025

varun-sundar-rabindranath commented May 5, 2025 •

edited by github-actions bot

Loading