-
Notifications
You must be signed in to change notification settings - Fork 665
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] Add support of new W4A4_LAOS_DYNAMIC quantization method
module:quantization
#5143
opened Dec 17, 2025 by
maxmgrdv
Loading…
fix: use batch_matmul_transpose operator in MLA _v_up_proj for better performance
#5142
opened Dec 17, 2025 by
LICO1314
Loading…
4 tasks
[Perf] Autotune and cache causal_conv1d_fwd launch parameters
module:ops
#5133
opened Dec 17, 2025 by
kiscad
Loading…
[feat][mm]optimize encoder cache by operating with embedding
#5132
opened Dec 17, 2025 by
HF-001
Loading…
[BugFix] Add top_p,top_k in EAGLE e2e
module:tests
ready
read for review
ready-for-test
start test by label for PR
#5131
opened Dec 17, 2025 by
zhaomingyu13
Loading…
[2/N] Remove Pangu Related Code
module:quantization
module:tests
#5130
opened Dec 17, 2025 by
Pr0Wh1teGivee
Loading…
Qwen3-Next:Update the gpu-memory-utilization parameter to 0.7
documentation
Improvements or additions to documentation
#5129
opened Dec 17, 2025 by
ming1212
Loading…
[Doc] Add a perf tune section
documentation
Improvements or additions to documentation
#5127
opened Dec 17, 2025 by
Potabk
Loading…
Add Qwen3-VL-235B tutorials
documentation
Improvements or additions to documentation
#5126
opened Dec 17, 2025 by
luluxiu520
Loading…
[BugFix]Fix incorrect get_current_vllm_config
merge-conflicts
ready
read for review
ready-for-test
start test by label for PR
#5121
opened Dec 17, 2025 by
Angazenn
Loading…
enable npugraph_ex
module:core
module:tests
ready
read for review
ready-for-test
start test by label for PR
#5120
opened Dec 17, 2025 by
panchao-hub
Loading…
ci test
module:ops
ready
read for review
ready-for-test
start test by label for PR
#5119
opened Dec 17, 2025 by
Trunrain
Loading…
[Feat] Support to use fullgraph with eagle
module:core
module:tests
#5118
opened Dec 17, 2025 by
anon189Ty
Loading…
[Feat] Adapt FlashComm2 with PCP
ready
read for review
ready-for-test
start test by label for PR
#5114
opened Dec 17, 2025 by
dsxsteven
Loading…
[Bugfix] install trition for test_custom_op
ci/build
#5112
opened Dec 17, 2025 by
zhangxinyuehfad
Loading…
[test] add w4a8 accuracy case
ci/build
module:tests
#5110
opened Dec 17, 2025 by
ck-hw-1018
Loading…
fixed fused alltoall execute all reduce
module:ops
ready
read for review
ready-for-test
start test by label for PR
#5109
opened Dec 17, 2025 by
AlvisGong
Loading…
CI test
ready
read for review
ready-for-test
start test by label for PR
#5104
opened Dec 16, 2025 by
hust17yixuan
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.