Skip to content

Actions: sgl-project/sglang

VLLM Dependency Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
14,882 workflow runs
14,882 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Refactor mem_pool_host deallocation to main thread
VLLM Dependency Test #18823: Pull request #9302 synchronize by pansicheng
August 29, 2025 02:01 In progress pansicheng:converge-free
August 29, 2025 02:01 In progress
enable llama3.1-8B on xpu
VLLM Dependency Test #18822: Pull request #9434 synchronize by huaiyuzh
August 29, 2025 01:59 Action required huaiyuzh:huaiyuzh/enable_llama3.1
August 29, 2025 01:59 Action required
[CPU] Fix issues when running llama3.2-11B vision model with image tasks
VLLM Dependency Test #18821: Pull request #8666 synchronize by Alcanderian
August 29, 2025 01:58 Action required blzheng:beilei/llama3.2_11b_vision
August 29, 2025 01:58 Action required
[feat] Add P/D attention select for draft model
VLLM Dependency Test #18818: Pull request #9755 synchronize by zhyncs
August 29, 2025 01:36 Action required Ximingwang-09:pd_attention_select_for_draft
August 29, 2025 01:36 Action required
bugfix(hicache): Move exists check before key suffixing (#9749)
VLLM Dependency Test #18817: Commit 38cd5fb pushed by zhyncs
August 29, 2025 01:29 6m 9s main
August 29, 2025 01:29 6m 9s
[HiCache] change the default policy to write through (#9772)
VLLM Dependency Test #18816: Commit 001f519 pushed by zhyncs
August 29, 2025 01:28 1m 26s main
August 29, 2025 01:28 1m 26s
Optimized deepseek-v3/r1 model performance on mxfp4 run
VLLM Dependency Test #18815: Pull request #9671 synchronize by kkHuang-amd
August 29, 2025 01:02 Action required HaiShaw:dsv3_opt_0822
August 29, 2025 01:02 Action required
[HiCache] resolve conflict between chunked-prefill and hicache hit count
VLLM Dependency Test #18813: Pull request #9776 opened by xiezhq-hermann
August 29, 2025 00:50 7m 16s xiezhq-hicache-chunk
August 29, 2025 00:50 7m 16s
[Draft] replace ebnf with jsonschema for tool calls
VLLM Dependency Test #18812: Pull request #9767 synchronize by TJ5
August 29, 2025 00:34 Action required TJ5:tool-call-jsonschema
August 29, 2025 00:34 Action required
Optimized deepseek-v3/r1 model performance on mxfp4 run
VLLM Dependency Test #18811: Pull request #9671 synchronize by kkHuang-amd
August 29, 2025 00:16 Action required HaiShaw:dsv3_opt_0822
August 29, 2025 00:16 Action required
[MODEL] Apertus and XIELU
VLLM Dependency Test #18810: Pull request #9774 synchronize by EduardDurech
August 29, 2025 00:10 Action required swiss-ai:model/apertus
August 29, 2025 00:10 Action required
[Bug Fix] Resolve CUDA illegal memory access issue in speculative decoding
VLLM Dependency Test #18809: Pull request #9687 synchronize by zhaochenyang20
August 29, 2025 00:09 7m 0s ryang-max:fix_eagle
August 29, 2025 00:09 7m 0s
[Feature] Support custom set kv buffer kernel in srt
VLLM Dependency Test #18808: Pull request #9775 opened by DarkSharpness
August 29, 2025 00:08 6m 16s DarkSharpness:set_kv
August 29, 2025 00:08 6m 16s
[MODEL] Apertus and XIELU
VLLM Dependency Test #18806: Pull request #9774 opened by EduardDurech
August 29, 2025 00:02 Action required swiss-ai:model/apertus
August 29, 2025 00:02 Action required
fix: fix MLA for ShardedModelLoader/RemoteModelLoader (#6287)
VLLM Dependency Test #18805: Commit 9f81d74 pushed by zhyncs
August 28, 2025 23:10 5m 49s main
August 28, 2025 23:10 5m 49s
feat(draft_model): support draft_model for RemoteModelLoader (#6407)
VLLM Dependency Test #18804: Commit a38c149 pushed by zhyncs
August 28, 2025 23:09 17s main
August 28, 2025 23:09 17s
[Feature] Support NPUGraph for DeepSeek on Ascend NPU (#9355)
VLLM Dependency Test #18803: Commit 74dd424 pushed by zhyncs
August 28, 2025 23:06 3m 44s main
August 28, 2025 23:06 3m 44s
[HiCache] change the default policy to write through
VLLM Dependency Test #18802: Pull request #9772 opened by xiezhq-hermann
August 28, 2025 23:02 6m 3s xiezhq-hicache-default
August 28, 2025 23:02 6m 3s
feat: add tuned fused moe config for GLM-4.5-Air-FP8 tp = 4 on B200 (…
VLLM Dependency Test #18801: Commit dc20c22 pushed by zhyncs
August 28, 2025 23:00 5m 41s main
August 28, 2025 23:00 5m 41s
Support overlap scheduling for speculative decoding
VLLM Dependency Test #18800: Pull request #9588 synchronize by thecodingwizard
August 28, 2025 22:32 5m 52s modal-labs:eagle-overlap
August 28, 2025 22:32 5m 52s
[AMD] Support Hierarchical Caching on AMD GPUs (#8236)
VLLM Dependency Test #18799: Commit 711390a pushed by zhyncs
August 28, 2025 22:27 5m 59s main
August 28, 2025 22:27 5m 59s