Skip to content

Actions: sgl-project/sglang

PR Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
24,555 workflow runs
24,555 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Optimized deepseek-v3/r1 model performance on mxfp4 run
PR Test #29427: Pull request #9671 synchronize by kkHuang-amd
August 29, 2025 01:02 Action required HaiShaw:dsv3_opt_0822
August 29, 2025 01:02 Action required
[Draft] replace ebnf with jsonschema for tool calls
PR Test #29424: Pull request #9767 synchronize by TJ5
August 29, 2025 00:34 Action required TJ5:tool-call-jsonschema
August 29, 2025 00:34 Action required
Optimize prefill performance on cpu backend (#8750)
PR Test #29423: Commit 5ad296b pushed by zhyncs
August 29, 2025 00:21 8s main
August 29, 2025 00:21 8s
Optimized deepseek-v3/r1 model performance on mxfp4 run
PR Test #29422: Pull request #9671 synchronize by kkHuang-amd
August 29, 2025 00:16 Action required HaiShaw:dsv3_opt_0822
August 29, 2025 00:16 Action required
[MODEL] Apertus and XIELU
PR Test #29421: Pull request #9774 synchronize by EduardDurech
August 29, 2025 00:10 Action required swiss-ai:model/apertus
August 29, 2025 00:10 Action required
[Bug Fix] Resolve CUDA illegal memory access issue in speculative decoding
PR Test #29420: Pull request #9687 synchronize by zhaochenyang20
August 29, 2025 00:09 1h 3m 46s ryang-max:fix_eagle
August 29, 2025 00:09 1h 3m 46s
[Feature] Support custom set kv buffer kernel in srt
PR Test #29419: Pull request #9775 opened by DarkSharpness
August 29, 2025 00:08 1h 3m 2s DarkSharpness:set_kv
August 29, 2025 00:08 1h 3m 2s
[MODEL] Apertus and XIELU
PR Test #29417: Pull request #9774 opened by EduardDurech
August 29, 2025 00:02 Action required swiss-ai:model/apertus
August 29, 2025 00:02 Action required
ROCm 7.0 update
PR Test #29416: Pull request #9757 synchronize by sogalin
August 28, 2025 23:45 Action required sogalin:rocm/update-rocm7.0
August 28, 2025 23:45 Action required
Surface DeepGEMM commit in README.md
PR Test #29415: Pull request #9773 opened by eicherseiji
August 28, 2025 23:40 Action required eicherseiji:improve-deepgemm-docs
August 28, 2025 23:40 Action required
fix: fix MLA for ShardedModelLoader/RemoteModelLoader (#6287)
PR Test #29414: Commit 9f81d74 pushed by zhyncs
August 28, 2025 23:10 53m 56s main
August 28, 2025 23:10 53m 56s
feat(draft_model): support draft_model for RemoteModelLoader (#6407)
PR Test #29413: Commit a38c149 pushed by zhyncs
August 28, 2025 23:09 17s main
August 28, 2025 23:09 17s
[Feature] Support NPUGraph for DeepSeek on Ascend NPU (#9355)
PR Test #29412: Commit 74dd424 pushed by zhyncs
August 28, 2025 23:06 3m 44s main
August 28, 2025 23:06 3m 44s
[HiCache] change the default policy to write through
PR Test #29411: Pull request #9772 opened by xiezhq-hermann
August 28, 2025 23:02 In progress xiezhq-hicache-default
August 28, 2025 23:02 In progress
feat: add tuned fused moe config for GLM-4.5-Air-FP8 tp = 4 on B200 (…
PR Test #29410: Commit dc20c22 pushed by zhyncs
August 28, 2025 23:00 6m 16s main
August 28, 2025 23:00 6m 16s
Support overlap scheduling for speculative decoding
PR Test #29409: Pull request #9588 synchronize by thecodingwizard
August 28, 2025 22:32 49m 23s modal-labs:eagle-overlap
August 28, 2025 22:32 49m 23s
[AMD] Support Hierarchical Caching on AMD GPUs (#8236)
PR Test #29408: Commit 711390a pushed by zhyncs
August 28, 2025 22:27 33m 41s main
August 28, 2025 22:27 33m 41s
Support overlap scheduling for speculative decoding
PR Test #29406: Pull request #9588 synchronize by thecodingwizard
August 28, 2025 21:59 26m 35s modal-labs:eagle-overlap
August 28, 2025 21:59 26m 35s
[docs / oneliner] update mmmu docs instruction
PR Test #29405: Pull request #9768 synchronize by vincentzed
August 28, 2025 21:56 Action required bzhng-development:upd-mmmu-docs
August 28, 2025 21:56 Action required