[GCU] Enable gcu CI #3190

EnflameGCU · 2025-08-04T08:58:00Z

使能GCU CI

paddle-bot · 2025-08-04T10:10:14Z

Thanks for your contribution!

yongqiangma

LGTM

gongshaotian · 2025-08-11T06:13:32Z

fastdeploy/worker/gcu_model_runner.py

@@ -675,7 +675,7 @@ def initialize_attn_backend(self) -> None:
        )
        self.share_inputs["decoder_batch_ids"] = paddle.full([int(decode_max_tile_size)], 0, dtype="int32")
        self.share_inputs["decoder_tile_ids_per_batch"] = paddle.full([int(decode_max_tile_size)], 0, dtype="int32")
-        self.share_inputs["decoder_num_blocks_cpu"] = paddle.full([1], 0, dtype="int32").pin_memory()


这里为什么直接删掉了，删掉后变成一个gpu tensor 了。不使用pinned memory 的话也应该加上.cpu() ？

这个 tensor 最终会在 get_block_shape_and_split_kv_block kernel 中被用到

yuanlehome · 2025-08-11T06:39:18Z

fastdeploy/worker/gcu_model_runner.py

-        self.seq_lens_this_time_buffer[:num_running_requests].copy_(
-            self.share_inputs["seq_lens_this_time"][:num_running_requests], False
-        )
+        self.seq_lens_this_time_buffer.copy_(self.share_inputs["seq_lens_this_time"], False)


这一块改动的原因是啥？

这里主要是为了配合第300行的修改，更新完整的数据：
self.share_inputs["seq_lens_this_time"] = self.seq_lens_this_time_buffer

暂时在GCU上没有采用real_bsz的原因：
AttentionBackend以及预处理后处理算子(update_inputs_gcu/set_value_by_flags_and_idx_gcu等)使用了seq_lens_this_time的shape做了一些操作，应该需要统一整改。

real_bsz这一改动对与GCU可能带来的影响：

对调度系统，应该需要保证本次调度到的num_running_requests个请求集中排在整个task列表的前面？

对自定义算子等需要使用到seq_lens_this_time的地方有了约束的变更，需要排查整改?

EnflameGCU force-pushed the enable_gcu_ci branch from 800027d to 069474d Compare August 4, 2025 09:05

paddle-bot bot added the contributor External developers label Aug 4, 2025

EnflameGCU force-pushed the enable_gcu_ci branch 20 times, most recently from 03f8018 to 00a565f Compare August 7, 2025 10:03

yongqiangma previously approved these changes Aug 8, 2025

View reviewed changes

EnflameGCU dismissed yongqiangma’s stale review via c67a11c August 8, 2025 02:48

EnflameGCU force-pushed the enable_gcu_ci branch 5 times, most recently from f75d8eb to 5d9aa0c Compare August 8, 2025 08:05

yongqiangma previously approved these changes Aug 11, 2025

View reviewed changes

gongshaotian reviewed Aug 11, 2025

View reviewed changes

EnflameGCU dismissed yongqiangma’s stale review via 8395c3a August 11, 2025 06:25

EnflameGCU force-pushed the enable_gcu_ci branch from cce50d4 to 8395c3a Compare August 11, 2025 06:25

yuanlehome reviewed Aug 11, 2025

View reviewed changes

EnflameGCU force-pushed the enable_gcu_ci branch from 81eef32 to f4d1206 Compare August 12, 2025 07:08

EnflameGCU added 2 commits August 12, 2025 09:51

[GCU] Update to the latest version

cfafbe0

[GCU] Enable CI

c10d2b5

EnflameGCU force-pushed the enable_gcu_ci branch from f4d1206 to c10d2b5 Compare August 12, 2025 09:51

yuanlehome approved these changes Aug 13, 2025

View reviewed changes

Jiang-Jia-Jun merged commit d1a92e3 into PaddlePaddle:develop Aug 13, 2025
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GCU] Enable gcu CI #3190

[GCU] Enable gcu CI #3190

EnflameGCU commented Aug 4, 2025

Uh oh!

paddle-bot bot commented Aug 4, 2025

Uh oh!

yongqiangma left a comment

Uh oh!

gongshaotian Aug 11, 2025

Uh oh!

gongshaotian Aug 11, 2025

Uh oh!

EnflameGCU Aug 11, 2025

Uh oh!

yuanlehome Aug 11, 2025

Uh oh!

EnflameGCU Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

[GCU] Enable gcu CI #3190

[GCU] Enable gcu CI #3190

Conversation

EnflameGCU commented Aug 4, 2025

Uh oh!

paddle-bot bot commented Aug 4, 2025

Uh oh!

yongqiangma left a comment

Choose a reason for hiding this comment

Uh oh!

gongshaotian Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

gongshaotian Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

EnflameGCU Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

yuanlehome Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

EnflameGCU Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!