Skip to content

【Inference Optimize】Optimize the repeated calculation part in the DSK attention layer and optimize the performance under mixed batch. #2061

【Inference Optimize】Optimize the repeated calculation part in the DSK attention layer and optimize the performance under mixed batch.

【Inference Optimize】Optimize the repeated calculation part in the DSK attention layer and optimize the performance under mixed batch. #2061

build

succeeded Aug 11, 2025 in 3s