Skip to content

Extract eh_proj Layer from ParallelLMHead for MTP to Avoid Weight Transposition Issue #103

Extract eh_proj Layer from ParallelLMHead for MTP to Avoid Weight Transposition Issue

Extract eh_proj Layer from ParallelLMHead for MTP to Avoid Weight Transposition Issue #103

Triggered via pull request July 4, 2025 04:45
Status Success
Total duration 21m 7s
Artifacts

ci.yml

on: pull_request
Fit to window
Zoom out
Zoom in