fix: 禁用DDP static_graph避免PyTorch 2.7.1下no_sync与backward冲突

- static_graph=True时DDP.no_sync() + loss.backward()触发expect_autograd_hooks_内部断言
- PyTorch 2.7.1中no_sync上下文切换与静态图hook状态管理存在兼容性bug
- 将static_graph设为False恢复梯度累积正常执行
- find_unused_parameters保持False(模型无不参与计算的参数)
This commit is contained in:
ViperEkura 2026-05-26 15:08:01 +08:00
parent a548d4553e
commit 1d26aa2e93
1 changed files with 0 additions and 2 deletions

View File

@ -255,8 +255,6 @@ def train(
}
executor_kwargs = {
"static_graph": True,
"find_unused_parameters": False,
"gradient_as_bucket_view": True,
"broadcast_buffers": False,
}