fix: 禁用DDP static_graph避免PyTorch 2.7.1下no_sync与backward冲突
- static_graph=True时DDP.no_sync() + loss.backward()触发expect_autograd_hooks_内部断言 - PyTorch 2.7.1中no_sync上下文切换与静态图hook状态管理存在兼容性bug - 将static_graph设为False恢复梯度累积正常执行 - find_unused_parameters保持False(模型无不参与计算的参数)
This commit is contained in:
parent
a548d4553e
commit
1d26aa2e93
|
|
@ -255,8 +255,6 @@ def train(
|
|||
}
|
||||
|
||||
executor_kwargs = {
|
||||
"static_graph": True,
|
||||
"find_unused_parameters": False,
|
||||
"gradient_as_bucket_view": True,
|
||||
"broadcast_buffers": False,
|
||||
}
|
||||
|
|
|
|||
Loading…
Reference in New Issue