AstrAI

History

ViperEkura f521a30b22 fix : FSDP 优化器顺序、温度除零、调度器静默死亡、ref模型设备 - executor: use_orig_params 硬编码 True，FSDP 不替换 Parameter 对象 - strategy: DPO/GRPO ref 模型创建后移到 device - sample: TemperatureStrategy clamp 1e-8，engine 验证改为 >0 - scheduler: 异常不 re-raise 避免 daemon 静默死亡，stop() 发回调给 waiting 任务		2026-05-29 21:57:44 +08:00
..
__init__.py	feat: 新增 Muon 优化器	2026-05-17 16:44:03 +08:00
metric_util.py	feat: 训练中新增验证循环	2026-05-17 16:12:42 +08:00
optim.py	perf: Muon step 改用 torch._foreach_* 批处理并移除 NS 迭代的冗余 bf16 转换	2026-05-23 19:50:12 +08:00
schedule.py	refactor : 移除 -> None 返回值标注，拆分 FSDP 参数，新增 mmap 数据集存储	2026-05-28 13:57:06 +08:00
strategy.py	fix : FSDP 优化器顺序、温度除零、调度器静默死亡、ref模型设备	2026-05-29 21:57:44 +08:00
train_callback.py	fix : 并行训练 state_dict 收集与训练/推理并发缺陷	2026-05-29 21:12:52 +08:00
train_context.py	fix : 并行训练 state_dict 收集与训练/推理并发缺陷	2026-05-29 21:12:52 +08:00
trainer.py	refactor: 检查点加载重构，路径替代对象传递	2026-05-27 20:15:29 +08:00