AstrAI/tests
ViperEkura 2c2697390d feat: 新增 GradientCheckpointingCallback
- TrainConfig.gradient_checkpointing_modules 指定模块类型
- apply 递归遍历,兼容 DDP,不硬编码模型结构
- modules=None 时静默跳过,零开销
2026-05-17 18:21:05 +08:00
..
data refactor: checkpoint 按 HF 方式存独立 .pt 文件,callback 接管恢复 2026-05-16 18:29:04 +08:00
inference fix: 修复推理引擎 batch decode 中多项正确性与并发问题 2026-05-14 21:31:39 +08:00
module refactor: Transformer更名为AutoRegressiveLM并新增EmbeddingEncoder 2026-05-17 15:29:20 +08:00
parallel feat: 增加server, 并且修改测试单元 2026-04-02 15:05:07 +08:00
trainer feat: 新增 GradientCheckpointingCallback 2026-05-17 18:21:05 +08:00
conftest.py refactor: Transformer更名为AutoRegressiveLM并新增EmbeddingEncoder 2026-05-17 15:29:20 +08:00