AstrAI/astrai/dataset
ViperEkura a548d4553e fix: 断点续训恢复优化器/调度器状态及采样器剩余长度
- 使用Checkpoint.load()替代手动加载model.safetensors,恢复optimizer/scheduler状态
- TrainContextBuilder从checkpoint.extra恢复优化器和调度器state_dict
- ResumableDistributedSampler.__len__返回剩余样本数而非总数
- 训练前对state_dict置空避免mp.spawn pickle 7GB大对象
2026-05-26 13:50:25 +08:00
..
__init__.py refactor: Storage 改用工厂模式,server reload 接入 uvicorn 2026-05-16 17:00:26 +08:00
dataset.py feat: 数据集加载时校验必填字段 2026-05-17 11:50:38 +08:00
sampler.py fix: 断点续训恢复优化器/调度器状态及采样器剩余长度 2026-05-26 13:50:25 +08:00
storage.py refactor: Storage 改用工厂模式,server reload 接入 uvicorn 2026-05-16 17:00:26 +08:00