AstrAI/astrai
ViperEkura 10ebd7211f feat: 新增 Muon 优化器
- 2D 参数用 Newton-Schulz 正交化 + Nesterov 动量更新
- 1D 参数用 AdamW 更新
- 支持 lr/momentum/weight_decay/ns_steps 配置
2026-05-17 16:44:03 +08:00
..
config feat: 训练中新增验证循环 2026-05-17 16:12:42 +08:00
dataset feat: 数据集加载时校验必填字段 2026-05-17 11:50:38 +08:00
inference fix: 移除多余 request 参数并增强 tokenizer 健壮性 2026-05-17 12:52:18 +08:00
model refactor: Transformer更名为AutoRegressiveLM并新增EmbeddingEncoder 2026-05-17 15:29:20 +08:00
parallel refactor: 优化并行训练配置与启动管理 2026-05-17 12:33:10 +08:00
tokenize fix: 移除多余 request 参数并增强 tokenizer 健壮性 2026-05-17 12:52:18 +08:00
trainer feat: 新增 Muon 优化器 2026-05-17 16:44:03 +08:00
__init__.py refactor: Transformer更名为AutoRegressiveLM并新增EmbeddingEncoder 2026-05-17 15:29:20 +08:00
factory.py refactor: 工厂 kwargs 过滤及组件参数清理 2026-05-16 16:47:41 +08:00
serialization.py refactor: Config序列化统一BaseConfig基类 2026-05-16 22:06:39 +08:00