This website requires JavaScript.
Explore
Help
Register
Sign In
ViperEkura
0 Followers
·
0 Following
Joined on
2026-04-02
Repositories
8
Projects
Packages
Public Activity
Starred Repositories
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-30 21:41:23 +08:00
1c2ff05a6d
docs : 三轮深度验证修复文档与代码不一致
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-30 21:04:53 +08:00
31ae2deeba
refactor : BaseConfig 提供 from_json/to_json,嵌套 config 自动反序列化
69207e2c57
refactor : 基于声明式 JSON 配置的预处理管线重构
138c5bcc08
feat : 添加 JSONL 预处理管线
a923e0a23a
fix : 修复 MMLU 评测脚本数据源和依赖
f521a30b22
fix : FSDP 优化器顺序、温度除零、调度器静默死亡、ref模型设备
Compare 7 commits »
ViperEkura
pushed to
main
at
ViperEkura/SKILLS
2026-05-30 15:35:03 +08:00
b50e4cd5d8
feat: bundle CDN assets locally to eliminate network timeout
ViperEkura
pushed to
main
at
ViperEkura/SKILLS
2026-05-30 15:31:42 +08:00
ViperEkura
pushed to
main
at
ViperEkura/SKILLS
2026-05-30 15:02:58 +08:00
b50e4cd5d8
feat: bundle CDN assets locally to eliminate network timeout
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-28 21:02:31 +08:00
b37c3d000c
docs : 同步文档与实际代码
6031020e37
feat : load_json/load_safetensors 支持 broadcast,跨节点分布式加载
c424dfc293
feat : checkpoint 支持保存 config.json
3a28e52e98
fix : start_epoch/start_batch 由用户参数决定,不再被 checkpoint 覆盖
e371908b54
fix : 保存 checkpoint 时 unwrap DDP/FSDP 避免 module. 前缀
Compare 7 commits »
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-28 14:38:31 +08:00
0a708fff24
docs : 更新架构文档与 storage 注释,同步 Store 重构
6e150ea6d0
refactor : Storage 层重构为 Store,移除 Fetcher 中间层,支持多段数据与显式长度
cb8dcb97ea
refactor : 移除 -> None 返回值标注,拆分 FSDP 参数,新增 mmap 数据集存储
2d5dc93b3d
fix : 修正类型标注与统一 CLI 参数命名
4145d35e3c
refactor: 检查点加载重构,路径替代对象传递
Compare 9 commits »
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-26 16:47:42 +08:00
65ab69543b
refactor: 统一序列化层,消除分散的 I/O 路径
1d26aa2e93
fix: 禁用DDP static_graph避免PyTorch 2.7.1下no_sync与backward冲突
a548d4553e
fix: 断点续训恢复优化器/调度器状态及采样器剩余长度
dd1b39f435
fix: ProgressBar默认输出到stdout
94d6e713e9
test: 补充推理协议层单测覆盖
Compare 6 commits »
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-25 21:22:18 +08:00
737585a32a
feat: 新增NTK-Aware RoPE缩放支持
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-25 21:20:19 +08:00
a304e16ff0
feat: 新增NTK-Aware RoPE缩放支持
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-25 20:15:38 +08:00
a4688021bf
feat: 新增LoRA微调模块
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-25 20:11:39 +08:00
432145a798
feat: 新增LoRA微调模块
7df6eb9211
feat: 新增FSDP并行后端
Compare 2 commits »
ViperEkura
pushed to
main
at
ViperEkura/video-promo
2026-05-25 19:19:59 +08:00
9de0bad3d4
fix transformer: GQA text overflow, heatmap sizing, auto-regressive pos labels
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-25 16:05:45 +08:00
82a3f2626f
docs: 更新文档与代码同步(Executor/训练循环/参数)
ViperEkura
pushed to
main
at
ViperEkura/llmEval
2026-05-24 22:36:42 +08:00
ac814e5c52
add all project source files
ViperEkura
created branch
main
in
ViperEkura/llmEval
2026-05-24 22:36:19 +08:00
ViperEkura
pushed to
main
at
ViperEkura/llmEval
2026-05-24 22:36:19 +08:00
d8b83a175b
first commit
ViperEkura
created repository
ViperEkura/llmEval
2026-05-24 22:35:50 +08:00
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-24 20:55:28 +08:00
7fa69572c0
fix: 测试日志写入临时目录避免冗余文件
ViperEkura
pushed to
main
at
ViperEkura/AstrAI
2026-05-24 20:49:07 +08:00
3ab4f237e5
refactor: 重构训练后端为 Executor 模式
First
Previous
1
2
3
4
5
...
Next
Last