This website requires JavaScript.
Explore
Help
Register
Sign In
ViperEkura
/
AstrAI
Watch
1
Star
0
Fork
You've already forked AstrAI
0
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
951df8155c
AstrAI
/
astrai
/
inference
History
ViperEkura
951df8155c
perf: gather 向量化
2026-05-10 21:01:03 +08:00
..
__init__.py
fix: 修复 CLI 参数缺失/重复、device_ids 越界、generate 参数名不一致、scheduler 时序、非流式截断等 bug
2026-05-09 14:36:42 +08:00
cache.py
perf: gather 向量化
2026-05-10 21:01:03 +08:00
engine.py
feat: _generate_streaming 支持 batch 模式
2026-05-10 17:42:20 +08:00
sampling.py
refactor: 分页 KV cache 替换固定 slot,删除 PrefixCache 及相关死代码
2026-05-08 20:44:05 +08:00
scheduler.py
fix: max_seq_len 检查改为仅 prompt 超限发 STOP,max_tokens 超出部分 clamp
2026-05-10 20:17:47 +08:00
server.py
feat: GRPO CLI 接入 + on-policy,OpenAI API top_k 参数化,补充训练参数表
2026-05-09 12:22:33 +08:00