This website requires JavaScript.
Explore
Help
Register
Sign In
ViperEkura
/
AstrAI
Watch
1
Star
0
Fork
You've already forked AstrAI
0
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
f81e2b4a73
AstrAI
/
astrai
/
inference
History
ViperEkura
f81e2b4a73
feat: OpenAI 兼容的 chat completion API(流式+非流式+usage)
2026-05-08 21:54:55 +08:00
..
__init__.py
refactor: 统一采样路径为 Strategy + batch tensor,删除 apply_sampling_strategies
2026-05-08 19:07:14 +08:00
cache.py
refactor: 分页 KV cache 替换固定 slot,删除 PrefixCache 及相关死代码
2026-05-08 20:44:05 +08:00
engine.py
refactor: 分页 KV cache 替换固定 slot,删除 PrefixCache 及相关死代码
2026-05-08 20:44:05 +08:00
sampling.py
refactor: 分页 KV cache 替换固定 slot,删除 PrefixCache 及相关死代码
2026-05-08 20:44:05 +08:00
scheduler.py
fix: 减少调度器延迟 — 移除解码路径 5ms 睡眠,修复 refill 任务丢失 bug
2026-05-08 21:13:52 +08:00
server.py
feat: OpenAI 兼容的 chat completion API(流式+非流式+usage)
2026-05-08 21:54:55 +08:00