AstrAI/astrai/inference
ViperEkura 6269bacfc3 refactor: decode 按页分桶批处理,position_ids 改为 per-task 构建 2026-05-14 14:22:11 +08:00
..
__init__.py refactor: TaskManager 剥离页管理,STOP 移至 task.py 2026-05-11 14:04:31 +08:00
cache.py refactor: 位置编码改用 position_ids [B,S],简化 attention mask 构建 2026-05-14 13:26:31 +08:00
engine.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00
executor.py refactor: decode 按页分桶批处理,position_ids 改为 per-task 构建 2026-05-14 14:22:11 +08:00
sample.py refactor: TaskManager 剥离页管理,STOP 移至 task.py 2026-05-11 14:04:31 +08:00
scheduler.py refactor: decode 按页分桶批处理,position_ids 改为 per-task 构建 2026-05-14 14:22:11 +08:00
server.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00
task.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00