AstrAI/tests/inference
ViperEkura c0effc9f5b refactor: 位置编码改用 position_ids [B,S],简化 attention mask 构建
- RotaryEmbedding/CacheView 接受 position_ids 替代 start_pos

- process_attention_mask 用 position_ids >= arange 做逐位置 causal

- 训练/无 KV cache 时 position_ids=None 内部自动处理

- 移除 executor/benchmark 中冗余的 input_mask 构造
2026-05-14 13:26:31 +08:00
..
conftest.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00
test_cache.py refactor: 位置编码改用 position_ids [B,S],简化 attention mask 构建 2026-05-14 13:26:31 +08:00
test_engine.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00
test_sample.py test: inference 模块补全单元测试,cache/sample/engine/task 2026-05-12 12:17:57 +08:00
test_scheduler.py style: 重命名 test_scheduler_concurrency 为 test_scheduler 2026-05-12 12:24:36 +08:00
test_server.py chore: 解耦 Executor/Scheduler/TaskManager,修复 stop 页泄漏,移除 ServerState 全局单例 2026-05-12 13:47:55 +08:00
test_task.py test: inference 模块补全单元测试,cache/sample/engine/task 2026-05-12 12:17:57 +08:00