Commit Graph

7 Commits

Author SHA1 Message Date
ViperEkura 6b0a1dbb5e refactor: redesign batching FSM as queue pipeline with dynamic task states
- Replace 4 vertical system-phase boxes with 3 horizontal lanes
  (PENDING queue / RUNNING batch / FINISHED done) for accurate
  request lifecycle per scheduler.py:197-200
- System phases (Refill, Prefill, Decode, Cleanup) shown as
  transition labels between lanes
- Tokens placed below lanes with dynamic state badge + cumulative
  token count, updated each tick via ReplacementTransform
- Fix prefix_cache collective FadeOut using self.mobjects sweep
- Remove weight=BOLD across all scenes to prevent text drift
- Adjust GQA y-coordinates for subtitle clearance
2026-05-07 17:56:17 +08:00
ViperEkura c05a432e45 chore: use Times New Roman across all scenes, widen Transformer block to 2.4 for \'Transformer Block × 24\' label 2026-05-07 14:59:27 +08:00
ViperEkura 57abefa47f fix: shift GQA layout down 0.4 to avoid title-input overlap 2026-05-07 14:51:31 +08:00
ViperEkura 0018868ee3 refactor: transformer — heatmap two-phase scores+mask, auto-regressive full I/O pipeline with Emb, RMS Norm, LM Head, distribution 2026-05-07 14:00:52 +08:00
ViperEkura e7d736a3b0 fix: bottom spacing, remove specs card, full formula first with no ellipsis in steps 2026-05-07 09:28:07 +08:00
ViperEkura eeaf0a5a16 feat: SDPA formula breakdown + attention score heatmap with per-cell causal mask 2026-05-07 00:58:46 +08:00
ViperEkura c03abd31fe add project source files 2026-05-06 21:16:57 +08:00