- Remove FSM 4-state cycle, tick animations, Refill/Prefill/Decode arrows
- Show PENDING/ACTIVE/FINISHED lanes with Refill->Cleanup flow
- Add FSM state row (Refill->Prefill->Decode->Cleanup)
- ACTIVE lane shows single Prefill state label
- architecture: drop '(1B params)' from title
- Replace 4 vertical system-phase boxes with 3 horizontal lanes
(PENDING queue / RUNNING batch / FINISHED done) for accurate
request lifecycle per scheduler.py:197-200
- System phases (Refill, Prefill, Decode, Cleanup) shown as
transition labels between lanes
- Tokens placed below lanes with dynamic state badge + cumulative
token count, updated each tick via ReplacementTransform
- Fix prefix_cache collective FadeOut using self.mobjects sweep
- Remove weight=BOLD across all scenes to prevent text drift
- Adjust GQA y-coordinates for subtitle clearance