6b0a1dbb5e
refactor: redesign batching FSM as queue pipeline with dynamic task states
c05a432e45
chore: use Times New Roman across all scenes, widen Transformer block to 2.4 for \'Transformer Block × 24\' label
57abefa47f
fix: shift GQA layout down 0.4 to avoid title-input overlap
6b26ec33ab
chore: switch .gitignore to whitelist mechanism (only .py, .md, .gitignore)
0018868ee3
refactor: transformer — heatmap two-phase scores+mask, auto-regressive full I/O pipeline with Emb, RMS Norm, LM Head, distribution
e7d736a3b0
fix: bottom spacing, remove specs card, full formula first with no ellipsis in steps
eeaf0a5a16
feat: SDPA formula breakdown + attention score heatmap with per-cell causal mask
ba100c19d1
refactor: Gantt chart — same 5 requests both sides, wait+run bars, batch boxes connect GPU↔reqs
b2ff243447
fix: call align_to AFTER move_to so bars follow rect positions
f6ccefb34e
fix: align throughput bars by same-x positioning instead of arrange Left-edge
ce2a7c165a
fix: throughput bars — proportional widths (1.0x vs 3.4x), proper GrowFromEdge anim
42c1d5f700
fix: throughput comparison layout — centered rows with proper Left-Right alignment
4ed16a70b4
fix: new requests enter at Prefill stage, not Refill
fc68fc9107
rewrite continuous_batching as state-machine with batch token rotation, in/out flow, bitmask anim, and throughput bars
a7a79eef96
fix: architecture layout and rewrite continuous_batching with dynamic pipeline animation