ideas

Prefill = memory capacity bound; decode = memory bandwidth bound

First seen 2026-05-24 · mentioned 1 times

Appearances