Tweet

Compute should be fluid.

Delegate to the lower level the task to run with your constraints then it happens wherever whenever it runs best, being CPU, GPU, NPU, FPGA, etc locally or remotely. https://x.com/lamchester/status/1853882650186203540

(original)