📄️ Device Coarse Architecture
Before introducing the ND kernel, we must take a look at how our devices work.
📄️ Work Distribution
We used to just use parallel_for then the task is distributed magically. But for now, if we want more control, we need to know how the task is distributed.