Liste des Groupes | Revenir à c arch |
On Wed, 18 Sep 2024 16:23:01 +0000, MitchAlsup1 wrote:That is why you use the CPU for human interactions and bandwidth
>On the other hand, and this is where the deprecation of the CPUs come>
in, The engines consuming the data are bandwidth machines {GPUs and
Inference engines} which are quite insensitive to latency (they are not
not latency bound machines like CPUs).
>
When doing GPUs, a memory access taking 400 cycles would hardly degrade
the overall GPU performance--while it would KILL any typical CPU
architecture.
But if it’s supposed to be for “interactive” use, it’s still going to
take
those 400 memory-cycle times to return a response.
Les messages affichés proviennent d'usenet.