Hello everyone, I'm quite curios about the bandwidth to GPUs people are getting especially with NVIDIA C1060 or Fermi on Intel hosts with two 5520 chipsets. Using bandwidthTest from CUDA SDK and averaging the results over all cores and GPUs (we have S1070) I'm getting with memory=pageable 3672 MB/s host to device and 3023 MB/s device to host. With memory=pinned the numbers increase to 5499 MB/s and 5291 MB/s respectively which look okay too me.
On a two chipset host 1) there is obviously asymmetry resulting in low and high numbers depending on affinity and, worryingly, 2) pinned bandwidth is a bit too low. memory=pageable host to device: 3702/3716 device to host: 2880/1807 memory=pinned host to device: 5751/4709 device to host: 3264/1873 If you happen to have numbers for ATI GPUs and/or AMD based hosts please post them too. Thanks, Igor
_______________________________________________ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf