Greg Lindahl wrote:
> On Tue, Dec 02, 2008 at 10:24:15AM -0500, Prentice Bisbal wrote:
> 
>> #warn: counter VL15Dropped = 476        (threshold 100) lid 1 port 1
>> Error check on lid 1 (aurora HCA-1) port 1:  FAILED
> 
> IB is blissfully fading from my brain, but I think this refers to
> control packets being dropped due to resource limits on the recipient.
> That takes talent if you're using a Mellanox HCA, as pretty much all
> of the VL15 packets are interpreted by the processor in the HCA.
> 
> -- greg
> 
> 

Just my luck. I'm using Cisco HCAs, which are really Mellanox HCAs:

# lspci | grep Infini
0b:00.0 InfiniBand: Mellanox Technologies MT25208 InfiniHost III Ex
(Tavor compatibility mode) (rev 20)

Fortunately, Gilad from Mellanox has offered me some assistance off-list.

-- 
Prentice
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to