> Subject: Re: [PATCH v2 net-next 0/7] dpaa2-eth: add support for Rx traffic
> classes
> 
> On Wed, 20 May 2020 15:10:42 +0000 Ioana Ciornei wrote:
> > DPAA2 has frame queues per each Rx traffic class and the decision from
> > which queue to pull frames from is made by the HW based on the queue
> > priority within a channel (there is one channel per each CPU).
> 
> IOW you're reading the descriptor for the device memory/iomem address and
> the HW will return the next descriptor based on configured priority?

That's the general idea but the decision is not made on a frame by frame bases
but rather on a dequeue operation which can, at a maximum, return
16 frame descriptors at a time.

> Presumably strict priority?

Only the two highest traffic classes are in strict priority, while the other 6 
TCs
form two priority tiers - medium(4 TCs) and low (last two TCs).

> 
> > If this should be modeled in software, then I assume there should be a
> > NAPI instance for each traffic class and the stack should know in
> > which order to call the poll() callbacks so that the priority is respected.
> 
> Right, something like that. But IMHO not needed if HW can serve the right
> descriptor upon poll.

After thinking this through I don't actually believe that multiple NAPI 
instances
would solve this in any circumstance at all:

- If you have hardware prioritization with full scheduling on dequeue then job 
on the
driver side is already done.
- If you only have hardware assist for prioritization (ie hardware gives you 
multiple
rings but doesn't tell you from which one to dequeue) then you can still use a 
single
NAPI instance just fine and pick the highest priority non-empty ring on-the-fly 
basically.

What I am having trouble understanding is how the fully software implementation
of this possible new Rx qdisc should work. Somehow the skb->priority should be 
taken
into account when the skb is passing though the stack (ie a higher priority skb 
should
surpass another previously received skb even if the latter one was received 
first, but
its priority queue is congested).

I don't have a very deep understanding of the stack but I am thinking that the
enqueue_to_backlog()/process_backlog() area could be a candidate place for 
sorting out
bottlenecks. In case we do that I don't see why a qdisc would be necessary at 
all and not
have everybody benefit from prioritization based on skb->priority.

Ioana

Reply via email to