> -----Original Message-----
> From: Horia Geanta
> Sent: Tuesday, March 5, 2019 10:14 PM
> To: Vakul Garg <vakul.g...@nxp.com>; linux-crypto@vger.kernel.org
> Cc: Aymen Sghaier <aymen.sgha...@nxp.com>;
> herb...@gondor.apana.org.au; da...@davemloft.net
> Subject: Re: [PATCH] crypto: caam/jr - optimize job ring enqueue and
> dequeue operations
> 
> On 3/5/2019 9:00 AM, Vakul Garg wrote:
> > Instead of reading job ring's occupancy registers for every req/rsp
> > enqueued/dequeued respectively, we read these registers once and store
> > them in memory. After completing a job enqueue/dequeue, we decrement
> > these values. When these values become zero, we refresh the snapshot
> > of job ring's occupancy registers. This eliminates need of expensive
> > device register read operations for every job enqueued and dequeued
> > and hence makes caam_jr_enqueue() and caam_jr_dequeue() faster.
> >
> How expensive?
> Please share the case you benchmarked and performance improvement you
> noticed.

The performance of kernel ipsec improved by about 6% on ls1028.

> 
> Somewhat related: it seems that after commit a0ca6ca022ac ("crypto: caam
> - one tasklet per job ring") the "outlock" spinlock could be removed, this
> being a good candidate for further improvement.
> 
Yes, I remember I discussed it before. 
There are other inefficiencies as well.
Will submit patches. 

> > Signed-off-by: Vakul Garg <vakul.g...@nxp.com>
> > ---
> >  drivers/crypto/caam/intern.h |  1 +
> >  drivers/crypto/caam/jr.c     | 12 ++++++++++--
> >  2 files changed, 11 insertions(+), 2 deletions(-)
> >
> > diff --git a/drivers/crypto/caam/intern.h
> > b/drivers/crypto/caam/intern.h index 5869ad58d497..b6d96e2ecf4c
> 100644
> > --- a/drivers/crypto/caam/intern.h
> > +++ b/drivers/crypto/caam/intern.h
> > @@ -59,6 +59,7 @@ struct caam_drv_private_jr {
> >     int out_ring_read_index;        /* Output index "tail" */
> >     int tail;                       /* entinfo (s/w ring) tail index */
> >     struct jr_outentry *outring;    /* Base of output ring, DMA-safe */
> > +   u32 inpring_avail;              /* Number of free entries in i/p
> ring*/
> Locality: this should be near the other enqueue-related structure members.
> 
> Nitpick: use "input" instead of "i/p".
> 

Sending v2. 

> Thanks,
> Horia

Reply via email to