On 2019/06/26 08:59, Manish Chopra wrote: > > -----Original Message----- > > From: Benjamin Poirier <bpoir...@suse.com> > > Sent: Monday, June 17, 2019 1:19 PM > > To: Manish Chopra <mani...@marvell.com>; GR-Linux-NIC-Dev <GR-Linux- > > nic-...@marvell.com>; netdev@vger.kernel.org > > Subject: [PATCH net-next 01/16] qlge: Remove irq_cnt > > > > qlge uses an irq enable/disable refcounting scheme that is: > > * poorly implemented > > Uses a spin_lock to protect accesses to the irq_cnt atomic variable > > * buggy > > Breaks when there is not a 1:1 sequence of irq - napi_poll, such as > > when using SO_BUSY_POLL. > > * unnecessary > > The purpose or irq_cnt is to reduce irq control writes when > > multiple work items result from one irq: the irq is re-enabled > > after all work is done. > > Analysis of the irq handler shows that there is only one case where > > there might be two workers scheduled at once, and those have > > separate irq masking bits. > > I believe you are talking about here for asic error recovery worker and MPI > worker. > Which separate IRQ masking bits are you referring here ?
INTR_EN with intr_dis_mask for completion interrupts INTR_MASK bit INTR_MASK_PI for mpi interrupts > > static int ql_validate_flash(struct ql_adapter *qdev, u32 size, const char > > *str) > > @@ -2500,21 +2451,22 @@ static irqreturn_t qlge_isr(int irq, void *dev_id) > > u32 var; > > int work_done = 0; > > > > - spin_lock(&qdev->hw_lock); > > - if (atomic_read(&qdev->intr_context[0].irq_cnt)) { > > - netif_printk(qdev, intr, KERN_DEBUG, qdev->ndev, > > - "Shared Interrupt, Not ours!\n"); > > - spin_unlock(&qdev->hw_lock); > > - return IRQ_NONE; > > - } > > - spin_unlock(&qdev->hw_lock); > > + /* Experience shows that when using INTx interrupts, the device does > > + * not always auto-mask the interrupt. > > + * When using MSI mode, the interrupt must be explicitly disabled > > + * (even though it is auto-masked), otherwise a later command to > > + * enable it is not effective. > > + */ > > + if (!test_bit(QL_MSIX_ENABLED, &qdev->flags)) > > + ql_disable_completion_interrupt(qdev, 0); > > Current code is disabling completion interrupt in case of MSI-X zeroth vector. > This change will break that behavior. Shouldn't zeroth vector in case of > MSI-X also disable completion interrupt ? > If not, please explain why ? In msix mode there's no need to explicitly disable completion interrupts, they are reliably auto-masked, according to my observations. I tested this on two QLE8142 adapters. Do you have reason to believe this might not always be the case? > > > > > - var = ql_disable_completion_interrupt(qdev, intr_context->intr); > > + var = ql_read32(qdev, STS); > > > > /* > > * Check for fatal error. > > */ > > if (var & STS_FE) { > > + ql_disable_completion_interrupt(qdev, 0); > > Why need to do it again here ? if before this it can disable completion > interrupt for INT-X case and MSI-X zeroth vector case ? I couldn't test this code path, so I preserved the original behavior. > > > ql_queue_asic_error(qdev); > > netdev_err(qdev->ndev, "Got fatal error, STS = %x.\n", var); > > var = ql_read32(qdev, ERR_STS); > > @@ -2534,7 +2486,6 @@ static irqreturn_t qlge_isr(int irq, void *dev_id) > > */ > > netif_err(qdev, intr, qdev->ndev, > > "Got MPI processor interrupt.\n"); > > - ql_disable_completion_interrupt(qdev, intr_context->intr); > > Why disable interrupt is not required here ? The interrupt source _is_ masked: ql_write32(qdev, INTR_MASK, (INTR_MASK_PI << 16)); > While in case of Fatal error case above ql_disable_completion_interrupt() is > being called ? > Also, in case of MSI-X zeroth vector it will not disable completion interrupt > but at the end, it will end of qlge_isr() enabling completion interrupt. > Seems like disabling and enabling might not be in sync in case of MSI-X > zeroth vector. I guess you forgot to consider that completion interrupts are auto-masked in msix mode.