Benjamin LaHaise <[EMAIL PROTECTED]> wrote: > > Hello Andrew et al, > > The patch below adds a fast path that avoids the atomic dec and test > operation and spinlock acquire/release on page free. This is especially > important to the network stack which uses put_page() to free user > buffers. Removing these atomic ops helps improve netperf on the P4 > from ~8126Mbit/s to ~8199Mbit/s (although that number fluctuates quite a > bit with some runs getting 8243Mbit/s). There are probably better > workloads to see an improvement from this on, but removing 3 atomics and > an irq save/restore is good. >
Am a bit surprised at those numbers. > diff --git a/mm/swap.c b/mm/swap.c > index cce3dda..d6934cf 100644 > --- a/mm/swap.c > +++ b/mm/swap.c > @@ -49,7 +49,10 @@ void put_page(struct page *page) > { > if (unlikely(PageCompound(page))) > put_compound_page(page); > - else if (put_page_testzero(page)) > + else if (page_count(page) == 1 && !PageLRU(page)) { > + set_page_count(page, 0); > + free_hot_page(page); > + } else if (put_page_testzero(page)) > __page_cache_release(page); Because userspace has to do peculiar things to get its pages taken off the LRU. What exactly was that application doing? The patch adds slight overhead to the common case while providing improvement to what I suspect is a very uncommon case? - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html