calloc too aggressive

Nathan Sidwell Fri, 17 Nov 2017 11:01:50 -0800

On 11/17/2017 01:37 PM, Jeff Law wrote:

ISTM the better way to drive this is to query the branch probabilities.
It'd probably be simpler too.  Is there some reason that's not a good
solution?


(a) I'd have to learn how to do that

(b) in the case where the condition is just a null check,ma.cc.046t.profile_estimate considers the memset reachable 53.47% ofthe time (see defect 83023)

when the condition is 'ptr && some_bool' we think it reachable 33% ofthe time.

It's not clear to me what a sensible threshold might be. I suppose morerealistic probabilities are 99.99% in the first case and 50% in thesecond case?

(c) the performance skew is presumably proportional to the sizeparameter. small size is probably swamped by the allocation effortitself. A large size, the memset cost might start to dominate.Profiling shows that it is the kernel burning this in flushing the tlbduring a syscall.

My guess is that the useage pattern repeatedly allocates and frees alarge chunk of uninitialized memory. That ends up not being syscally atall. With the change to use calloc, each of those allocations turns outto be a large TLB churn getting read-as-zero anonymous pages. Andpossibly similar churn returning freed pages to the OS.


nathan

--
Nathan Sidwell

Re: [PR tree-optimization/83022] malloc/memset->calloc too aggressive

Reply via email to