David Miller a écrit :
From: Eric Dumazet Hi <[EMAIL PROTECTED]>
Date: Tue, 17 Oct 2006 14:58:37 +0200

reqsk_queue_alloc() goal is to use a power of two size for the whole
listen_sock structure, to avoid wasting memory for large backlogs,
meaning the hash table nr_table_entries is not anymore a power of
two. (Hence one AND (nr_table_entries - 1) must be replaced by
MODULO nr_table_entries)

Modulus can be very expensive for some small/slow cpus.  Please round
down to a power-of-2 instead of up if you think the wastage really
matters.

Thanks.

I am not sure I understand your points. Rounding up or down still need the modulus. Only the size changes by a two factor. I feel you want me to remove the modulus, thats unrelated to rounding.

A 66 MHz 486 can perform 1.000.000 divisions per second. Is it a 'slow' cpu ?

If we stay with a power-of-two, say 2^X hash slots, using (2^X)*sizeof(void*), the extra bits added by struct listen_sock will *need* the same amount of memory, because of kmalloc() alignment to next power-of-two. That basically wastes half of the ram taken by struct listen_sock allocation, unless we add yet another pointer to hash table and do two kmallocs(), one for pure power-of-two hash table, one for struct listen_sock. If we keep current scheme, the current max kmalloc size of 131072 bytes would limit us to 65536 bytes for the hash table itself, so 8192 slots on 64bits platforms. I was expecting to use a 16380 slots hash size instead.

The modulus is done on two places :

inet_csk_search_req() : called from tcp_v4_err()/dccp_v4_err() only after checks. Frequency of such events is rather low.

tcp_v4_hnd_req() : called from tcp_v4_do_rcv() for TCP_LISTEN state. Frequency of such events is rather low, especially on machines driven by small/slow cpus...

inet_csk_reqsk_queue_hash_add()called from tcp_v4_conn_request() when a new connection attempt is stored in hash table.

Thats in normal conditions two modulus done per new tcp/dccp sessions establishments. In DOS situation, I doubt the extra cycles will do any difference.

So... what do you prefer :

1) Keep the modulus
2) allocate two blocks of ram (powser-of -two hash size, but one extra indirection)
3) waste near half of ram because one block allocated, and power-of-two hash 
size.

Thank you

Eric
-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to