Re: Output packet processing (was stretch ACKs, etc.)

Mark Butler Sun, 26 Mar 2006 16:16:21 -0800

Andi Kleen wrote:

On Saturday 25 March 2006 23:32, Mark Butler wrote:
A true firewall should never need to do anything but drop packets andreset connections. Changes to the way packets are routed should be doneat the routing layer, using the flow information from the transportlayer.
The real world doesn't work this way.

Agreed that there are other uses for "filtering" that are not firewallsin the normal sense of the word, but rather transparent proxies andother odd applications. So the way to do this would be to run throughthe NF output chain at dst entry assignment time, asking each entry toreturn negative if drops everything in the flow, 0 if it is a no-op forthe flow, and postive if it needs to be called for every packet in theflow.If any entry returns negative, then an appropriate error would bereturned to the transport layer, so that it could immediately cancel theconnection / path as appropriate.

If all entries return zero, then we know that the NF chain does not needto be traversed for packets in that flow. If some of them returnpositive, then one could either operate as usual, or (preferably)construct a list of just the ones that may be applicable and use those.

A positive return value would consist of a bitmask indicating the typesof transformations the entry applies. Possibly flags for:


  Examines packet only (no side effects)
  Drops packets
  Generates additional packets

  Changes layer 2 hardware type
  Changes layer 2 interface
  Changes layer 2 address
  Changes layer 2 control information
  Adds    layer 2 encapsulation
  Removes layer 2 encapsulation

  Changes layer 3 protocol
  Changes layer 3 routing
  Changes layer 3 address
  Changes layer 3 control information
  Adds    layer 3 encapsulation
  Removes layer 3 encapsulation

  Changes layer 4 protocol
  Changes layer 4 routing
  Changes layer 4 addressing (ports)
  Changes layer 4 control information
  Adds    layer 4 encapsulation
  Removes layer 4 encapsulation

  Changes higher layer protocol
  Changes higher layer routing
  Changes higher layer addressing
  Changes higher layer control information
  Changes higher layer encoding
  Adds    higher layer encapsulation
  Removes higher layer encapsulation
  Other higher layer changes

The positive return bitmask could be OR-ed together and returned to thetransport layer, which could then use cleared bits to know whether itwas safe to make certain types of assumptions - e.g. that packets froman IPoIB flow were going to use an IB interface, or that packets from aloopback flow were actually going to use the loopback interface.

Of course, if netfilter changes were made, the relevant dst entrieswould need to be marked obsolete and the process repeated. It wouldalso be helpful if the entry flow check functions returned the amount ofheadroom that entry requires.

The flowi structure already contains all that information for routingpurposes. No reason why it could not be used to do early netfilterreduction as well. Right?
netfilter is unfortunately too powerfull for that. It can do many complex
dynamic decisions per packet that are impossible to cache or predict.

Dynamic decisions are fine as long as there is a way to know in advancewhat flows they apply to, so unaffected flows can use the fast path.

In theory you could try to build such a fast path for some simplefiltering that implements a subset of full netfilter, but nobody hasattempted to do so so far.

I hate to say this, but one other application besides transport outputpath optimatization would be for (horrors!) TOE / RNIC / iSCSI driversto check a flow for netfilter applicabililty and revert back to thestandard kernel output processing if appropriate, as well as detectingwhen it is necessary to inject duplicate packets back into the kernelfor read-only filters to examine, say when Ethereal is active.


- Mark





-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Output packet processing (was stretch ACKs, etc.)

Reply via email to