[DNSOP] Re: New Version Notification for draft-tdj-dnsop-associated-prefixes-for-domains-00.txt

Tommy Jensen Sun, 13 Jul 2025 13:44:33 -0700

On 7/8/25 08:32, Ben Schwartz wrote:

On Jul 7, 2025, at 8:13 PM, Tommy Jensen <[email protected]> wrote:
Comments in-line. Meta note: it seems your concerns are all focusedon the way allow/block traffic enforcement consumes this information.Do you have any concerns with the ability to discover associationsfor logging and audit purposes?
For logging and audit purposes the standard practice is to use PTRrecords, which are more precise than CIDRS and have clearer semantics.

Tying into what I've said further down about : this implies JIT queriesat every connection. For enforcement, this introduces unnecessarylatency at connection time wherever this is introduced versus theability to independently refresh the candidate CIDRs that can JIT beconsulted as a local cache. For auditing and logging, it's potentially alot of traffic messages per app/service experience that could insteadhave been 1-2 transactions depending on how many unique IP addresses arecontacted.

Another benefit: CIDRs can specify arbitrary length prefixes, whereasPTR requires multiple entries when prefixes do not align withoctet/nibble boundaries.

On 7/7/25 15:58, Ben Schwartz wrote:
Thanks for the explanation.
I have serious concerns about proposals that would encourageblocking/unblocking IP addresses based on previous DNS activity. Ifyour network's firewall behavior depends on the history of DNSqueries, this creates an extreme form of stateful protocolossification that prevents IP from working correctly. It's like NATbut worse, because the stateful behaviors at the IP layer depend onhistory from "outside" of IP.
The comparison with NAT seems like a weird apples-to-mangoes comparison.
They’re both situations where the apparent behavior of the IP layer isinconsistent, and depends on prior history. This breaks theend-to-end IP model, complicates debugging, etc. I could have said“stateful firewall” instead.
...
Anyway, how is this different from enforcement of IP allow/blockbased on other dynamic, non-IP logic such as which process itoriginated from or what the current time is, both common practicestoday that are "history from 'outside' of IP"?
Neither of those are “history”. They are information contemporaneouswith the access decision.


Ok, for those two examples that's fair.

When I create an IP addressed based firewall rule, I did so based onsome assumption such as a reputation lookup (is this ASN trustworthy?).That is history, unless someone out there is re-consulting ASNreputation JIT. Whether the reasoning for making an allow/block/routedecision is based on the believed-to-be association of the IP addresswith a domain name, or an ASN number, or a company identity prior to amerger, or a service being active versus deprecated, or, or, or... isnot in any way "like NAT but worse" including those that are alsoprevious API calls like the DNS such as ASN reputation or ownership. Thefirewall, the IP layer, they don't care about any of that includingdomain name mappings. They may be operating on false assumptions, whichare the responsibility of the entity plumbing the firewall rules, butthat has always been the case.

It also messes with DNS (which is a lookup protocol,/not/ asignaling protocol). For example, it creates perverse interactionswith DNS stub caching, which one might have to/disable/ in order togenerate the DNS query activity that will cause a block to be lifted.
Implementations may end up doing this, sure, but it isn't inevitable.I know my previous employer's implementation of DNS-basedallowlisting has a long-term approval time period that well exceedsmost TTL values, because in real life, everyone continues usingresolutions for as long as possible. Breaking connectivity justbecause a cache was cleared (for any reason) was deemed unacceptable.In other words: this is up to the implementation of the enforcement.
This seems like a great example of how this is going to fail in nasty,confusing ways. QUIC connections will happily live for days, forexample, but this mechanism means that _sometimes_ those connectionsare going to fail because of an invisible timeout. Those failureswill probably exhibit blackhole behavior, resulting in an outage (ofprobably at least 30 seconds) while the connection attempts to getthrough, eventually gives up, and falls back to an application-levelreconnect (which may also be user-visible).

No matter the mechanism used, any attempt to validate a policy decisionpoint's opinion on the access rights of a given destination are more andmore encouraged to be time bound. I'm not buying "momentary need toreconnect" once per ones of days as an argument against anything incommon networking scenarios. How would a policy decision point decide torevoke permission to access a given network segment if a node in thatsegment is known to be compromised if it's afraid it might break along-running connection? Just because a connection was trusted athandshake time does not mean it remains trusted for its lifetime by anoperator of any segment of the threat model.

Network operators that want to limit network activity to allowed DNSdomains should use a domain-based transport proxy such as HTTPCONNECT, so policies can be imposed/before/ DNS resolution, and eachdata flow is explicitly tied to its domain.
Two things: (1) you are assuming the deployer is a *network* operatorand not a *device* operator. What about when there is no network"edge" to manage?
On-device enforcement seems like it doesn’t need this mechanism. Appscan be identified reliably, and can ship their own network accesspolicy signed by the publisher, etc.

That assumes the apps in question can do this, or will in any timelyfashion. Not all network or device/endpoint operators have that luxuryin their dependencies. A common scenario that complicates IPv6 migrationis the inability to update apps for many years at a time fornon-networking reasons. Also, this focuses on the endpoint operator(which I did push for) but now rules out network operators. Both havename lookups in common, hence this draft's suggestion to give an optionfor operators without control over some subset of the end-to-endarchitecture other than attempts at TLS termination and the net negativethat introduces.

Another point for app manifests (which btw are a good idea I agree with,especially for endpoints managed by the app dev): what about endpointsthe app doesn't manage? When an app developed by Foo Enterprises offersdata backup integration with DropBox, OneDrive, or whatever else, is FooEnterprises supposed to push app updates when they divine that DropBoxor Microsoft changes their associated CIDRs? That would be ideal, butnot in line with real-world expectations, similar to saying all uses ofremote IP addresses must be A/AAAA values for a domain name.

...
It is absolutely true that trust in an endpoint, defined by anyidentifier, has risks. A firewall rule that blocks specific IPaddresses isn't perfect when an allowed IP address will proxy trafficto those same IP addresses. I do not see how this draft introduces anew paradigm in that regard.
AFAICT this draft is only relevant in cases where
1. An application bootstraps via DNS
2. The application then begins to communicate with other IP addresseswithout resolving them from names.3. These address literals haven’t been communicated to the firewall inadvance.4. The firewall does not normally allow client-initiated access tothese IPs.
5. The firewall wants this application to have access to unrecognized IPs.
6. The application’s traffic is not identified in any other way.
If our only examples of this usage pattern are cases (like TURN) wherethe policy is not an effective security measure, then it doesn’t makesense to build standards to support it.

Your step 3 brings me back to this draft. Communicating these IPaddresses to the firewall, wherever it resides, is simplified if theCIDRs for a name (the thing consistent access policy is referencing) canbe looked up rather than regularly scraped manually from a hodge podgeof sources. This is a very real customer story I witnessed repeatedly atmy previous employer. Why can't the firewall be the DNS client in thisdraft's flow, and collaborating with the DNS resolver used by managedendpoints for consistency (not getting different query results betweenendpoints, which happens for lots of reasons)? This draft is aboutdistribution of information

...
As for assigning DNS names to <whatever>... yes, I would very muchlike that, but this requires the same operator to control the managedendpoints *and* all services they connect to. That isn't reflectiveof reality, where everyone has dependencies on many third parties whocan define their endpoints by domain name or IP addresses.
Third-party IP address literal dependencies are rare in clientapplications. When they do exist, there is often no guarantee thatthey will stay within a particular CIDR. For example, consumer VPNoperators often distribute server IPs as IP literals, but theygenerally do not promise that those IPs will fall in any particular range.
...
Even though there are services which operate this way, many do limitand actively communicate their CIDR dependencies.
Could you point to some examples of services that fit this pattern_and_ require IP-literal-based communication with these endpoints fromenterprise clients?

The majority of use cases I directly learned about are under NDA, butone easy example that isn't is WhatsApp. Predictable domain name lookupsfollowed by contact to hard-coded IP addresses. Why? I don't know, andas a network or endpoint operator with no control over the app'sdevelopment, why should I care? The point of the draft is to give amechanism to associate IP addresses with names in a standard way thatavoids per-vendor manual documentation of these mappings *without*having to worry about the long right tail of weird things apps do withnetworking. The draft is *not* attempting to say there aren'talternative approaches, many fo which you've iterated, just that this isa viable alternative in less-than-ideal situations where lack of controlover apps/endpoints/services leads to a lack of options.


—Ben

_______________________________________________
DNSOP mailing list -- [email protected]
To unsubscribe send an email to [email protected]

[DNSOP] Re: New Version Notification for draft-tdj-dnsop-associated-prefixes-for-domains-00.txt

Reply via email to