DNS Push Notifications

twic · on June 24, 2020

> The DNS Long-Lived Queries (LLQ) mechanism [RFC8764] is an existing deployed solution to provide asynchronous change notifications; it was used by Apple's Back to My Mac [RFC6281] service introduced in Mac OS X 10.5 Leopard in 2007. Back to My Mac was designed in an era when the data center operations staff asserted that it was impossible for a server to handle large numbers of TCP connections, even if those connections carried very little traffic and spent most of their time idle. Consequently, LLQ was defined as a UDP-based protocol, effectively replicating much of TCP's connection state management logic in user space and creating its own imitation of existing TCP features like flow control, reliability, and the three-way handshake.

I can't help but hear the skepticism in "asserted"!

Also, fun that Apple want to move a service from UDP to TCP at a time when Google are trying to move another service from TCP to UDP.

jeffbee · on June 24, 2020

It’s still tricky to handle more than, say, a million or so TCP connections per box, but moving the connections to UDP with TCP-like state machine in user-space solves none of the issues.

toast0 · on June 24, 2020

It depends on how TCP like you end up being. In TCP, all outgoing data is stored for retransmit if not acked; in this application, a server wouldn't need its responses acked or stored (if the client doesn't get the response, it will resend the request), and push messages could be regenerated if not acked, rather than stored and retransmitted as-is.

That reduction in required storage could be significant. User space connection tracking might also use less memory than kernel connection tracking, depending on how many non-essential things each implementation tracks. Colocation of the connection tracking data and the application level data might be beneficial as well.

I didn't read the prior RFCs to see what kind of hoops they jumped to make the connections long lived though, my guess is TCP session timeouts for NAT boxes are longer than UDP, and there are more networks that disallow UDP than TCP, so TCP is probably a good idea from that standpoint.

VWWHFSfQ · on June 24, 2020

yeah really, aren't you just moving the buffers that track the TCP connection states from the kernel to some program in user space? how does that solve anything

YarickR2 · on June 24, 2020

memory management techniques in userspace are much more diverse, you can reuse RAM efficiently.

linsomniac · on June 24, 2020

Does anyone know what mechanism is used between Route 53 and Google DNS? When I update a record in Route 53, there seems to be 0 delay in the updated records being present in 8.8.8.8, even if I've recently request the old value. I've been imagining that they had set up some sort of "cache invalidate" message that AWS could send Google, but I haven't done any investigation.

geocar · on June 24, 2020

So I just ran a test, with a secret domain name I haven't told anyone:

    $ (for x in `seq 1 500`; do echo -n "$x  "; host -v MYSECRETDOMAIN. 8.8.8.8 2>&1 |grep SOA; done) > data    
    $ awk '($3 == 899) { print }' data | wc -l
    34

That suggests there are about 34 machines near London. Neat!

I repeated the test with:

    $ (for x in `seq 1 500`; do echo -n "$x  "; host -v MYSECRETDOMAIN. 8.8.4.4 2>&1 |grep SOA; done) > data2
    $ awk '($3 == 899) { print }' data2 | wc -l                                                         
    3

which suggests the IP addresses share infrastructure (at least near London!)

You can try this sort of thing with other providers to try and map out their internal infrastructure. (1.1.1.1/cloudflare has around 22 machines near me; quad9 has 16; opendns also has 16; verisign has 31; etc).

One thing I tried was making an ad that recorded the cookie id in the impression tracker so I could record the connecting IP addresses in our DNS server. I could then target users who have a particular network provider (or use a particular DNS provider) which could be useful if I want a large number of users who (effectively) ignore DNS caches.

jasonjayr · on June 24, 2020

8.8.8.8 is a bunch of servers, and you may be getting different server to service your request than the one that cached your recently requested old value. Make several requests, and observe the TTL -- you may notice that it jumps around as you get different servers, especially if you space your requests apart a few seconds.

runjake · on June 24, 2020

Specifically, 8.8.8.8 is an AnyCast address.

https://en.wikipedia.org/wiki/Anycast

pwg · on June 24, 2020

The named DNS server includes the ability to send notify messages to other master and slave servers (https://www.zytrax.com/books/dns/ch7/xfer.html#notify) when changes are made to records.

Perhaps Route 53 and Google have setup a system to notify each other when a record changes so they can then request a transfer and have near zero propagation delay.

Note that this notify system is not the same as that proposed in the RFC. This notify system is configured by the admin and is meant to keep secondary servers updated with changes in the primary server, not for general notification of changes to anyone who is interested.

LorenzA · on June 24, 2020

you can do this also yourself @ https://developers.google.com/speed/public-dns/cache

ryanjkirk · on June 24, 2020

also https://1.1.1.1/purge-cache/ for cloudfare.

jrockway · on June 24, 2020

I haven't noticed this with Route 53 (but I also haven't used it for a year). My experience was that even Amazon's resolvers didn't update with my changes immediately. So if I added a record, then immediately did "dig @8.8.8.8 my.new.domain", I would get NXDOMAIN cached for the time on my SOA record. To avoid that wait, I had to be careful to not try resolving it for a couple minutes, so that the NXDOMAIN wouldn't be cached.

Maybe things have changed, which would be nice. Nowadays I'm on DigitalOcean and they don't let you control the TTL on the SOA record, so you have to be even more careful than with Amazon. Very annoying.

dividuum · on June 24, 2020

You could query your authoritative name server(s) first and only query other resolvers once you see your new record. Shouldn't be to hard to automated that using something like dnspython.

aeden · on June 24, 2020

If the authoritative is using Anycast (which they should) then you run into similar problems that you get with Anycast resolvers, which is to say you might get different results depending on distribution of data. Due to this we currently provide an API for checking distribution of zone changes with our authoritative name servers: (https://developer.dnsimple.com/v2/zones/#checkZoneDistributi...).

AndyMcConachie · on June 24, 2020

I don't know, but it may be something like this.

https://datatracker.ietf.org/doc/html/draft-wkumari-dnsop-ha...

linsomniac · on June 25, 2020

Thanks for the replies on this. Looks like most of this was luck of the draw and not notifications from AWS to Google. I had a change to make today, and made 100 requests from 8.8.8.8 before, then made the change and it hadn't updated after a couple minutes. In Denver, looks like we have 15 different servers answering the 8.8.8.8 requests. I then hit the "clear cache" URL someone mentioned here, and it updated quickly then. Great discussion, thanks everyone!

duskwuff · on June 24, 2020

How "popular" is the record you're updating? If it isn't commonly requested, it might just not be in the Google DNS resolvers' caches.

oefrha · on June 24, 2020

HTML version with a TOC sidebar for easier navigation: https://www.rfc-editor.org/rfc/rfc8765.html

kissgyorgy · on June 24, 2020

I can instantly think of a gazzillion ways how this can be abused :D

Ayesh · on June 24, 2020

This is what I immediately thought too. I don't think even NTP servers screamed this much potential for abuse by the time th standards were being drafted. However, I imagine this to use TCP and not UDP (didn't real the whole RFC yet), which mitigates some of the attacks.

dpcan · on June 24, 2020

Go on....

LunaSea · on June 24, 2020

DNS rebinding attacks is probably one of them.

belorn · on June 24, 2020

Browsers will need to update their same-origin policy so that a change in IP address will block same requesting a different site under a different name.

LunaSea · on June 24, 2020

DNS rebinding attacks also work in non-browser environments like SSRF attacks.

philsnow · on June 24, 2020

This would mean that long-lived single page web apps would need to be hard-refreshed every once in a while when, through no fault of the app developer, all the IP addresses that their domain name resolves to have rotated.

m3047 · on June 24, 2020

The DNS already supports NOTIFY, which is a push notification for updates to a zone (this is something set up by the operators of the auth servers, typically for mirrors/secondaries so that they know when to request a zone transfer); the alternative, polling, requests the SOA RR for a zone and compares serial numbers.

Didn't give it a detailed read, but this looks like a more granular proposal, an example given being printer discovery.

osrec · on June 24, 2020

A question for anyone with more knowledge: does this circumvent the need for a TTL on DNS records?

oefrha · on June 24, 2020

1. SOA and SRV records used for discovery have to specify TTL; Section 6.1: https://www.rfc-editor.org/rfc/rfc8765.html#section-6.1

2. A standard nonzero TTL is otherwise not very meaningful as long as subscription is active. Section 6.3.1 explains the TTL field of a PUSH message clearly: https://www.rfc-editor.org/rfc/rfc8765.html#section-6.3.1

But this mechanism does not replace normal DNS, only supplements it, so no, you probably still need to set TTL.

jeffalyanak · on June 24, 2020

Unless there is another mechanism for a record to die, it'll still be important to have a TTL.

TTLs could be kept quite long, though, since they'd only be used when push updates are not occurring.

philsnow · on June 24, 2020

They'd also be used by any clients that don't understand DNS push notifications. That's including a lot of networking hardware, industrial hardware, medical devices, etc.

reiketsuu · on June 24, 2020

Then does anyone understand what are differences between push notifications and using a record until the TTL expires? Thanks!

rhizome · on June 24, 2020

TTL tells servers and clients in the wild how long to hold on to a query result. You'll want to set this very high if you expect a nuclear war soon. There is no push notification for this.

Push notifications occur when the primary is HUP'd or restarted, telling the secondaries to pull fresh zones so that everybody's is known to have the same serial. After this the secondaries poll the primary every 'refesh' seconds to check for a newer copy of the zone.

Randor · on June 24, 2020

After reading the spec it looks like there is alot of differences... this RFC essentially gives complete control of the resource record set lifetime to the DNS server. This would require major changes on the DNS client side.

Take a look at: https://www.rfc-editor.org/rfc/rfc8765.html#name-push-messag...

That section contains most of the RRset remove notifications.

fanf2 · on June 24, 2020

I would be interested to of any implementation of these protocol extensions outside Apple.

mrpippy · on June 24, 2020

This getting released during WWDC does not feel like a coincidence. I wonder what Apple will be using it for, it’ll probably be mentioned in a session sometime this week.

derhuerst · on June 24, 2020

Just speculating here, but they still have a Back-to-my-Mac-like feature: The HomePod allows for remote access to things in your home WiFi, e.g. IoT devices.

As IP addresses of home routers change often enough, this might be a use case for DNS push. Access has to work across work across carrier-grade NAT though, so they might still need more than DNS.

yingw787 · on June 24, 2020

What’s the difference between this and server sent events in http2? You don’t need a live persistent connection to issue events?

underdeserver · on June 24, 2020

Well, you can listen on UDP (which DNS uses anyway).

yingw787 · on June 24, 2020

The more I think about this the less it makes sense to me.

My understanding of UDP is it's supposed to be for heavy, lossy traffic where late traffic is pointless or harmful (like video streaming frames, if your frame is late better toss it out than keep it). But I kind of want my notifications even if they're late. I think I'm missing something in my understanding. I was thinking if you can stream using DNS, and cut out something like Kafka, that might be a big deal, but on second thought it doesn't make sense because DNS is more about service discovery than it is about piping load; you want an alias to a server that does the heavy lifting.

Brain messy today.

WatchDog · on June 24, 2020

This proposal requires TCP.

simonjgreen · on June 24, 2020

Love that this is now in rfc. The benefits of caching with the benefits of zero caching.

bottled_poe · on June 24, 2020

Haven’t read the whole proposal, but some concerns that stand out to me are: 1 - impact on DNS system performance; and 2 - what infrastructure does this proposal rely upon?

skissane · on June 25, 2020

I see this is defined in terms of DNS over TLS over TCP.

I don't see any mention of DNS over HTTPS or DNS over QUIC.

Could this work with either of those?

Mojah · on June 24, 2020

Does this RFC just sherlock DNS Spy?

joshspankit · on June 24, 2020

As a methodology, push is always better than polling.

Change my mind.

jedberg · on June 24, 2020

You need both for a robust system. Simple example: On system startup, a system should poll for the current config. It should keep polling every few minutes to verify that it hasn't missed any pushes.

But it should also register for push notifications of config changes so it can get them faster.

It should also renew its subscription if it finds that it missed an update during poling.

ubertaco · on June 24, 2020

I mostly agree with you, but having spent a good part of my career dealing with data integration patterns, I'm all too familiar with the problem of missed messages from the consumer (or failed-to-send messages from the producer) that results in something downstream being in an incoherent state. The simplest fix for that incoherent state is often to also poll periodically, or something similar.

joshspankit · on June 24, 2020

Sounds like it would share solutions with the “two generals problem”.

wereHamster · on June 24, 2020

Push requires more engineering effort to build the server-side of the system. Poll is easier to implement for consumers, and probably also easier to scale for most engineering teams (scaling a central DB is a well understood engineering problem).

I agree that push is better overall, but… tradeoffs

joshspankit · on June 24, 2020

In the real world: agreed on there being tradeoffs. But as a methodology?

jeffbee · on June 24, 2020

Push can cause thundering herds while TTL expiration should occur uniformly over time.