"Well, sir, if you look in our terms of service, it says quite clearly that if most of your traffic goes to a set of services that we have determined are most likely VPNs because we control most of the end-traffic on the internet, well then we can disconnect you. I know, sir, machine learning is wonderful as I'm sure you've read on Hacker News that you visit regularly. What's that? You swear it isn't VPN traffic? Ok, can you provide any evidence to that effect? You can't? Ok, well, you have been disconnected sir, and if you like, you can pay a $300 fee to get reconnected and rejoin our new VPN plan at $150 per month. If you want to use our VPN plan that will be $30 per month on top of all the other services you will need to select"
I suspect you can. You can certainly analyze the timing and lifetime's to identify VPN's vs HTTPS (at the expense of a few websocket false positives). And I suspect you could even characterize entropy.
Even if this were true, it would be very easy for an ISP to insist that their own certificates be trusted as part of the terms of usage. This would give the ISP data access.