If you're going to allow dotted IPs you should really allow 32-bit IPs too, e.g....

tshaddox · on June 22, 2014

Also IPv6 URLs, like http://[1080:0:0:0:8:800:200C:417A]/index.html

http://www.ietf.org/rfc/rfc2732.txt

gamegoblin · on June 22, 2014

Since I just finished implementing a toy HTTP/1.1 server, I must throw in my newfound knowledge that 2732 has been updated with Zone IDs:

http://tools.ietf.org/html/rfc6874

ds · on June 22, 2014

Yep.

I should mention that 99.9% of domains will fall into standard form ( handle.domain or ip.ip.ip.ip )

As such, You are definitely more likely to let a user enter a bad URL they did not intend because it validates then to let a uncommon domain actually be used.

As such- a much simpler regex would likely 'make more people happy' than being 100% correct to tech spec.

zAy0LfpBZLC8mAC · on June 23, 2014

None of those is a URI, so a URI validator most certainly should not accept them. Just because browsers tend to understand them as a matter of a historical accident does not mean those are valid URIs, just as tag soup that browsers also tend to understand isn't valid HTML either.

mathias · on June 23, 2014

Exactly. The goal was to come up with a good regular expression to validate URLs as user input. There’s no way I’d want to allow alternate IP address notations.

to3m · on June 23, 2014

Yes... I've found the relevant RFC now. How disappointing! I like 32-bit hex IP addresses.