Hacker News new | past | comments | ask | show | jobs | submit login

That wikipedia article is actually terrible, and so a great example of the confusion.

> refers to any or all network hosts on the Internet that no-one can reach.[1] According to some estimates, only 0.03% of the web is searchable, hence leaving almost 100.00% of all data being dark Internet

Apparently the wikipedia authors think that "network hosts that no-one can reach" is the same thing as "not searchable". I mean, a network host that no-one can reach is obviously not searchable (if really no-one can reach it, is it a 'network host' at all?), but a lot more is too, and of course it depends on what mechanisms you are using to search. Those estimates that "only 0.3% of the web is searchable" are surely not saying that 99.7% of the internet is "network hosts no-one can reach".

The wikipedia article goes on:

> Failures within the allocation of Internet resources due to the Internet's chaotic tendencies of growth and decay are a leading cause of dark address formation. One form of dark address is military sites on the archaic MILNET. These government networks are sometimes as old as the original ARPANET, and have simply not been incorporated into the Internet's evolving architecture.

I am not sure what they are talking about, and introduce a new term without explaining what it means, "dark address", what? If they really don't have routable IP addresses, are they part of 'the internet' at all, let alone 99.7% of the internet?

Hosts 'not incorporated into the internets evolving architecture' seems to be yet another thing again, although perhaps it's a subset of "network hosts that no-one can reach", but is an entirely different thing from Tor hidden services, and probably not a part of "websites not searchable [by Google?]" because they probably aren't "websites" at all, and arguably aren't on "the internet" at all if they were "not incorporated into the Internet's evolving architecture", whatever that means.

Really, the entire wikipedia article makes almost no sense from a technical perspective.

My non-technical friends hear various things about the 'dark net', and conflating different vague definitions, tell me they heard that the vast majority of the internet (nearly 100% according to wikipedia!) actually consists of pedophilia that you need special technical measures to access, or something like that.

Um.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: