Hacker News new | past | comments | ask | show | jobs | submit login

> I know the government has viewed certain companies as "too big to fail," because of the negative financial impact we would experience if they were to do so.

> I wonder if there needs to be a cultural equivalent, where a repository of digital or physical artifacts is "too big to fail" because of the negative cultural impact their failure would have.

If there is, the Internet Archive isn't it.

Yeah, it's super important in certain weird technology and library subcultures, but few people outside of them would even notice it if disappeared. If they shut down, the story wouldn't even be able to muscle it's way onto the front page of the New York Times: it would loose to whatever the latest Trump drama is and the Nth repetition of the standard mass shooting media package.

> In the absence of that, anyone who sees high cultural value to the preservation of these digital artifacts should, counterintuitively, not treat the Internet Archive as having some special status, but should treat it as a liability. It has become the custodian of too much, and too much is on the line if it fails.

> Rather than trying to constantly shore up the IA so it can't fail financially, we should be looking at ways to preserve those artifacts redundantly, so that even if the IA fails, it's not a calamity.

Yeah, especially since the Internet Archive (as an organization) has proven itself to be irresponsible.

Lots of people want to turn the publishers into the villains, for ideological reasons as well as a bias towards the Internet Archive, but the it's the IA that fucked up here. They imperiled their core mission for some unnecessary grandstanding. They either need to fire whatever lawyers OK'd the "Emergency Library" or the leaders that refused to listen to sane legal advice telling them not to do it.




> certain weird technology and library subcultures, but few people outside of them would even notice it if disappeared

It's pretty much indispensible to anybody who's a researcher.

It's very frequent that you're tracking down citations to webpages that don't exist anymore, and the IA is the only way to find sources.

Not to mention that it's also often the only way to quickly get access to non-bestseller books that are more than a couple of decades old, which is also commonly needed for research purposes. Many of these books are only otherwise available in the country's largest research libraries. (Google has copies too, but nobody can view them.)

It's not weird or a subculture unless you think those labels apply to researchers. And there are a lot of researchers out there, across academics, non-fiction authors, and journalists.


I do wonder if we're rapidly entering a world where finding sources, doing research, etc is going to only be interesting to weird subcultures and everyone else will be satisfied with whatever the first Google result says, or a confident bot's fabrication.


It was always this way. Replace Google with your neighbor who owns an encyclopedia, or the local priest. Caring about research, truth, and sources has always been something that only a minority cares about.


> the local priest

Or any other authority figure. My favorite exchange that I personally witnessed during early COVID was this one:

A, conspiratorially: You should hear what the dentist has to say about the COVID vaccine.

B: Why, does it affect my teeth?


It gets better/worse: much of dentistry is itself not evidence-based [0].

[0] an informative article and the American College of Dentists response:

https://www.theatlantic.com/magazine/archive/2019/05/the-tro...

https://www.acd.org/position-papers/the-college-responds-to-...


> It's not weird or a subculture unless you think those labels apply to researchers. And there are a lot of researchers out there, across academics, non-fiction authors, and journalists.

I do think the label applies to researchers. And except for the case of "tracking down citations to webpages that don't exist anymore," researchers can continue to do all those things using traditional methods (which may be a harder, but still easy enough, especially if it's your job).


In fairness, a lot of content created over the past 25 years or so only exists in ephemeral digital form. Some is captured in sources that libraries subscribe to but a lot of it isn't. (Of course, libraries also subscribe to content that was never on the open web.)

It's also the case that pre-web, a lot of that sort of content was pretty much lost to time. Per a discussion, a little while back, relatively little from the BBS era is preserved because it was mostly a distributed group of hobbyists.


> researchers can continue to do all those things using traditional methods

No they can't, as I stated. Citations that point to URL's that no longer exist require IA. There's no other tool currently.


What is the traditional method for finding a webpage that does not exist anymore?


The Internet Archive.


> but few people outside of them would even notice it if disappeared.

Um... I don't get the impression that you are speaking from any sort of authority

Look up any of their stats and tell me that's librarians and "weird technologists".

Small example: 600,000 new users per month are niche technologists?

https://archive.org/about/stats.php

More ppl in every town prob use IA than their local library. It's important (albeit overly central, but they are working on resolving that via IPFS and other technologies)


Internet Archive is likely the only memory of the digital world. They may or may not have made a mistake during the covid pandemic by extending their book lending program, but in the age of IA where many text and images are going to lack any source of truth, they may be one of the very few ways to document modern history.


I'm not saying it's not valuable, I'm saying not that many people would even notice, let alone care, if it disappeared.

You may personally be one of the people who cares about the IA, or know a lot of people who care, but if you go to a shopping mall six months after the IA shuts down and ask people about it at random, you'd find that level of knowledge and caring is unusual. It's not the kind of too-big-to-fail that the government would take interest in.


Sorry, but this is a reductionist argument. There are any number of things that could disappear without mall visitors knowing about it (NOAA, Earth's magnetic field, etc) but that doesn't mean they are not important for researchers, students, and others all over the world.


You're missing the point. The question isn't "is it important for someone," it's "is it too big to fail" (i.e. does the government think it's sooo important that it must swoop in and save it).

If the Internet Archive shuts down and its archive lost, the economy will keep humming, masses of people won't lose their jobs let alone be inconvenienced, etc. Sure, some paper about Geocities culture cira 1997 won't get written, some researcher won't be able to access some old dead link, and I won't be able to access the download the PDF manual for some old product from a company that went out-of-business. Life will go on with almost no disruption, and no one will lose an election because they failed to act to save it.


> and no one will lose an election because they failed to act to save it.

At the very least in a just world, I think the ability to reference historical data from the web probably would influence some elections.

In the unjust world we actually live in, access to historical data can be used as a tool (or weapon!) by journalists, pacs, candidates, etc. to find strengths and weaknesses, and to influence elections.

"back in 2026, the candidate enacted XYZ, as proven by [1][2][3] (all ia links). Ten years later this ended in <great victory|terrible disaster> therefore you <should|shouldn't> vote for them"

... Also, eg. Wikipedia often uses IA links for references to deal with link rot, which happens a lot more than you'd think. I won't say WP would shut down completely, but it's effectiveness would definitely be degraded.

Same probably for a lot of professions and jobs that require research. (including eg. secretaries, special librarians, political assistants, etc...)

Just because you personally can't imagine the impact on society, doesn't mean it doesn't have an impact on society.


I agree with tablespoon that the demise of IA would not even be a footnote for the commons. So what if an archive of the internet goes away? Most people aren't going to care, and that's the brutal and apathetic reality.


I agree that many people might not directly care if IA goes away. But obviously the destruction of such a large amount of knowledge would not be without consequences (including to courts themselves)


If earth’s magnetic field disappeared then the solar wind would begin to strip away the atmosphere. It’d be the beginning of the extinction of all life on the surface of the planet. It would also interfere with communications and knock out power grids everywhere. People would be bombarded with radiation and begin to get cancer at ever increasing rates. We’d also be able to see the aurorae at lower latitudes.

People would notice!


Just on the defense of NOAA, I think most people would notice if they couldn’t get a weather forecast or couldn’t fly a plane in stormy weather. NOAA is a major collector of weather data, and provides half of the world’s GCM initial conditions … the European models would get worse with a huge data hole.

This is just more libertarian bullshit that entire government departments could disappear without anyone noticing. But if NOAA disappeared it would wreck the economy. What a terrible example.


Eh, if all nuclear research would disappear overnight less than 0.001% of the world population would be impacted. I’m still inclined to say that a repository of that knowledge is too big to fail, simply because the knowledge is so important.

Not that that’s necessarily true for the internet archive, but…


Stop giving a shit about what "many people" care about. "Many people" are dumb, hardly aware of the world that isn't directly in front of them. Their opinion here is worthless, doubly so because of how easily they can be misled.


> If they shut down, the story wouldn't even be able to muscle it's way onto the front page of the New York Times

What do New York Times reporters use to check web history when researching for their stories? Do they just make stuff up?


I think they mostly pull trending stories off Twitter.


I can't disagree about the irresponsibility of the IA when it comes to the Emergency Library stuff. But I for one would be pretty heartbroken to lose access to the absolutely enormous catalog of live shows from a ton of bands that IA hosts completely legally.


> Yeah, especially since the Internet Archive (as an organization) has proven itself to be irresponsible.

I think the archive made a bad decision. I do not think it is generally irresponsible.


The Internet Archive is one of the easiest-to-access source of warez right now, among other things.

Either they have a grand plan for the targets they are painting on themselves, or they are bloody irresponsible idiots; I'm inclined to bet on the latter given no further contexts.


https://archive.org/about/dmca.php they have a legal exemption


Hmmm. When they decided to do something so boneheaded, and so obviously going-to-seriously-piss-off a bunch of famously deep pocketed litigious publishers, they crossed a whole bunch of lines.

Not just a "this was a bad decision" line, but past a few others as well. eg "reckless behaviour", "wilful disregard", and probably more. :(

Hopefully some kind of Hail Mary saves the day. But at the _very_ least the senior people at the IA who reviewed and green-lit this program should be moved to less senior roles.

They clearly needed an adult in charge to have said "No, don't do this".

The whole program fails even the most cursory risk assessment, and should have been obvious that if it went badly (and likely would) they'd risk killing their whole organisation. :(


> Hmmm. When they decided to do something so boneheaded, and so obviously going-to-seriously-piss-off a bunch of famously deep pocketed litigious publishers, they crossed a whole bunch of lines.

But if no-one pushes against the encroaching draconian copyright laws, then they will continue to get worse. The IA has a philosophy that is counter to a lot of entrenched mainstream organisation, and this move was aligned with their philosophy, if not necessarily their best interests.

It was bold, and we're talking about it, and even testing it in court. So it went against them, which is not surprising, but it's better than if they'd just been meek, battened down the hatches, and not do anything at all.

I applaud them.

And if anyone in this thread is using the IA and hasn't donated to them recently, it's a good time to do it.


> But if no-one pushes against the encroaching draconian copyright laws, then they will continue to get worse.

While that's true, having our custodian of archiving the internet bet the farm on stupid shit like this is outstandingly irresponsible. :(

So yeah, applaud places which do it all you want. And I'll be right there with you in a lot of cases.

But this? This was far past just stupid. :(


> Yeah, it's super important in certain weird technology and library subcultures, but few people outside of them would even notice it if disappeared.

A service can be important even if few people use it directly. The service can have downstream effects that are beneficial to a lot of people because the people who do use it are creating and disseminating other content that filters its way down.

An analogy would be the US National Weather Service (NOAA). Few people look directly at an NOAA feed, but it's used by news channels, apps, airlines, scientists, etc. and becomes content and services that most people have benefited from.

A robust archive lowers the cost/time of doing research. It enables fact checking and investigation, particularly of an historical or obscure nature. It services the long tail of less frequently accessed content that many of us will, at some time, want access to. Basically all the reasons a research library is useful.


> If they shut down, the story wouldn't even be able to muscle it's way onto the front page of the New York Times: it would loose to whatever the latest Trump drama is and the Nth repetition of the standard mass shooting media package.

That is not a good metric. If the louvre burned, that would make headlines everywhere, but it would be nowhere near as disasterous as if the internet archive was destroyed.


Just because most people may not have heard of it, it doesn't mean it's not critically important.

I'm sure most people in the US hadn't heard of many of the banks involved in the 2008 financial crisis, but many of them, after that fact, might agree that they were indeed too big to fail.


> Lots of people want to turn the publishers into the villains

I mean, they are the villains here. They’re sueing the IA over something that is less than a footnote in their balance book.

They’re purely doing this for the chilling effect it will have on other people that might be impertinent enough to try and share their books with others.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: