Campaign for a new HTTP Error code: 451 - Blocked for legal reasons

vilya · on Aug 15, 2013

Seems like a good idea, but the wrong way to achieve it. The right way, as I understand it, would be to write it up as an RFC and submit it to the IETF; and to contribute code for it to some of the popular web servers (apache, nginx, etc). The site doesn't make any mention of either of those things.

Edit: oops, I was wrong. There is an RFC and it's linked from http://www.451unavailable.org/what-is-error-451/

orenbarzilai · on Aug 15, 2013

Since the sites blocked on ISP level code contribution to apache/nginx or other web servers is irrelevant.

drdaeman · on Aug 15, 2013

Disclaimer: I work for an ISP and every ISP in Russia is legally required to do the censorship (and mirror all traffic to FSB black boxes, but that's another story). I'm not partucilarly happy with the situation, but can't do anything about that.

Nginx is totally relevant as many ISPs including our use GNU/Linux boxes running Nginx as a highly performant transparent proxy (there are TPROXY patches for Nginx) to dive into HTTP traffic and do URL filtering (obviously, after initial crude IP-based filtering). Costs less than those fancy Cisco solutions, and it's not like we're willing to spend additional money on something that downgrades the service.

Also, there are cases where actual sites are legally forced to remove resources. Well, not really forced, but it's just a sort request too many sites can't really decline. You either comply and remove a single page (blocking for Russian visitors only seems sufficient), or get blocked on ISP level and since many ISPs (including several giant ones) just blacklist a whole IP address, that means your site becomes completely unavailable.

kintamanimatt · on Aug 15, 2013

> I work for an ISP and every ISP in Russia is legally required to do the censorship (and mirror all traffic to FSB black boxes, but that's another story).

I hate the censorship but I like the fact you're not gagged and can talk about the fact traffic is being mirrored.

gngeal · on Aug 15, 2013

OR, he's not allowed to, and he's doing it anyway.

drdaeman · on Aug 15, 2013

Nope, it's codified in the law, in a legalese, but right in the open.

The sad thing is, practically nobody cared about that, for years. The dissatisfaction became visible only when government granted themselves an ability not only sniff on others communications (which is obviously invisible to end user) but also actively censor them.

Osiris · on Aug 15, 2013

Given that Russia appears to be engaged in even broader surveillance and monitoring than the U.S., I find it odd that Snowden would be granted temporary asylum there for speaking out about a similar program in the U.S. It makes the whole situation look more like a political game.

marcosdumay · on Aug 15, 2013

If a russian speaking about Russia surveilance asked for asylum in the US, do you expect the US to deny or grant it?

jlgreco · on Aug 15, 2013

He shouldn't be required to fight that battle as well; he's done more than we could possibly ask of somebody already. Russia is providing him with a certain degree of safety, he shouldn't be obligated to reject that.

hnha · on Aug 15, 2013

could you post some authoritative proof/references for the black boxes?

drdaeman · on Aug 15, 2013

Authorative: http://minsvyaz.ru/common/upload/prikaz_16-01-2008_N6.pdf (sorry, the document's in Russian and I can't find any translation, nor skilled enough to do that myself) - I'm not a lawyer, but in my understanding (as I was explained) this decree contains requirements to networks that ISPs must conform to (otherwise they can't get the license and provide services), and it states (in thick legalese) that all subscriber-generated traffic must be mirrored to operational search activities control ("пункт управления ОРМ"), which is usually (but maybe not universally) a black box sitting in a rack.

Non-authorative reference: http://en.wikipedia.org/wiki/SORM#SORM-2

From what I've heard, SORM-2 hardware is a secured 1U *nix-based server (peer was not sure whenever it was BSD or GNU/Linux variant), running some kind of sniffer (probably pcap-based) software with some FSB's in-house tools. They are supposed to be dormant for the most of time, but nobody except FSB knows what they're actually doing (and they don't have to report when they're doing a lawful intercetion).

ra · on Aug 15, 2013

You can think of this as a blueprint for snoopworthy governments around the world including all US or Russian allies.

Think about that! (and submit a patch to support 451 in your favourite web server / framework).

hnha · on Aug 16, 2013

thank you very much!

mike_esspe · on Aug 15, 2013

Requirement to install them is called SORM-2:

https://en.wikipedia.org/wiki/SORM#SORM-2

hnha · on Aug 16, 2013

cheers!

masklinn · on Aug 15, 2013

DMCA requests could be served as 451 as well, it's not just ISP-based blocking.

diminish · on Aug 15, 2013

Then to make things easier, some people need an http verb called patch_legal, to take down pages for dmca and other legal reasons.

merlincorey · on Aug 15, 2013

I once worked somewhere where some resources could not be displayed to all clients. We chose to (ab)use HTTP 409 Conflict.

> 10.4.10 409 Conflict

> The request could not be completed due to a conflict with the current state of the resource. This code is only allowed in situations where it is expected that the user might be able to resolve the conflict and resubmit the request. The response body SHOULD include enough

> information for the user to recognize the source of the conflict. Ideally, the response entity would include enough information for the user or user agent to fix the problem; however, that might not be possible and is not required.

> Conflicts are most likely to occur in response to a PUT request. For example, if versioning were being used and the entity being PUT included changes to a resource which conflict with those made by an earlier (third-party) request, the server might use the 409 response to indicate that it can't complete the request. In this case, the response entity would likely contain a list of the differences between the two versions in a format defined by the response Content-Type.

edent · on Aug 15, 2013

Discussion on HN from when the IETF Draft was created https://news.ycombinator.com/item?id=4099751

My blog post which helped inspire it http://shkspr.mobi/blog/2012/06/there-is-no-http-code-for-ce...

Simultaneously glad and disgusted that there is a campaign around this.

chrisfarms · on Aug 15, 2013

Is this saying:

"As a web user I want our ISPs/governments to give us a nice error page so we understand what is going on when they DNS block or seize websites"

Or is it saying:

"As a web-master, when have to take down content due to legal proceedings I want a nice HTTP code to return"

They give example of the first (Virgin Media), but that takes down an entire domain, so it's kind of irrelevant if the correct HTTP code is returned, it's not like that is going to be resolved quickly. 503 would be the correct code here.

The second might be useful to spiders (who might want to back-off spidering so often for a while), but then wouldn't you just want to show your users a 404 with a nice reason why the content has gone.

nwh · on Aug 15, 2013

I understood it as the latter. Something that a webmaster can display to show a DMCA or geographical legal issue.

marcosdumay · on Aug 15, 2013

It's good for both cases, isn't it? It's usefull for informing your users what is happenning.

> but then wouldn't you just want to show your users a 404 with a nice reason why the content has gone.

A 451 with a nice reason why the content was gone is better.

yxhuvud · on Aug 15, 2013

I don't get it? There are already drafts for it, which were created close in time to when Bradbury died.

http://tools.ietf.org/html/draft-tbray-http-legally-restrict...

jvdh · on Aug 15, 2013

A draft is not an implementation. And an implementation in browsers is not an implementation in providers or webservers.

This is a very good way to make people aware of legal blocking, and to make sure that it is adopted.

DharmaSoldat · on Aug 15, 2013

Surprised no one has made any reference to Farenheit 451.

Allow me to be the first.

Thankfully websites are not flammable.

unethical_ban · on Aug 15, 2013

The error code for censorship is the reference. I guess not everyone would recognize that, but I immediately noticed.

walt74 · on Aug 16, 2013

I'm also surprised (this Error Code was first mentioned, as far as I know, when Ray Bradbury died)… but I like the subtle Element of not Mentioning it.

mcv · on Aug 15, 2013

For some reason, this article is blocked from my work due to proxy avoidance.

Somehow seems fitting.

corobo · on Aug 15, 2013

Surely this should be within the 5xx range of status codes? I get there's a reference to be had using 451 but this is more of a server error than client.

dlitz · on Aug 15, 2013

No, I'd say it's a 4xx error. Have a look at https://en.wikipedia.org/wiki/List_of_HTTP_status_codes

Usually, 5xx means that the client could retry the request at a later time, and have it succeed. 4xx means the client should expect the request to fail forever unless something is changed.

Some HTTP clients (not browsers, but other things) take advantage of this by showing the user an error dialog on a 4xx error, but just retry at a later time on a 5xx error.

matsemann · on Aug 15, 2013

Is the 451 a reference to something?

jvdh · on Aug 15, 2013

Yes, Fahrenheit 451 by Ray Bradbury.

A dystopian novel about a world where all books are banned and fire squads roam around to burn them.

corobo · on Aug 15, 2013

Going by something someone has put on wikipedia "A reference to the 1953 dystopian novel Fahrenheit 451, where books are outlawed".

On that note it looks like Microsoft have already dibs'd 451 for an Exchange ActiveSync redirect anyway.

dagw · on Aug 15, 2013

Almost certainly Ray Bradbury's book Fahrenheit 451

nailer · on Aug 15, 2013

I was about to write a +1, but then I realised:

- 4xx isn't appropriate as it's not a client error. - 5xx isn't appropriate because it's not a server error either.

corobo · on Aug 15, 2013

"The 4xx class of status code is intended for cases in which the client seems to have erred"

vs

"Response status codes beginning with the digit '5' indicate cases in which the server is aware that it has encountered an error or is otherwise incapable of performing the request."

There's always that quote people chuck around a lot about censorship being an error so the internet routes around it. By that definition the server knows it has errored so it should be a 5xx response.

[Nerdit: Fixed the quote marks]

drsintoma · on Aug 15, 2013

I'd consider it as much of an error as a 404 in this context.

1) Link works: 200

2) Take down notice arrives

Now:

3) Link not found: 404

Then:

3) Link not available: 451

Terretta · on Aug 15, 2013

I'd argue it's a client error. Whoever's hosting the info is happy to host it, while the client needs to go vote out the politician that censored it.

mpyne · on Aug 15, 2013

What if the client is not from a nation that is causing the server to refuse to serve the content? It's very much a server error.

dragonwriter · on Aug 16, 2013

I think it is a 4xx client error, and not necessarily one that needs a new status code. It seems to me that it is a fairly simple case of 403 Forbidden:

  The server understood the request, but is refusing to 
  fulfill it. Authorization will not help and the request 
  SHOULD NOT be repeated. If the request method was not HEAD 
  and the server wishes to make public why the request has 
  not been fulfilled, it SHOULD describe the reason for the 
  refusal in the entity. If the server does not wish to make 
  this information available to the client, the status code 
  404 (Not Found) can be used instead.

nailer · on Aug 16, 2013

From http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html

"10.4 Client Error 4xx

The 4xx class of status code is intended for cases in which the client seems to have erred. "

The client has not erred by requested a document that exists and which the server can technically provide (separately, the server has not erred by refusing to provide a document to the client which the exists and which access control would allow the client to have, because a government is threatening the server operator in some manner).

dragonwriter · on Aug 16, 2013

> The client has not erred by requested a document that exists and which the server can technically provide

The client has erred in requesting a document which the server is legally forbidden to provide to that client. As specified for 403, the server understands the request and refuses to fulfill it.

Admittedly, a hypothetical 6xx Third-Party Interference series of error codes might be useful for these kind of cases (and some instances currently handled by 503.)

nailer · on Aug 16, 2013

>> The client has erred

> The client has not erred... the server understands the request and refuses to fulfill it.

As I see it, the server is not refusing anything. +1 6xx Third-Party Interference.

Let's just agree to disagree.

venus · on Aug 15, 2013

Well, the cool thing about HTTP error codes is that you don't need a campaign or get permission from the W3C, you can just start using them if you want.

nathan_long · on Aug 15, 2013

Yes, but that's true of other standards. Eg, I can start putting <haiku> tags in my HTML if I want.

The issue is whether anyone else will expect this tag or code and do anything meaningful with it.

ToastyMallows · on Aug 15, 2013

I get what you mean but I don't think this is the same thing. HTML has a DTD, something that people conform to when writing, so your <haiku> tag would not follow that guideline, whereas companies like Twitter can implement their own error codes as they see fit. See error code 420, Enhance Your Calm.[0]

[0]: https://en.wikipedia.org/wiki/List_of_HTTP_status_codes

kijeda · on Aug 15, 2013

Well, sure, just like I can create my invented domain name like hacker.news and use my own IP addresse 1.2.3.4 without being allocated them. The Internet interoperates by everyone agreeing to follow the same agreed conventions, but there is no rule that says you have to. The registry for HTTP status codes has no 420 code and could be assigned for a different purpose in the future: http://www.iana.org/assignments/http-status-codes

nathan_long · on Aug 16, 2013

Yes, they can respond with 420. However, my browser does not have a predetermined response for that. Whereas if they respond with 200 or 302 or 404, all browsers know what to do; that's what it means to be a "standard".

dragonwriter · on Aug 16, 2013

> HTML has a DTD

Tangentially to your point, that's only true of the 4.x and earlier versions of HTML which are SGML applications, WHATWG HTML / W3C HTML5 is not SGML-based and does not have a DTD.

gngeal · on Aug 15, 2013

Eg, I can start putting <haiku> tags in my HTML if I want. The issue is whether anyone else will expect this tag or code and do anything meaningful with it.

You mean like this? https://dvcs.w3.org/hg/webcomponents/raw-file/tip/spec/custo...

saraid216 · on Aug 15, 2013

Honestly, people who believe in this strongly enough should just start using them and provide themselves as examples of good use cases. That doesn't mean it shouldn't be campaigned for to get more people to use it.

krajzeg · on Aug 15, 2013

There is a lot of discussion below on whether 451 is the right error code and how to implement it properly, but I'm missing one thing - what's the benefit of doing it as a status code at all?

If you're going to say that it raises censorship awareness - Internet protocols are intended as useful technical standards for programs to communicate, not vehicles for political goals.

What is the technical benefit of failing with a different error code? Is there need for client software to react differently to a 451 and a 403? The status code is not intended for the human user. If we want to raise awareness, than we already have means to do that - a 403 with a descriptive page citing the reasons. Many websites already do that when complying to DMCA takedowns.

eatitraw · on Aug 15, 2013

Some sites in Russia actually return 451:

curl -I http://lurkmore.to/Конопля

saraid216 · on Aug 15, 2013

I have no idea what this is or what 451 means in this context, but I'm upvoting this as a legitimate technical reason not to use 451.

nawitus · on Aug 15, 2013

Technically speaking 403 can be returned if it's blocked for legal reasons, but obviously it would be nice to know why it is forbidden.

txutxu · on Aug 15, 2013

There is some reasoning about this on: http://www.451unavailable.org/why-do-we-need-error-451

Still, imho, 4XX could be a response for a given URL, but when "a website is blocked" (from the text of the previous URL), we should go to 5XX, as in 503 - service unavailable.

eli · on Aug 15, 2013

5xx does not really make sense for a single page removed due to DMCA.

txutxu · on Aug 15, 2013

Yes, I don't know if two codes (one for URL and one for the entire site) could make sense in this case.

orenbarzilai · on Aug 15, 2013

does it really matter if it's 5xx or 4xx?

zackkitzmiller · on Aug 15, 2013

Yes. 400 responses are because of client error. 500 responses are for server error.

jvdh · on Aug 15, 2013

Legal restrictions on content are almost always issues on the client side, otherwise they should not be made available at all.

Consider for example the blocking of sites in the UK, or the blocking of the PirateBay website in the Netherlands. Those are all very limited audiences where the location of the client is causing a legal reason why the content can not be displayed.

gpvos · on Aug 15, 2013

541, then?

jvdh · on Aug 15, 2013

I bet you they are really happy you submitted this smack in the middle of the holiday of their main employee/volunteer :-).

Messages to their volunteer address get a vacation message that they're away until September 1st.

nilved · on Aug 15, 2013

We should absolutely not be standardizing censorship.

Zenst · on Aug 15, 2013

In this form I totally agree. Reason being that by haveing a block at the web server level in essence places the blocking to be done by the hosting site and blocking based upon location and content. This places the onus of censorship upon the host, which they can already code for if they want.

Not sure placeing the onus of censorship into the hands of the host and not the goverment with there IP/DNS blocks or however they impose such blocks (China has a nice firewall for outside China sites and I dare to think of how they block a website inside China though can bet it is just as effective).

That all said the posiblility to volantarly do the blocking in a way that the powers that be will accept and with that allow there country's owners to see parts of yoru site that are legal and not the illegal parts for them. Well that would possibily have uses and opens your site up still instead of a blanket ban.

  Piratebay has legal torrents, yet they are blocked as some form of descrimnation blanket ban.  So it does have it's possibilities, albiet a dangerous path that should not be tread lightly.

alexchamberlain · on Aug 15, 2013

I got Access Denied at work...

mrab · on Aug 15, 2013

Is this really necessary? How about 456 - unavailable because someone spilled coffee on our backend server? Or 467 - unavailable because garden gnomes invaded our offices?

javindo · on Aug 15, 2013

I can see some reasoning behind this, but the reasoning is that the emphasis of the problem is "people are angry at the site because something is blocked so let's show an error code reflecting the real reason." Using 451 would take the emphasis away from the site and onto the legal oppressor.

On the other hand, why not inverse all inaccessible content to legal oppressors? Change the default meaning of 403 for example to "Access denied for permissive or legal reasons".

gargron · on Aug 15, 2013

Change the default meaning of 403? I don't think that's a good idea. When a user needs to be logged in to do something and they aren't, you show them 403. When access is restricted to people outside a network, they see 403. It'll be hard to force a new behaviour onto the existing web, easier to add a new HTTP code.

RyanMcGreal · on Aug 15, 2013

My understanding is that the server should respond with 401 Unauthorized when someone is attempting to access a resource that requires authentication. What is the case for using 403 instead?

gargron · on Aug 15, 2013

OK, 401 makes more sense in that context. But another 403 case would be "the authorized user lacks permission to open resource."

gpvos · on Aug 15, 2013

When they have authenticated (logged on), but they still do not have access to that particular resource (but may have access to others).

p_papageorgiou · on Aug 15, 2013

The whole point of this is wrong... The internet is free, content shouldn't be blocked for legal reasons

lsh · on Aug 15, 2013

The internet is _not_ free, obviously, because content has been blocked for legal reasons in certain regions of the world.

I agree with your sentiment though - the internet _should_ be free.

nextw33k · on Aug 15, 2013

The whole point of this is right, its highlighting the government breaking the Internet.

Content shouldn't be blocked, the Internet should be free and open. However without letting people know what's happening they will not complain.

Of course what the browser should display is:

451: The Right Honourable Claire Perry MP has deemed that you shouldn't be able to look at this site.

p_papageorgiou · on Aug 15, 2013

I have no clue about US politics but this sounds cool!

alan_cx · on Aug 15, 2013

In case you didn't know, MP Clair Perry is a British Member of Parliament, sadly. She is one of those people who know better, than us "plebs".

IMHO, she is ignorant(1) and irrational(1), and there for scared witless of the internet, so she seeks to control it. I don't think she knows how wrong she is, there for I don't see her as evil, as such. However, it does seem our vote whore of a Prime Minister listens to her every word, hence the attempts to block porn and make people opt in to avoid the blocks, since her position ties in to an awful lot of right wing voters, who are equally, if not more, ignorant and irrational. They are the kind of people who genuinely believed Rock and Roll was the work of the devil, and that black people are a different species.

(1)I use these words for their real meaning and not the insult, judgement, or political value. I honestly think many people literally do not understand the technical issues, and act illogically as a result. I believe this is a lot to do with the traditional media, who politicians rely on, issuing scare stories about the internet because their business were and still are threatened by it. My "evil" in this is those who spread the lies, ie, the media. I hate to admit this, but in many ways I see the likes of Clair Perry as well intentioned victims and mules of the media.

gpvos · on Aug 15, 2013

Yet it is, because the courts have a different opinion on this, and they decide what happens. So it's better to make it clear to the end user when it happens, so at least they are not kept in the dark, or are basically lied to, about the reason why they cannot see the document.

r00fus · on Aug 15, 2013

should != is

And the distance between them is measured as response code 451

taopao · on Aug 16, 2013

452 Blocked for reasons which may or may not be legal but I probably shouldn't say either way

dlitz · on Aug 15, 2013

How does this interact with the effort to encourage the use of HTTPS everywhere?

Osiris · on Aug 15, 2013

If the 451 code is returned by the web server because the site got a DMCA request, then HTTPS doesn't matter because the destination web server already decrypted the session to find the request that you're making before returning the code.

Inetgate · on Aug 16, 2013

Unfortunately, that page return 200 OK. Why does not it return 451 status code?

bugsense · on Aug 15, 2013

That would be great when you have to block access to OFAC listed countries

aaronkrolik · on Aug 15, 2013

Tried to open this link at work. Blocked for legal reasons :)

hawleyal · on Aug 15, 2013

That is a dumb error code.

404 already exists.

ceejayoz · on Aug 15, 2013

404 is for "not found". 451 would be for "found, but we can't show it to you".

dragonwriter · on Aug 16, 2013

> 451 would be for "found, but we can't show it to you".

"The server understood the request, but is refusing to fulfill it" is already 403.