For HN: Open access to scientific papers, a summary of the state of play

_delirium · on Oct 30, 2011

What do you think about independent action from the editorial side? The number is still fairly small, but there have been several successful instances of a commercial journal's editorial board resigning en masse and setting up an open-access journal intended to replace it.

A few examples:

http://www.sigir.org/forum/F2001/sigirFall01Letters.html

http://www.math.columbia.edu/~woit/wordpress/?p=442

http://www.math.columbia.edu/~woit/wordpress/?p=581

JMLR, at least, has gone on to successfully eclipse the journal, Machine Learning, that it was intended to replace (about 3x the impact factor).

I do notice that the examples I find are all in computer science and mathematics, and the new journals have basically zero budgets (and like it that way) and don't charge authors any fees. Is this because in CS/math a common expectation is that the author can use LaTeX, produce their own figures, and submit a print-ready PDF, whereas in other fields authors expect significant formatting work to be done by the journal?

michael_nielsen · on Oct 30, 2011

At the moment the impact of these actions seems significantly smaller than policy. But perhaps it's a useful complement, showing policy-makers that many faculty support open access.

As regards your CS/math question, it's a good one, and I don't know enough about the cost structure of the journals in question to answer. I suspect from conversations with journal editors that it's not a whole lot cheaper to produce CS/math journals than other types, but that's just a general impression, not a certainty.

_delirium · on Oct 30, 2011

I suspect from conversations with journal editors that it's not a whole lot cheaper to produce CS/math journals than other types, but that's just a general impression, not a certainty.

There seem to at least be some that are much cheaper, in the sense of having budgets literally approaching $0. From some brief chats with JMLR editors, they're in that camp: they run on donated server space from MIT and volunteer editors. No staff, no office, no recurring expenses, yet they're still one of the top journals in CS. That seems like a significantly different cost structure from something like PLoS ONE, perhaps closer to the arXiv model turned into a journal.

chaosprophet · on Oct 30, 2011

I would suggest that you put this up as a blog post, and submit the link here. It would make for a much better reading experience, especially since your post is quite link-heavy.

michael_nielsen · on Oct 30, 2011

As a blog post:

http://michaelnielsen.org/blog/open-access-a-short-summary/

I've now edited the header of the HN submission to point to this URL.

Thanks for the suggestion --- I should probably have done that initially. My thinking was that I didn't want to put it on my blog, since it's meant specificially for HN. But with a little framing, that issue takes care of itself. And, as you say, the links work a lot better in a blog post.

RK · on Oct 30, 2011

The Scientific American link in the blog post seems broken.

Also, off topic, I never got the email I signed up for about the release of your new book. Did you ever send those?

michael_nielsen · on Oct 30, 2011

Thanks for pointing this out - I'll fix it.

(On the email: yes, I sent them. Don't know what happened to yours, sorry about that. But the book is out, and available at places like Amazon.)

jessriedel · on Oct 30, 2011

What prevents other fields from doing the same thing that the physics community has done?: Nearly all published physics articles since 1992 have been released freely to the public on the ArXiv.

The only thing I can think of is that physicist are more likely than most other academics to be proficient with LaTeX, and therefore to have a presentable (though often not perfect) version of their paper before an editor ever touches it.

_csoz · on Oct 30, 2011

Or, arxiv could just add life sciences section. AFAIK everyone prepares appropriate preprints, so that shouldn't be an issue, it's just that the attitude is missing, and, while many scientists now make their preprints available, there's no coordinated movement or repository.

jessriedel · on Oct 31, 2011

Oh, I think the ArXiv would gladly do this if even a little momentum built. They have added mathematics and finance in the past.

rwl · on Oct 30, 2011

Do you have any sense of the state of open access publishing outside the sciences? I am a graduate student in the humanities, and it sounds like many of these mechanisms aren't designed to extend beyond the sciences. Most humanities research is not done with support from NIH grants, for example---at least, not directly. Humanities faculty with smaller research budgets are less likely to voluntarily pay a fee to make their work open access through Elsevier and Springer. And I don't know of open office humanities journals on the level of e.g. PLoS One (though of course that doesn't mean they don't exist!).

So, it sounds like the only mechanisms you've mentioned that might also apply to the humanities are institutional policies like Princeton's, or the Federal Public Research Access Act. Princeton and other top tier research universities are in a unique position of leverage over journals, because few journals can afford not to accept research from faculty at these institutions, but most researchers are not working at institutions with that kind of leverage.

Does the Federal Public Research Access Act have language that extends to humanities research? A lot of humanities research, I think, is indirectly funded through public funds, since universities take a cut of incoming grants for the sciences, and then redistribute that money to humanities departments. The right language in this bill could therefore extend open access requirements to humanities research by requiring that whatever research the money ends up supporting be published open access. Do you have any idea whether language like this is in the bill?

(I also worry that, if it isn't, even science researchers could start using the "university cut loophole": the university takes the entire incoming grant and redistributes it to science researchers without open access requirements attached. Is there any danger of this?)

rwl · on Oct 30, 2011

For those interested, there is a ton of great information on Peter Suber's page about open access in the humanities:

http://www.earlham.edu/~peters/writing/apa.htm

But I don't see any suggestions there for extending open access requirements to the humanities by bootstrapping off open access requirements for publicly-funded science research via grant money that ends up in humanities departments.

michael_nielsen · on Oct 30, 2011

I don't have a lot to add. It's not an issue I've spent a lot of time on (not because it's not important, just because of limits on time).

jcr · on Oct 31, 2011

> In 2009, Elsevier made a profit of 1.1 billion dollars on revenue of 3.2 billion dollars.

...

> the American Chemical Society made a profit of 40 million dollars on > revnues [SIC] of 340 million dollars.

Please pardon my ignorance on the topic, but there's something at work here that I simply don't grasp; Why does it cost so much to produce a journal? (i.e. revenue minus profit?)

I'm a bit nervous to ask the above question here since it would potentially lead to yet another useless political debate on whether or not "top execs" are worth what they're paid. If we could skip that part of the discussion, it would be appreciated. I'm mostly interested in what the real costs are, not whether or not they are justified.

thwest · on Oct 30, 2011

Have you seen much pressure for data driven science to reproduce code+data alongside the pdf?

michael_nielsen · on Oct 30, 2011

There's more and more pressure for that kind of thing, but it's still early days in most fields. For an example of a forward-thinking policy, consider the Wellcome Trust: http://www.wellcome.ac.uk/About-us/Policy/Policy-and-positio...

john_horton · on Oct 30, 2011

I think working on the funding agencies (i.e., getting them to adopt sensible policies) is the right strategy. For the individual researcher, the incentives for code & data sharing are pretty limited right now, at least until the culture changes.

On the technical side of code/data sharing--I think one obstacle (in least in the fields I'm familiar with) is that may researchers put together papers in a way that makes reproducibility needlessly hard. If you do all your stats in something like Stata or SPSS, then paste tables/figures into an MS Word document (which is passed around among colleagues), finding your own errors is hard enough---never mind some third party trying to re-produce your results. If instead, you use tools like Sweave & script the data analysis & paper assembly process (ideally with version control), reproduction/sharing becomes much simpler.

timsally · on Oct 30, 2011

I like Matt Might's take on that particular issue: http://matt.might.net/articles/crapl/

An open source license for academics has additional needs: (1) it should require that source and modifications used to validate scientific claims be released with those claims; and (2) more importantly, it should absolve authors of shame, embarrassment and ridicule for ugly code.

jessriedel · on Oct 30, 2011

I think the difference is that, from the researcher's standpoint, there is no downside other than cost for having your article pdf's open access. (In fact, there is a slight boost; if your work is more easily accessible, you're more likely to get cited.) On the other hand, releasing your data and code requires quite a bit of effort in curating it to keep from exposing yourself to additional criticism (warranted or not). On the net for society, it's probably extremely beneficial to have researchers do this, but right now they won't because they don't have any incentive.

curtrice · on Oct 30, 2011

Thanks for this good resource -- and the link to your book, which looks very promising. Two comments, or rather references to ongoing work. First, we have a conference focused on OA every fall here at the University of Tromsø (in English), which has really become a big event. Maybe some of your readers will join us here in a few weeks! http://www.ub.uit.no/baser/ocs/index.php/Munin/MC6 Second, we have a national organization in Norway, Current Research System in Norway (cristin.no) (which i chair the board of), with responsibility for (i) documenting research activity (especially publication), (ii) negotiating national licenses, and (iii) pushing forward on OA work. We're relatively newly into it, but it could be promising. I blog now and then about OA stuff, too: http://curtrice.wordpress.com/category/open-access/

hollerith · on Oct 30, 2011

Thanks for your work on open access, Michael. Now allow me to pick at a nit:

If you put the same text on 2 different web sites, both instances have lower PageRank than a single instance would.

Also, it is less than optimal to put URLs where they will not be turned into clickable links.

Vivtek · on Oct 30, 2011

Is there nothing cool that Peter Suber hasn't gotten into at some point?

_csoz · on Oct 30, 2011

Anyone can suggest web apps and services related to papers, finding preprints, summaries, Q&A etc etc? I 'll start with http://pubcentral.net/

tylerneylon · on Oct 30, 2011

How easy would it be for an open access startup to be profitable?

I'd like to see the cause supported and this sounds like a fun way -- but it's not clear to me how that would work as a business.

Thrymr · on Oct 30, 2011

I think it would be very, very hard. You would need to provide something that existing for-profits don't do (open access), while also competing with non-profits that have the same goal.

The problem is that scientists who care about open access are not likely to move their work to another for-profit enterprise. There are plenty of non-profit entities, from universities to professional societies to PLoS to the government, who have an interest in making research open, and don't need a profit (though they do need operating revenue). It's a tough sell to make your for-profit business the one that a scientist would submit papers to, or review papers for, when they could choose either an established for-profit with higher impact, or a non-profit.

michael_nielsen · on Oct 30, 2011

I'm not sure it's easy for many startups in any space to become profitable.

However, it is possible.

One open access startup that has met this challenge and come to profitability is the Public Library of Science: http://blogs.plos.org/plos/2011/07/2010-plos-progress-update... They are now making a great deal of money, which is presumably why organizations such as Nature Publishing Group are looking at replicating PLoS One.

There is a lot of online discussion of business models for open access startups. Here's one useful guide: http://www.arl.org/sparc/publisher/incomemodels/

Thrymr · on Oct 30, 2011

PLoS is a non-profit: http://www.plos.org/about/what-is-plos/

michael_nielsen · on Oct 30, 2011

My understanding is that they are making a great deal of money, which they reinvest in the organization. The linked post has a link to a document which includes their balance sheet. You are, of course, correct that they're not-for-profit.

impendia · on Oct 30, 2011

Unfortunately I think that academicians (of which I am one) are very, very conservative in this regard. The problem is that they have no personal incentives, and powerful disincentives, to cut costs.

If it were otherwise, how the hell would Elsevier still exist?