Libtiff goes offline

japaget · on Sept 12, 2016

The code is now being hosted at http://www.simplesystems.org/libtiff/ with a mirror site at http://libtiff.maptools.org/ .

rsfinn · on Sept 13, 2016

There is also a github mirror at https://github.com/vadz/libtiff .

gpvos · on Sept 13, 2016

The nice thing is that every libtiff tarball contains a copy of the web site. So at https://rawgit.com/vadz/libtiff/master/html/index.html you can view a partially functional site (the images don't work very well).

sspiff · on Sept 13, 2016

Or at http://libtiff.org/

gpvos · on Sept 13, 2016

NO. That is an old, hijacked domain. Read the story on Wikipedia: https://en.wikipedia.org/wiki/LibTIFF .

sspiff · on Sept 13, 2016

Oh, very annoying that remotesensing.org links to libtiff.org then!

gpvos · on Sept 13, 2016

Yes. The real libtiff people are trying to resolve that with the domain owner; he seems to be at least a little bit more cooperative than the libtiff.org one.

jetpks · on Sept 13, 2016

libtiff.org is a major release behind.

dustinmr · on Sept 13, 2016

Seems like we've had organizations that house ideas for the public good for centuries now.

Why not involve libraries in the effort? They could keep a "master" copy and github.com, gitlab.org, SF.com, or whoever else comes next can host development versions.

ashitlerferad · on Sept 13, 2016

https://www.softwareheritage.org/

gpvos · on Sept 13, 2016

They only do archiving. Hosting projects that are under development is not in their mission. https://www.linuxfoundation.org/ looks like a better match.

There is a need for an organization that is willing to host established but small and unfunded projects like libtiff, libpng, zlib, etc., and give them some minimal organizational backing in the interest of continuity (so no github). Somewhat similar to, but more general than, what the Network Time Foundation is doing for ntp and related projects.

riffraff · on Sept 13, 2016

I'm not saying you should do it or that it's the best option, but putting your code under the umbrella of something like the FSF or the Apache foundation comes with this sort of benefits.

throwanem · on Sept 13, 2016

A potentially brilliant idea. I wish I had a clue how to follow up on it. Perhaps someone else here might!

cyphar · on Sept 13, 2016

The Internet Archive would be the perfect group of folks to archive code.

toomuchtodo · on Sept 13, 2016

Indeed. Updates in Github (which can create a webhook) should trigger an archive operation.

It has been on my list to integrate https://github.com/joeyh/github-backup with https://github.com/ArchiveTeam/ArchiveBot.

rlkf · on Sept 13, 2016

I love the Internet Archive, so it would be great if some kind of archiving function could be worked out. But if it's on GitHub now, it should be exported to Zenodo today, so there's at least one reference version around for Internet Archive to backup.

archildress · on Sept 13, 2016

Thankful for libtiff's role in the PSP exploit community many years ago. :)

benwilber0 · on Sept 13, 2016

also one of the original iPhone 1 jailbreaks

Benjamin_Dobell · on Sept 13, 2016

To my knowledge this was the original iPhone jailbreak (JailbreakMe). I was on IRC whilst the author was seeking donations before releasing it ;) Ran it on my trusty iPod Touch 1g only minutes after it was released.

This was for iOS 1.1 though, not iOS 1.0, as it was not encrypted and was never really locked down. It was Apple's choice to release iOS 1.0 without the security enabled that made further jailbreaking efforts much more straight-forward, because there was extensive knowledge of the file system.

This was all well before there was an App Store, and when Apple's public position was that they'd never allow third-party software on their platform ;)

saghm · on Sept 13, 2016

A bit tangential, but I just noticed that in the Arch Linux repos, `libtiff` is version 4.0.6, but there's another package called `libtiff4`, which is version 3.9.7. I don't suppose anybody here might know the rationale behind this bizarre version/naming paradigm?

chrisseaton · on Sept 13, 2016

Isn't the number in the library file name a binary incompatibility counter and not necessarily related to the software version number?

http://www.faqs.org/docs/Linux-HOWTO/Program-Library-HOWTO.h...

saghm · on Sept 14, 2016

Ah, okay, I somehow didn't realize that the trailing number referred to the number in the library file. Thanks!

LukeShu · on Sept 13, 2016

As chrisseaton said, version numbers aren't necessarily ABI numbers.

Arch `libtiff` provides libtiff ABI 5, while `libtiff4` provides ABI 4.

bsder · on Sept 13, 2016

And we have yet another core infrastructure project that isn't supported and doesn't get funding.

The real issue is that a lot of people rely on this and nobody is willing to put anything back toward it.

Apparently we have learned nothing from the OpenSSL debacle.

_RPM · on Sept 13, 2016

Why would the libtiff.org owner not want anything to do with the project?

sparky_ · on Sept 13, 2016

RemoteSensing seems to be back up, with a link to http://libtiff.org as its purported new home.

However looking at the mailing list, it seems this domain is not controlled by the developers.

gpvos · on Sept 13, 2016

We've had this before, when libtiff.org was hijacked, which it still is: it contains an old version of the libtiff website, with ads.

rilut · on Sept 12, 2016

because @libtiff is already taken on Github, I think libtiff.gitlab.io is okay enough

BlackFingolfin · on Sept 13, 2016

It doesn't seem to be actively used in the past few years, though; isn't there a process one can use to claim unused github user / org names? Might be worth talking to GitHub staff about...

techdragon · on Sept 13, 2016

There is definitely process and they should apply for it. Hopefully it's free and not hosting active private repos.

L33THaQer · on Sept 13, 2016

LibTiff.info and LibTiff.io point to one of the available mirrors.

kevin_thibedeau · on Sept 12, 2016

Seems like it's time to host it on a DVCS service so the problem won't be so disruptive again.

alrs · on Sept 13, 2016

Remember sunsite.unc.edu? Sourceforge? Both were the gonna-be-around-forever source code distribution points of their day.

sedachv · on Sept 13, 2016

> Remember sunsite.unc.edu?

That's a throwback. I looked it up and apparently UNC's Sunsite turned into http://www.ibiblio.org/about/

There are a couple of other Sunsites still up. This one looks like it has not been updated since 1997: http://sunsite.ubc.ca/

tropo · on Sept 13, 2016

Nah, sunsite is a noob, along with Slackware. When you wanted to download boot/root, Tamu, or SLS you went to tsx-11.mit.edu or ftp.funet.fi to get it.

flukus · on Sept 13, 2016

Which is why a DVCS is important, the history goes with every copy.

kasey_junk · on Sept 13, 2016

A history goes with every copy. You still need a protocol for determining the history.

cyphar · on Sept 13, 2016

If you have a maintainer who signs their commits, you trust their signatures and call it a day. Go with the latest commit and that tells you the history that they have cryptographically claimed is "correct".

Someone · on Sept 13, 2016

One problem I see there is that, with git, you still have to take care that you keep around all the commits that at any given point in time were declared "correct".

For example, if a commit is amended and force pushed out to remove a backdoor from a popular package, a site interested in the history of the library will want to ensure that the faulty commit stays around.

I'm not even sure that is 100% possible, but it, at least, will take some careful git configuration.

cyphar · on Sept 13, 2016

You never rewrite published history with git. It's fine to force push signed git commits to a random branch that are going to be rewritten -- that's just how development works. But when you merge something, it stays that way. Touching history in release branches is something that will always end in tears (all of the tagging breaks, everyone's clone will complain when they update it, etc). Just bad news all around.

As for a site which cares about archiving, they can almost certainly add a tag for every commit they slurp up (making sure it doesn't get hit by the git gc). Though, I'm fairly sure that you can just configure your git instance to never garbage collect.

flukus · on Sept 13, 2016

> You still need a protocol for determining the history.

Is this a real problem or a theoretical problem?

pferde · on Sept 13, 2016

Such problems are always theoretical, until they suddenly become real, and you're happy that someone years ago has thought of that possibility, and left behind a way to solve them...

flukus · on Sept 13, 2016

And many, many more problems are theoretical and remain theoretical forever.

ashitlerferad · on Sept 13, 2016

Sourceforge still exists. sunsite.unc.edu merged into ibiblio.

CydeWeys · on Sept 12, 2016

Yeah. It's still on CVS. May as well throw it up on GitHub like every other open source project. That doesn't have to be its only home on the web, but at least it would always be available there. The main open source project I work on (Pywikibot) isn't primarily hosted on GitHub, but it is mirrored there.

japaget · on Sept 12, 2016

Github mirror is at https://github.com/vadz/libtiff .

wyldfire · on Sept 13, 2016

   /me cringes in fear of a left-pad moment -- what distros can't be built now?

Only half joking. ;)

noobermin · on Sept 13, 2016

For many distro's, I'm sure they host the binaries (and source) on their own mirrors, so the issue isn't immediately threatening.

Example for my distro, gentoo[0]

[0] http://gentoo.osuosl.org/distfiles/tiff-4.0.6.tar.gz

jdub · on Sept 13, 2016

Any distro that complies with the GPL will generalise its requirements to everything else, and thus host original sources for every single package. :-)

voltagex_ · on Sept 13, 2016

But not necessarily source history + issues plus obscure facts that were only pointed out once on a mailing list by someone who has since disappeared

cyphar · on Sept 13, 2016

With openSUSE, we store a copy of the source code in our own repos in OBS[1]. And all users can download the source RPMs.

[1]: https://build.opensuse.org/

banachtarski · on Sept 13, 2016

Tiff? That is a format I haven't heard in a long time.

edit: to everyone replying. TIFF is awful. We have lossless compression now (read PNG) which is at least a billion times better. We shouldn't use it for anything in this day and age. Hell, just DEFLATE your TIFF and call it a new format. It will be better than TIFF

rleigh · on Sept 13, 2016

TIFF is one of the most open and flexible file formats out there.

PNG offers only a tiny subset of what's possible with TIFF. Note that TIFF supports multiple compression formats, lossless and lossy, multiple sample formats and sample counts, very flexible organisation of data layout etc. PNG offers a small number of sample formats, fixed layout and compression, and none of the higher-level TIFF features.

PNG is simple and easy to use. But "better" is subjective and by most objective measures it's inferior to TIFF.

mahyarm · on Sept 13, 2016

TIFF is typically used as an intermediate uncompressed memory format for almost everything in images. Your png library is probably uncompressing png images into TIFF images internally. Your printer probably translates whatever is given to it into a TIFF image to print, your scanners probably use TIFF as a raw format to translate into something else. Even camera RAW images are probably using TIFF in some way or another.

kalleboo · on Sept 13, 2016

> Even camera RAW images are probably using TIFF in some way or another

https://en.wikipedia.org/wiki/Digital_Negative DNG is based on the TIFF/EP standard format, and mandates significant use of metadata

kalleboo · on Sept 13, 2016

> Hell, just DEFLATE your TIFF and call it a new format

You don't have to call it a new format, TIFF has supported lossless compression for decades now https://en.wikipedia.org/wiki/TIFF#TIFF_Compression_Tag

DNG is also based on TIFF, so I can imagine people might use libtiff to read DNG headers.

cornellwright · on Sept 13, 2016

Tiff is very useful because it lets you do almost anything. A tiff image is just several arrays of numbers, you decide how many bits, signed, unsigned, float, int, etc.

This is very helpful for processing which treats each pixel as a sample of the scene, such as computational microscopy or remote sensing. I've always wondered if that had something to do with why it was hosted at remotesensing.org.

banachtarski · on Sept 13, 2016

See my edit. Also, when you are doing image processing, operating in scanlines is usually the least performant way of doing things, hence whey we have texture compression formats (even lossless ones) which do block encodings. TIFF has no reason for existing in my opinion.

cjhanks · on Sept 13, 2016

TIFF supports many useful features for processing very large images. Tiled compression, storing channels in contiguous hyperplanes or on a single image plane, multiple levels of detail, custom compression codecs (lossless or lossy), sparse images, arbitrary bit widths, and permits storing arbitrary metadata with the data.

Viewers of libtiff often treat it like PNG/JPEG, but good Tiff viewers can leverage this functionality. Typically few batteries are included.

Tiff is kind of a cross breed between structured portable formats (HDF5/NetCDF) and imagery.

banachtarski · on Sept 13, 2016

I don't doubt that you can have a library that provides all those features. All those features can exist for any lossless image format. My point is that as far as comparing image formats goes, the capabilities of the frontend library isn't a useful metric for evaluating the format itself which, in my opinion, can optimize for the following traits:

1. On-disk size

2. Compression/decompression speed

3. Access speed (for use in image analysis algorithms, editing, compositing, etc)

4. GPU friendliness (which operates in warps on small blocks of contiguous pixels in image space)

TIFF doesn't really optimize for any of these (not even #2 since in-memory decompression can be faster than paging uncompressed data from disk)

rleigh · on Sept 13, 2016

TIFF is the standard format for scientific imaging. It's the container format used by well over 80% of all imaging formats out there.

It's being slowly replaced by HDF5 for some applications.

But all of the features you mention above are tunable and are in practice perfectly fine with TIFF containers.

7952 · on Sept 13, 2016

The big problem with highly tuned formats is that they make a lot of choices for the user. This works well when the format is used exactly as designed, but that is often not the case. With a tif I can pick exactly the compression, bit depth, color mode, tiling etc. that I need for the specific data.

gpvos · on Sept 13, 2016

> All those features can exist for any lossless image format.

Yes, but for most image formats they aren't defined. They are for TIFF.

The problem, of course, is that if you use an exotic TIFF feature, very few TIFF readers will understand you.

rleigh · on Sept 13, 2016

TIFF can operate with scanlines, strips or tiles. Big images, such as 200k x 200k digital pathology slide scans, are stored in TIFF as e.g. 512x512 tiles, indivdually compressed and transparently accessible to the viewer. It is perfectly capable of dealing with images of vast sizes.

wtracy · on Sept 13, 2016

I don't know of any other application-independent image format with good support for layers. Seriously. (Even Tiff doesn't really have good support for layers.)

.PSD is tightly tied to Photoshop's internals, and .xcf is tightly tied to Gimp's internals. (The Gimp core developers explicitly recommend against using .xcf as an interchange format, even though the format has a fairly complete publicly available spec.) I've seen .gif images with multiple frames used as images with layers, but that forces every layer to have the same resolution, and of course limits you to an 8-bit color depth.

Supposedly the Gimp and Krita devs are collaborating on a new interchange format that will support things like layers, but I haven't heard any news about that in years.

DiThi · on Sept 13, 2016

What happened with the OpenRaster format?

boudewijnrempt · on Sept 13, 2016

It's doing fine. It's the native format for MyPaint, Krita supports it, Scribus supports it. Gimp's support is outdated, but that's because Gimp gets releases so rarely.

DiThi · on Sept 13, 2016

It was kind of a rhetorical question. I looked it up after I sent the comment to confirm what you just said. I was answering to the last paragraph of the parent comment.

wtracy · on Sept 13, 2016

Oh, very cool, it looks like they are making good progress!

Unfortunately, it looks like there's no support for it in Photoshop, and I doubt that's likely to change.

forgettableuser · on Sept 13, 2016

TIFF is still used in the Apple ecosystems. It got a mini-rebirth recently when Apple started doing retina displays. TIFF has the ability to store multiple images, which Apple has latched onto to allow different images to be used for different resolution screens.

niccaluim · on Sept 13, 2016

It's popular in mapping because it can be georeferenced. https://trac.osgeo.org/geotiff/

TylerE · on Sept 13, 2016

Still run into it with things like patents. It was the goto for a long time for scanned imagery.

fdgdasfadsf · on Sept 13, 2016

Can we have multichannel 16bit PNGs now? Can PNGs hold 3D images? If so awesome I can drop TIFF at last.

Not all images are taken by cameras using 3 channels.

dagw · on Sept 13, 2016

Can we have multichannel 16bit PNGs now?

The PNG-format supports them (up to 4 channels) however not all apps know what do with them.

cooper12 · on Sept 13, 2016

It's commonly used as an archival format since it's not lossy like jpeg.

matthuggins · on Sept 13, 2016

JPEG is commonly used in its lossy format, but there's lossless JPEG[1] as well.

[1] https://en.wikipedia.org/wiki/Lossless_JPEG

acdha · on Sept 13, 2016

Application support is pretty huge, though: that's what effectively turned JPEG 2000 into a niche format since the standard was patent and license encumbered for the first decade and change and the vendors didn't make interoperability a priority because they assumed the technical merits would force everyone to adopt it in the end. When you're talking about lossless, however, it's often in the context of archival storage and people get spooked after they encounter files which they have trouble opening with perfect fidelity.

It's a shame since the compression technology was impressive and j2k would also have made a great progressive image format had browsers supported it.

rleigh · on Sept 13, 2016

TIFF has selectable (and pluggable) compression. While it's standard to use deflate compression, you can also use jpeg2000 or any other algorithm of your choice if you want lossy compression.

banachtarski · on Sept 13, 2016

See my edit

torrent-of-ions · on Sept 13, 2016

TIFF is like a container format. It supports lossless compression. Read up.

maxxxxx · on Sept 13, 2016

TIFF is still used a lot in photography.

justinlardinois · on Sept 13, 2016

No matter how old or outdated a technology is, there will always be someone still using it somewhere.

I work for a company that makes workers' compensation software. I don't know of anything in our product that actually generates new TIFFs, but we've got plenty of old ones bumping around that were either uploaded by users or imported from other systems. I end up interacting with TIFFs one way or another every other month or so.

aoloe · on Sept 13, 2016

As far as I know, PNG does not support CMYK nor spot colors.

Nor clipping paths.

For print, there is not really an "open" alternative that is as good as TIFF.

umanwizard · on Sept 13, 2016

IIRC, it's used a lot for black-and-white images without a lot of photographic detail (schematics, etc.)

tomc1985 · on Sept 13, 2016

Digital negative storage for photography

Texture data for some videogames

Portable format for layered composites (a la Photoshop)

rwmj · on Sept 13, 2016

TIFF was used (probably still is) to store data from gas detectors. It wasn't called TIFF, but it was based on the format plus custom tags to store extra data and image layers. As a format it is super-flexible.

draw_down · on Sept 13, 2016

And yet, TIFF images still exist.

noobermin · on Sept 13, 2016

For important projects like this (libjpeg, libpng and others) it would make sense if there was sort of place to get all of them apart from mirrors.

What am I saying, that place[0] exists. Developers shouldn't have to shoulder the burden of hosting their code if they don't want to or don't wish to weather the expense. It certainly seems in this case they couldn't pay for self-hosting (or friend-hosting or whatever this is). I do suppose it is difficult to move development to a new system though(CVS to git).

[0] https://github.com/

jcrawfordor · on Sept 13, 2016

I'm no Stallman, but I'm deeply uncomfortable with how readily everyone is accepting (and encouraging) GitHub as the complete overlord of open-source software. As already pointed out, Sourceforge is a cautionary tale of how these services can go very wrong due to business issues. There's also a much larger philosophical issue with basing the open-source economy on a proprietary platform with no particular intent of open-sourcing its core software.

Yes, GH does a lot of things right, including generally making it easy to export data form their apps in a reasonably vendor-neutral form. But what happens when GH runs into financial trouble like SF did? Do we want just about every project out there to have to struggle to do something with their GitHub issues?

There are issues with having open-source projects maintain their own infrastructure, but I think it's the right thing to do wherever possible. It makes them truly independent in a way that a GitHub repo can never be.

_RPM · on Sept 13, 2016

Git is decentralized. GitHub provides free hosting for Git repositories. It's not like if GitHub dies tomorrow, you would lose all your source code.

throwanem · on Sept 13, 2016

You might lose a lot of metadata around it. The comment to which you're responding mentions issues, which along with pull requests would be a particular area of concern.

noobermin · on Sept 13, 2016

Would it be any different than say a redmine gets lost? Still, the coupling to GH is still a concern.

noobermin · on Sept 13, 2016

It ameliorates the problem by making sure you have your repo and history, yes; but, you do lose the means of distribution (host) which is a loss and I think is really what happened here with libtiff.

ocschwar · on Sept 13, 2016

Github is a lot easier to migrate out of, come that day.

SwellJoe · on Sept 13, 2016

That place existed in the past, too: https://sf.net

The question is, can we trust an entity like that to remain trustworthy through decades? Turns out SF.net wasn't trustworthy through all of that time, even if they were in the beginning, and seem to be sincerely trying to be trustworthy again. So, if we put all of our eggs in one basket, we better be really confident of that basket. We could have lost every OSS project website, rather than just one, if an entity that we trust today becomes untrustworthy tomorrow.

Mizza · on Sept 13, 2016

GitHub will rot some day as well. (I think we're already seeing it start to decline.)

This is simply a permanent problem. Archive.org and to a lesser extent IPFS are viable solutions for archiving, but _contingency_ plans are I think the missing component here. Pray for the best but prepare for the worst.

bane · on Sept 13, 2016

You know, come to think of it, Archive.org is sorta kinda the right place to store these things. If they offered source code control they would both get the source as well as the path to the latest version all at once, which could be an invaluable historic record.

ajross · on Sept 13, 2016

A git HTTP archive can be crawled and archived in principle. They might well be doing this already with github and elsewhere just through the normal course of their operation.

toomuchtodo · on Sept 13, 2016

https://github.com/joeyh/github-backup

ashitlerferad · on Sept 13, 2016

https://github.com/joeyh/github-backup/issues/6

toomuchtodo · on Sept 13, 2016

Pertains only to Issues, not the git repo itself.

themartorana · on Sept 13, 2016

I don't think we're seeing a Github decline. Yes they've had some internal strife not all that connected to business, but Google just moved a lot of its OSS there, Microsoft just moved a bunch of stuff there. Git repos take a single command line statement to move, but Github is only slowing if by the force of not having much left to hoover up.

Gitlab et al are growing, and now AWS has its own integrated solution, but I don't see Github going anywhere for a very, very long time. (Barring catastrophic happenings, of course.)

jen20 · on Sept 13, 2016

The issues and pull requests associated with a repository on GitHub (and often referenced in commit messages) do not move along with the code though.

lfam · on Sept 13, 2016

There is a new project, Software Heritage [0], that specifically addresses the problem of long-term code availability.

https://www.softwareheritage.org/

mary_fortran · on Sept 13, 2016

I won't use github for my own projects because I don't agree with their politics. (If you're ok with them, good for you. I won't stop you.)

Mandating github for everybody is a Bad Idea.

asveikau · on Sept 13, 2016

What do you disagree with?

I am receptive to this kind of argument, I am honestly curious for specifics.

cyphar · on Sept 13, 2016

Presumably the fact that they take down repositories that disagree with their politics (or the politics of other governments). They also don't appear to put up any fight against DMCA's, and they mandate that all users must run proprietary JavaScript. LibreJS makes the site mostly work though, so it's not that bad. The biggest problem is their policies, which are not pro-free-software (no matter what they might say).

mary_fortran · on Sept 13, 2016

That's irrelevant, really. Just that standardizing on a particular business for all repositories will never please everyone.

See if there's a better solution.

defective · on Sept 13, 2016

Github had a feminism mini-scandal a couple years ago: http://www.theverge.com/2014/3/15/5512462/github-developer-l...

cyphar · on Sept 13, 2016

That place should not be a company which is known for not fighting against DMCA requests, and requires that users run proprietary JavaScript. The best place would be the Internet Archive, hopefully running an FSF approved Git front-end (GitLab or GNU Savannah -- Gogs is probably also fine but they haven't reviewed it). Nothing else is acceptable if you actually want to make sure that code will truly outlive us.

fucking_tragedy · on Sept 13, 2016

> What am I saying, that place[0] exists

One day, this will have only existed. Until then, they should host the repository on Google Code.

ashitlerferad · on Sept 13, 2016

Google Code is dead.

noobermin · on Sept 13, 2016

I'm assuming that's their point.

victormunoz · on Sept 14, 2016

There exists an alternative to TiffLib called TiffLibrary4java. https://github.com/EasyinnovaSL/Tiff-Library-4J http://mvnrepository.com/artifact/com.easyinnova/tifflibrary...