interesting find! I wonder what would be a good safeguard to this. I feel like j...

ralfd · on March 12, 2017

Hm. You could make two backups and checksum each file and safe the checksum. Then you could compare regularly file contents with the initial checksum, if there is a mismatch copy from the other backup.

icebraining · on March 12, 2017

Git-annex is nice for this; the fsck command will check the file against a checksum and request a new copy from other node automatically if the check fails.

heinrich5991 · on March 12, 2017

You could use ZFS, then the file cannot silently become corrupted.

dpedu · on March 12, 2017

Yes it can. ZFS will only notice next time you scrub or read the sector.

db48x · on March 12, 2017

Sure, but your ZFS pool will have redundancy, and ZFS will know which block was corrupted. This lets it recover from the error.

ianhowson · on March 12, 2017

If the corruption occurred on disk, yes. If it occurred in memory then it will write multiple incorrect copies to disk.

X86BSD · on March 12, 2017

This is why ECC is important. Many many people poo poo the idea that it's needed. But by not having it you have left a single vital part of the data path unprotected. And ram and disk is cheap, losing your data is not. The risk simply isn't worth it to save literally a few dollars.

db48x · on March 15, 2017

That's no argument against ZFS, or backups, or any other form of redundancy. Only the insane would buy computers without ECC.