Posted November 24, 2012
I use an external hard drive for backup of my files (games, music, photos, documents, etc).
Now, since I've being doing this for some years (and yes, I do manual backups via copy and paste) I just wanted to make sure that I hadn't missed any files out and that they were all up to date. As such I decided the best way would be to carry out a full MD5 digest on every file within my backed up folders. The nifty little FCIV utility from Microsoft does the job just fine, especially as it can output to XML file.
This evening I carried out the check and the results are as follows:
11871 files checked
3 files found to be missing (and so I've backed these up)
7 files with MD5 discrepancy
Of the 7 files with a discrepancy, 2 were out of date as my local copies had a later modified date, so those were backed up again.
This leaves 5 files. 4 of these were Excel spreadsheets and just to do a test I opened one and closed it again (no save) and rechecked the file again. On doing this the MD5 hash for that file changed! Is this normal? Is it some property associated to the file like last opened or something? Anyway, just to make me happy I backed up my local copies of these 4 spreadsheets, leaving just 1 more discrepancy to check. This is where it gets strange.
The last file is an old file I've not used for 9 years, so it was simply copied to my current machine, it has never been used on it. Similarly it was just copied to my external drive for backup. It is actually a RiscOS Acorn Archimedes file that I used to use via emulator. Anyway, I opened both my local copy and the backup copy in Notepad++ in order to do a file compare between the two. It came back saying that they were identical. As such, I ask the question, is it possible for two identical files to have different MD5 digest? And if not, why am I getting a discrepancy with this file?
Now, since I've being doing this for some years (and yes, I do manual backups via copy and paste) I just wanted to make sure that I hadn't missed any files out and that they were all up to date. As such I decided the best way would be to carry out a full MD5 digest on every file within my backed up folders. The nifty little FCIV utility from Microsoft does the job just fine, especially as it can output to XML file.
This evening I carried out the check and the results are as follows:
11871 files checked
3 files found to be missing (and so I've backed these up)
7 files with MD5 discrepancy
Of the 7 files with a discrepancy, 2 were out of date as my local copies had a later modified date, so those were backed up again.
This leaves 5 files. 4 of these were Excel spreadsheets and just to do a test I opened one and closed it again (no save) and rechecked the file again. On doing this the MD5 hash for that file changed! Is this normal? Is it some property associated to the file like last opened or something? Anyway, just to make me happy I backed up my local copies of these 4 spreadsheets, leaving just 1 more discrepancy to check. This is where it gets strange.
The last file is an old file I've not used for 9 years, so it was simply copied to my current machine, it has never been used on it. Similarly it was just copied to my external drive for backup. It is actually a RiscOS Acorn Archimedes file that I used to use via emulator. Anyway, I opened both my local copy and the backup copy in Notepad++ in order to do a file compare between the two. It came back saying that they were identical. As such, I ask the question, is it possible for two identical files to have different MD5 digest? And if not, why am I getting a discrepancy with this file?
This question / problem has been solved by Pidgeot