So, if the names are not the same. How would you then know that file_1 is equal to file_2?
If you do a md5sum on all the files and create something like a hash (eg. file_1: md5sum output), then compare the md5sum. If the md5sum are the same, then the files are the same? But what the chances that two files have the same md5sum but in reality they are not the same.
| [reply] |
MD5 hashes are typically 32 hexadecimal digits, so theoretically the chances of a collision are one in 3216, i.e. 1,208,925,819,614,629,174,706,176.
| [reply] |
You made two errors.
32 hexadecimal digits is 1632 numbers. MD5 hashes are 128 bits in size. 16 hex digits are used to represent the hash since 1632 is equal to 2128.
The chances of a collision cannot be ascertained since you haven't shown that every hash is equally likely to be generated.
| [reply] |
some of the files would be the same. The names may change, the files size etc would stay the same. So the only varible chaing would be the name.
| [reply] |
| [reply] |