Fuzzy hashing

Fuzzy hashing, also known as similarity hashing,[1] is a technique for detecting data that is similar, but not exactly the same, as other data. This is in contrast to cryptographic hash functions, which are designed to have significantly different hashes for even minor differences. Fuzzy hashing has been used to identify malware[2][3] and has potential for other applications, like data loss prevention and detecting multiple versions of code.[4][5]

  1. ^ Cite error: The named reference NIST.SP.800-168 was invoked but never defined (see the help page).
  2. ^ Cite error: The named reference Beyond Precision and Recall: Understanding Uses (and Misuses) of Similarity Hashes in Binary Analysis was invoked but never defined (see the help page).
  3. ^ Cite error: The named reference Forensic Malware Analysis: The Value of Fuzzy Hashing Algorithms in Identifying Similarities was invoked but never defined (see the help page).
  4. ^ Cite error: The named reference ssdeep was invoked but never defined (see the help page).
  5. ^ Cite error: The named reference tlsh was invoked but never defined (see the help page).

© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search