MinHash
MinHash for Jaccard similarity estimation - used in duplicate detection and recommendation systems
Similarity
Input Texts
Hash Outputs
Algorithm Info
Similarity
MinHash for Jaccard similarity estimation - used in duplicate detection and recommendation systems
Related Algorithms
SimHash
Locality-sensitive SimHash for document similarity - finds near-duplicate content efficiently
b-bit MinHash
Compact b-bit MinHash - memory-efficient similarity detection for large-scale applications
SuperMinHash
Enhanced SuperMinHash (2017) - improved accuracy over traditional MinHash for similarity search
Nilsimsa
Nilsimsa hash for spam detection - specialized locality-sensitive hash for email and text analysis
I-Match
I-Match lexicon-based similarity algorithm - customizable duplicate detection using word dictionaries
