Recent posts

LSH with Jaccard Index

3 minute read

Minhash algorithm can be used to detect near duplicate documents. Minhash algorithm works by calculating multiple hashes for different sections of a document...