资 源 简 介
This project have a Java-implementations of "Similarity Join" algorithm as follows
・ MPJoin 1
・ PPJoin、PPJoin+ 2
Reference
1 Ribeiro.Leonardo A、Harder.Theo : Efficient Set Similarity Joins Using Min-prefixes (2009)
2 Xiao.Chuan、Wang.Wei、Lin.Xuemin、Xu.Jeffrey Yu : Efficient Similarity Joins for Near Duplicate Detection (2008)