From: Pairwise document similarity measure based on present term set
Document Sets
#Documents
#Near-Duplicate Documents
WebKB_NDD
1000
50
R8_NDD