Skip to main content

Table 6 Properties of document sets used in near-duplicates detection experiment

From: Pairwise document similarity measure based on present term set

Document Sets

#Documents

#Near-Duplicate Documents

WebKB_NDD

1000

50

R8_NDD

1000

50