Skip to main content

Table 3 Properties of real-world document collections

From: Pairwise document similarity measure based on present term set

Document collection

#Documents

#Involved terms (vector dimension)

#Categories

#Test documents

#Train documents

WebKB

4199

7772

4

1396

2803

R8

7674

17,387

8

2189

5485