Skip to main content

Table 1 A summary of three datasets used for the evaluation

From: Contextual topic discovery using unsupervised keyphrase extraction and hierarchical semantic graph model

Type

Dataset

No. Documents

No. Tokens per doc

No. Sections

Short text

Semeval2017

493

176

493

WWW

675

152

675

Long text

Wiki20

20

4977

251