Skip to main content

Table 1 Summary of assumptions and costs of discrete text categorisation [30]

From: Smart literature review: a practical topic modelling approach to exploratory literature review

 

Method

Reading

Human coding

Dictionaries

Supervised learning

Topic model

A. Assumptions

 Categories are known

No

Yes

Yes

Yes

No

 Category nesting. If any, is known

No

Yes

Yes

Yes

No

 Relevant text features are known

No

No

Yes

Yes

Yes

 Mapping is known

No

No

Yes

No

No

 Coding can be automated

No

No

Yes

Yes

Yes

B. Costs

 Preanalysis costs

  Person-hours spent conceptualizing

Low

High

High

High

Low

  Level of substantive knowledge

Moderate/high

High

High

High

Low

 Analysis costs

  Person hours spent per text

High

High

Low

Low

Low

  Level of substantive knowledge

Moderate/high

Moderate

Low

Low

Low

 Postanalysis costs

  Person-hours spent interpreting

High

Low

Low

Low

Moderate

  Level of substantive knowledge

High

High

High

High

High