Fig. 3From: Hypert: hypernymy-aware BERT with Hearst pattern exploitation for hypernym discoveryOverview of the further pretraining method. (Left) Sentence extraction used extended Hearst patterns for sentence retrieval. (Right) Masked language modeling exploits each extracted subtask corpus and creates the Hypert for each subtaskBack to article page