Fig. 9From: Bilingual video captioning model for enhanced video retrievalArchitecture of the caption generation modelBack to article page