Fig. 8From: Bilingual video captioning model for enhanced video retrievalArchitecture of the training model [45]Back to article page