Bilingual video captioning model for enhanced video retrieval

Journal of Big Data

Table 6 Dataset specifications

Dataset split	# of videos	# of captions	# of captions used (English experiment)	# of captions used (Arabic experiment)
Training	1200	49,142	47,472	34,477
Validation	100	3990	3870	2836
Testing	670	27,695	27,695	27,695
Total	1970	80,827	79,037	65,008