Skip to main content

Table 2 Performance results of the proposed image captioning method for different mother wavelets. Here BM denotes the baseline method

From: Image caption generation using Visual Attention Prediction and Contextual Spatial Relation Extraction

Mother wavelet

Flickr8K

Flickr30K

MSCOCO

 

B@4

CD

B@4

CD

B@4

CD

BM

24.43

58.31

23.68

57.89

35.78

118.02

db1

25.77

59.37

24.87

58.91

36.57

119.84

db4

25.86

59.56

25.01

59.14

36.82

119.95

bior1.5

26.34

60.58

25.30

60.13

37.14

120.41

bior2.4

26.18

60.52

25.32

60.02

37.01

120.16

bior3.5

26.04

60.19

25.03

59.84

36.89

120.03

bior5.5

25.85

59.92

24.84

59.77

36.77

119.98

Coif2

25.96

59.77

24.97

59.52

36.79

119.64

Coif5

26.08

59.63

24.82

59.03

36.62

119.58

Sym2

25.81

59.68

24.73

58.78

36.81

119.80

Sym4

24.97

59.72

24.61

58.65

36.73

119.63

  1. The highest values for each of the metrics are given in bold