From: Task-agnostic representation learning of multimodal twitter data for downstream applications
Method | Recall @ 1 (%) | Recall @ 5 (%) | Recall @ 10 (%) | Median rank |
---|---|---|---|---|
VSE++ | 0.02 | 0.1 | 0.18 | 2492 |
VSE++ with hashtags as text | 0.02 | 0.08 | 0.16 | 2481 |
Proposed method | 17.4 | 25.04 | 28.2 | 308 |