Review of Recent Deep Learning Based Methods for Image-Text Retrieval - Institut d'Électronique et des Technologies du numéRique - UMR CNRS 6164 Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Review of Recent Deep Learning Based Methods for Image-Text Retrieval

Résumé

Cross-modal retrieval has drawn much attention in recent years due to the diversity and the quantity of information data that exploded with the popularity of mobile devices and social media. Extracting relevant information efficiently from large-scale multi-modal data is becoming a crucial problem of information retrieval. Cross-modal retrieval aims to retrieve relevant information across different modalities. In this paper, we highlight key points of recent cross-modal retrieval approaches based on deep-learning, especially in the image-text retrieval context, and classify them into four categories according to different embedding methods. Evaluations of state-of-the-art cross-modal retrieval methods on two benchmark datasets are shown at the end of this paper.
Fichier principal
Vignette du fichier
IEEE_CS_Latex_MIPR_2020_Camera_ready_final.pdf (2.57 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02480975 , version 1 (03-09-2020)

Identifiants

Citer

Jianan Chen, Lu Zhang, Cong Bai, Kidiyo Kpalma. Review of Recent Deep Learning Based Methods for Image-Text Retrieval. IEEE 3rd International Conference on Multimedia Information Processing and Retrieval, Aug 2020, Shenzhen, China. ⟨10.1109/MIPR49039.2020.00042⟩. ⟨hal-02480975⟩
578 Consultations
1013 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More