You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was very surprised because the core of this framework is very similar to news2meme. They were much clever in extracting tags from the images, but the relationship is only on a word-level (even when phrases contain multiple words). I wonder how this framework would perform if we would include the subspace representation.
7. bibtex
@inproceedings{parcalabescu2020exploring,
title={Exploring Phrase Grounding without Training: Contextualisation and Extension to Text-Based Image Retrieval},
author={Parcalabescu, Letitia and Frank, Anette},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
pages={962--963},
year={2020}
}