Publication

The BreakingNews dataset

Conference Article

Conference

Workshop on Vision and Language (VL)

Edition

6th

Pages

38-29

Doc link

http://www.aclweb.org/anthology/W17-2005

Document original web page

Authors

Abstract

We present BreakingNews, a novel dataset with approximately 100K news articles including images, text and captions, and enriched with heterogeneous meta-data (e.g. GPS coordinates and popularity metrics). The tenuous connection between the images and text in news data is appropriate to take work at the intersection of Computer Vision and Natural Language Processing to the next step, hence we hope this dataset will help spur progress in the field.

Categories

computer vision.

Author keywords

Vision and Language

Scientific reference

A. Ramisa, F. Yan, F. Moreno-Noguer and K. Mikolajczyk. The BreakingNews dataset, 6th Workshop on Vision and Language, 2017, Valencia, pp. 38-29.