Publication

Multimodal news article analysis

Conference Article

Conference

International Joint Conference on Artificial Intelligence (IJCAI)

Edition

26th

Pages

5136-5140

Doc link

https://doi.org/10.24963/ijcai.2017/737

File

Download the digital copy of the doc pdf document

Abstract

The intersection of Computer Vision and Natural Language Processing has been a hot topic of research in recent years, with results that were unthinkable only a few years ago. In view of this progress, we want to highlight online news articles as a potential next step for this area of research. The rich interrelations of text, tags, images or videos, as well as a vast corpus of general knowledge are an exciting benchmark for high-capacity models such as the deep neural networks. In this paper we present a series of tasks and baseline approaches to leverage corpus such as the BreakingNews dataset.

Categories

computer vision.

Author keywords

deep learning; information retrieval; vision and perception

Scientific reference

A. Ramisa. Multimodal news article analysis, 26th International Joint Conference on Artificial Intelligence, 2017, Melbourne, Australia, pp. 5136-5140.