From image to language and back again

被引:0
|
作者
Belz, A. [1 ]
Berg, T. L. [2 ]
Yu, L. [2 ]
机构
[1] Univ Brighton, Comp Engn & Math, Lewes Rd, Brighton BN2 4GJ, E Sussex, England
[2] Univ N Carolina, Comp Sci, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
GENERATION; NETWORKS; MODELS;
D O I
10.1017/S1351324918000086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Work in computer vision and natural language processing involving images and text has been experiencing explosive growth over the past decade, with a particular boost coming from the neural network revolution. The present volume brings together five research articles from several different corners of the area: multilingual multimodal image description (Frank et al.), multimodal machine translation (Madhyastha et al., Frank et al.), image caption generation (Madhyastha et al., Tanti et al.), visual scene understanding (Silberer et al.), and multimodal learning of high-level attributes (Sorodoc et al.). In this article, we touch upon all of these topics as we review work involving images and text under the three main headings of image description (Section 2), visually grounded referring expression generation (REG) and comprehension (Section 3), and visual question answering (VQA) (Section 4).
引用
收藏
页码:325 / 362
页数:38
相关论文
共 50 条
  • [1] From Language Comprehension to Action Understanding and Back Again
    Tremblay, Pascale
    Small, Steven L.
    [J]. CEREBRAL CORTEX, 2011, 21 (05) : 1166 - 1177
  • [2] From participatory sense-making to language: there and back again
    Elena Clare Cuffari
    Ezequiel Di Paolo
    Hanne De Jaegher
    [J]. Phenomenology and the Cognitive Sciences, 2015, 14 : 1089 - 1125
  • [3] From participatory sense-making to language: there and back again
    Cuffari, Elena Clare
    Paolo, Ezequiel Di
    Jaegher, Hanne De
    [J]. PHENOMENOLOGY AND THE COGNITIVE SCIENCES, 2015, 14 (04) : 1089 - 1125
  • [4] Teaching and learning language arts: From campus to classroom and back again
    Heller, Mary F.
    Wood, Naomi J.
    Shawgo, Mary
    [J]. JOURNAL OF EDUCATIONAL RESEARCH, 2007, 100 (04): : 226 - 234
  • [5] From There and Back Again
    Bohart, Arthur C.
    [J]. JOURNAL OF CLINICAL PSYCHOLOGY, 2015, 71 (11) : 1060 - 1069
  • [6] From Notions to Models and Back Again, Again
    Sonenberg, Liz
    [J]. AGENTS IN PRINCIPLE, AGENTS IN PRACTICE, 2011, 7047 : 3 - 3
  • [7] OTTO-AND-STEIN - FROM WORD TO IMAGE AND BACK AGAIN - ZIEGLER,UE
    BAINES, P
    [J]. DESIGN, 1992, (519): : 59 - 59
  • [8] From Eve to the virgin and back again: The image of woman in contemporary (religious) film
    ApostolosCappadona, D
    [J]. NEW IMAGE OF RELIGIOUS FILM, 1997, : 111 - 127
  • [9] From A to Z and Back Again
    Hannah, Dawn
    [J]. VISUAL COMMUNICATION QUARTERLY, 2008, 15 (1-2) : 113 - 120
  • [10] 'FROM A TO B AND BACK AGAIN'
    HOFMANN, M
    [J]. POETRY REVIEW, 1994, 84 (01): : 48 - 48