COMPUTATIONAL MODELS FOR INTEGRATING LINGUISTIC AND VISUAL INFORMATION - A SURVEY

被引:22
|
作者
SRIHARI, RK [1 ]
机构
[1] SUNY BUFFALO,DEPT COMP SCI,BUFFALO,NY 14228
关键词
NATURAL LANGUAGE UNDERSTANDING; COMPUTER VISION; DIAGRAM UNDERSTANDING; SPATIAL REASONING; MULTIMEDIA;
D O I
10.1007/BF00849725
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper surveys research in developing computational models for integrating linguistic and visual information. It begins with a discussion of systems which have been actually implemented and continues with computationally motivated theories of human cognition. Since existing research spans several disciplines (e.g., natural language understanding, computer vision, knowledge representation), as well as several application areas, an important contribution of this paper is to categorize existing research based on inputs and objectives. Finally, some key issues related to integrating information from two such diverse sources are outlined and related to existing research. Throughout, the key issue addressed is the correspondence problem, namely how to associate visual events with words and vice versa.
引用
收藏
页码:349 / 369
页数:21
相关论文
共 50 条
  • [1] Computational models for integrating linguistic and visual information: a survey
    Srihari, Rohini K.
    Artificial Intelligence Review, 1994, 8 (5-6) : 349 - 369
  • [2] Cues to Generality: Integrating Linguistic and Visual Information When Generalizing Biological Information
    Menendez, David
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2023, 115 (08) : 1110 - 1124
  • [3] Computational Models of Human Visual Attention and Their Implementations: A Survey
    Kimura, Akisato
    Yonetani, Ryo
    Hirayama, Takatsugu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03) : 562 - 578
  • [4] Editorial: Integrating Visual System Mechanisms, Computational Models and Algorithms/Technologies
    Spitzer, Hedva
    Otazu, Xavier
    Hel-Or, Hagit
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 7
  • [5] RETRACTED ARTICLE: From the human visual system to the computational models of visual attention: a survey
    Sílvio Filipe
    Luís A. Alexandre
    Artificial Intelligence Review, 2015, 43 : 601 - 601
  • [6] Integrating Visual Information Overtime
    Boynton, Geoffrey M.
    Bjorn, H. -Wallander
    I-PERCEPTION, 2015, 6 (06):
  • [7] Computational models of visual attention
    Tsotsos, John K.
    Eckstein, Miguel P.
    Landy, Michael S.
    VISION RESEARCH, 2015, 116 : 93 - 94
  • [8] Computational linguistic models and language technologies for croatian
    Basic, Bojana Dalbelo
    Dovedan, Zdravko
    Raffaelli, Ida
    Seljan, Sanja
    Tadic, Marko
    PROCEEDINGS OF THE ITI 2007 29TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2007, : 521 - +
  • [9] A Comparative Analysis of Symbolic Linguistic Computational Models
    Rodriguez, R. M.
    Martinez, L.
    Espinilla, M.
    PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 108 - 113
  • [10] Integrating ontological and linguistic knowledge for conceptual information extraction
    Basili, R
    Vindigni, M
    Zanzotto, FM
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 175 - 181