COMPUTATIONAL MODELS FOR INTEGRATING LINGUISTIC AND VISUAL INFORMATION - A SURVEY

被引:22
|
作者
SRIHARI, RK [1 ]
机构
[1] SUNY BUFFALO,DEPT COMP SCI,BUFFALO,NY 14228
关键词
NATURAL LANGUAGE UNDERSTANDING; COMPUTER VISION; DIAGRAM UNDERSTANDING; SPATIAL REASONING; MULTIMEDIA;
D O I
10.1007/BF00849725
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper surveys research in developing computational models for integrating linguistic and visual information. It begins with a discussion of systems which have been actually implemented and continues with computationally motivated theories of human cognition. Since existing research spans several disciplines (e.g., natural language understanding, computer vision, knowledge representation), as well as several application areas, an important contribution of this paper is to categorize existing research based on inputs and objectives. Finally, some key issues related to integrating information from two such diverse sources are outlined and related to existing research. Throughout, the key issue addressed is the correspondence problem, namely how to associate visual events with words and vice versa.
引用
收藏
页码:349 / 369
页数:21
相关论文
共 50 条
  • [21] Computational models of cortical visual processing
    Heeger, DJ
    Simoncelli, EP
    Movshon, JA
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (02) : 623 - 627
  • [22] Computational models of cortical visual processing
    Heeger, D. J.
    Simoncelli, E. P.
    Movshon, J. A.
    Physical Review B: Condensed Matter, 53 (11):
  • [23] Graphical models for integrating syllabic information
    Bartels, Chris D.
    Bilmes, Jeff A.
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 685 - 697
  • [24] Integrating Visual and Tactile Information in the Perirhinal Cortex
    Holdstock, J. S.
    Hocking, J.
    Notley, P.
    Devlin, J. T.
    Price, C. J.
    CEREBRAL CORTEX, 2009, 19 (12) : 2993 - 3000
  • [25] INTEGRATING VISUAL INFORMATION FROM SUCCESSIVE FIXATIONS
    JONIDES, J
    IRWIN, DE
    YANTIS, S
    SCIENCE, 1982, 215 (4529) : 192 - 194
  • [26] Computational Intelligence for Information Security: A Survey
    Wang, Ruili
    Ji, Wanting
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (05): : 616 - 629
  • [27] The role of general cognitive skills in integrating visual and linguistic information during sentence comprehension: individual differences across the lifespan
    Hintz, Florian
    Voeten, Cesko C.
    Dobo, Dorottya
    Lukics, Krisztina Sara
    Lukacs, Agnes
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [28] Integrating shallow and linguistic techniques for information extraction from text
    Ciravegna, F
    Cancedda, N
    TOPICS IN ARTIFICIAL INTELLIGENCE, 1995, 992 : 127 - 138
  • [29] Suspect face retrieval using visual and linguistic information
    Jalal, Anand Singh
    Sharma, Dilip Kumar
    Sikander, Bilal
    VISUAL COMPUTER, 2023, 39 (07): : 2609 - 2635
  • [30] THE PROFESSIONS MODELS OF INFORMATION - A COGNITIVE LINGUISTIC ANALYSIS
    GREEN, R
    JOURNAL OF DOCUMENTATION, 1991, 47 (02) : 130 - 148