COMPUTATIONAL MODELS FOR INTEGRATING LINGUISTIC AND VISUAL INFORMATION - A SURVEY

被引:22
|
作者
SRIHARI, RK [1 ]
机构
[1] SUNY BUFFALO,DEPT COMP SCI,BUFFALO,NY 14228
关键词
NATURAL LANGUAGE UNDERSTANDING; COMPUTER VISION; DIAGRAM UNDERSTANDING; SPATIAL REASONING; MULTIMEDIA;
D O I
10.1007/BF00849725
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper surveys research in developing computational models for integrating linguistic and visual information. It begins with a discussion of systems which have been actually implemented and continues with computationally motivated theories of human cognition. Since existing research spans several disciplines (e.g., natural language understanding, computer vision, knowledge representation), as well as several application areas, an important contribution of this paper is to categorize existing research based on inputs and objectives. Finally, some key issues related to integrating information from two such diverse sources are outlined and related to existing research. Throughout, the key issue addressed is the correspondence problem, namely how to associate visual events with words and vice versa.
引用
收藏
页码:349 / 369
页数:21
相关论文
共 50 条
  • [31] Interface of Linguistic and Visual Information During Audience Design
    Fukumura, Kumiko
    COGNITIVE SCIENCE, 2015, 39 (06) : 1419 - 1433
  • [32] INTEGRATION OF VISUAL AND LINGUISTIC INFORMATION IN SPOKEN LANGUAGE COMPREHENSION
    TANENHAUS, MK
    SPIVEYKNOWLTON, MJ
    EBERHARD, KM
    SEDIVY, JC
    SCIENCE, 1995, 268 (5217) : 1632 - 1634
  • [33] Visual Text Analytics: Techniques for Linguistic Information Visualization
    El-Assady, Mennatallah
    PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [34] Suspect face retrieval using visual and linguistic information
    Anand Singh Jalal
    Dilip Kumar Sharma
    Bilal Sikander
    The Visual Computer, 2023, 39 : 2609 - 2635
  • [35] Survey on Computational Trust and Reputation Models
    Braga, Diego de Siqueira
    Niemann, Marco
    Hellingrath, Bernd
    de Lima Neto, Fernando Buarque
    ACM COMPUTING SURVEYS, 2019, 51 (05)
  • [36] Conceptual description of visual scenes from linguistic models
    Mukerjee, A
    Gupta, K
    Nautiyal, S
    Singh, MP
    Mishra, N
    IMAGE AND VISION COMPUTING, 2000, 18 (02) : 173 - 187
  • [37] Visual Semantic Information Pursuit: A Survey
    Liu, Daqi
    Bober, Miroslaw
    Kittler, Josef
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1404 - 1422
  • [38] Integrating Perceptual Properties of the HVS into the Computational Model of Visual Attention
    Zeng, Ming
    Li, Youfu
    Meng, Qinghao
    Qiu, Xinjie
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 2402 - 2405
  • [39] Editorial: Integrating Computational and Neural Findings in Visual Object Perception
    Peters, Judith C.
    Op de Beeck, Hans P.
    Goebel, Rainer
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2016, 10
  • [40] Integrating language into medical visual recognition and reasoning: A survey
    Lu, Yinbin
    Wang, Alan
    MEDICAL IMAGE ANALYSIS, 2025, 102