Dynamically structuring, updating and interrelating representations of visual and linguistic discourse context

被引:18
|
作者
Kelleher, J [1 ]
Costello, F
van Genabith, J
机构
[1] Deutsch Forsch Zentrum Kunstl Intelligenz, Saarbrucken, Germany
[2] Univ Coll Dublin, Dublin 2, Ireland
[3] Dublin City Univ, Dublin 9, Ireland
关键词
visual salience; reference resolution; generating referring expressions; discourse context; cross-modal representations; synthetic vision;
D O I
10.1016/j.artint.2005.04.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fundamental claim of this paper is that salience - both visual and linguistic - is an important overarching semantic category structuring visually situated discourse. Based on this we argue that computer systems attempting to model the evolving context of a visually situated discourse should integrate models of visual and linguistic salience within their natural language processing (NLP) framework. The paper highlights the importance of dynamically updating and interrelating visual and linguistic discourse context representations. To support our approach, we have developed a real-time, natural language virtual reality (NLVR) system (called LIVE, for Linguistic Interaction with Virtual Environments) that implements an NLP framework based on both visual and linguistic salience. Within this framework saliency information underpins two of the core subtasks of NLP: reference resolution and the generation of referring expressions. We describe the theoretical basis and architecture of the LIVE NLP framework and present extensive evaluation results comparing the system's performance with that of human participants in a number of experiments. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:62 / 102
页数:41
相关论文
共 50 条
  • [31] The Development Discourse during Socialist Romania in Visual Representations of the Urban Area
    Ilovan, Oana-Ramona
    [J]. JOURNAL OF URBAN HISTORY, 2022, 48 (04) : 861 - 895
  • [32] Attention-based Integration of Visual Context in Place Representations
    Soyer, Cagatay
    [J]. COGNITIVE PROCESSING, 2021, 22 (SUPPL 1) : 36 - 36
  • [33] The Influence of Visual Representations and Context on Mathematical Word Problem Solving
    Cankoy, Osman
    Ozder, Hasan
    [J]. PAMUKKALE UNIVERSITESI EGITIM FAKULTESI DERGISI-PAMUKKALE UNIVERSITY JOURNAL OF EDUCATION, 2011, (30): : 91 - 100
  • [34] Health movement in French in a minority linguistic context: the representations of players on the future of services
    Bouchard, Louise
    [J]. CANADIAN REVIEW OF SOCIOLOGY-REVUE CANADIENNE DE SOCIOLOGIE, 2011, 48 (02): : 203 - 215
  • [35] The visual basis of linguistic meaning and its implications for critical discourse studies: Integrating cognitive linguistic and multimodal methods
    Hart, Christopher
    [J]. DISCOURSE & SOCIETY, 2016, 27 (03) : 335 - 350
  • [37] Recommending Themes for Ad Creative Design via Visual-Linguistic Representations
    Zhou, Yichao
    Mishra, Shaunak
    Verma, Manisha
    Bhamidipati, Narayan
    Wang, Wei
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2521 - 2527
  • [38] Bridging between emojis and kaomojis by learning their representations from linguistic and visual information
    Kwon, Jingun
    Kobayashi, Naoki
    Kamigaito, Hidetaka
    Takamura, Hiroya
    Okumura, Manabu
    [J]. Proceedings - 2019 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2019, 2019, : 116 - 123
  • [39] Bridging Between Emojis and Kaomojis by Learning Their Representations from Linguistic and Visual Information
    Kwon, Jingun
    Kobayashi, Naoki
    Kamigaito, Hidetaka
    Takamura, Hiroya
    Okumura, Manabu
    [J]. 2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 116 - 123
  • [40] Language matters: representations of 'heart failure' in English discourse - a large-scale linguistic study
    Demmen, Jane
    Hartshorne-Evans, Nick
    Semino, Elena
    Sankaranarayanan, Rajiv
    [J]. OPEN HEART, 2022, 9 (01):