Conceptual representations between video signals and natural language descriptions

被引:16
|
作者
Arens, M. [1 ]
Gerber, R. [1 ]
Nagel, H. -H. [1 ]
机构
[1] Univ Karlsruhe TH, Fak Informat, Inst Algorithmen & Kognit Syst, D-76128 Karlsruhe, Germany
关键词
cognitive vision; knowledge representation;
D O I
10.1016/j.imavis.2005.07.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An artificial cognitive vision system associates video signals with conceptual descriptions of the depicted time-varying scene. This linkage is mediated by knowledge representation formalisms. An experimental implementation of such an approach yielded initial results for the conceptual description of videos recorded at innercity traffic scenes, see [M. Haag, H.-H. Nagel, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing 18 (2) (2000) 137-153]. Accumulating experience with this system approach and its extension for the generation of natural language texts from videos caused us to redesign the overall computer vision system as well as the knowledge representation formalisms utilised within that system. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 66
页数:14
相关论文
共 50 条
  • [1] Translating Video Content to Natural Language Descriptions
    Rohrbach, Marcus
    Qiu, Wei
    Titov, Ivan
    Thater, Stefan
    Pinkal, Manfred
    Schiele, Bernt
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 433 - 440
  • [2] Video Event Understanding using Natural Language Descriptions
    Ramanathan, Vignesh
    Liang, Percy
    Li Fei-Fei
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 905 - 912
  • [3] A framework for creating natural language descriptions of video streams
    Khan, Muhammad Usman Ghani
    Al Harbi, Nouf
    Gotoh, Yoshihiko
    [J]. INFORMATION SCIENCES, 2015, 303 : 61 - 82
  • [4] Natural language descriptions of human Behavior from video sequences
    Tena, Carles Fernandez
    Baiget, Pau
    Roca, Xavier
    Gonzalez, Jordi
    [J]. KI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4667 : 279 - +
  • [5] Analyzing the Gap Between Workflows and their Natural Language Descriptions
    Groth, Paul
    Gil, Yolanda
    [J]. 2009 IEEE CONGRESS ON SERVICES (SERVICES-1 2009), VOLS 1 AND 2, 2009, : 299 - 305
  • [6] Summarizing Conceptual Descriptions using Knowledge Representations
    Harispe, Sebastien
    Montmain, Jacky
    Medjkoune, Massissilia
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [7] Learning Spatial-Semantic Representations from Natural Language Descriptions and Scene Classifications
    Hemachandra, Sachithra
    Walter, Matthew R.
    Tellex, Stefanie
    Teller, Seth
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2623 - 2630
  • [8] Learning Unified Video-Language Representations via Joint Modeling and Contrastive Learning for Natural Language Video Localization
    Cui, Chenhao
    Liang, Xinnian
    Wu, Shuangzhi
    Li, Zhoujun
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [9] From Sensory Signals to Modality-Independent Conceptual Representations: A Probabilistic Language of Thought Approach
    Erdogan, Goker
    Yildirim, Ilker
    Jacobs, Robert A.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (11)
  • [10] Vision signals and the language of vision descriptions in the prophets
    Carver, Daniel E.
    [J]. JOURNAL FOR THE STUDY OF THE OLD TESTAMENT, 2021, 45 (03) : 371 - 387