Conceptual representations between video signals and natural language descriptions

被引:16
|
作者
Arens, M. [1 ]
Gerber, R. [1 ]
Nagel, H. -H. [1 ]
机构
[1] Univ Karlsruhe TH, Fak Informat, Inst Algorithmen & Kognit Syst, D-76128 Karlsruhe, Germany
关键词
cognitive vision; knowledge representation;
D O I
10.1016/j.imavis.2005.07.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An artificial cognitive vision system associates video signals with conceptual descriptions of the depicted time-varying scene. This linkage is mediated by knowledge representation formalisms. An experimental implementation of such an approach yielded initial results for the conceptual description of videos recorded at innercity traffic scenes, see [M. Haag, H.-H. Nagel, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing 18 (2) (2000) 137-153]. Accumulating experience with this system approach and its extension for the generation of natural language texts from videos caused us to redesign the overall computer vision system as well as the knowledge representation formalisms utilised within that system. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 66
页数:14
相关论文
共 50 条
  • [41] The Role of Context in the Interpretation of Natural Language Location Descriptions
    Stock, Kristin
    Hall, Mark
    [J]. PROCEEDINGS OF WORKSHOPS AND POSTERS AT THE 13TH INTERNATIONAL CONFERENCE ON SPATIAL INFORMATION THEORY (COSIT 2017), 2018, : 245 - 254
  • [42] Characterizing the Natural Language Descriptions in Software Logging Statements
    He, Pinjia
    Chen, Zhuangbin
    He, Shilin
    Lyu, Michael R.
    [J]. PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 178 - 189
  • [43] Zoom: a corpus of natural language descriptions of map locations
    Altamirano, Romina
    Ferreira, Thiago C.
    Paraboni, Ivandre
    Benotti, Luciana
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 69 - 75
  • [44] The identification of index terms in natural language object descriptions
    Heidorn, PB
    [J]. ASIS 99: PROCEEDINGS OF THE 62ND ASIS ANNUAL MEETING, VOL 36, 1999: KNOWLEDGE: CREATION ORGANIZATION AND USE, 1999, 36 : 472 - 481
  • [45] Detecting geospatial location descriptions in natural language text
    Stock, Kristin
    Jones, Christopher B.
    Russell, Shaun
    Radke, Mansi
    Das, Prarthana
    Aflaki, Niloofar
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (03) : 547 - 584
  • [47] Natural language driven video sequencer
    [J]. Terebijon Gakkaishi, 10 (1585):
  • [48] Natural Language Access to Video Databases
    Francis, Danny
    Pidou, Paul
    Merialdo, Bernard
    Huet, Benoit
    [J]. 2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 78 - 81
  • [49] Learning Video Representations from Large Language Models
    Zhao, Yue
    Misra, Ishan
    Krahenbuhl, Philipp
    Girdhar, Rohit
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6586 - 6597
  • [50] Localizing Moments in Video with Natural Language
    Hendricks, Lisa Anne
    Wang, Oliver
    Shechtman, Eli
    Sivic, Josef
    Darrell, Trevor
    Russell, Bryan
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5804 - 5813